AI Archives

AI ◇ Cloud ◇ Enterprise

NetApp Expands Google Cloud Collaboration for Sovereign, Air-Gapped Deployments

by Harold Fritts on April 16, 2026

NetApp announced an expanded collaboration with Google Cloud, formalized through a four-year enterprise agreement to accelerate the deployment of NetApp storage within Google Distributed Cloud (GDC) Air-Gapped environments. Delivered with World Wide Technology (WWT), the offering targets sovereign cloud use cases that require strict data residency, security, and operational isolation. The joint solution integrates NetApp’s

AI ◇ Enterprise

Supermicro JumpStart Review: H14 with AMD Instinct MI350X

by Brian Beeler on April 13, 2026

Supermicro’s JumpStart program has established itself as one of the more useful tools in the pre-purchase evaluation toolkit for AI infrastructure. Rather than a scripted demo in a shared environment, JumpStart gives qualified users free, time-boxed, bare-metal access to real production servers via SSH, IPMI, and VNC, enabling them to run workloads on actual hardware.

AI ◇ Enterprise

AMD Instinct MI355X Achieves MLPerf Inference v6.0 Gains with Over 1 Million Tokens per Second and Supports Scalable ROCm Stack

by Harold Fritts on April 2, 2026

AMD has released its MLPerf Inference v6.0 results, positioning the Instinct MI355X GPU as a scalable inference platform across single-node, multinode, and heterogeneous deployments. The submission extends beyond incremental gains by adding new workloads, demonstrating cluster-scale throughput exceeding 1 million tokens per second, and validating reproducibility across a growing partner ecosystem. CDNA 4 Architecture Targets

AI ◇ Enterprise

NVIDIA Sets MLPerf Inference v6.0 Records with Blackwell Ultra Platform

by Harold Fritts on April 1, 2026

NVIDIA has published results for MLPerf Inference v6.0, highlighting system-level gains driven by tight co-design across hardware, software, and models. The company positions inference throughput and token economics as the primary metrics for AI factory performance, moving beyond peak accelerator specifications to measured output under real workloads. In this round, systems built on NVIDIA Blackwell

AI ◇ Enterprise

WEKA Integrates NeuralMesh with NVIDIA STX to Address AI Inference Memory Bottlenecks

by Harold Fritts on March 25, 2026

WEKA announced integration of its NeuralMesh platform with the NVIDIA STX reference architecture, positioning its Augmented Memory Grid as a core component for next-generation AI infrastructure. The combined solution targets one of the primary constraints in large-scale inference environments: memory limitations that impact performance, cost, and scalability. Running on NeuralMesh, WEKA’s Augmented Memory Grid extends

AI ◇ Enterprise

WEKA Announces General Availability of NeuralMesh AIDP

by Harold Fritts on March 25, 2026

WEKA has announced the general availability of its NeuralMesh AI Data Platform, an enterprise-focused, composable infrastructure designed for AI factory deployments. Built on the NVIDIA AI Data Platform reference architecture, NeuralMesh AIDP provides an integrated stack that delivers AI-ready data to production environments, with a focus on accelerating time-to-deployment for large-scale AI applications. The platform

AI ◇ Enterprise

IBM and NVIDIA Announce Expanded Partnership to Operationalize Enterprise AI

by Harold Fritts on March 23, 2026

At GTC 2026, IBM and NVIDIA announced a significant expansion of their more than decade-long partnership, focusing on moving AI from pilot phases to full-scale production. The collaboration targets several critical bottlenecks in enterprise AI adoption, including GPU-native data analytics, intelligent document processing, and infrastructure for regulated environments. The joint effort aims to provide a

AI ◇ Enterprise ◇ Enterprise Storage

VDURA Introduces RDMA and Context-Aware Tiering for AI Data Platforms at GTC 2026

by Harold Fritts on March 23, 2026

During GTC 2026, VDURA showcased updates to its Data Platform that improve GPU utilization and storage efficiency in AI environments. The announcement includes the general availability of Remote Direct Memory Access (RDMA), a preview of its Context-Aware Tiering technology, and validated infrastructure setups based on AMD EPYC Turin CPUs and NVIDIA ConnectX-7 networking. The updates

AI ◇ Enterprise

NVIDIA Groq 3 LPX: Everything we know

by Divyansh Jain on March 18, 2026

The LPU, or Language Processing Unit, is a custom AI inference accelerator designed and built by Groq, Inc. Founded in 2016 by Jonathan Ross, a former Google engineer credited as one of the original inventors of the TPU, Groq spent years developing a deterministic, software-defined processor architecture from the ground up. Unlike GPUs, which rely

AI ◇ Enterprise

HPE Expands NVIDIA AI Computing Portfolio with Scalable Private Cloud AI and Blackwell GPU Integration

by Harold Fritts on March 18, 2026

HPE has announced a significant expansion of the NVIDIA AI Computing by HPE portfolio, introducing integrated systems designed to scale enterprise AI deployments while maintaining security and governance. The update focuses on co-engineered, validated architectures intended to accelerate time-to-value for AI inferencing and model development. HPE CEO Antonio Neri and NVIDIA CEO Jensen Huang positioned