WEKA Maximizes Token Output With Lower Cost Per Token on NVIDIA BlueField-4 STX
NVIDIA's BlueField-4 STX boosts token throughput up to 6.5x and doubles data ingestion speed to reduce AI inference costs and improve GPU utilization.
- On Tuesday, March 17, 2026, at GTC 2026, NVIDIA unveiled the BlueField-4 STX reference architecture, promising 5x token throughput and 4x energy efficiency compared to traditional CPU-based storage for AI workloads.
- Ian Buck, NVIDIA's vice president of hyperscale and high-performance computing, noted that traditional data centers lack responsiveness for agentic AI; STX offloads storage data paths from the CPU to eliminate bottlenecks.
- WEKA announced its NeuralMesh software now integrates with NVIDIA STX, with Firmus Chief Technology Officer Daniel Kearney stating the integration delivers "up to 6.5x higher tokens per second and 4x faster TTFT at scale."
- Major storage partners including IBM and Dell Technologies are co-designing STX-based infrastructure, with platforms expected to arrive in the second half of 2026 from their vendor relationships.
- NVIDIA positions STX as the reference standard for storage infrastructure serving agentic AI, expecting the architecture to support most enterprise AI deployments within the next two to three years.
22 Articles
22 Articles
WEKA Maximizes Token Output With Lower Cost Per Token on NVIDIA BlueField-4 STX
NeuralMesh and Augmented Memory Grid Integration with NVIDIA STX Increases Token Production by 6.5x in the Same GPU Footprint, Slashing Cost of Inference for AI-Driven Organizations
Nvidia BlueField-4 STX adds a context memory layer to storage to close the agentic AI throughput gap
When an AI agent loses context mid-task because traditional storage can't keep pace with inference, it is not a model problem — it is a storage problem. At GTC 2026, Nvidia announced BlueField-4 STX, a modular reference architecture that inserts a dedicated context memory layer between GPUs and traditional storage, claiming 5x the token throughput, 4x the energy efficiency and 2x the data ingestion speed of conventional CPU-based storage.The bot…
Nvidia Unveils BlueField-4 STX Storage for Agentic AI at GTC 2026
Nvidia Unveils BlueField-4 STX Storage for Agentic AI at GTC 2026 nvidia Unveils BlueField-4 STX Storage for Agentic AI at GTC 2026 Editor’s note: This article synthesizes Nvidia’s established technologies (BlueField DPUs, DOCA, GPUDirect Storage, NVMe-oF) and industry best practices to explain the role a platform like ”BlueField-4 STX Storage” can play in agentic AI data pipelines. For precise specifications from GTC 2026, please refer …
Supermicro Among First to Unveil NVIDIA BlueField-4 STX Storage Server to Improve AI Inference Performance
Supermicro, Inc., a Total IT Solution Provider for AI, Cloud, Storage, and 5G/Edge, today unveiled one of the industry's first context memory (CMX) storage server as part of NVIDIA STX reference architecture announced at NVIDIA GTC 2026. STX is a new modular reference architecture from NVIDIA which is designed to accelerate the full lifecycle of AI. "Supermicro continues to be first to market with new rack scale architectures designed to exceed…
Nvidia GTC 2026: Nvidia Launches BlueField-4 STX Storage Architecture With Broad Industry Adoption
Summary: New Nvidia STX reference architecture provides up to 5x token throughput and up to 4x energy efficiency with 2x faster data ingestion Early adopters of STX for context memory storage include CoreWeave, Crusoe, IREN, Lambda, Mistral AI, Nebius, Oracle Cloud Infrastructure (OCI) and Vultr Storage providers and manufacturing partners are building infrastructure using Nvidia […] The post Nvidia GTC 2026: Nvidia Launches BlueField-4 STX Stor…
Coverage Details
Bias Distribution
- 88% of the sources are Center
Factuality
To view factuality data please Upgrade to Premium









