IBM Launches Granite 4.0 Hybrid AI Models With Lower Memory and Hardware Costs
2 Articles
2 Articles
IBM Launches Granite 4.0 Hybrid AI Models With Lower Memory and Hardware Costs
IBM has released Granite 4.0, its latest family of open large language models (LLMs), featuring a hybrid Mamba/transformer architecture to reduce memory requirements and hardware costs. The company announced the launch on October 2, 2025. According to IBM, Granite 4.0 models can run on significantly cheaper GPUs while maintaining performance. “Granite 4.0 features a new hybrid Mamba/transformer architecture that greatly reduces memory requiremen…
IBM Released new Granite 4.0 Models with a Novel Hybrid Mamba-2/Transformer Architecture: Drastically Reducing Memory Use without Sacrificing Performance
IBM just released Granite 4.0, an open-source LLM family that swaps monolithic Transformers for a hybrid Mamba-2/Transformer stack to cut serving memory while keeping quality. Sizes span a 3B dense “Micro,” a 3B hybrid “H-Micro,” a 7B hybrid MoE “H-Tiny” (~1B active), and a 32B hybrid MoE “H-Small” (~9B active). The models are Apache-2.0, cryptographically signed, and—per IBM—the first open models covered by an accredited ISO/IEC 42001:2023 AI m…
Coverage Details
Bias Distribution
- There is no tracked Bias information for the sources covering this story.
Factuality
To view factuality data please Upgrade to Premium