LLM Comparison
DiffusionGemma vs Granite 4.0 Micro
Side-by-side specs, pricing & capabilities · Updated June 2026
Add to comparison
2/6 modelsSame tier:
| Organization | ||
| OpenTools Score | 38 | |
| Family | Gemma | Granite |
| Status | Current | Current |
| Release Date | Jun 2026 | Oct 2025 |
| Context Window | 256K tokens | 131K tokens |
| Input Price | Free | $0.02/M tokens |
| Output Price | Free | $0.11/M tokens |
| Pricing Notes | Open-weights model under Apache 2.0; API pricing depends on the host or local infrastructure used. | — |
| Capabilities | textvisioncodereasoninglocal-inference | textcode |
| Max Output | 256 tokens | — |
| API Identifier | google/diffusiongemma-26b-a4b-it | ibm-granite/granite-4.0-h-micro |
| Benchmarks | ||
| MMLU Pro | 77.6official-google-model-card | — |
| GPQA Diamond | 73.2official-google-model-card | — |
| LiveCodeBench v6 | 69.1official-google-model-card | — |
| MMMLU | 81.5official-google-model-card | — |
| HLE no tools | 11official-google-model-card | — |
| View DiffusionGemma | View Granite 4.0 Micro | |
Cost Calculator
Enter your expected monthly token usage to compare costs.
| Model | Input | Output | Total / mo | vs Best |
|---|---|---|---|---|
| DiffusionGemmaCheapest | $0.00 | $0.00 | $0.00 | — |
| Granite 4.0 Micro | $0.02 | $0.06 | $0.07 | +0% |
DiffusionGemma
DiffusionGemma is Google DeepMind’s experimental open-weights text-diffusion model based on Gemma 4 26B A4B. It uses discrete diffusion and parallel canvas denoising to trade some benchmark quality for much faster local generation on dedicated GPUs.
IBM
Granite 4.0 Micro
Granite 4.0 Micro is a large language model from IBM. Supports up to 131,000 token context window. Available from $0.02/M input tokens.
More Comparisons
Looking for more AI models?
Browse All LLMs