LLM Comparison
DiffusionGemma vs Olmo 3.1 32B Instruct
Side-by-side specs, pricing & capabilities · Updated June 2026
Add to comparison
2/6 modelsSame tier:
| Organization | ||
| OpenTools Score | 38 | |
| Family | Gemma | Olmo |
| Status | Current | Current |
| Release Date | Jun 2026 | Jan 2026 |
| Context Window | 256K tokens | 66K tokens |
| Input Price | Free | $0.20/M tokens |
| Output Price | Free | $0.60/M tokens |
| Pricing Notes | Open-weights model under Apache 2.0; API pricing depends on the host or local infrastructure used. | — |
| Capabilities | textvisioncodereasoninglocal-inference | textcode |
| Max Output | 256 tokens | — |
| API Identifier | google/diffusiongemma-26b-a4b-it | allenai/olmo-3.1-32b-instruct |
| Benchmarks | ||
| MMLU Pro | 77.6official-google-model-card | — |
| GPQA Diamond | 73.2official-google-model-card | — |
| LiveCodeBench v6 | 69.1official-google-model-card | — |
| MMMLU | 81.5official-google-model-card | — |
| HLE no tools | 11official-google-model-card | — |
| View DiffusionGemma | View Olmo 3.1 32B Instruct | |
Cost Calculator
Enter your expected monthly token usage to compare costs.
| Model | Input | Output | Total / mo | vs Best |
|---|---|---|---|---|
| DiffusionGemmaCheapest | $0.00 | $0.00 | $0.00 | — |
| Olmo 3.1 32B Instruct | $0.20 | $0.30 | $0.50 | +0% |
DiffusionGemma
DiffusionGemma is Google DeepMind’s experimental open-weights text-diffusion model based on Gemma 4 26B A4B. It uses discrete diffusion and parallel canvas denoising to trade some benchmark quality for much faster local generation on dedicated GPUs.
AllenAI
Olmo 3.1 32B Instruct
Olmo 3.1 32B Instruct is a large language model from AllenAI. Supports up to 65,536 token context window. Available from $0.20/M input tokens.
More Comparisons
Looking for more AI models?
Browse All LLMs