LLM Comparison
DiffusionGemma vs MiMo-V2-Flash
Side-by-side specs, pricing & capabilities · Updated June 2026
Add to comparison
2/6 modelsSame tier:
| Organization | ||
| OpenTools Score | 38 | |
| Family | Gemma | MiMo |
| Status | Current | Current |
| Release Date | Jun 2026 | Dec 2025 |
| Context Window | 256K tokens | 262K tokens |
| Input Price | Free | $0.09/M tokens |
| Output Price | Free | $0.29/M tokens |
| Pricing Notes | Open-weights model under Apache 2.0; API pricing depends on the host or local infrastructure used. | Cache read: $0.0450/M tokens |
| Capabilities | textvisioncodereasoninglocal-inference | textcode |
| Max Output | 256 tokens | 66K tokens |
| API Identifier | google/diffusiongemma-26b-a4b-it | xiaomi/mimo-v2-flash |
| Benchmarks | ||
| MMLU Pro | 77.6official-google-model-card | — |
| GPQA Diamond | 73.2official-google-model-card | — |
| LiveCodeBench v6 | 69.1official-google-model-card | — |
| MMMLU | 81.5official-google-model-card | — |
| HLE no tools | 11official-google-model-card | — |
| View DiffusionGemma | View MiMo-V2-Flash | |
Cost Calculator
Enter your expected monthly token usage to compare costs.
| Model | Input | Output | Total / mo | vs Best |
|---|---|---|---|---|
| DiffusionGemmaCheapest | $0.00 | $0.00 | $0.00 | — |
| MiMo-V2-Flash | $0.09 | $0.15 | $0.24 | +0% |
DiffusionGemma
DiffusionGemma is Google DeepMind’s experimental open-weights text-diffusion model based on Gemma 4 26B A4B. It uses discrete diffusion and parallel canvas denoising to trade some benchmark quality for much faster local generation on dedicated GPUs.
Xiaomi
MiMo-V2-Flash
MiMo-V2-Flash is a large language model from Xiaomi. Supports up to 262,144 token context window. Available from $0.09/M input tokens.
More Comparisons
Looking for more AI models?
Browse All LLMs