LLM Comparison
DiffusionGemma vs Hermes 4 70B
Side-by-side specs, pricing & capabilities · Updated June 2026
Add to comparison
2/6 modelsSame tier:
| Organization | ||
| OpenTools Score | 38 | |
| Family | Gemma | Hermes |
| Status | Current | Current |
| Release Date | Jun 2026 | Aug 2025 |
| Context Window | 256K tokens | 131K tokens |
| Input Price | Free | $0.13/M tokens |
| Output Price | Free | $0.40/M tokens |
| Pricing Notes | Open-weights model under Apache 2.0; API pricing depends on the host or local infrastructure used. | — |
| Capabilities | textvisioncodereasoninglocal-inference | textcode |
| Max Output | 256 tokens | — |
| API Identifier | google/diffusiongemma-26b-a4b-it | nousresearch/hermes-4-70b |
| Benchmarks | ||
| MMLU Pro | 77.6official-google-model-card | — |
| GPQA Diamond | 73.2official-google-model-card | — |
| LiveCodeBench v6 | 69.1official-google-model-card | — |
| MMMLU | 81.5official-google-model-card | — |
| HLE no tools | 11official-google-model-card | — |
| View DiffusionGemma | View Hermes 4 70B | |
Cost Calculator
Enter your expected monthly token usage to compare costs.
| Model | Input | Output | Total / mo | vs Best |
|---|---|---|---|---|
| DiffusionGemmaCheapest | $0.00 | $0.00 | $0.00 | — |
| Hermes 4 70B | $0.13 | $0.20 | $0.33 | +0% |
DiffusionGemma
DiffusionGemma is Google DeepMind’s experimental open-weights text-diffusion model based on Gemma 4 26B A4B. It uses discrete diffusion and parallel canvas denoising to trade some benchmark quality for much faster local generation on dedicated GPUs.
Nous Research
Hermes 4 70B
Hermes 4 70B is a large language model from Nous Research. Supports up to 131,072 token context window. Available from $0.13/M input tokens.
More Comparisons
Looking for more AI models?
Browse All LLMs