LLM Comparison
DiffusionGemma vs ERNIE 4.5 21B A3B Thinking
Side-by-side specs, pricing & capabilities · Updated June 2026
Add to comparison
2/6 modelsSame tier:
| Organization | ||
| OpenTools Score | 38 | |
| Family | Gemma | ERNIE |
| Status | Current | Current |
| Release Date | Jun 2026 | Oct 2025 |
| Context Window | 256K tokens | 131K tokens |
| Input Price | Free | $0.07/M tokens |
| Output Price | Free | $0.28/M tokens |
| Pricing Notes | Open-weights model under Apache 2.0; API pricing depends on the host or local infrastructure used. | — |
| Capabilities | textvisioncodereasoninglocal-inference | textcodeextended-thinking |
| Max Output | 256 tokens | 66K tokens |
| API Identifier | google/diffusiongemma-26b-a4b-it | baidu/ernie-4.5-21b-a3b-thinking |
| Benchmarks | ||
| MMLU Pro | 77.6official-google-model-card | — |
| GPQA Diamond | 73.2official-google-model-card | — |
| LiveCodeBench v6 | 69.1official-google-model-card | — |
| MMMLU | 81.5official-google-model-card | — |
| HLE no tools | 11official-google-model-card | — |
| View DiffusionGemma | View ERNIE 4.5 21B A3B Thinking | |
Cost Calculator
Enter your expected monthly token usage to compare costs.
| Model | Input | Output | Total / mo | vs Best |
|---|---|---|---|---|
| DiffusionGemmaCheapest | $0.00 | $0.00 | $0.00 | — |
| ERNIE 4.5 21B A3B Thinking | $0.07 | $0.14 | $0.21 | +0% |
DiffusionGemma
DiffusionGemma is Google DeepMind’s experimental open-weights text-diffusion model based on Gemma 4 26B A4B. It uses discrete diffusion and parallel canvas denoising to trade some benchmark quality for much faster local generation on dedicated GPUs.
Baidu
ERNIE 4.5 21B A3B Thinking
ERNIE 4.5 21B A3B Thinking is a large language model from Baidu. Supports up to 131,072 token context window. Available from $0.07/M input tokens.
More Comparisons
Looking for more AI models?
Browse All LLMs