LLM Comparison
DiffusionGemma vs Step 3.5 Flash
Side-by-side specs, pricing & capabilities · Updated June 2026
Add to comparison
2/6 modelsSame tier:
| Organization | ||
| OpenTools Score | 38 | |
| Family | Gemma | Step |
| Status | Current | Current |
| Release Date | Jun 2026 | Jan 2026 |
| Context Window | 256K tokens | 262K tokens |
| Input Price | Free | $0.10/M tokens |
| Output Price | Free | $0.30/M tokens |
| Pricing Notes | Open-weights model under Apache 2.0; API pricing depends on the host or local infrastructure used. | — |
| Capabilities | textvisioncodereasoninglocal-inference | textcode |
| Max Output | 256 tokens | 66K tokens |
| API Identifier | google/diffusiongemma-26b-a4b-it | stepfun/step-3.5-flash |
| Benchmarks | ||
| MMLU Pro | 77.6official-google-model-card | — |
| GPQA Diamond | 73.2official-google-model-card | — |
| LiveCodeBench v6 | 69.1official-google-model-card | — |
| MMMLU | 81.5official-google-model-card | — |
| HLE no tools | 11official-google-model-card | — |
| View DiffusionGemma | View Step 3.5 Flash | |
Cost Calculator
Enter your expected monthly token usage to compare costs.
| Model | Input | Output | Total / mo | vs Best |
|---|---|---|---|---|
| DiffusionGemmaCheapest | $0.00 | $0.00 | $0.00 | — |
| Step 3.5 Flash | $0.10 | $0.15 | $0.25 | +0% |
DiffusionGemma
DiffusionGemma is Google DeepMind’s experimental open-weights text-diffusion model based on Gemma 4 26B A4B. It uses discrete diffusion and parallel canvas denoising to trade some benchmark quality for much faster local generation on dedicated GPUs.
StepFun
Step 3.5 Flash
Step 3.5 Flash is a large language model from StepFun. Supports up to 262,144 token context window. Available from $0.10/M input tokens.
More Comparisons
Looking for more AI models?
Browse All LLMs