LLM Comparison

DiffusionGemma vs Granite 4.0 Micro

Side-by-side specs, pricing & capabilities · Updated July 2026

Add to comparison

2/6 models

Same tier:

	DiffusionGemma	Granite 4.0 Micro
Organization	Google	IBM
OpenTools Score	38
Family	Gemma	Granite
Status	Current	Current
Release Date	Jun 2026	Oct 2025
Context Window	256K tokens	131K tokens
Input Price	Free	$0.02/M tokens
Output Price	Free	$0.11/M tokens
Pricing Notes	Open-weights model under Apache 2.0; API pricing depends on the host or local infrastructure used.	—
Capabilities	textvisioncodereasoninglocal-inference	textcode
Max Output	256 tokens	—
API Identifier	`google/diffusiongemma-26b-a4b-it`	`ibm-granite/granite-4.0-h-micro`
Benchmarks
MMLU Pro	77.6official-google-model-card	—
GPQA Diamond	73.2official-google-model-card	—
LiveCodeBench v6	69.1official-google-model-card	—
MMMLU	81.5official-google-model-card	—
HLE no tools	11official-google-model-card	—
	View DiffusionGemma	View Granite 4.0 Micro

Cost Calculator

Enter your expected monthly token usage to compare costs.

Input tokens / month

Output tokens / month

Model	Input	Output	Total / mo	vs Best
DiffusionGemmaCheapest	$0.00	$0.00	$0.00	—
Granite 4.0 Micro	$0.02	$0.06	$0.07	+0%

Google

DiffusionGemma

DiffusionGemma is Google DeepMind’s experimental open-weights text-diffusion model based on Gemma 4 26B A4B. It uses discrete diffusion and parallel canvas denoising to trade some benchmark quality for much faster local generation on dedicated GPUs.

IBM

Granite 4.0 Micro

Granite 4.0 Micro is a large language model from IBM. Supports up to 131,000 token context window. Available from $0.02/M input tokens.