LLM Comparison

MiniCPM vs Nemotron 3 Super

Side-by-side specs, pricing & capabilities · Updated June 2026

Add to comparison

2/6 models

Same tier:

	MiniCPM	Nemotron 3 Super
Organization	OpenBMB	NVIDIA
OpenTools Score		21 78.1
Family	MiniCPM	Nemotron
Status	Current	Current
Release Date	—	Mar 2026
Context Window	128K tokens	262K tokens
Input Price	Free	$0.09/M tokens
Output Price	Free	$0.45/M tokens
Pricing Notes	Open-weight GitHub and Hugging Face model family. There is no fixed vendor API price; runtime cost depends on the host, hardware, or inference provider.	—
Capabilities	textcodereasoninglocal-inference	textcode
Training Cutoff	Not publicly specified in queued source	—
Max Output	33K tokens	—
API Identifier	`OpenBMB/MiniCPM`	`nvidia/nemotron-3-super-120b-a12b`
Benchmarks
MiniCPM-SALA standard benchmark average	76.53official-github-readme	—
MiniCPM-SALA long-context average	38.97official-github-readme	—
MiniCPM-SALA 2048K extrapolation score	81.6official-github-readme	—
MiniCPM4.1 reasoning decoding speedup	3official-github-readme	—
MiniCPM4 Jetson AGX Orin decoding speedup vs Qwen3-8B	7official-github-readme	—
	View MiniCPM	View Nemotron 3 Super

Cost Calculator

Enter your expected monthly token usage to compare costs.

Input tokens / month

Output tokens / month

Model	Input	Output	Total / mo	vs Best
MiniCPMCheapest	$0.00	$0.00	$0.00	—
Nemotron 3 Super	$0.09	$0.23	$0.32	+0%

OpenBMB

MiniCPM

MiniCPM is OpenBMB’s ultra-efficient open language-model family for edge and end-device deployment. The MiniCPM4 and MiniCPM4.1 lines focus on fast local reasoning, while MiniCPM-SALA extends the family toward sparse/linear attention and million-token context research.

NVIDIA

Nemotron 3 Super

Nemotron 3 Super is a large language model from NVIDIA. Supports up to 262,144 token context window. Available from $0.09/M input tokens.