LLM Comparison
GPT-5.5 vs Claude Sonnet 4.6
Side-by-side specs, pricing & capabilities · Updated April 2026
Price vs Intelligence
Add to comparison
2/6 modelsSame tier:
| Organization | ||
| OpenTools Score | 90 5.1 | 52 5.7 |
| Family | GPT | Claude |
| Status | Current | Current |
| Release Date | Apr 2026 | Feb 2026 |
| Context Window | 1.1M tokens | 1.0M tokens |
| Input Price | $5.00/M tokens | $3.00/M tokens |
| Output Price | $30.00/M tokens | $15.00/M tokens |
| Pricing Notes | Cached input: $0.50/M tokens. Long context (>272K tokens): 2x input, 1.5x output. Batch API: 50% discount. Priority: 2.5x standard. | Cache read: $0.3000/M tokens |
| Capabilities | textvisioncodetool-useextended-thinkingcomputer-useweb-search | textvisioncodetool-use |
| Training Cutoff | December 2025 | — |
| Max Output | 128K tokens | 128K tokens |
| API Identifier | openai/gpt-5.5 | anthropic/claude-sonnet-4.6 |
| Benchmarks | ||
| MMLU | 92.4openai | 86.7anthropic |
| GPQA Diamond | 93.6openai | 89.9anthropic |
| ARC-AGI-2 | 85openai | — |
| Terminal-Bench 2.0 | 82.7openai | 51anthropic |
| SWE-bench Pro | 58.6openai | — |
| OSWorld-Verified | 78.7openai | — |
| BrowseComp | 84.4openai | — |
| MMMU-Pro | 81.2openai | — |
| FrontierMath Tier 4 | 35.4openai | — |
| HLE (with tools) | 52.2openai | — |
| GDPval | 84.9openai | — |
| Toolathlon | 55.6openai | — |
| CyberGym | 81.8openai | — |
| MRCR v2 512K-1M | 74openai | — |
| SWE-bench Verified | — | 72.5anthropic |
| MATH 500 | — | 54.2anthropic |
| LiveCodeBench | — | 41.5anthropic |
| Berkeley Function Calling | — | 88.3anthropic |
| HLE | — | 30openrouter |
| GPQA-AA Elo | — | 1674artificial-analysis |
| View GPT-5.5 | View Claude Sonnet 4.6 | |
Cost Calculator
Enter your expected monthly token usage to compare costs.
| Model | Input | Output | Total / mo | vs Best |
|---|---|---|---|---|
| Claude Sonnet 4.6Cheapest | $3.00 | $7.50 | $10.50 | — |
| GPT-5.5 | $5.00 | $15.00 | $20.00 | +90% |
OpenAI
GPT-5.5
GPT-5.5 is OpenAI's smartest and most intuitive model, built for agentic work like coding, research, and data analysis. It matches GPT-5.4 per-token latency while delivering higher intelligence with significantly fewer tokens. Supports a 1,050,000 token context window and five reasoning effort levels (none through xhigh).
Anthropic
Claude Sonnet 4.6
Claude Sonnet 4.6 is a multimodal llm from Anthropic. Supports up to 1,000,000 token context window. Achieves 88.0% on MMLU. Available from $3.00/M input tokens.
More Comparisons
Looking for more AI models?
Browse All LLMs