Anthropic's Claude Opus 4.5 Achieves Record 80.9% Coding Accuracy, Beats OpenAI and Google While Slashing Prices 67%
Summary
Anthropic's new Claude Opus 4.5 shatters coding performance records with 80.9% accuracy on industry benchmarks, outperforming OpenAI's GPT-5.1 and Google's Gemini 3 while dramatically cutting costs by 67% and introducing revolutionary 'effort' controls for developers.
Key Points
- Anthropic launches Claude Opus 4.5, achieving the highest SWE-Bench Verified accuracy score of 80.9% and surpassing OpenAI's GPT-5.1-Codex-Max and Google's Gemini 3 in coding performance
- The new model features significantly reduced pricing at $5 per million input tokens and $25 per million output tokens, down from $15/$75, and includes an 'effort' parameter allowing developers to control computational resources
- Opus 4.5 becomes Anthropic's best model for computer use cases with Chrome extension access for Claude Max subscribers, while new Claude Developer Platform updates include upgraded plan mode and desktop app support