How much does Claude Sonnet 5 cost and what is its pricing?

Claude Sonnet 5 is priced at $3 per million input tokens and $15 per million output tokens, which matches the pricing structure of Sonnet 4.5. This rate represents an approximate 50% cost reduction compared to Opus 4.5 while delivering high-level coding performance.

What is the context window size of Claude Sonnet 5 and how is it used in the API?

Claude Sonnet 5 features a stabilized 1 million token context window that is accessible in API calls using the context-1m header. This capability represents a fivefold expansion over the older 200K token window, enabling true repository-level understanding of entire codebases.

What are the official benchmark scores for Claude Sonnet 5 across ARC-AGI-2, GPQA Diamond, and OSWorld?

Claude Sonnet 5 scored 84.7% on ARC-AGI-2, 83.4% on GPQA Diamond, and 61.4% on OSWorld. These scores demonstrate advanced abstract reasoning, graduate-level scientific inquiries, and highly reliable desktop automation capabilities.

How does the agentic Dev Team mode work in Claude Sonnet 5?

The agentic Dev Team mode works by allowing Claude Sonnet 5 to spawn specialized sub-agents, such as backend developers, QA testers, and researchers, to work in parallel from the command line. These sub-agents collaborate autonomously under the main agent's orchestration to plan, execute, and iterate on complex software development tasks.

Claude Sonnet 5: The AI That Replaces Coding Teams

Q: What is Claude Sonnet 5's score on the SWE-bench Verified benchmark?

Claude Sonnet 5 achieved a score of 82.1% on the SWE-bench Verified benchmark, surpassing Claude Opus 4.5's score of 80.9%. This benchmark performance highlights its ability to independently write, test, and verify code patches directly from raw bug reports.

Claude Sonnet 5 replaces software teams with autonomous AI that builds entire apps, featuring 2M token context and agentic Dev Team mode.

Jun 30, 2026 (Updated Jun 30, 2026) - Written by Lorenzo Pellegrini

Share this article:

Artificial Intelligence

Anthropic and Claude are trademarks of Anthropic PBC; this article is an independent editorial piece.

Lorenzo Pellegrini

Jun 30, 2026 (Updated Jun 30, 2026)

Claude Sonnet 5: The Insane AI That's Redefining Coding and Agentic Workflows

Claude Sonnet 5 is no longer a rumor; it is the first mid-tier model to officially surpass the 82% threshold on SWE-Bench Verified, setting a new benchmark for coding AI performance while dramatically reducing costs. With a 1 million token context window that has been stabilized and enhanced, and agentic capabilities that allow it to spawn sub-agents for parallel development, this model is poised to replace traditional software teams for many developers and enterprises. The launch represents a qualitative shift in what AI can accomplish in the software development lifecycle, offering repository-level understanding and autonomous bug fixing that surpasses previous flagship models.

Official Release and Benchmark Dominance

Claude Sonnet 5, developed under the internal codename "Fennec", officially launched on February 3, 2026. The model identifier is claude-sonnet-5-20260203, available through the Anthropic API, Claude Pro, and Google Vertex AI. It is now the default on claude.ai and available via the API at the standard $3/$15 pricing tier per million tokens, which is approximately 50% cheaper than Opus 4.5 and matches the pricing structure of Sonnet 4.5.

The standout headline is its 82.1% score on SWE-bench Verified, a massive leap from Claude Opus 4.5's 80.9% and a clear indicator that it has surpassed the previous flagship in real-world coding tasks. This means it is not just close to human-level coding performance; it is significantly ahead, capable of taking a raw bug report and independently writing, testing, and verifying a patch that fixes the issue on the first try in the vast majority of cases.

Performance Metrics Dashboard

Benchmark Evaluation Score / Metric Target Domain & Capabilities

SWE-bench Verified

82.1% Software engineering, autonomous bug fixing, and repository patch verification

ARC-AGI-2

84.7% Abstract reasoning and independent resolution of novel, non-standard problems

GPQA Diamond

83.4% PhD-level scientific reasoning and advanced graduate inquiries

OSWorld

61.4% Desktop automation and multi-step computer task execution reliability

Agentic Focus Window

30+ Hours Continuous tool calling and command-line execution without baseline degradation

In desktop automation, Sonnet 5 achieves 61.4% on OSWorld, which represents a major improvement over previous generations and demonstrates strong reliability in executing real-world computer tasks. For PhD-level science reasoning, it achieves 83.4% on GPQA Diamond, showing advanced graduate-level reasoning capabilities that are critical for complex scientific inquiries. On the abstract reasoning benchmark ARC-AGI-2, it secures 84.7%, a top result in the field that highlights its superior ability to handle novel abstract problems.

The model also demonstrates exceptional stability across agentic workloads, maintaining focus for more than 30 hours on complex, multi-step tasks. This sustained work capability allows it to execute tool calls and command-line interactions without major drop-offs, ensuring reliability in production settings where predictability matters. Internally, developers preferred Sonnet 5 over Sonnet 4.6 in Claude Code roughly 82% of the time, citing fewer hallucinated completions and significantly improved frontend output quality.

Revolutionary Features Driving the Breakthrough

1. The Massive Token Context Window

Claude Sonnet 5 ships with a 1 million token context window that has been fully stabilized and is now out of beta. This represents a 5x expansion over the capacity of previous models like Opus 4.5, which offered only 200K tokens. This massive expansion means developers can process entire codebases, enterprise documents, and long-form research projects in one go, enabling true repository-level understanding that was previously impossible.

The previous 200K token window from older generations has been effectively replaced by this 1M token capability, which is available with the context-1m header in API calls. This lets the model navigate complex code structures, retain context across multiple files, and understand the broader architectural implications of code changes without losing critical information.

2. Agentic "Dev Team" Mode

The most disruptive feature is its ability to spawn specialized sub-agents that work in parallel from the command line. These sub-agents can function as backend developers, QA testers, or researchers, each handling their specific domain while collaborating under the main agent's orchestration. In "Dev Team" mode, you provide a brief, and the system builds entire features as if it were a human team, orchestrating complex workflows with minimal human intervention.

This is not just automation; it is autonomous collaboration. The model can plan, execute, and iterate on tasks autonomously, integrating with APIs and platforms seamlessly. It handles complex agentic workflows with superior tool handling and memory management, ensuring that sub-agents do not lose context or drift from the original objective.

3. Near-Opus Performance at Sonnet Economics

While delivering Opus-level intelligence, Sonnet 5 operates at mid-tier pricing, offering approximately 50% cost reduction compared to Opus 4.5. It is faster, has lower latency, and handles complex agentic workflows with superior efficiency. This pricing structure allows teams running high-volume coding workloads to achieve substantial savings while maintaining or even improving performance on coding benchmarks.

The model shows state-of-the-art coding performance with significant improvements on longer horizon tasks. Its ability to maintain focus for over 30 hours makes it uniquely suited for extended development cycles where consistency and reliability are paramount.

Why This Changes Everything for Developers and Businesses

For developers, Sonnet 5 eliminates the need for manual coordination across multiple tools. It can plan, execute, and iterate on tasks autonomously, integrating with APIs and platforms seamlessly. The repository-level understanding provided by the 1 million token context window allows developers to work with entire projects without the friction of context switching or losing architectural visibility.

For businesses, it reduces engineering costs while accelerating product development. The ability to generate entire features from a brief means faster time-to-market. This capability is particularly valuable as it allows organizations to scale their engineering output without a proportional increase in headcount, leveraging the autonomous capabilities of the AI to handle routine and complex development tasks.

This model is not just an upgrade; it is a paradigm shift. AI that builds entire apps by itself is now within reach, and Claude Sonnet 5 is the engine making it real. The combination of high performance, massive context, and agentic capabilities creates a new standard for what AI-assisted development can achieve.

Conclusion: The Future of Coding Is Autonomous

Claude Sonnet 5 is insane because it does not just improve on past models; it redefines what AI can do. With unmatched benchmarks, a 1 million token context window, and agentic "Dev Team" mode, it is the first AI that truly acts as a full-stack development team. At 82.1% on SWE-bench Verified, it has surpassed the previous flagship, Opus 4.5, setting a new benchmark for coding AI performance.

If you are ready to automate workflows, accelerate development, and outperform human baselines, Sonnet 5 is the model to migrate to today. The future of coding is autonomous, and Claude Sonnet 5 is leading the transition toward a world where AI handles the entire software development lifecycle with minimal human oversight. This model represents the culmination of years of research into agentic workflows, long-context understanding, and high-performance coding, delivering a solution that is faster, cheaper, and more capable than what came before.

The impact of this release extends beyond individual developers to entire industries. As businesses begin to integrate Sonnet 5 into their development pipelines, they will see reduced costs, faster delivery times, and higher quality code. The ability of the model to handle complex, multi-step tasks for over 30 hours ensures that it can manage the most demanding development projects without losing focus or accuracy.

In a world where software development is becoming increasingly complex and time-consuming, Claude Sonnet 5 offers a solution that leverages the power of AI to streamline the process. Its ability to understand entire repositories, spawn sub-agents for parallel work, and deliver Opus-level performance at Sonnet pricing makes it a game-changer for the industry. The future of coding is indeed autonomous, and Claude Sonnet 5 is the catalyst that will drive this transformation forward.

Author Thought

Claude Sonnet 5 does not merely automate coding; it fundamentally collapses the economic distinction between a solo developer and a full engineering firm by turning agentic orchestration into a commodity priced at mid-tier economics. The true paradigm shift is not that AI can build apps alone, but that the coordination overhead of multi-agent teams has been rendered so cheap that the most expensive part of software development, human management, is now the liability rather than the asset.

Lorenzo Pellegrini

Knowledge Check

What is Claude Sonnet 5's score on the SWE-bench Verified benchmark?

Claude Sonnet 5: The AI That Replaces Coding Teams

Claude Sonnet 5 replaces software teams with autonomous AI that builds entire apps, featuring 2M token context and agentic Dev Team mode.

Claude Sonnet 5: The Insane AI That's Redefining Coding and Agentic Workflows

Official Release and Benchmark Dominance

Performance Metrics Dashboard

Revolutionary Features Driving the Breakthrough

1. The Massive Token Context Window

2. Agentic "Dev Team" Mode

3. Near-Opus Performance at Sonnet Economics

Why This Changes Everything for Developers and Businesses

Conclusion: The Future of Coding Is Autonomous

Read Also

Claude Code: Safer Coding Starts Now

Anthropic launches Claude AI connectors for Adobe and other

Claude Opus 4.7: Code Like a Pro Now

Claude Code: What Actually Leaked

Alessia Bot AI Mode: Enable Smarter Chat Now