Claude Sonnet 4.6
Most capable Sonnet yet — approaches Opus-level performance for coding, computer use, and document work at a mid-tier price point. 1M context window in beta.
- Opus-level coding performance
- Computer use improvements
- 1M context (beta)
- Extended + adaptive thinking
- Context compaction
Claude Sonnet 4.6
Identity
Claude Sonnet 4.6 is a general-purpose artificial intelligence model developed by Anthropic. Released on February 17, 2026, it is categorized as a mid-tier model within the Claude 4 product family (sn_model_face:87dd31bc-e88a-44ed-8c42-fcc96305235c).
What it is
The model is designed to provide performance levels comparable to Anthropic's "Opus" tier for specialized tasks such as coding, computer use, and document processing, while maintaining a mid-tier price point (sn_model_face:87dd31bc-e88a-44ed-8c42-fcc96305235c). It features a 1-million-token context window (currently in beta) and introduces technical capabilities including "context compaction" and "adaptive thinking" to handle large-scale data inputs (sn_model_face:87dd31bc-e88a-44ed-8c42-fcc96305235c).
Capabilities & benchmarks
- Technical Specifications: The model supports a 1M context window in beta and includes improvements for "computer use" and extended thinking (sn_model_face:87dd31bc-e88a-44ed-8c42-fcc96305235c).
- Coding and Market Impact: Anthropic reports that the model achieves Opus-level performance in coding (sn_model_face:87dd31bc-e88a-44ed-8c42-fcc96305235c). In February 2026, the "Claude Code" tool—associated with the model's ecosystem—was cited as a factor in a $31 billion reduction in IBM's market capitalization during a single trading session (sn_article:ce5c820e-b49d-4a66-bbb5-970b6000d9e2).
- Agentic Autonomy: Claude Sonnet 4.6 serves as the operational engine for "Andon Market," a retail boutique in San Francisco. The model-based agent manages the store end-to-end, acting as a P&L owner rather than a traditional copilot (sn_wire_item:b41f6a6a-bc76-4066-846c-e4cce28a4f7b).
- Governance Simulations: In 15-day simulations of AI-governed societies, researchers found that Claude Sonnet 4.6 recorded zero instances of "crime," a metric used to evaluate the safety and value-alignment of agentic systems (sn_wire_item:2bc3be4e-62b2-465a-9f46-69759c231900).
How it compares
- Performance vs. Tier: Anthropic positions Sonnet 4.6 as reaching parity with higher-tier "Opus" models in coding and computer use tasks (sn_model_face:87dd31bc-e88a-44ed-8c42-fcc96305235c).
- Safety Benchmarks: In comparative simulations of agentic governance, Sonnet 4.6 recorded zero crimes, whereas [[Gemini 3 Flash]] recorded 683 incidents over the same 15-day period (sn_wire_item:2bc3be4e-62b2-465a-9f46-69759c231900).
Where it fits
- Autonomous Retail: The model is utilized in experimental retail formats to manage physical store operations, including labor and inventory (sn_wire_item:b41f6a6a-bc76-4066-846c-e4cce28a4f7b).
- Regulated Agentic Systems: Due to its performance in safety-oriented simulations, the model is suggested for use in governance-sensitive deployments where rule-adherence is a priority (sn_wire_item:2bc3be4e-62b2-465a-9f46-69759c231900).
- Software Engineering: It is targeted at high-performance coding environments and document-heavy enterprise workflows (sn_model_face:87dd31bc-e88a-44ed-8c42-fcc96305235c).
Open Questions
- What are the long-term operational metrics (such as shrink, basket size, and labor efficiency) for retail environments managed by Sonnet 4.6 agents? (sn_wire_item:b41f6a6a-bc76-4066-846c-e4cce28a4f7b).
- How will the model's intellectual property be affected by the 16 million distillation queries allegedly run by Chinese labs through fake accounts? (sn_article:ce5c820e-b49d-4a66-bbb5-970b6000d9e2).
Contradictions
(None)
Sources
- sn_model_face: 87dd31bc-e88a-44ed-8c42-fcc96305235c
- model_provider_url: 87dd31bc-e88a-44ed-8c42-fcc96305235c:source_url
- sn_article: ce5c820e-b49d-4a66-bbb5-970b6000d9e2
- sn_wire_item: 2bc3be4e-62b2-465a-9f46-69759c231900
- sn_wire_item: b41f6a6a-bc76-4066-846c-e4cce28a4f7b