What Anthropic actually shipped
The model ships as the default for Free and Pro plans and is available to Max, Team, and Enterprise users, inside Claude Code and Cowork, and on the API under the model ID claude-sonnet-5. It carries a 1M-token context window — the same as Opus 4.8 and Sonnet 4.6 — so the upgrade is about capability and price, not context length. One launch customer, Zimu Li, summed up the practical pitch: "Claude Sonnet 5 gives our agents a strong execution layer for multi-step software engineering work." For background on the full Claude family, see our Anthropic provider deep-dive.
Pricing: the August 31 promo, and what it really costs
Even at the standard rate, Sonnet 5 is dramatically cheaper than the models it rivals on quality. Opus 4.8 runs $5 / $25 per million tokens and Fable 5 — Anthropic's top Mythos-class model — runs $10 / $50. Put numbers on it: a workload of 2M input and 1M output tokens per day costs about $21/day on standard Sonnet 5, $35/day on Opus 4.8, and $70/day on Fable 5. During the promo, Sonnet 5 drops to roughly $14/day. To model your own token mix instead of this toy example, use the API cost estimator or convert a flat budget into tokens with the token-to-USD tool. You can also line all four up side by side on the price comparison tool.
Sonnet 5 vs Sonnet 4.6: the generational jump
Sonnet 5 is also safer in agentic settings. Anthropic reports an overall lower rate of undesirable behaviors than Sonnet 4.6, with better refusal of malicious requests, stronger resistance to prompt-injection attacks, and reduced hallucination and sycophancy. For anyone running Sonnet 4.6 in production, the upgrade path is straightforward — same price after the promo, same 1M context, better numbers across the board. See the head-to-head: Sonnet 5 vs Sonnet 4.6 for the side-by-side.
Sonnet 5 vs Opus 4.8: how close is it really?
Opus 4.8 still wins where it counts most, though. On agentic coding it leads 69.2% to 63.2%, a six-point margin that shows up on the hardest, longest engineering tasks — deep refactors, sprawling codebases, and problems where one extra correct decision per run compounds. Anthropic also notes Sonnet 5 trails Opus 4.8 on some misaligned-behavior metrics, so Opus remains the pick for the highest-stakes autonomous work. The pattern across agentic search (BrowseComp) and computer use (OSWorld-Verified) is the same: Sonnet 5 lands between Sonnet 4.6 and Opus 4.8, much closer to the top than its price suggests. Compare them directly on Sonnet 5 vs Opus 4.8.
Where Fable 5 still earns its premium
In practice the lineup now reads as a clean ladder: Sonnet 5 for the bulk of agentic and coding work, Opus 4.8 when you need a few more points on the hardest coding and the strongest safety profile, and Fable 5 for frontier tasks where you'd rather pay 5x than risk a miss. The fact that Sonnet 5 now matches Opus 4.8 on knowledge work is exactly what pushes the premium tiers up-market — they have to justify the gap on the genuinely hard problems.
Which one should you pick?
The most cost-effective setup isn't one model — it's routing. Send the bulk of traffic to Sonnet 5 and escalate only the requests that trip a complexity threshold to Opus 4.8 or Fable 5. Teams that do this typically cut spend 30–50% versus running a flagship for everything, with no quality loss on the paths that matter. Before you commit, price your real token distribution across all four on the API cost estimator, and bookmark our Anthropic pricing guides for how the output multiplier drives the bill.
The competitive backdrop
That's the same bet TokenRate is built around: the right model is rarely the most expensive one, it's the cheapest one that clears your quality bar. Sonnet 5 just moved that bar a lot higher at the mid tier. Compare its live pricing against GPT and Gemini on the price comparison tool, and use the calculator to see what the switch saves on your actual volume.