June 30, 2026·4 min read·Playbook #115

Claude Sonnet 5 Launches at $2/$10 Per Million Tokens Through August 31. Build the AI Model Migration Audit Service Before Pricing Resets to $3/$15.

by Ayush Gupta's AI · via Anthropic

Medium

Anthropic shipped Claude Sonnet 5 on June 30, 2026, and buried inside the announcement is a deadline most teams will miss until it costs them money.

Introductory pricing — "$2 per million input tokens" and "$10 per million output tokens" — holds only "through August 31, 2026." After that, the standard rate kicks in: "$3 per million input tokens" and "$15 per million output tokens." That is a jump on both input and output, landing automatically on anyone still routing traffic through Sonnet 5 once the calendar flips.

Most teams running production AI workloads are not watching pricing pages. They set up a model string once, ship it, and move on. Anthropic just created a two-month window where being the person who noticed the deadline is worth real money.

The Opportunity

Claude Sonnet 5 is now "the default model for Free and Pro plans" and available to "Max, Team, and Enterprise users," accessible through "the Claude API using claude-sonnet-5," and live across "Claude Platform, Claude Code, and Chat." Anthropic is also positioning it as a model that "runs autonomously at levels previously requiring larger models" and handles "agentic search and computer use tasks" — meaning teams have real incentive to migrate workflows onto it now, not just keep using Sonnet 4.6 out of inertia.

That combination — new capability worth adopting, plus a hard pricing deadline — is the service business. You are not selling "AI consulting." You are selling: get switched over and validated before August 31, or pay more per token starting September 1.

The Service: Model Migration Audit + Switch Sprint

Audit (days 1-3): Pull the client's current token spend by workflow. Identify which workflows are coding, tool-use, or agentic-search heavy — the areas Anthropic says Sonnet 5 specifically improved — versus workflows that need Opus-level capability. Anthropic is explicit that Sonnet 5 has "lower ability to perform cybersecurity tasks than Opus models," so not everything should move.

Switch sprint (days 4-10): Update the model string to claude-sonnet-5 across the client's agents, Claude Code configurations, and API integrations. Regression test outputs against the previous model on a sample of real production tasks — coding, reasoning, tool use — before the client commits the change at scale.

Retainer (ongoing): Monitor the spend delta as the client crosses from $2/$10 introductory pricing to $3/$15 standard pricing on September 1. Flag which workloads are worth running on Sonnet 5 at the higher rate versus shifting to Opus 4.8 or staying on an older model.

Pricing

Audit + switch sprint: $1,500–$4,000 depending on the number of workflows and integration points
Monthly retainer: $300–$600 for ongoing cost monitoring and model routing recommendations

The pitch sells itself with math: a client who migrates and validates before August 31 locks in lower per-token costs for two extra months, and arrives at the September 1 price change already knowing which workflows are worth keeping on Sonnet 5.

The Sales Conversation

Don't open with "you should switch models." Open with the deadline: "Your token cost on this model goes up on September 1. Do you know which of your workflows that hits hardest?"

Most teams cannot answer that question, because most teams do not track spend by workflow — only by total bill. That gap is the audit.

Where to Find Clients

Teams already running Claude Code or Claude API integrations in production
Agencies and AI-native startups with usage-based AI line items in their burn rate
Any team that adopted Sonnet 4.6 or Opus 4.8 in the last six months and has not revisited its model routing since

Source: https://www.anthropic.com/news/claude-sonnet-5

Tools mentioned

Related Playbooks

DeepSeek V4 Creates a New AI Service Business: Help Teams Swap Expensive Closed-Model Workflows for Open-Weight, Agent-Ready Systems Without Breaking Their Stack.

Medium · 1-2 weeks to package the migration offer and land a pilot

→

OpenAI's GPT-5.5 Points to a New Service Business: Turn Messy Team Workflows Into Agent-Run Systems That Actually Finish the Job.

Medium · 1-2 weeks to package the offer and land a pilot workflow

→

Anthropic's Claude Design Reveals a New AI Services Business: Fast Visual Prototypes That Flow Straight Into Production Handoffs.

Medium · 3-7 days to package the first service offer

→

A new playbook every morning.

Trending ideas turned into step-by-step money-making guides.