Hire Laravel developers with AI expertise at $20/hr. Get started in 48 hours.

Claude Opus 4.6 adds adaptive thinking, 128K output, compaction API, and more

Published on by

Claude Opus 4.6 adds adaptive thinking, 128K output, compaction API, and more image

Anthropic released Claude Opus 4.6 with adaptive thinking, doubled output tokens (128K), a new Compaction API for long conversations, and data residency controls. The release also brings the effort parameter and fine-grained tool streaming to general availability.

  • Adaptive thinking mode
  • 128K max output tokens (up from 64K)
  • Effort parameter GA with new max level
  • Compaction API (beta) for server-side context summarization
  • Fine-grained tool streaming GA
  • Data residency controls via inference_geo

What's New

Adaptive Thinking Mode

A new thinking: {type: "adaptive"} mode lets Claude decide when and how much to think based on the problem. At the default high effort level, Claude will almost always think. At lower effort levels, it may skip thinking for simpler problems. This replaces the previous budget_tokens approach, which is now deprecated.

response = client.messages.create(
model="claude-opus-4-6",
max_tokens=16000,
thinking={"type": "adaptive"},
messages=[{"role": "user", "content": "Solve this complex problem..."}]
)

Adaptive thinking also automatically enables interleaved thinking, removing the need for the interleaved-thinking-2025-05-14 beta header.

128K Output Tokens

Opus 4.6 supports up to 128K output tokens, double the previous 64K limit. This allows for longer thinking budgets and more detailed responses. The SDKs require streaming for requests with large max_tokens values to avoid HTTP timeouts.

Effort Parameter GA

The effort parameter no longer requires a beta header. A new max effort level provides the highest capability on Opus 4.6. Combine it with adaptive thinking for cost-quality tradeoffs.

Compaction API (Beta)

A new server-side context summarization feature that enables long-running conversations. When context approaches the window limit, the API automatically summarizes earlier parts of the conversation instead of truncating.

Fine-Grained Tool Streaming GA

Fine-grained tool streaming is now generally available on all models and platforms with no beta header required.

Data Residency Controls

A new inference_geo parameter lets you specify where model inference runs — "global" (default) or "us". US-only inference is priced at 1.1x on Opus 4.6 and newer models.

Breaking Changes

Prefill removal: Prefilling assistant messages is not supported on Opus 4.6. Requests with prefilled assistant messages return a 400 error. Use structured outputs or system prompt instructions instead.

output_format renamed: The output_format parameter has moved to output_config.format. The old parameter still works but is deprecated.

# Before
response = client.messages.create(
output_format={"type": "json_schema", "schema": {...}},
...
)
 
# After
response = client.messages.create(
output_config={"format": {"type": "json_schema", "schema": {...}}},
...
)

Deprecations

  • thinking: {type: "enabled", budget_tokens: N} — use adaptive thinking instead
  • interleaved-thinking-2025-05-14 beta header — no longer needed with adaptive thinking
  • output_format — use output_config.format

References

Paul Redmond photo

Staff writer at Laravel News. Full stack web developer and author.

Filed in:
Cube

Laravel Newsletter

Join 40k+ other developers and never miss out on new tips, tutorials, and more.

image
Laravel Cloud

Easily create and manage your servers and deploy your Laravel applications in seconds.

Visit Laravel Cloud
Tinkerwell logo

Tinkerwell

The must-have code runner for Laravel developers. Tinker with AI, autocompletion and instant feedback on local and production environments.

Tinkerwell
Get expert guidance in a few days with a Laravel code review logo

Get expert guidance in a few days with a Laravel code review

Expert code review! Get clear, practical feedback from two Laravel devs with 10+ years of experience helping teams build better apps.

Get expert guidance in a few days with a Laravel code review
PhpStorm logo

PhpStorm

The go-to PHP IDE with extensive out-of-the-box support for Laravel and its ecosystem.

PhpStorm
Laravel Cloud logo

Laravel Cloud

Easily create and manage your servers and deploy your Laravel applications in seconds.

Laravel Cloud
Acquaint Softtech logo

Acquaint Softtech

Acquaint Softtech offers AI-ready Laravel developers who onboard in 48 hours at $3000/Month with no lengthy sales process and a 100 percent money-back guarantee.

Acquaint Softtech
Kirschbaum logo

Kirschbaum

Providing innovation and stability to ensure your web application succeeds.

Kirschbaum
Shift logo

Shift

Running an old Laravel version? Instant, automated Laravel upgrades and code modernization to keep your applications fresh.

Shift
Harpoon: Next generation time tracking and invoicing logo

Harpoon: Next generation time tracking and invoicing

The next generation time-tracking and billing software that helps your agency plan and forecast a profitable future.

Harpoon: Next generation time tracking and invoicing
Lucky Media logo

Lucky Media

Get Lucky Now - the ideal choice for Laravel Development, with over a decade of experience!

Lucky Media
SaaSykit: Laravel SaaS Starter Kit logo

SaaSykit: Laravel SaaS Starter Kit

SaaSykit is a Multi-tenant Laravel SaaS Starter Kit that comes with all features required to run a modern SaaS. Payments, Beautiful Checkout, Admin Panel, User dashboard, Auth, Ready Components, Stats, Blog, Docs and more.

SaaSykit: Laravel SaaS Starter Kit
MongoDB logo

MongoDB

Enhance your PHP applications with the powerful integration of MongoDB and Laravel, empowering developers to build applications with ease and efficiency. Support transactional, search, analytics and mobile use cases while using the familiar Eloquent APIs. Discover how MongoDB's flexible, modern database can transform your Laravel applications.

MongoDB

The latest

View all →
Laravel Cloud Adds Path Blocking to Prevent Bots From Waking Hibernated Apps image

Laravel Cloud Adds Path Blocking to Prevent Bots From Waking Hibernated Apps

Read article
Making Laravel MongoDB Operations Idempotent: Safe Retries for Financial Transactions image

Making Laravel MongoDB Operations Idempotent: Safe Retries for Financial Transactions

Read article
FormRequest Strict Mode and Queue Job Inspection in Laravel 13.4.0 image

FormRequest Strict Mode and Queue Job Inspection in Laravel 13.4.0

Read article
Pretty PHP Info: A Modern Replacement for `phpinfo()` image

Pretty PHP Info: A Modern Replacement for `phpinfo()`

Read article
Laracon US 2026 Announced image

Laracon US 2026 Announced

Read article
Laravel QuickBooks MCP Server: Connect QuickBooks Online to AI Clients image

Laravel QuickBooks MCP Server: Connect QuickBooks Online to AI Clients

Read article