Get expert guidance in a few days with a Laravel code review

Claude Opus 4.6 adds adaptive thinking, 128K output, compaction API, and more

Published on by

Claude Opus 4.6 adds adaptive thinking, 128K output, compaction API, and more image

Anthropic released Claude Opus 4.6 with adaptive thinking, doubled output tokens (128K), a new Compaction API for long conversations, and data residency controls. The release also brings the effort parameter and fine-grained tool streaming to general availability.

  • Adaptive thinking mode
  • 128K max output tokens (up from 64K)
  • Effort parameter GA with new max level
  • Compaction API (beta) for server-side context summarization
  • Fine-grained tool streaming GA
  • Data residency controls via inference_geo

What's New

Adaptive Thinking Mode

A new thinking: {type: "adaptive"} mode lets Claude decide when and how much to think based on the problem. At the default high effort level, Claude will almost always think. At lower effort levels, it may skip thinking for simpler problems. This replaces the previous budget_tokens approach, which is now deprecated.

response = client.messages.create(
model="claude-opus-4-6",
max_tokens=16000,
thinking={"type": "adaptive"},
messages=[{"role": "user", "content": "Solve this complex problem..."}]
)

Adaptive thinking also automatically enables interleaved thinking, removing the need for the interleaved-thinking-2025-05-14 beta header.

128K Output Tokens

Opus 4.6 supports up to 128K output tokens, double the previous 64K limit. This allows for longer thinking budgets and more detailed responses. The SDKs require streaming for requests with large max_tokens values to avoid HTTP timeouts.

Effort Parameter GA

The effort parameter no longer requires a beta header. A new max effort level provides the highest capability on Opus 4.6. Combine it with adaptive thinking for cost-quality tradeoffs.

Compaction API (Beta)

A new server-side context summarization feature that enables long-running conversations. When context approaches the window limit, the API automatically summarizes earlier parts of the conversation instead of truncating.

Fine-Grained Tool Streaming GA

Fine-grained tool streaming is now generally available on all models and platforms with no beta header required.

Data Residency Controls

A new inference_geo parameter lets you specify where model inference runs — "global" (default) or "us". US-only inference is priced at 1.1x on Opus 4.6 and newer models.

Breaking Changes

Prefill removal: Prefilling assistant messages is not supported on Opus 4.6. Requests with prefilled assistant messages return a 400 error. Use structured outputs or system prompt instructions instead.

output_format renamed: The output_format parameter has moved to output_config.format. The old parameter still works but is deprecated.

# Before
response = client.messages.create(
output_format={"type": "json_schema", "schema": {...}},
...
)
 
# After
response = client.messages.create(
output_config={"format": {"type": "json_schema", "schema": {...}}},
...
)

Deprecations

  • thinking: {type: "enabled", budget_tokens: N} — use adaptive thinking instead
  • interleaved-thinking-2025-05-14 beta header — no longer needed with adaptive thinking
  • output_format — use output_config.format

References

Paul Redmond photo

Staff writer at Laravel News. Full stack web developer and author.

Filed in:
Cube

Laravel Newsletter

Join 40k+ other developers and never miss out on new tips, tutorials, and more.

image
Tinkerwell

Enjoy coding and debugging in an editor designed for fast feedback and quick iterations. It's like a shell for your application – but with multi-line editing, code completion, and more.

Visit Tinkerwell
Lucky Media logo

Lucky Media

Get Lucky Now - the ideal choice for Laravel Development, with over a decade of experience!

Lucky Media
Harpoon: Next generation time tracking and invoicing logo

Harpoon: Next generation time tracking and invoicing

The next generation time-tracking and billing software that helps your agency plan and forecast a profitable future.

Harpoon: Next generation time tracking and invoicing
MongoDB logo

MongoDB

Enhance your PHP applications with the powerful integration of MongoDB and Laravel, empowering developers to build applications with ease and efficiency. Support transactional, search, analytics and mobile use cases while using the familiar Eloquent APIs. Discover how MongoDB's flexible, modern database can transform your Laravel applications.

MongoDB
Laravel Cloud logo

Laravel Cloud

Easily create and manage your servers and deploy your Laravel applications in seconds.

Laravel Cloud
Acquaint Softtech logo

Acquaint Softtech

Acquaint Softtech offers AI-ready Laravel developers who onboard in 48 hours at $3000/Month with no lengthy sales process and a 100 percent money-back guarantee.

Acquaint Softtech
PhpStorm logo

PhpStorm

The go-to PHP IDE with extensive out-of-the-box support for Laravel and its ecosystem.

PhpStorm
Tinkerwell logo

Tinkerwell

The must-have code runner for Laravel developers. Tinker with AI, autocompletion and instant feedback on local and production environments.

Tinkerwell
Kirschbaum logo

Kirschbaum

Providing innovation and stability to ensure your web application succeeds.

Kirschbaum
Get expert guidance in a few days with a Laravel code review logo

Get expert guidance in a few days with a Laravel code review

Expert code review! Get clear, practical feedback from two Laravel devs with 10+ years of experience helping teams build better apps.

Get expert guidance in a few days with a Laravel code review
SaaSykit: Laravel SaaS Starter Kit logo

SaaSykit: Laravel SaaS Starter Kit

SaaSykit is a Multi-tenant Laravel SaaS Starter Kit that comes with all features required to run a modern SaaS. Payments, Beautiful Checkout, Admin Panel, User dashboard, Auth, Ready Components, Stats, Blog, Docs and more.

SaaSykit: Laravel SaaS Starter Kit
Shift logo

Shift

Running an old Laravel version? Instant, automated Laravel upgrades and code modernization to keep your applications fresh.

Shift

The latest

View all →
A Free Shift to Check If Your App is Ready for Laravel Cloud image

A Free Shift to Check If Your App is Ready for Laravel Cloud

Read article
Laravel Idempotency: HTTP Idempotency Middleware for Laravel image

Laravel Idempotency: HTTP Idempotency Middleware for Laravel

Read article
Polyscope for Windows is Now Available image

Polyscope for Windows is Now Available

Read article
Laravel Sluggable image

Laravel Sluggable

Read article
Ship AI with Laravel: RAG with Embeddings and pgvector in Laravel 13 image

Ship AI with Laravel: RAG with Embeddings and pgvector in Laravel 13

Read article
Laravel Mobile Pass: Generate Apple Wallet and Google Wallet Passes image

Laravel Mobile Pass: Generate Apple Wallet and Google Wallet Passes

Read article