Opus 4.6 Fast Mode available on AI Gateway
Mirrored from Vercel — AI for archival readability. Support the source by reading on the original site.
Fast mode support for Claude Opus 4.6 is now available on AI Gateway.
Fast mode is a premium high-speed option that delivers 2.5x faster output token speeds with the same model intelligence. This is an early, experimental feature.
Fast mode's increased output token speeds enable new use cases, especially for human-in-the-loop workflows. Run large coding tasks without needing to context switch and get planning results without extended waits.
To enable fast mode, pass speed: 'fast' in the anthropic provider options in AI SDK:
You can use fast mode with Claude Code via AI Gateway by setting the CLAUDE_CODE_SKIP_FAST_MODE_ORG_CHECK variable in your shell configuration file or in ~/.claude/settings.json.
Try fast mode directly in the AI Gateway playground for Opus 4.6.
Fast mode is priced at 6x standard Opus rates.
Standard | Fast Mode |
|---|---|
Input: $5 / 1M tokens Output: $25 / 1M tokens | Input: $30 / 1M tokens Output: $150 / 1M tokens |
All standard pricing multipliers (e.g., prompt caching) apply on top of these rates.
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.