Free 3 min read
Is AI21 Labs Down?
Check live AI21 Labs status — Jamba long-context model (256k tokens), Jurassic-2, and enterprise API. See recent incidents and set up free email alerts.
AI21 Labs — live status
Updated every 5 minutes. Full incident history at prismix.dev/service/ai21.
Quick check: is AI21 Labs down right now?
- Prismix: prismix.dev/service/ai21 — live status + 30-day uptime + incidents.
- AI21 Labs status:
status.ai21.com— official AI21 Labs status page for the API and Jamba model. - API call:
curl https://prismix.dev/api/v1/statuses | jq '.services[] | select(.id=="ai21")'
Set up free email alerts for AI21 Labs
- 1
Sign in
Go to prismix.dev/sign-in — email OTP or GitHub sign-in.
- 2
Star AI21 Labs
On prismix.dev/service/ai21, click the ☆ star icon.
- 3
Alerts are live
You'll get an email within minutes of any status change.
Common causes of "AI21 Labs not working"
If Prismix shows AI21 Labs as "Operational" but you're having issues:
- Jamba 256k context window truncating at 128k — the
context_lengthparameter may not be respected if you're calling an older Jamba model variant. Verify you are usingjamba-1.6-largeor newer, and that your API plan supports the full 256k context window. Older model slugs cap at 128k regardless of what you pass. - Jurassic-2 API endpoint deprecated — AI21 Labs has sunset the Jurassic-2 (
j2-*) endpoints in favor of the Jamba API. If you're receiving 404 or 410 errors on/v1/complete, migrate to the chat completions endpoint under the Jamba model family. - API key invalid — must use studio key not playground key — AI21 Labs has two separate key namespaces: keys generated in the AI21 Studio console (
studio.ai21.com) and keys from the older playground. Only Studio API keys work with the production API. Regenerate your key from the Studio dashboard and update your environment variables. - chat_only mode returning non-chat format — if you set
chat_only: truebut the response is still in completion format, you may be hitting an older cached model version. Explicitly passmodel: "jamba-1.6-large"and use the/v1/chat/completionsendpoint rather than the legacy completion path. - Rate limit 429 on free tier (5 req/min) — the AI21 Labs free tier enforces a hard limit of 5 requests per minute. Burst traffic from parallel application calls will reliably hit this limit. Implement request queuing with a 13-second minimum interval between calls, or upgrade to a paid plan for higher rate limits.
- Streaming response not finishing (connection reset before final token) — streaming via the Jamba API can be interrupted if the client closes the connection before receiving the stop token, or if a network proxy buffers the entire stream before forwarding. Ensure your HTTP client has streaming enabled (e.g.,
stream=Truein Python) and set a generous read timeout of at least 120 seconds for long outputs.
🔔
Stop manually checking — get alerts instead
Star AI21 Labs on Prismix and get emailed the moment status changes. Free, no credit card.
Monitor related AI model APIs?
Full status dashboard: prismix.dev/status