Rate limits
Per-key request limits and per-plan concurrency.
Request rate limit
Each API key is limited to 60 requests per minute (sliding window). When you exceed
it, the API responds with 429 rate_limited and these headers:
| Header | Meaning |
|---|---|
Retry-After | Seconds to wait before retrying. |
X-RateLimit-Limit | The limit per window. |
X-RateLimit-Remaining | Requests remaining in the window. |
X-RateLimit-Reset | Unix time (seconds) when the window resets. |
Back off and retry after Retry-After seconds.
Concurrency limit
The number of in-flight transcriptions (pending or processing) per account is
capped by your plan. Creating a job beyond the cap returns 429. Wait for jobs to finish
or upgrade your plan to raise the limit.
| Plan | Concurrent transcriptions |
|---|---|
| Free | 3 |
| Pro | 25 |