Rate limits

Request rate limit

Each API key is limited to 60 requests per minute (sliding window). When you exceed it, the API responds with 429 rate_limited and these headers:

Header	Meaning
`Retry-After`	Seconds to wait before retrying.
`X-RateLimit-Limit`	The limit per window.
`X-RateLimit-Remaining`	Requests remaining in the window.
`X-RateLimit-Reset`	Unix time (seconds) when the window resets.

Back off and retry after Retry-After seconds.

The number of in-flight transcriptions (pending or processing) per account is capped by your plan. Creating a job beyond the cap returns 429. Wait for jobs to finish or upgrade your plan to raise the limit.

Plan	Concurrent transcriptions
Free	3
Pro	25

Rate limits

Request rate limit

Concurrency limit

On this page