API requests are rate limited per account to ensure system stability.
Generation limits
You can create up to 10 generations per 60 seconds, with only 1 concurrent generation allowed.Endpoint limits
60 requests per minute.
60 requests per minute.
30 requests per minute.
10 requests per minute.
1 concurrent request maximum.
Handling rate limits
| Implementation | Description |
|---|---|
| Monitor remaining requests. | Check the X-RateLimit-Remaining header to track your usage. |
| Implement exponential backoff. | When rate limited (429 status), wait before retrying with increasing delays. |
| Use reset timestamp. | Check X-RateLimit-Reset to know exactly when to retry. |
| Queue requests. | Implement client-side queuing to stay within rate limits. |
| Log request IDs. | Always log X-Request-ID for debugging failed or rate-limited request. |