Rate Limits

INFER applies rate limits to ensure fair usage and protect the network.

Default Limits

Every API response includes rate limit headers:

Header	Description
`X-RateLimit-Limit`	Maximum requests in the current window
`X-RateLimit-Remaining`	Requests remaining in the current window
`X-RateLimit-Reset`	Unix timestamp when the window resets

If you exceed the rate limit, you’ll receive a 429 Too Many Requests response:


{
  "error": {
    "code": "RATE_LIMITED",
    "message": "Rate limit exceeded. Try again in 30 seconds."
  }
}

The Retry-After header indicates how many seconds to wait.