Skip to content

Rate Limits

Rate limiting configuration, quota management, and throttling policies for API consumers.

The Rate Limits rail manages API consumption controls:

  • Rate limit policies — Define requests-per-second/minute/hour limits per tenant/API key
  • Quota management — Track and enforce monthly API call quotas per subscription tier
  • Burst handling — Configure burst allowances above base rate limits
  • Throttling — Graceful degradation with 429 responses and Retry-After headers
  • Rate limit analytics — Monitor consumption patterns and near-limit alerts
ConceptDescription
Token BucketAlgorithm used for rate limiting with burst support
Sliding WindowTime-based rate limit window that slides with each request
QuotaTotal allowed API calls within a billing period
BurstShort-term allowance above the sustained rate limit
Retry-AfterHTTP header indicating when the client may retry
TierRequests/SecondRequests/MinuteMonthly Quota
Free510010,000
Professional501,000500,000
Enterprise50010,000Unlimited

Refer to the API Explorer for interactive endpoint documentation.

All endpoints require authentication via Bearer token or API key.