Rate Limits

Rate limiting configuration, quota management, and throttling policies for API consumers.

Overview

The Rate Limits rail manages API consumption controls:

Rate limit policies — Define requests-per-second/minute/hour limits per tenant/API key
Quota management — Track and enforce monthly API call quotas per subscription tier
Burst handling — Configure burst allowances above base rate limits
Throttling — Graceful degradation with 429 responses and Retry-After headers
Rate limit analytics — Monitor consumption patterns and near-limit alerts

Concept	Description
Token Bucket	Algorithm used for rate limiting with burst support
Sliding Window	Time-based rate limit window that slides with each request
Quota	Total allowed API calls within a billing period
Burst	Short-term allowance above the sustained rate limit
Retry-After	HTTP header indicating when the client may retry

Tier	Requests/Second	Requests/Minute	Monthly Quota
Free	5	100	10,000
Professional	50	1,000	500,000
Enterprise	500	10,000	Unlimited

Refer to the API Explorer for interactive endpoint documentation.

All endpoints require authentication via Bearer token or API key.