Rate Limits
Rate Limits
Section titled “Rate Limits”Rate limiting configuration, quota management, and throttling policies for API consumers.
Overview
Section titled “Overview”The Rate Limits rail manages API consumption controls:
- Rate limit policies — Define requests-per-second/minute/hour limits per tenant/API key
- Quota management — Track and enforce monthly API call quotas per subscription tier
- Burst handling — Configure burst allowances above base rate limits
- Throttling — Graceful degradation with 429 responses and Retry-After headers
- Rate limit analytics — Monitor consumption patterns and near-limit alerts
Key Concepts
Section titled “Key Concepts”| Concept | Description |
|---|---|
| Token Bucket | Algorithm used for rate limiting with burst support |
| Sliding Window | Time-based rate limit window that slides with each request |
| Quota | Total allowed API calls within a billing period |
| Burst | Short-term allowance above the sustained rate limit |
| Retry-After | HTTP header indicating when the client may retry |
Default Limits
Section titled “Default Limits”| Tier | Requests/Second | Requests/Minute | Monthly Quota |
|---|---|---|---|
| Free | 5 | 100 | 10,000 |
| Professional | 50 | 1,000 | 500,000 |
| Enterprise | 500 | 10,000 | Unlimited |
Endpoints
Section titled “Endpoints”Refer to the API Explorer for interactive endpoint documentation.
Authentication
Section titled “Authentication”All endpoints require authentication via Bearer token or API key.