Skip to main content

Rate Limits

Real200 implements rate limiting on API requests to ensure service stability and fairness.

Rate Limit Metrics​

MetricDescription
RPM (Requests Per Minute)Maximum requests per minute
TPM (Tokens Per Minute)Maximum tokens per minute

Default Limits​

User TypeRPMTPM
Free Users6050,000
Paid Users600500,000
EnterpriseContact for customContact for custom

:::tip Increase Limits

To increase rate limit thresholds, contact support or submit a request in the console.

:::

Rate Limit Response​

When limits are exceeded, the API returns 429 Too Many Requests:

{
"error": {
"message": "Request rate limit exceeded (RPM: 60). Please retry later.",
"type": "rate_limit_exceeded",
"code": "rate_limit_exceeded"
}
}

Retry Strategy​

Recommended Exponential Backoff strategy:

import time
import random

def call_with_retry(func, max_retries=3):
for attempt in range(max_retries):
try:
return func()
except RateLimitError:
if attempt == max_retries - 1:
raise
wait = (2 ** attempt) + random.uniform(0, 1)
time.sleep(wait)