Operations

Rate limits

Two dimensions: requests per minute (smooth-out) and tokens per month (plan quota). Both return 429 when exceeded.

Per-plan limits

PlanRequests / minTokens / monthMax API keys
Free60100,0002
Pro ($19/mo)30010,000,00010
Team ($49/mo)1,000100,000,000Unlimited
EnterpriseCustomUnlimitedUnlimited

Requests per minute

Rate limiting is applied per API key using a sliding-window counter. When exceeded, you get 429 Too Many Requests with a Retry-After header indicating how many seconds to wait.

Monthly token quota

The quota counts output tokens from your compressed responses (i.e., what mintoken saved you from paying for at the provider, but counts against your mintoken plan). When approaching the limit, every response carries:

HeaderValue
X-Mintoken-Tokens-UsedCurrent cumulative token usage this billing cycle.
X-Mintoken-Tokens-LimitYour plan's monthly limit.

When you exceed the quota, subsequent requests return 429 token quota exceeded until the next billing cycle starts.

Zero-spend protection
Mintoken never auto-charges you for overage. Hitting your limit fails fast — upgrade in the dashboard to raise the ceiling. This is by design: no surprise end-of-month bills.

Request size

Every plan has a per-request body-size cap to protect the proxy. Exceeded: 413 Payload Too Large.

PlanMax request body
Free512 KB
Pro2 MB
Team8 MB
EnterpriseNegotiable