Most APIs in Monica API Platform have rate limits, and this document lists the default rate limits for all endpoints.

If these limits are too low for your needs, please contact us to request a higher limit.

You can check your actual rate limits anytime on the limits page in the dashboard.

Chat

ModelRPMRPM
Steiner-preview10150,000
gpt-4o100100,000
gpt-4o-mini5002,000,000
o1-preview1030,000
o1-mini50300,000
Claude 3.5 Sonnet100100,000
Claude 3.5 Haiku100100,000
Claude 3 Opus100100,000
Claude 3 Haiku5002,000,000
Gemini 1.5 Pro 002100100,000
Gemini 1.5 Flash 0025002,000,000
Grok Beta50300,000
Llama 3.1 405B Instruct50300,000
Llama 3.3 70B Instruct50300,000
DeepSeek V2.550300,000
Qwen 2.5 72B50300,000

Image generations

ModelRPMConcurrent Requests
Stable Diffusion XL 1.0108
Stable Diffusion 3 Large108
Stable Diffusion 3.5 Large108
Flux Schnell 1.0108
Flux Dev 1.0108
Flux Pro 1.0108
DALL·E 3108
Playground V2.5108
Ideogram V2108

Text tools

APIRPMConcurrent Requests
Humanize108

Image tools

APIRPMConcurrent Requests
Upscale108
Remove Object108