Using the API
Rate limits
Most APIs in Monica API Platform have rate limits, and this document lists the default rate limits for all endpoints.
If these limits are too low for your needs, please contact us to request a higher limit.
You can check your actual rate limits anytime on the limits page in the dashboard.
Chat
Model | RPM | RPM |
---|---|---|
Steiner-preview | 10 | 150,000 |
gpt-4o | 100 | 100,000 |
gpt-4o-mini | 500 | 2,000,000 |
o1-preview | 10 | 30,000 |
o1-mini | 50 | 300,000 |
Claude 3.5 Sonnet | 100 | 100,000 |
Claude 3.5 Haiku | 100 | 100,000 |
Claude 3 Opus | 100 | 100,000 |
Claude 3 Haiku | 500 | 2,000,000 |
Gemini 1.5 Pro 002 | 100 | 100,000 |
Gemini 1.5 Flash 002 | 500 | 2,000,000 |
Grok Beta | 50 | 300,000 |
Llama 3.1 405B Instruct | 50 | 300,000 |
Llama 3.3 70B Instruct | 50 | 300,000 |
DeepSeek V2.5 | 50 | 300,000 |
Qwen 2.5 72B | 50 | 300,000 |
Image generations
Model | RPM | Concurrent Requests |
---|---|---|
Stable Diffusion XL 1.0 | 10 | 8 |
Stable Diffusion 3 Large | 10 | 8 |
Stable Diffusion 3.5 Large | 10 | 8 |
Flux Schnell 1.0 | 10 | 8 |
Flux Dev 1.0 | 10 | 8 |
Flux Pro 1.0 | 10 | 8 |
DALL·E 3 | 10 | 8 |
Playground V2.5 | 10 | 8 |
Ideogram V2 | 10 | 8 |
Text tools
API | RPM | Concurrent Requests |
---|---|---|
Humanize | 10 | 8 |
Image tools
API | RPM | Concurrent Requests |
---|---|---|
Upscale | 10 | 8 |
Remove Object | 10 | 8 |