Using the API
Rate limits
Most APIs in Monica API Platform have rate limits, and this document lists the default rate limits for all endpoints.
If these limits are too low for your needs, please contact us to request a higher limit.
Chat
Model | RPM | TPM |
---|---|---|
gpt-4o | 100 | 100,000 |
gpt-4o-mini | 500 | 2,000,000 |
o1-preview | 10 | 30,000 |
o1-mini | 50 | 300,000 |
Claude 3.5 Sonnet | 100 | 100,000 |
Claude 3.5 Haiku | 100 | 100,000 |
Claude 3 Opus | 100 | 100,000 |
Claude 3 Haiku | 500 | 2,000,000 |
Gemini 1.5 Pro 002 | 100 | 100,000 |
Gemini 1.5 Flash 002 | 500 | 2,000,000 |
Grok Beta | 50 | 300,000 |
Llama 3.1 405B Instruct | 50 | 300,000 |
Llama 3.3 70B Instruct | 50 | 300,000 |
DeepSeek V2.5 | 50 | 300,000 |
Qwen 2.5 72B | 50 | 300,000 |
other model | 100 | 100,000 |
Image generations
Model | RPM | Concurrent Requests |
---|---|---|
Stable Diffusion XL 1.0 | 10 | 8 |
Stable Diffusion 3 Large | 10 | 8 |
Stable Diffusion 3.5 Large | 10 | 8 |
Flux Schnell 1.0 | 10 | 8 |
Flux Dev 1.0 | 10 | 8 |
Flux Pro 1.0 | 10 | 8 |
DALL·E 3 | 10 | 8 |
Playground V2.5 | 10 | 8 |
Ideogram V2 | 10 | 8 |
Image tools
API | RPM | Concurrent Requests |
---|---|---|
Upscale | 10 | 8 |
Remove Object | 10 | 8 |