Skip to main content

Documentation Index

Fetch the complete documentation index at: https://platform.monica.im/docs/llms.txt

Use this file to discover all available pages before exploring further.

Gemini is Google’s most capable AI model family, designed to be multimodal, efficient, and highly capable at a wide variety of tasks.

Using the API

Use Chat completion API to call Gemini models. See the pricing page for pricing details.

Gemini models comparison

FeatureGemini 3 FlashGemini 2.5 Pro PreviewGemini 2.5 FlashGemini 2.5 Flash-LiteGemini 2.0 FlashGemini 1.5 Pro
DescriptionNext-generation fast-response model for low-latency, high-throughput use casesMost advanced Gemini modelVersatile and fastLightweight and efficientCost-effective for simple tasksPrevious generation advanced model
Best forHigh-throughput applications and real-time interactionsComplex tasks requiring deep understandingHigh-throughput applicationsQuick responses and efficient processingBudget-friendly tasksComplex reasoning and analysis
Vision capabilityYesYesNoNoNoYes
Context window1.05M tokens1M tokens1M tokens1M tokens1M tokens1M tokens
API model keygemini-3-flash-previewgemini-2.5-progemini-2.5-flashgemini-2.5-flash-litegemini-2.0-flash-001
gemini-2.0-flash-thinking-exp-01-21
gemini-2.0-flash-exp
gemini-1.5-pro-preview-0409
gemini-1.5-pro

Important notes

Long context handling

  • All models support up to 1M tokens context window
  • Different pricing tiers apply based on input length:
    • Standard rate for inputs less than 128k tokens
    • Higher rate for inputs more than 128k tokens
  • Please check the pricing page for detailed pricing information

Vision capabilities

  • Gemini 2.5 Pro Preview and Gemini 1.5 Pro support multimodal inputs including images
  • Can analyze complex visual information and provide detailed descriptions
  • Vision support is not available in Flash or Flash-Lite models

Model strengths

  • Gemini 3 Flash:
    • Next-generation fast-response model
    • Designed for low-latency, high-throughput use cases
    • Reliable performance for real-time interactions
    • Supports vision input
  • Gemini 2.5 Pro Preview:
    • Advanced reasoning and analysis
    • Multimodal understanding
    • Complex task handling
    • Code generation and analysis
  • Gemini 2.5 Flash:
    • Fast response times
    • Cost-effective for simple tasks
    • Efficient token usage
    • Good for high-throughput applications
  • Gemini 2.5 Flash-Lite:
    • Lightweight and efficient
    • Ideal for quick responses
    • Budget-friendly for simple tasks
  • Gemini 2.0 Flash:
    • Cost-effective for basic tasks
    • Reliable for lightweight applications
    • Efficient token usage
  • Gemini 1.5 Pro:
    • Previous generation advanced model
    • Multimodal capabilities
    • Complex reasoning and analysis

Official documentation