Gemini is Google’s most capable AI model family, designed to be multimodal, efficient, and highly capable at a wide variety of tasks.

Using the API

Use Chat completion API to call Gemini models. See the pricing page for pricing details.

Gemini models comparison

FeatureGemini 1.5 Pro 002Gemini 1.5 Flash 002
DescriptionMost capable Gemini modelFast and efficient model
Best forComplex tasks requiring deep understandingQuick responses and efficient processing
Vision capabilityYesNo
Context window1M tokens1M tokens
API model keygemini-1.5-pro-002gemini-1.5-flash-002

Important notes

Long context handling

  • Both models support up to 1M tokens context window
  • Different pricing tiers apply based on input length:
    • Standard rate for inputs less than 128k tokens
    • Higher rate for inputs more than 128k tokens
  • Please check the pricing page for detailed pricing information

Vision capabilities

  • Gemini 1.5 Pro supports multimodal inputs including images
  • Can analyze complex visual information and provide detailed descriptions
  • Vision support is not available in Flash model

Model strengths

  • Gemini 1.5 Pro 002:

    • Advanced reasoning and analysis
    • Multimodal understanding
    • Complex task handling
    • Code generation and analysis
  • Gemini 1.5 Flash 002:

    • Fast response times
    • Cost-effective for simple tasks
    • Efficient token usage
    • Good for high-throughput applications

Official documentation