> ## Documentation Index
> Fetch the complete documentation index at: https://platform.monica.im/docs/llms.txt
> Use this file to discover all available pages before exploring further.

# Google models

[Gemini](https://ai.google.dev/gemini-api/docs) is Google's most capable AI model family, designed to be multimodal, efficient, and highly capable at a wide variety of tasks.

## Using the API

Use [Chat completion](/en/chat/chat) API to call Gemini models. See the [pricing page](/en/models-and-pricing#google-models) for pricing details.

## Gemini models comparison

| Feature           | Gemini 3 Flash                                                                 | Gemini 2.5 Pro Preview                     | Gemini 2.5 Flash             | Gemini 2.5 Flash-Lite                    | Gemini 2.0 Flash                                                                              | Gemini 1.5 Pro                                      |
| ----------------- | ------------------------------------------------------------------------------ | ------------------------------------------ | ---------------------------- | ---------------------------------------- | --------------------------------------------------------------------------------------------- | --------------------------------------------------- |
| Description       | Next-generation fast-response model for low-latency, high-throughput use cases | Most advanced Gemini model                 | Versatile and fast           | Lightweight and efficient                | Cost-effective for simple tasks                                                               | Previous generation advanced model                  |
| Best for          | High-throughput applications and real-time interactions                        | Complex tasks requiring deep understanding | High-throughput applications | Quick responses and efficient processing | Budget-friendly tasks                                                                         | Complex reasoning and analysis                      |
| Vision capability | Yes                                                                            | Yes                                        | No                           | No                                       | No                                                                                            | Yes                                                 |
| Context window    | 1.05M tokens                                                                   | 1M tokens                                  | 1M tokens                    | 1M tokens                                | 1M tokens                                                                                     | 1M tokens                                           |
| API model key     | `gemini-3-flash-preview`                                                       | `gemini-2.5-pro`                           | `gemini-2.5-flash`           | `gemini-2.5-flash-lite`                  | `gemini-2.0-flash-001`<br />`gemini-2.0-flash-thinking-exp-01-21`<br />`gemini-2.0-flash-exp` | `gemini-1.5-pro-preview-0409`<br />`gemini-1.5-pro` |

## Important notes

### Long context handling

* All models support up to 1M tokens context window
* Different pricing tiers apply based on input length:
  * Standard rate for inputs less than 128k tokens
  * Higher rate for inputs more than 128k tokens
* Please check the [pricing page](/en/models-and-pricing#google-models) for detailed pricing information

### Vision capabilities

* Gemini 2.5 Pro Preview and Gemini 1.5 Pro support multimodal inputs including images
* Can analyze complex visual information and provide detailed descriptions
* Vision support is not available in Flash or Flash-Lite models

### Model strengths

* **Gemini 3 Flash**:
  * Next-generation fast-response model
  * Designed for low-latency, high-throughput use cases
  * Reliable performance for real-time interactions
  * Supports vision input

* **Gemini 2.5 Pro Preview**:
  * Advanced reasoning and analysis
  * Multimodal understanding
  * Complex task handling
  * Code generation and analysis

* **Gemini 2.5 Flash**:
  * Fast response times
  * Cost-effective for simple tasks
  * Efficient token usage
  * Good for high-throughput applications

* **Gemini 2.5 Flash-Lite**:
  * Lightweight and efficient
  * Ideal for quick responses
  * Budget-friendly for simple tasks

* **Gemini 2.0 Flash**:
  * Cost-effective for basic tasks
  * Reliable for lightweight applications
  * Efficient token usage

* **Gemini 1.5 Pro**:
  * Previous generation advanced model
  * Multimodal capabilities
  * Complex reasoning and analysis

## Official documentation

* [Gemini API Overview](https://ai.google.dev/docs/gemini_api_overview)
* [Vision Guide](https://ai.google.dev/docs/multimodal_concepts)
* [Safety & Responsible AI](https://ai.google.dev/docs/safety_guidance)
