Gemini 2.0 Flash Lite (Vertex)

Model details

Model: gemini-2.0-flash-lite
Provider: google-vertex
API: google-vertex
Base URL: https://{location}-aiplatform.googleapis.com
Input: text, image
Reasoning: Yes
Context window: 1,048,576
Max tokens: 65,536
Cost / million input: $0.075
Cost / million output: $0.3
Cost / million cache read: $0.01875
Cost / million cache write: $0

Model config JSON

{
  "providers": {
    "google-vertex": {
      "apiKey": "YOUR_API_KEY",
      "models": [
        {
          "id": "gemini-2.0-flash-lite",
          "name": "Gemini 2.0 Flash Lite (Vertex)",
          "reasoning": true,
          "input": [
            "text",
            "image"
          ],
          "contextWindow": 1048576,
          "maxTokens": 65536,
          "cost": {
            "input": 0.075,
            "output": 0.3,
            "cacheRead": 0.01875,
            "cacheWrite": 0
          }
        }
      ],
      "api": "google-vertex",
      "baseUrl": "https://{location}-aiplatform.googleapis.com"
    }
  }
}

Also available from other providers