Gemini 1.5 Flash-8B (Vertex)

Model details

Gemini 1.5 Flash-8B (Vertex)

Model
gemini-1.5-flash-8b
Provider
google-vertex
API
google-vertex
Base URL
https://{location}-aiplatform.googleapis.com
Input
text, image
Reasoning
No
Context window
1,000,000
Max tokens
8,192
Cost / million input
$0.0375
Cost / million output
$0.15
Cost / million cache read
$0.01
Cost / million cache write
$0
Model config JSON
{
  "providers": {
    "google-vertex": {
      "apiKey": "YOUR_API_KEY",
      "models": [
        {
          "id": "gemini-1.5-flash-8b",
          "name": "Gemini 1.5 Flash-8B (Vertex)",
          "reasoning": false,
          "input": [
            "text",
            "image"
          ],
          "contextWindow": 1000000,
          "maxTokens": 8192,
          "cost": {
            "input": 0.0375,
            "output": 0.15,
            "cacheRead": 0.01,
            "cacheWrite": 0
          }
        }
      ],
      "api": "google-vertex",
      "baseUrl": "https://{location}-aiplatform.googleapis.com"
    }
  }
}