- Model
gemini-2.0-flash-lite- Provider
google-vertex- API
google-vertex- Base URL
https://{location}-aiplatform.googleapis.com- Input
- text, image
- Reasoning
- Yes
- Context window
- 1,048,576
- Max tokens
- 65,536
- Cost / million input
- $0.075
- Cost / million output
- $0.3
- Cost / million cache read
- $0.01875
- Cost / million cache write
- $0
Model config JSON
{
"providers": {
"google-vertex": {
"apiKey": "YOUR_API_KEY",
"models": [
{
"id": "gemini-2.0-flash-lite",
"name": "Gemini 2.0 Flash Lite (Vertex)",
"reasoning": true,
"input": [
"text",
"image"
],
"contextWindow": 1048576,
"maxTokens": 65536,
"cost": {
"input": 0.075,
"output": 0.3,
"cacheRead": 0.01875,
"cacheWrite": 0
}
}
],
"api": "google-vertex",
"baseUrl": "https://{location}-aiplatform.googleapis.com"
}
}
}