- Model
accounts/fireworks/routers/glm-5p1-fast- Provider
fireworks- API
anthropic-messages- Base URL
https://api.fireworks.ai/inference- Input
- text
- Reasoning
- Yes
- Context window
- 202,800
- Max tokens
- 131,072
- Cost / million input
- $2.8
- Cost / million output
- $8.8
- Cost / million cache read
- $0.52
- Cost / million cache write
- $0
Model config JSON
{
"providers": {
"fireworks": {
"apiKey": "YOUR_API_KEY",
"models": [
{
"id": "accounts/fireworks/routers/glm-5p1-fast",
"name": "GLM 5.1 Fast",
"reasoning": true,
"input": [
"text"
],
"contextWindow": 202800,
"maxTokens": 131072,
"cost": {
"input": 2.8,
"output": 8.8,
"cacheRead": 0.52,
"cacheWrite": 0
},
"compat": {
"sendSessionAffinityHeaders": true,
"supportsEagerToolInputStreaming": false,
"supportsCacheControlOnTools": false,
"supportsLongCacheRetention": false
}
}
],
"api": "anthropic-messages",
"baseUrl": "https://api.fireworks.ai/inference"
}
}
}