GLM 5.1 Fast

Models

Model details

Model
accounts/fireworks/routers/glm-5p1-fast
Provider
fireworks
API
anthropic-messages
Base URL
https://api.fireworks.ai/inference
Input
text
Reasoning
Yes
Context window
202,800
Max tokens
131,072
Cost / million input
$2.8
Cost / million output
$8.8
Cost / million cache read
$0.52
Cost / million cache write
$0
Model config JSON
{
  "providers": {
    "fireworks": {
      "apiKey": "YOUR_API_KEY",
      "models": [
        {
          "id": "accounts/fireworks/routers/glm-5p1-fast",
          "name": "GLM 5.1 Fast",
          "reasoning": true,
          "input": [
            "text"
          ],
          "contextWindow": 202800,
          "maxTokens": 131072,
          "cost": {
            "input": 2.8,
            "output": 8.8,
            "cacheRead": 0.52,
            "cacheWrite": 0
          },
          "compat": {
            "sendSessionAffinityHeaders": true,
            "supportsEagerToolInputStreaming": false,
            "supportsCacheControlOnTools": false,
            "supportsLongCacheRetention": false
          }
        }
      ],
      "api": "anthropic-messages",
      "baseUrl": "https://api.fireworks.ai/inference"
    }
  }
}