- Model
workers-ai/@cf/zai-org/glm-4.7-flash- Provider
cloudflare-ai-gateway- API
openai-completions- Base URL
https://gateway.ai.cloudflare.com/v1/{CLOUDFLARE_ACCOUNT_ID}/{CLOUDFLARE_GATEWAY_ID}/compat- Input
- text
- Reasoning
- Yes
- Context window
- 131,072
- Max tokens
- 131,072
- Cost / million input
- $0.06
- Cost / million output
- $0.4
- Cost / million cache read
- $0
- Cost / million cache write
- $0
Model config JSON
{
"providers": {
"cloudflare-ai-gateway": {
"apiKey": "YOUR_API_KEY",
"models": [
{
"id": "workers-ai/@cf/zai-org/glm-4.7-flash",
"name": "GLM-4.7-Flash",
"reasoning": true,
"input": [
"text"
],
"contextWindow": 131072,
"maxTokens": 131072,
"cost": {
"input": 0.06,
"output": 0.4,
"cacheRead": 0,
"cacheWrite": 0
},
"compat": {
"sendSessionAffinityHeaders": true
}
}
],
"api": "openai-completions",
"baseUrl": "https://gateway.ai.cloudflare.com/v1/{CLOUDFLARE_ACCOUNT_ID}/{CLOUDFLARE_GATEWAY_ID}/compat"
}
}
}