Nemotron 3 Super 120B

Models

Model details

Nemotron 3 Super 120B

Model
workers-ai/@cf/nvidia/nemotron-3-120b-a12b
Provider
cloudflare-ai-gateway
API
openai-completions
Base URL
https://gateway.ai.cloudflare.com/v1/{CLOUDFLARE_ACCOUNT_ID}/{CLOUDFLARE_GATEWAY_ID}/compat
Input
text
Reasoning
Yes
Context window
256,000
Max tokens
256,000
Cost / million input
$0.5
Cost / million output
$1.5
Cost / million cache read
$0
Cost / million cache write
$0
Model config JSON
{
  "providers": {
    "cloudflare-ai-gateway": {
      "apiKey": "YOUR_API_KEY",
      "models": [
        {
          "id": "workers-ai/@cf/nvidia/nemotron-3-120b-a12b",
          "name": "Nemotron 3 Super 120B",
          "reasoning": true,
          "input": [
            "text"
          ],
          "contextWindow": 256000,
          "maxTokens": 256000,
          "cost": {
            "input": 0.5,
            "output": 1.5,
            "cacheRead": 0,
            "cacheWrite": 0
          },
          "compat": {
            "sendSessionAffinityHeaders": true
          }
        }
      ],
      "api": "openai-completions",
      "baseUrl": "https://gateway.ai.cloudflare.com/v1/{CLOUDFLARE_ACCOUNT_ID}/{CLOUDFLARE_GATEWAY_ID}/compat"
    }
  }
}