Nemotron 3 Ultra

Models

Model details

Model
nvidia/nemotron-3-ultra-550b-a55b
Provider
vercel-ai-gateway
API
anthropic-messages
Base URL
https://ai-gateway.vercel.sh
Input
text
Reasoning
Yes
Context window
1,000,000
Max tokens
65,000
Cost / million input
$0.6
Cost / million output
$2.4
Cost / million cache read
$0.12
Cost / million cache write
$0
Model config JSON
{
  "providers": {
    "vercel-ai-gateway": {
      "apiKey": "YOUR_API_KEY",
      "models": [
        {
          "id": "nvidia/nemotron-3-ultra-550b-a55b",
          "name": "Nemotron 3 Ultra",
          "reasoning": true,
          "input": [
            "text"
          ],
          "contextWindow": 1000000,
          "maxTokens": 65000,
          "cost": {
            "input": 0.6,
            "output": 2.4,
            "cacheRead": 0.12,
            "cacheWrite": 0
          }
        }
      ],
      "api": "anthropic-messages",
      "baseUrl": "https://ai-gateway.vercel.sh"
    }
  }
}