Llama 3.3 Nemotron Super 49B v1

Models

Model details

Model
nvidia/llama-3.3-nemotron-super-49b-v1
Provider
nvidia
API
openai-completions
Base URL
https://integrate.api.nvidia.com/v1
Input
text
Reasoning
Yes
Context window
131,072
Max tokens
131,072
Cost / million input
$0
Cost / million output
$0
Cost / million cache read
$0
Cost / million cache write
$0
Model config JSON
{
  "providers": {
    "nvidia": {
      "apiKey": "YOUR_API_KEY",
      "models": [
        {
          "id": "nvidia/llama-3.3-nemotron-super-49b-v1",
          "name": "Llama 3.3 Nemotron Super 49B v1",
          "reasoning": true,
          "input": [
            "text"
          ],
          "contextWindow": 131072,
          "maxTokens": 131072,
          "cost": {
            "input": 0,
            "output": 0,
            "cacheRead": 0,
            "cacheWrite": 0
          },
          "compat": {
            "supportsStore": false,
            "supportsDeveloperRole": false,
            "supportsReasoningEffort": false,
            "maxTokensField": "max_tokens",
            "supportsStrictMode": false,
            "supportsLongCacheRetention": false
          }
        }
      ],
      "api": "openai-completions",
      "baseUrl": "https://integrate.api.nvidia.com/v1"
    }
  }
}