NVIDIA: Llama 3.1 Nemotron 70B Instruct

Model details

NVIDIA: Llama 3.1 Nemotron 70B Instruct

Model: nvidia/llama-3.1-nemotron-70b-instruct
Provider: openrouter
API: openai-completions
Base URL: https://openrouter.ai/api/v1
Input: text
Reasoning: No
Context window: 131,072
Max tokens: 16,384
Cost / million input: $1.2
Cost / million output: $1.2
Cost / million cache read: $0
Cost / million cache write: $0

Model config JSON

{
  "providers": {
    "openrouter": {
      "apiKey": "YOUR_API_KEY",
      "models": [
        {
          "id": "nvidia/llama-3.1-nemotron-70b-instruct",
          "name": "NVIDIA: Llama 3.1 Nemotron 70B Instruct",
          "reasoning": false,
          "input": [
            "text"
          ],
          "contextWindow": 131072,
          "maxTokens": 16384,
          "cost": {
            "input": 1.2,
            "output": 1.2,
            "cacheRead": 0,
            "cacheWrite": 0
          }
        }
      ],
      "api": "openai-completions",
      "baseUrl": "https://openrouter.ai/api/v1"
    }
  }
}

NVIDIA: Llama 3.1 Nemotron 70B Instruct

Also available from other providers