- Model
nvidia/llama-3.1-nemotron-70b-instruct- Provider
openrouter- API
openai-completions- Base URL
https://openrouter.ai/api/v1- Input
- text
- Reasoning
- No
- Context window
- 131,072
- Max tokens
- 16,384
- Cost / million input
- $1.2
- Cost / million output
- $1.2
- Cost / million cache read
- $0
- Cost / million cache write
- $0
Model config JSON
{
"providers": {
"openrouter": {
"apiKey": "YOUR_API_KEY",
"models": [
{
"id": "nvidia/llama-3.1-nemotron-70b-instruct",
"name": "NVIDIA: Llama 3.1 Nemotron 70B Instruct",
"reasoning": false,
"input": [
"text"
],
"contextWindow": 131072,
"maxTokens": 16384,
"cost": {
"input": 1.2,
"output": 1.2,
"cacheRead": 0,
"cacheWrite": 0
}
}
],
"api": "openai-completions",
"baseUrl": "https://openrouter.ai/api/v1"
}
}
}