- Model
nvidia/nemotron-3-ultra-550b-a55b- Provider
vercel-ai-gateway- API
anthropic-messages- Base URL
https://ai-gateway.vercel.sh- Input
- text
- Reasoning
- Yes
- Context window
- 262,144
- Max tokens
- 65,000
- Cost / million input
- $0.5
- Cost / million output
- $2.5
- Cost / million cache read
- $0.15
- Cost / million cache write
- $0
Model config JSON
{
"providers": {
"vercel-ai-gateway": {
"apiKey": "YOUR_API_KEY",
"models": [
{
"id": "nvidia/nemotron-3-ultra-550b-a55b",
"name": "Nemotron 3 Ultra",
"reasoning": true,
"input": [
"text"
],
"contextWindow": 262144,
"maxTokens": 65000,
"cost": {
"input": 0.5,
"output": 2.5,
"cacheRead": 0.15,
"cacheWrite": 0
}
}
],
"api": "anthropic-messages",
"baseUrl": "https://ai-gateway.vercel.sh"
}
}
}