- Model
stepfun-ai/step-3.5-flash- Provider
nvidia- API
openai-completions- Base URL
https://integrate.api.nvidia.com/v1- Input
- text
- Reasoning
- Yes
- Context window
- 256,000
- Max tokens
- 16,384
- Cost / million input
- $0
- Cost / million output
- $0
- Cost / million cache read
- $0
- Cost / million cache write
- $0
Model config JSON
{
"providers": {
"nvidia": {
"apiKey": "YOUR_API_KEY",
"models": [
{
"id": "stepfun-ai/step-3.5-flash",
"name": "Step 3.5 Flash",
"reasoning": true,
"input": [
"text"
],
"contextWindow": 256000,
"maxTokens": 16384,
"cost": {
"input": 0,
"output": 0,
"cacheRead": 0,
"cacheWrite": 0
},
"compat": {
"supportsStore": false,
"supportsDeveloperRole": false,
"supportsReasoningEffort": false,
"maxTokensField": "max_tokens",
"supportsStrictMode": false,
"supportsLongCacheRetention": false
}
}
],
"api": "openai-completions",
"baseUrl": "https://integrate.api.nvidia.com/v1"
}
}
}