nvidia

NVIDIA: Nemotron 3 Super

Nemotron 3 Super is a text-in, text-out model from NVIDIA with a 1-million-token context window, tool use, and reasoning support. Structured output support is unconfirmed, and the model does not advertise a completion-token ceiling in available documentation. It is not free, and open-weight availability is currently unlisted. At $0.09 per million input tokens and $0.45 per million output tokens, pricing sits at the lower end of reasoning-capable models, which may attract cost-conscious teams running long-context or agentic workloads. The practical caveat is significant: there is no independent benchmark coverage to validate actual performance, so buyers have no third-party reference point for quality. Teams that can run their own evals and tolerate some uncertainty may find the price worth exploring, but anyone who needs verified performance data before committing should wait until external scores are available.

Quality Score
99/100
price + capability + benchmarks
Input Price
$0.09
per 1M tokens
Output Price
$0.45
per 1M tokens
Context Window
1,000,000
tokens
Model ID
nvidia/nemotron-3-super-120b-a12b
Vendor
nvidia
Tokenizer
Other
Input Modalities
text
Output Modalities
text
Max Output
default
Tool Calling
✓ supported
Structured Output
✓ supported
Reasoning Mode
✓ supported
Vision
text only
Audio
no
Moderated
no

Similar models