NVIDIA: Nemotron 3 Super
Nemotron 3 Super is a text-in, text-out model from NVIDIA with a 1-million-token context window, tool use, and reasoning support. Structured output support is unconfirmed, and the model does not advertise a completion-token ceiling in available documentation. It is not free, and open-weight availability is currently unlisted. At $0.09 per million input tokens and $0.45 per million output tokens, pricing sits at the lower end of reasoning-capable models, which may attract cost-conscious teams running long-context or agentic workloads. The practical caveat is significant: there is no independent benchmark coverage to validate actual performance, so buyers have no third-party reference point for quality. Teams that can run their own evals and tolerate some uncertainty may find the price worth exploring, but anyone who needs verified performance data before committing should wait until external scores are available.
- Model ID
- nvidia/nemotron-3-super-120b-a12b
- Vendor
- nvidia
- Tokenizer
- Other
- Input Modalities
- text
- Output Modalities
- text
- Max Output
- default
- Tool Calling
- ✓ supported
- Structured Output
- ✓ supported
- Reasoning Mode
- ✓ supported
- Vision
- text only
- Audio
- no
- Moderated
- no