nvidia

NVIDIA: Nemotron 3 Super

Nemotron 3 Super is a text-in, text-out model from NVIDIA with a 1-million-token context window, tool use, and reasoning support. Structured output support is unconfirmed, and the model does not advertise a completion-token ceiling in available documentation. It is not free, and open-weight availability is currently unlisted. At $0.09 per million input tokens and $0.45 per million output tokens, pricing sits at the lower end of reasoning-capable models, which may attract cost-conscious teams running long-context or agentic workloads. The practical caveat is significant: there is no independent benchmark coverage to validate actual performance, so buyers have no third-party reference point for quality. Teams that can run their own evals and tolerate some uncertainty may find the price worth exploring, but anyone who needs verified performance data before committing should wait until external scores are available.

Query via API → View on nvidia → Estimate cost

Quality Score

99/100

price + capability + benchmarks

Input Price

$0.09

per 1M tokens

Output Price

$0.45

per 1M tokens

Context Window

1,000,000

tokens

Model ID: nvidia/nemotron-3-super-120b-a12b
Vendor: nvidia
Tokenizer: Other
Input Modalities: text
Output Modalities: text
Max Output: default
Tool Calling: ✓ supported
Structured Output: ✓ supported
Reasoning Mode: ✓ supported
Vision: text only
Audio: no
Moderated: no

Similar models

nvidia

NVIDIA: Nemotron 3 Super

Similar models

NVIDIA: Nemotron 3 Nano 30B A3B

NVIDIA: Nemotron 3 Nano Omni (free)

NVIDIA: Nemotron 3 Ultra

NVIDIA: Nemotron 3 Super (free)

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

NVIDIA: Nemotron Nano 12B 2 VL (free)