Company
NVIDIA
Nemotron and related inference-oriented open models, often tied closely to deployment runtimes.
Start here
Latest model
Series
OpenReasoning Nemotron
OpenReasoning Nemotron 32B
Largest Nemotron checkpoint in this batch, intended as a serious reasoning model that still follows a plain dense Qwen2.5-style memory profile.
32.5B dense • 131,072 context • 8 KV heads
OpenReasoning Nemotron 14B
Mid-sized dense Nemotron checkpoint for users who want stronger reasoning behavior than 7B without stepping straight into 32B deployment territory.
14.7B dense • 131,072 context • 8 KV heads
OpenReasoning Nemotron 7B
Reasoning-tuned dense Nemotron checkpoint that tracks the familiar Qwen2.5 7B memory shape while targeting stronger math and code performance.
7.6B dense • 131,072 context • 4 KV heads
OpenReasoning Nemotron 1.5B
Small dense Nemotron reasoning model built on the Qwen2.5 1.5B geometry, aimed at strong math and code behavior on modest hardware.
1.5B dense • 32,768 context • 2 KV heads