Company

NVIDIA

Nemotron and related inference-oriented open models, often tied closely to deployment runtimes.

Start here

Latest model

Nemotron

Largest Nemotron checkpoint in this batch, intended as a serious reasoning model that still follows a plain dense Qwen2.5-style memory profile.

32.5B dense • 131,072 context • 8 KV heads

Series

Largest Nemotron checkpoint in this batch, intended as a serious reasoning model that still follows a plain dense Qwen2.5-style memory profile.

32.5B dense • 131,072 context • 8 KV heads

Mid-sized dense Nemotron checkpoint for users who want stronger reasoning behavior than 7B without stepping straight into 32B deployment territory.

14.7B dense • 131,072 context • 8 KV heads

Reasoning-tuned dense Nemotron checkpoint that tracks the familiar Qwen2.5 7B memory shape while targeting stronger math and code performance.

7.6B dense • 131,072 context • 4 KV heads

Small dense Nemotron reasoning model built on the Qwen2.5 1.5B geometry, aimed at strong math and code behavior on modest hardware.

1.5B dense • 32,768 context • 2 KV heads