NVIDIA has released Nemotron Nano 9B V2, a small 9B reasoning model that scores 43 on the Artificial Analysis Intelligence Index, the highest yet for <10B models Nemotron 9B V2 is the first Nemotron model pre-trained by @NVIDIA. Previous Nemotron models have been developed by post-training on Meta Llama models. Architecture & Training: The model uses a hybrid Mamba-Transformer architecture. NVIDIA pre-trained a 12B parameter base model and applied post-training with a range of techniques including RLHF and GRPO. The final 9B size was pruned from this model and re-trained with the base model as a teacher. Small-model frontier: with only 9B parameters, Nemotron Nano 9B V2 is placed ahead of Llama 4 Maverick on our leaderboard, equal to Solar Pro 2 with reasoning and trails just behind gpt-oss-20B (high). Along with this model, NVIDIA released a 6.6-trillion token subset of their pre-training data for public use on @huggingface Key model details: ➤ 128k token context window ➤ Supports reasoning and non-reasoning modes (with ‘/no_think’ settings in the system prompt) ➤ Released under the NVIDIA Open Model License, and not additionally covered by Meta’s Llama license like prior Nemotron models - this means that there is no limitation on use by large companies or requirement to keep ‘Nemotron’ in the name of derivative models ➤ No serverless inference providers are yet serving the model, but it is available now on Hugging Face for local inference or self-deployment See below for our full analysis and key announcement links from NVIDIA 👇

Aug 27, 2025 · 12:47 AM UTC

21
54
520