NVIDIA-NeMo

Verified
We've verified that the organizationNVIDIA-NeMo controls the domain:
- nvidia.com
Learn more about verified organizations

NVIDIA NeMo Framework Overview

NeMo Framework is NVIDIA's GPU accelerated, end-to-end training framework for large language models (LLMs), multi-modal models and speech models. It enables seamless scaling of training (both pretraining and post-training) workloads from single GPU to thousand-node clusters for both 🤗Hugging Face/PyTorch and Megatron models. This GitHub organization includes a suite of libraries and recipe collections to help users train models from end to end.

NeMo Framework is also a part of the NVIDIA NeMo software suite for managing the AI agent lifecycle.

Latest 📣 announcements and 🗣️ discussions

🐳 NeMo AutoModel

🔬 NeMo RL

[10/1/2025]On-policy Distillation
[9/27/2025]FP8 Quantization in NeMo RL
[8/15/2025]NeMo-RL: Journey of Optimizing Weight Transfer in Large MoE Models by 10x

💬 NeMo Speech

[8/1/2025]Guide to Fine-tune Nvidia NeMo models with Granary Data

More to come and stay tuned!

Getting Started

	Installation	Checkpoint Conversion HF<>Megatron	LLM example recipes and scripts	VLM example recipes and scripts
Under 1,000 GPUs	NeMo Automodel,NeMo RL	No Need	Pre-training,SFT,LoRA,DPO,GRPO	SFT,LoRA,GRPO
Over 1,000 GPUs	NeMo Megatron-Bridge,NeMo-RL	Conversion	Pretrain, SFT, and LoRA,DPO withmegatron_cfg,GRPO withmegatron_cfg	SFT, LoRA,GRPO megatron config

Repo organization under NeMo Framework

Summary of key functionalities and container strategy of each repo

Visit the individual repos to find out more 🔍, raise 🐛, contribute ✍️ and participate in discussion forums 🗣️!

Repo	Key Functionality & Documentation Link	Training Loop	Training Backends	Infernece Backends	Model Coverage	Container
NeMo Megatron-Bridge	Pretraining, LoRA, SFT	PyT native loop	Megatron-core	NA	LLM & VLM	NeMo Framework Container
NeMo AutoModel	Pretraining, LoRA, SFT	PyT native loop	PyTorch	NA	LLM, VLM, Omni, VFM	NeMo AutoModel Container
Previous NeMo ->will repurpose to focus on Speech	Pretraining,SFT	PyTorch Lightning Loop	Megatron-core & PyTorch	RIVA	Speech	NA
NeMo RL	SFT, RL	PyT native loop	Megatron-core & PyTorch	vLLM	LLM, VLM	NeMo RL container
NeMo Gym	RL Environment, integrate with RL Framework	NA	NA	NA	NA	NeMo RL Container (WIP)
NeMo Aligner (deprecated)	SFT, RL	PyT Lightning Loop	Megatron-core	TRTLLM	LLM	NA
NeMo Curator	Data curation	NA	NA	NA	Agnostic	NeMo Curator Container
NeMo Evaluator	Model evaluation	NA	NA		Agnostic	NeMo Framework Container
NeMo Export-Deploy	Export to Production	NA	NA	vLLM, TRT, TRTLLM, ONNX	Agnostic	NeMo Framework Container
NeMo Run	Experiment launcher	NA	NA	NA	Agnostic	NeMo Framework Container
NeMo Guardrails	Guardrail model response	NA	NA	NA		NA
NeMo Skills	Reference pipeline for SDG & Eval	NA	NA	NA	Agnostic	NA
NeMo Emerging Optimizers	Collection of Optimizers	NA	Agnostic	NA	NA	NA
NeMo DFM (WIP)	Diffusion foundation model training	PyT native loop	Megatron-core and PyTorch	PyTorch	VFM, Diffusion	TBD
NeMotron	Developer asset hub for nemotron models	NA	NA	NA	Nemotron models	NA
NeMo Data-designer	Synthetic data generation toolkit	NA	NA	NA	NA	NA

Table 1. NeMo Framework Repos

Diagram Ilustration of Repos under NeMo Framework (WIP)

Figure 1. NeMo Framework Repo Overview

Some background motivations and historical contexts

The NeMo GitHub Org and its repo collections are created to address the following problems

Need for composability: ThePrevious NeMo is monolithic and encompasses too many things, making it hard for users to find what they need. Container size is also an issue. Breaking down the Monolithic repo into a series of functional-focused repos to facilitate code discovery.
Need for customizability: ThePrevious NeMo uses PyTorch Lighting as the default trainer loop, which provides some out of the box functionality but making it hard to customize.NeMo Megatron-Bridge,NeMo AutoModel, andNeMo RL have adopted pytorch native custom loop to improve flexibility and ease of use for developers.

License

Apache 2.0 licensed with third-party attributions documented in each repository.

PinnedLoading

CuratorCuratorPublic
Scalable data pre processing and curation toolkit for LLMs
Python 1.2k 191
RLRLPublic
Scalable toolkit for efficient model reinforcement
Python 1k 173
AutomodelAutomodelPublic
Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support
Python 190 25
Megatron-BridgeMegatron-BridgePublic
HuggingFace conversion and training library for Megatron-based models
Python 227 72
GuardrailsGuardrailsPublic
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
Python 5.3k 563
GymGymPublic
Build RL environments for LLM training
Python 67 3

Repositories

Showing 10 of 17 repositories

Megatron-Bridge Public
HuggingFace conversion and training library for Megatron-based models
NVIDIA-NeMo/Megatron-Bridge’s past year of commit activity
Python 227Apache-2.0 72 153 (1 issue needs help) 100 UpdatedNov 29, 2025
Automodel Public
Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support
NVIDIA-NeMo/Automodel’s past year of commit activity
Python 190Apache-2.0 25 64 (2 issues need help) 47 UpdatedNov 29, 2025
ProRL-Agent-Server Public
NVIDIA-NeMo/ProRL-Agent-Server’s past year of commit activity
Python 15 2 0 1 UpdatedNov 29, 2025
Run Public
A tool to configure, launch and manage your machine learning experiments.
NVIDIA-NeMo/Run’s past year of commit activity
Python 208Apache-2.0 85 15 13 UpdatedNov 29, 2025
NeMo Public
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
NVIDIA-NeMo/NeMo’s past year of commit activity
Python 16,197Apache-2.0 3,211 163 90 UpdatedNov 28, 2025
Evaluator Public
Open-source library for scalable, reproducible evaluation of AI models and benchmarks.
NVIDIA-NeMo/Evaluator’s past year of commit activity
Python 104Apache-2.0 11 10 11 UpdatedNov 29, 2025
RL Public
Scalable toolkit for efficient model reinforcement
NVIDIA-NeMo/RL’s past year of commit activity
Python 1,048Apache-2.0 173 292 (2 issues need help) 74 UpdatedNov 28, 2025
Guardrails Public
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
NVIDIA-NeMo/Guardrails’s past year of commit activity
Python 5,338 563 103 (4 issues need help) 37 UpdatedNov 28, 2025
Skills Public
A project to improve skills of large language models
NVIDIA-NeMo/Skills’s past year of commit activity
Python 626Apache-2.0 116 63 17 UpdatedNov 28, 2025
Export-Deploy Public
A library for exporting models including NeMo and Hugging Face to optimized inference backends, and deploying them for efficient querying
NVIDIA-NeMo/Export-Deploy’s past year of commit activity
Python 21Apache-2.0 2 25 29 UpdatedNov 27, 2025