Open and efficient multimodal models for agentic AI.
NVIDIA Nemotron™ is a family of open models, datasets, and technologies that empower you to build efficient, accurate, and specialized agentic AI systems. Designed for advanced reasoning, coding, visual understanding, agentic tasks, safety, and information retrieval, Nemotron models are openly available and integrated across the AI ecosystem so they can be deployed anywhere—from edge to cloud.
With transparent training data and broad platform support, Nemotron makes it easier to create and deploy trustworthy, high-performance AI agents.
Learn how open-source AI technology like Nemotron provides the transparency and trust businesses need to successfully adopt AI.
NVIDIA Nemotron open models, datasets, and recipes unlock developers to build the most efficient and accurate specialized agentic AI to run anywhere.
Video
Hear from Bryan Catanzaro, VP of applied deep learning research at NVIDIA, as he shares the vision behind Nemotron and why open technologies are essential for building trusted, enterprise-ready AI.
NVIDIA’s open data and optimization techniques ensure powerful, transparent, and adaptable models for developers and enterprises. Models and training data are published openly on Hugging Face.
Through the pruning of larger models, the Nemotron family is optimized for top compute efficiency, using NVIDIA TensorRT™-LLM to deliver higher throughput and on-or-off reasoning capabilities.
Built on popular open reasoning models for their exceptional knowledge, post-trained with high-quality training data, and aligned to reason like humans, Nemotron models achieve the highest accuracy on leading benchmarks.
The Nemotron model family, available as optimized NVIDIA NIM™ microservices, offers peak inference performance and flexible deployment options, ensuring superior security, privacy, and portability.
Nemotron models excel in a range of agentic AI tasks, including reasoning,vision,retrieval-augmented generation (RAG), and safety.Research models are also available for experimentation.
Select from a range of Nemotron reasoning models—Nanoprovides superior accuracy for PC and edge devices,Superoffers the highest accuracy and throughput to run on a single NVIDIA Tensor Core GPU, andUltradelivers the best accuracy for complex systems optimized for multi-GPU data centers.
Nemotron models deliver industry-leading extraction, embedding, and reranking capabilities for building retrieval pipelines that connect your enterprise data to agentic systems to provide accurate, real-time business insights.
NVIDIA Nemotron Safety Guard models provide real-time protection against harmful content, off-topic drift, and jailbreak attempts. They add a multilingual content safety layer, enhancing moderation and ensuring cultural alignment.
Start building and optimizing AI agents with NVIDIA NeMo™ for custom agentic AI, NVIDIA NIM for fast, enterprise-ready deployment, and NVIDIA Blueprints for accelerating development with customizable reference workflows.
Get started with easy-to-use API endpoints for NIM, powered by DGX™ Cloud.
Talk to an NVIDIA AI specialist about moving generative AI pilots to production with the security, API stability, and support that comes with NVIDIA AI Enterprise.
Learn how Nemotron accelerates innovation, empowers developers, and shapes the future of AI.
Learn how access to Nemotron’s model weights, datasets, and training recipes enabled deeper evaluation, what ServiceNow discovered about visual Q&A accuracy, and why openness matters for continuous improvement in multimodal AI.
See how an LLM with AI reasoning capabilities thinks outside the box to come up with a solution to a wedding seating chart while navigating family dynamics and guest preferences.
NVIDIA Nemotron models aren't just open, but truly open source. NVIDIA publishes the training datasets, techniques, and model weights so the open-source community can benefit from our learnings and use these resources to create their own models.
The NVIDIA Open Model License is a permissive license that allows users to use, modify, distribute, and commercially deploy the models and derivatives without crediting NVIDIA, to encourage innovation and further development of generative AI.
Yes, you can download and run NVIDIA Nemotron models fromHugging Face for free in production.
NVIDIA also offers Nemotron models as NVIDIA NIM microservices for secure, scalable deployment, which requires an NVIDIA AI Enterprise license. You can try the Nemotron models and download the NIM microservices frombuild.nvidia.com.
Yes, NVIDIA is committed to publishing more Nemotron models, datasets, and techniques to enable open-source ecosystems.
NVIDIA Nemotron models are built on top of frontier open models, making it possible to build better models faster. Additionally, NVIDIA publishes the model weights, training datasets, and training techniques so the developer community can use these different parts of Nemotron to train their own models.
Yes. NVIDIA built the Llama Nemotron models on top of the Llama model family using NVIDIA’s open datasets and advanced techniques, such as Neural Architecture Search (NAS). The Llama Nemotron models inherit the parent Llama model license.
NVIDIA provides a variety of tools, such as NVIDIA Dynamo, TensorRT-LLM, and NIM, to run Nemotron models at scale in production. You can also use popular open-source libraries, such as SGLang and vLLM.
Use the right tools and technologies to take NVIDIA Nemotron models from development to production.
Talk to an NVIDIA product specialist about moving from pilot to production with the security, API stability, and support that comes withNVIDIA AI Enterprise.
Get the latest agentic AI news, technologies, breakthroughs, and more sent straight to your inbox.