pimlock/GenerativeAIExamplesPublic

forked fromNVIDIA/GenerativeAIExamples

NotificationsYou must be signed in to change notification settings
Fork0
Star0

Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

License

Apache-2.0 license

0 stars 921 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 157 Commits
RAG		RAG
community		community
docs		docs
finetuning		finetuning
industries/healthcare		industries/healthcare
llama_3.3_nemotron_super_49B		llama_3.3_nemotron_super_49B
nemo		nemo
nemotron/VLM/llama_3.1_nemotron_nano_VL_8B		nemotron/VLM/llama_3.1_nemotron_nano_VL_8B
vision_workflows		vision_workflows
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
LICENSE.DATA		LICENSE.DATA
LICENSE.md		LICENSE.md
README.md		README.md
SECURITY.md		SECURITY.md

Repository files navigation

NVIDIA Generative AI Examples

This repository is a starting point for developers looking to integrate with the NVIDIA software ecosystem to speed up their generative AI systems. Whether you are building RAG pipelines, agentic workflows, or fine-tuning models, this repository will help you integrate NVIDIA, seamlessly and natively, with your development stack.

What's New?

Data Flywheel

This tutorial demonstrates an end-to-end Data Flywheel implementation that uses NVIDIA NeMo Microservices. It features a tool-calling workflow with the NVIDIA NeMo Datastore, NeMo Entity Store, NeMo Customizer, NeMo Evaluator, NeMo Guardrails microservices, and NVIDIA NIMs.

Tool Calling Fine-tuning, Inference, and Evaluation with NVIDIA NeMo Microservices and NIMs

Knowledge Graph RAG

This example implements a GPU-accelerated pipeline for creating and querying knowledge graphs using RAG by leveraging NIM microservices and the RAPIDS ecosystem to process large-scale datasets efficiently.

Knowledge Graphs for RAG with NVIDIA AI Foundation Models and Endpoints

Agentic Workflows with Llama 3.1

Build an Agentic RAG Pipeline with Llama 3.1 and NVIDIA NeMo Retriever NIM microservices [Blog,Notebook]
NVIDIA Morpheus, NIM microservices, and RAG pipelines integrated to create LLM-based agent pipelines

RAG with Local NIM Deployment and LangChain

Tips for Building a RAG Pipeline with NVIDIA AI LangChain AI Endpoints by Amit Bleiweiss. [Blog,Notebook]

For more information, refer to theGenerative AI Example releases.

Vision NIM Workflows

A collection of Jupyter notebooks, sample code and reference applications built with Vision NIMs.

To pull the vision NIM workflows, clone this repository recursively:

git clone https://github.com/nvidia/GenerativeAIExamples --recurse-submodules

The workflows will then be located atGenerativeAIExamples/vision_workflows

Follow the links below to learn more:

Try it Now!

Experience NVIDIA RAG Pipelines with just a few steps!

Get your NVIDIA API key.
1. Go to theNVIDIA API Catalog.
2. Select any model.
3. ClickGet API Key.
4. Run:
```
export NVIDIA_API_KEY=nvapi-...
```

Clone the repository.

git clone https://github.com/nvidia/GenerativeAIExamples.git

Build and run the basic RAG pipeline.

cd GenerativeAIExamples/RAG/examples/basic_rag/langchain/docker compose up -d --build

Go tohttps://localhost:8090/ and submit queries to the sample RAG Playground.
Stop containers when done.
```
docker compose down
```

Data Flywheel

AData Flywheel is a self-reinforcing cycle where user interactions generate data that improves AI models or products, leading to better outcomes that attract more users and further enhance data quality. This feedback loop relies on continuous data processing, model refinement, and guardrails to ensure accuracy and compliance while compounding value over time. Real-world applications range from personalized customer experiences to operational systems like inventory management, where improved predictions drive efficiency and growth.

Tool-Calling Notebooks

Tool calling empowers Large Language Models (LLMs) to integrate with external APIs, execute dynamic workflows, and retrieve real-time data beyond their training scope. The NVIDIA NeMo microservices platform offers a modular infrastructure for deploying AI pipelines that includes fine-tuning, evaluation, inference, and guardrail enforcement—across Kubernetes clusters in cloud or on-premises environments.

This end-to-endtutorial demonstrates how to leverage NeMo Microservices to customizeLlama-3.2-1B-Instruct by using thexLAM function-calling dataset, assess its accuracy, and implement safety constraints to govern its behavior.

RAG

RAG Notebooks

NVIDIA has first-class support for popular generative AI developer frameworks likeLangChain,LlamaIndex, andHaystack. These end-to-end notebooks show how to integrate NIM microservices using your preferred generative AI development framework.

Use thesenotebooks to learn about the LangChain and LlamaIndex connectors.

LangChain Notebooks

LlamaIndex Notebooks

Basic RAG with LlamaIndex Integration

RAG Examples

By default, these end-to-endexamples use preview NIM endpoints onNVIDIA API Catalog. Alternatively, you can run any of the exampleson premises.

Basic RAG Examples

Advanced RAG Examples

RAG Tools

Example tools and tutorials to enhance LLM development and productivity when using NVIDIA RAG pipelines.

RAG Projects

NVIDIA Tokkio LLM-RAG: Use Tokkio to add avatar animation for RAG responses.
Hybrid RAG Project on AI Workbench: Run an NVIDIA AI Workbench example project for RAG.

Documentation

Getting Started

Prerequisites

How To's

Reference

Community

We're posting these examples on GitHub to support the NVIDIA LLM community and facilitate feedback.We invite contributions! Open a GitHub issue or pull request! Seecontributing Check out thecommunity examples and notebooks.

About

Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

Releases

No releases published

Packages

No packages published

Languages

Jupyter Notebook74.4%
Python22.9%
JavaScript1.0%
HTML0.4%
Shell0.4%
CSS0.3%
Other0.6%

Movatterモバイル変換

License

pimlock/GenerativeAIExamples

Folders and files

Latest commit

History

Repository files navigation

NVIDIA Generative AI Examples

Table of Contents

What's New?

Data Flywheel

Knowledge Graph RAG

Agentic Workflows with Llama 3.1

RAG with Local NIM Deployment and LangChain

Vision NIM Workflows

Try it Now!

Data Flywheel

Tool-Calling Notebooks

RAG

RAG Notebooks

LangChain Notebooks

LlamaIndex Notebooks

RAG Examples

Basic RAG Examples

Advanced RAG Examples

RAG Tools

RAG Projects

Documentation

Getting Started

How To's

Reference

Community

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Languages

Packages