NVIDIA Generative AI Examples

This repository is a starting point for developers looking to integrate with the NVIDIA software ecosystem to speed up their generative AI systems. Whether you are building RAG pipelines, agentic workflows, or fine-tuning models, this repository will help you integrate NVIDIA, seamlessly and natively, with your development stack.

What's New?

Data Flywheel

This tutorial demonstrates an end-to-end Data Flywheel implementation that uses NVIDIA NeMo Microservices. It features a tool-calling workflow with the NVIDIA NeMo Datastore, NeMo Entity Store, NeMo Customizer, NeMo Evaluator, NeMo Guardrails microservices, and NVIDIA NIMs.

Tool Calling Fine-tuning, Inference, and Evaluation with NVIDIA NeMo Microservices and NIMs

Knowledge Graph RAG

This example implements a GPU-accelerated pipeline for creating and querying knowledge graphs using RAG by leveraging NIM microservices and the RAPIDS ecosystem to process large-scale datasets efficiently.

Knowledge Graphs for RAG with NVIDIA AI Foundation Models and Endpoints

Agentic Workflows with Llama 3.1

Build an Agentic RAG Pipeline with Llama 3.1 and NVIDIA NeMo Retriever NIM microservices [Blog, Notebook]
NVIDIA Morpheus, NIM microservices, and RAG pipelines integrated to create LLM-based agent pipelines

RAG with Local NIM Deployment and LangChain

Tips for Building a RAG Pipeline with NVIDIA AI LangChain AI Endpoints by Amit Bleiweiss. [Blog, Notebook]

For more information, refer to the Generative AI Example releases.

Vision NIM Workflows

A collection of Jupyter notebooks, sample code and reference applications built with Vision NIMs.

To pull the vision NIM workflows, clone this repository recursively:

git clone https://github.com/nvidia/GenerativeAIExamples --recurse-submodules

The workflows will then be located at GenerativeAIExamples/vision_workflows

Follow the links below to learn more:

Try it Now!

Experience NVIDIA RAG Pipelines with just a few steps!

Get your NVIDIA API key.
1. Go to the NVIDIA API Catalog.
2. Select any model.
3. Click Get API Key.
4. Run:
```
export NVIDIA_API_KEY=nvapi-...
```

Clone the repository.

git clone https://github.com/nvidia/GenerativeAIExamples.git

Build and run the basic RAG pipeline.

cd GenerativeAIExamples/RAG/examples/basic_rag/langchain/
docker compose up -d --build

Go to https://localhost:8090/ and submit queries to the sample RAG Playground.
Stop containers when done.
```
docker compose down
```

Data Flywheel

A Data Flywheel is a self-reinforcing cycle where user interactions generate data that improves AI models or products, leading to better outcomes that attract more users and further enhance data quality. This feedback loop relies on continuous data processing, model refinement, and guardrails to ensure accuracy and compliance while compounding value over time. Real-world applications range from personalized customer experiences to operational systems like inventory management, where improved predictions drive efficiency and growth.

Tool-Calling Notebooks

Tool calling empowers Large Language Models (LLMs) to integrate with external APIs, execute dynamic workflows, and retrieve real-time data beyond their training scope. The NVIDIA NeMo microservices platform offers a modular infrastructure for deploying AI pipelines that includes fine-tuning, evaluation, inference, and guardrail enforcement—across Kubernetes clusters in cloud or on-premises environments.

This end-to-end tutorial demonstrates how to leverage NeMo Microservices to customize Llama-3.2-1B-Instruct by using the xLAM function-calling dataset, assess its accuracy, and implement safety constraints to govern its behavior.

RAG

RAG Notebooks

NVIDIA has first-class support for popular generative AI developer frameworks like LangChain, LlamaIndex, and Haystack. These end-to-end notebooks show how to integrate NIM microservices using your preferred generative AI development framework.

Use these notebooks to learn about the LangChain and LlamaIndex connectors.

LangChain Notebooks

LlamaIndex Notebooks

Basic RAG with LlamaIndex Integration

RAG Examples

By default, these end-to-end examples use preview NIM endpoints on NVIDIA API Catalog. Alternatively, you can run any of the examples on premises.

Basic RAG Examples

Advanced RAG Examples

RAG Tools

Example tools and tutorials to enhance LLM development and productivity when using NVIDIA RAG pipelines.

RAG Projects

NVIDIA Tokkio LLM-RAG: Use Tokkio to add avatar animation for RAG responses.
Hybrid RAG Project on AI Workbench: Run an NVIDIA AI Workbench example project for RAG.

Documentation

Getting Started

Prerequisites

How To's

Reference

Community

We're posting these examples on GitHub to support the NVIDIA LLM community and facilitate feedback. We invite contributions! Open a GitHub issue or pull request! See contributing Check out the community examples and notebooks.

Name		Name	Last commit message	Last commit date
Latest commit History 147 Commits
RAG		RAG
community		community
docs		docs
finetuning		finetuning
industries/healthcare		industries/healthcare
llama_3.3_nemotron_super_49B		llama_3.3_nemotron_super_49B
nemo		nemo
vision_workflows		vision_workflows
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
LICENSE.DATA		LICENSE.DATA
LICENSE.md		LICENSE.md
README.md		README.md
SECURITY.md		SECURITY.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

NVIDIA Generative AI Examples

Table of Contents

What's New?

Data Flywheel

Knowledge Graph RAG

Agentic Workflows with Llama 3.1

RAG with Local NIM Deployment and LangChain

Vision NIM Workflows

Try it Now!

Data Flywheel

Tool-Calling Notebooks

RAG

RAG Notebooks

LangChain Notebooks

LlamaIndex Notebooks

RAG Examples

Basic RAG Examples

Advanced RAG Examples

RAG Tools

RAG Projects

Documentation

Getting Started

How To's

Reference

Community

About

Licenses found

Releases 8

Contributors 57

Languages

License

Licenses found

NVIDIA/GenerativeAIExamples

Folders and files

Latest commit

History

Repository files navigation

NVIDIA Generative AI Examples

Table of Contents

What's New?

Data Flywheel

Knowledge Graph RAG

Agentic Workflows with Llama 3.1

RAG with Local NIM Deployment and LangChain

Vision NIM Workflows

Try it Now!

Data Flywheel

Tool-Calling Notebooks

RAG

RAG Notebooks

LangChain Notebooks

LlamaIndex Notebooks

RAG Examples

Basic RAG Examples

Advanced RAG Examples

RAG Tools

RAG Projects

Documentation

Getting Started

How To's

Reference

Community

About

Topics

Resources

License

Licenses found

Security policy

Stars

Watchers

Forks

Releases 8

Contributors 57

Languages