opea-project/GenAIExamplesPublic

NotificationsYou must be signed in to change notification settings
Fork332
Star709

Generative AI Examples is a collection of GenAI examples such as ChatQnA, Copilot, which illustrate the pipeline capabilities of the Open Platform for Enterprise AI (OPEA) project.

License

Apache-2.0 license

709 stars 332 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 1,349 Commits
.github		.github
AgentQnA		AgentQnA
ArbPostHearingAssistant		ArbPostHearingAssistant
AudioQnA		AudioQnA
AvatarChatbot		AvatarChatbot
BrowserUseAgent		BrowserUseAgent
ChatQnA		ChatQnA
CodeGen		CodeGen
CodeTrans		CodeTrans
DBQnA		DBQnA
DeepResearchAgent		DeepResearchAgent
DocIndexRetriever		DocIndexRetriever
DocSum		DocSum
EdgeCraftRAG		EdgeCraftRAG
FinanceAgent		FinanceAgent
GraphRAG		GraphRAG
HybridRAG		HybridRAG
InstructionTuning		InstructionTuning
MultimodalQnA		MultimodalQnA
PolyLingua		PolyLingua
ProductivitySuite		ProductivitySuite
RerankFinetuning		RerankFinetuning
SearchQnA		SearchQnA
Text2Image		Text2Image
Translation		Translation
VideoQnA		VideoQnA
VisualQnA		VisualQnA
WorkflowExecAgent		WorkflowExecAgent
one_click_deploy		one_click_deploy
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.prettierignore		.prettierignore
.set_env.sh		.set_env.sh
LEGAL_INFORMATION.md		LEGAL_INFORMATION.md
LICENSE		LICENSE
README-deploy-benchmark.md		README-deploy-benchmark.md
README.md		README.md
benchmark.py		benchmark.py
deploy.py		deploy.py
deploy_and_benchmark.py		deploy_and_benchmark.py
docker_images_list.md		docker_images_list.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
supported_examples.md		supported_examples.md
validated_configurations.md		validated_configurations.md

Repository files navigation

Generative AI Examples

Introduction

GenAIExamples are designed to give developers an easy entry into generative AI, featuring microservice-based samples that simplify the processes of deploying, testing, and scaling GenAI applications. All examples are fully compatible with both Docker and Kubernetes, supporting a wide range of hardware platforms such as Gaudi, Xeon, AMD EPYC CPUs, AMD Instinct GPUs, and other hardwares including NVIDIA GPUs, ensuring flexibility and efficiency for your GenAI adoption.

Architecture

GenAIComps is a service-based tool that includes microservice components such as llm, embedding, reranking, and so on. Using these components, various examples in GenAIExample can be constructed including ChatQnA, DocSum, etc.

GenAIInfra is part of the OPEA containerization and cloud-native suite and enables quick and efficient deployment of GenAIExamples in the cloud.

GenAIEval measures service performance metrics such as throughput, latency, and accuracy for GenAIExamples. This feature helps users compare performance across various hardware configurations easily.

Use Cases

Below are some highlighted GenAI use cases across various application scenarios:

Scenario	Use Case
Question Answering	ChatQnA ✨: Chatbot with Retrieval Augmented Generation (RAG). VisualQnA ✨: Visual Question-answering.
Image Generation	Text2Image ✨: Text-to-image generation.
Content Summarization	DocSum: Document Summarization Application.
Code Generation	CodeGen: Gen-AI Powered Code Generator.
Information Retrieval	DocIndexRetriever: Document Retrieval with Retrieval Augmented Generation (RAG).
Fine-tuning	InstructionTuning: Application of Instruction Tuning.

For the full list of the available use cases and their supported deployment type, please referhere.

Documentation

The GenAIExamplesdocumentation contains a comprehensive guide on all available examples including architecture, deployment guides, and more. Information on GenAIComps, GenAIInfra, and GenAIEval can also be found there.

Getting Started

GenAIExamples offers flexible deployment options that cater to different user needs, enabling efficient use and deployment in various environments. Three primary methods are presently used to do this: Python startup, Docker Compose, and Kubernetes.

Users can choose the most suitable approach based on ease of setup, scalability needs, and the environment in which they are operating.

Deployment Guide

Deployment is based on released docker images by default - checkdocker image list for detailed information. You can also build your own images following instructions.

Prerequisite

For Docker Compose-based deployment, you should have docker compose installed. Refer todocker compose install for more information.
For Kubernetes-based deployment, you can useHelm orGMC-based deployment.
- You should have a kubernetes cluster ready for use. If not, you can refer tok8s install to deploy one.
- (Optional) You should have Helm (version >= 3.15) installed if you want to deploy with Helm Charts. Refer to theHelm Installation Guide for more information.
- (Optional) You should have GMC installed to your kubernetes cluster if you want to try with GMC. Refer toGMC install for more information.
Recommended Hardware Reference
Based on different deployment model sizes and performance requirements, you may choose different hardware platforms or cloud instances. Here are some of the reference platforms:

Use Case	Deployment model	Reference Configuration	Hardware access/instances
Xeon	Intel/neural-chat-7b-v3-3	64 vCPUs, 365 GB disk, 100 GB RAM, and Ubuntu 24.04	Intel Tiber Developer Cloud
Gaudi	Intel/neural-chat-7b-v3-3	1 or 2 Gaudi Cards, 16 vCPUs, 365 GB disk, 100 GB RAM, and Ubuntu 24.04	Intel Tiber Developer Cloud
Xeon (AWS)	Intel/neural-chat-7b-v3-3	64 vCPUs, 100 GB disk, 64 GB RAM, and Ubuntu 24.04	AWS Cloud (e.g.,`c7i.16xlarge`)
AMD EPYC	meta-llama/Meta-Llama-3-8B-Instruct	64 vCPUs, 100 GB disk, 256 GB RAM, and Ubuntu 24.04	Google Cloud Platform Microsoft Azure AWS
AMD Instinct	meta-llama/Llama-3.1-405B	GPU: 8× MI300X, 1536 GB vRAM, and Ubuntu 24.04	AMD Developer Cloud Oracle Cloud Infrastructure Microsoft Azure

Deploy Examples

Note: Check forsample guides first for your use case. If it is not available, then refer to the table below:

Use Case	Docker Compose Deployment on Xeon	Docker Compose Deployment on Gaudi	Docker Compose Deployment on AMD EPYC	Docker Compose Deployment on ROCm	Kubernetes with Helm Charts	Kubernetes with GMC
ChatQnA	Xeon Instructions	Gaudi Instructions	EPYC Instructions	ROCm Instructions	ChatQnA with Helm Charts	ChatQnA with GMC
CodeGen	Xeon Instructions	Gaudi Instructions	EPYC Instructions	ROCm Instructions	CodeGen with Helm Charts	CodeGen with GMC
CodeTrans	Xeon Instructions	Gaudi Instructions	EPYC Instructions	ROCm Instructions	CodeTrans with Helm Charts	CodeTrans with GMC
DocSum	Xeon Instructions	Gaudi Instructions	EPYC Instructions	ROCm Instructions	DocSum with Helm Charts	DocSum with GMC
SearchQnA	Xeon Instructions	Gaudi Instructions	EPYC Instructions	Not Supported	SearchQnA with Helm Charts	SearchQnA with GMC
Translation	Xeon Instructions	Gaudi Instructions	EPYC Instructions	ROCm Instructions	Not Supported	Translation with GMC
AudioQnA	Xeon Instructions	Gaudi Instructions	EPYC Instructions	ROCm Instructions	AudioQnA with Helm Charts	AudioQnA with GMC
VisualQnA	Xeon Instructions	Gaudi Instructions	EPYC Instructions	ROCm Instructions	VisualQnA with Helm Charts	VisualQnA with GMC
MultimodalQnA	Xeon Instructions	Gaudi Instructions	EPYC Instructions	ROCm Instructions	Not Supported	Not Supported
ProductivitySuite	Xeon Instructions	Not Supported	EPYC Instructions	Not Supported	Not Supported	Not Supported
Text2Image	Xeon Instructions	Gaudi Instructions	EPYC Instructions	Not Supported	Text2Image with Helm Charts	Not Supported

Supported Examples

Checkhere for detailed information of supported examples, models, hardwares, etc.

Validated Configurations

Checkhere for the validated configurations of GenAIExamples, including hardware and software versions that have been tested for each release.

Contributing to OPEA

Welcome to the OPEA open-source community! We are thrilled to have you here and excited about the potential contributions you can bring to the OPEA platform. Whether you are fixing bugs, adding new GenAI components, improving documentation, or sharing your unique use cases, your contributions are invaluable.

Together, we can make OPEA the go-to platform for enterprise AI solutions. Let's work together to push the boundaries of what's possible and create a future where AI is accessible, efficient, and impactful for everyone.

Please check theContributing guidelines for a detailed guide on how to contribute a GenAI component and all the ways you can contribute!

Thank you for being a part of this journey. We can't wait to see what we can achieve together!

Additional Content

About

Generative AI Examples is a collection of GenAI examples such as ChatQnA, Copilot, which illustrate the pipeline capabilities of the Open Platform for Enterprise AI (OPEA) project.

opea.dev

Releases10

Generative AI Examples v1.4 Release Notes Latest

Aug 25, 2025

+ 9 releases

Packages

No packages published

Contributors128

+ 114 contributors

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Generative AI Examples

Introduction

Architecture

Use Cases

Documentation

Getting Started

Deployment Guide

Prerequisite

Deploy Examples

Supported Examples

Validated Configurations

Contributing to OPEA

Additional Content

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases10

Packages

Uh oh!

Contributors128

Uh oh!

Languages

Movatterモバイル変換

License

opea-project/GenAIExamples

Folders and files

Latest commit

History

Repository files navigation

Generative AI Examples

Introduction

Architecture

Use Cases

Documentation

Getting Started

Deployment Guide

Prerequisite

Deploy Examples

Supported Examples

Validated Configurations

Contributing to OPEA

Additional Content

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases10

Packages0

Uh oh!

Contributors128

Uh oh!

Languages

Packages