NotificationsYou must be signed in to change notification settings
Fork352
Star6.6k

Commit4ceda37

authored

pgml chat opensourceai (#1238)

1 parentf7401b8 commit4ceda37Copy full SHA for 4ceda37

File tree

5 files changed

+838

-451

lines changed

pgml-apps/pgml-chat
- .env.template
- README.md
- pgml_chat
  - main.py
- poetry.lock
- pyproject.toml

5 files changed

+838

-451

lines changed

`‎pgml-apps/pgml-chat/.env.template‎`

Lines changed: 2 additions & 1 deletion

Original file line number	Diff line number	Diff line change
`@@ -3,4 +3,5 @@ DATABASE_URL=<POSTGRES_DATABASE_URL starts with postgres://>`
`3`	`3`
`4`	`4`	`SLACK_BOT_TOKEN=<SLACK_BOT_TOKEN>`
`5`	`5`	`SLACK_APP_TOKEN=<SLACK_APP_TOKEN>`
`6`		`-DISCORD_BOT_TOKEN=<DISCORD_BOT_TOKEN>`
	`6`	`+DISCORD_BOT_TOKEN=<DISCORD_BOT_TOKEN>`
	`7`	`+SYSTEM_PROMPT_TEMPLATE=<SYSTEM PROMPT FOR CHAT COMPLETION MODEL. Check prompts.md file for examples>`

`‎pgml-apps/pgml-chat/README.md‎`

Lines changed: 41 additions & 10 deletions

Original file line number	Diff line number	Diff line change
`@@ -3,7 +3,7 @@ A command line tool to build and deploy a _knowledge based_ chatbot using Po`
`3`	`3`
`4`	`4`	`There are two stages in building a knowledge based chatbot:`
`5`	`5`	`- Build a knowledge base by ingesting documents, chunking documents, generating embeddings and indexing these embeddings for fast query`
`6`		`-- Generate responses to user queries by retrieving relevant documents and generating responses using OpenAI API`
	`6`	`+- Generate responses to user queries by retrieving relevant documents and generating responses using OpenAIand[OpenSourceAIAPI](https://postgresml.org/docs/introduction/apis/client-sdks/opensourceai)`
`7`	`7`
`8`	`8`	`This tool automates the above two stages and provides a command line interface to build and deploy a knowledge based chatbot.`
`9`	`9`
`@@ -12,7 +12,7 @@ Before you begin, make sure you have the following:`
`12`	`12`
`13`	`13`	`- PostgresML Database: Sign up for a free[GPU-powered database](https://postgresml.org/signup)`
`14`	`14`	`- Python version >=3.8`
`15`		`-- OpenAI API key`
	`15`	`+-(Optional)OpenAI API key`
`16`	`16`
`17`	`17`
`18`	`18`	`#Getting started`
`@@ -30,24 +30,24 @@ wget https://raw.githubusercontent.com/postgresml/postgresml/master/pgml-apps/pg`
`30`	`30`	```
`31`	`31`	3. Copy the template file to`.env`
`32`	`32`
`33`		`-4. Update environment variables with yourOpenAI API key andPostgresML database credentials.`
	`33`	`+4. Update environment variables with yourPostgresML database credentials andOpenAI API key (optional).`
`34`	`34`	```bash
`35`		`-OPENAI_API_KEY=<OPENAI_API_KEY>`
`36`	`35`	`DATABASE_URL=<POSTGRES_DATABASE_URL starts with postgres://>`
	`36`	`+OPENAI_API_KEY=<OPENAI_API_KEY># Optional`
`37`	`37`	```
`38`	`38`
`39`	`39`	`#Usage`
`40`	`40`	`You can get help on the command line interface by running:`
`41`	`41`
`42`	`42`	```bash
`43`	`43`	`(pgml-bot-builder-py3.9) pgml-chat % pgml-chat % pgml-chat --help`
`44`		`-usage: pgml-chat [-h] --collection_name COLLECTION_NAME [--root_dir ROOT_DIR] [--stage {ingest,chat}] [--chat_interface {cli,slack,discord}]`
`45`		`- [--chat_history CHAT_HISTORY] [--bot_name BOT_NAME] [--bot_language BOT_LANGUAGE] [--bot_topic BOT_TOPIC]`
`46`		`- [--bot_topic_primary_language BOT_TOPIC_PRIMARY_LANGUAGE] [--bot_persona BOT_PERSONA]`
	`44`	`+usage: pgml-chat [-h] --collection_name COLLECTION_NAME [--root_dir ROOT_DIR] [--stage {ingest,chat}] [--chat_interface {cli,slack,discord}] [--chat_history CHAT_HISTORY] [--bot_name BOT_NAME]`
	`45`	`+ [--bot_language BOT_LANGUAGE] [--bot_topic BOT_TOPIC] [--bot_topic_primary_language BOT_TOPIC_PRIMARY_LANGUAGE] [--bot_persona BOT_PERSONA]`
	`46`	`+ [--chat_completion_model CHAT_COMPLETION_MODEL] [--max_tokens MAX_TOKENS] [--vector_recall_limit VECTOR_RECALL_LIMIT]`
`47`	`47`
`48`	`48`	`PostgresML Chatbot Builder`
`49`	`49`
`50`		`-optional arguments:`
	`50`	`+options:`
`51`	`51`	`-h, --help show thishelp message andexit`
`52`	`52`	`--collection_name COLLECTION_NAME`
`53`	`53`	`Name of the collection (schema) to store the datain PostgresML database (default: None)`
`@@ -57,16 +57,21 @@ optional arguments:`
`57`	`57`	`--chat_interface {cli,slack,discord}`
`58`	`58`	`Chat interface to use (default: cli)`
`59`	`59`	`--chat_history CHAT_HISTORY`
`60`		`- Number of messages fromhistory usedfor generating response (default:1)`
	`60`	`+ Number of messages fromhistory usedfor generating response (default:0)`
`61`	`61`	`--bot_name BOT_NAME Name of the bot (default: PgBot)`
`62`	`62`	`--bot_language BOT_LANGUAGE`
`63`	`63`	`Language of the bot (default: English)`
`64`	`64`	`--bot_topic BOT_TOPIC`
`65`	`65`	`Topic of the bot (default: PostgresML)`
`66`	`66`	`--bot_topic_primary_language BOT_TOPIC_PRIMARY_LANGUAGE`
`67`		`- Primary programming language of the topic (default: )`
	`67`	`+ Primary programming language of the topic (default:SQL)`
`68`	`68`	`--bot_persona BOT_PERSONA`
`69`	`69`	`Persona of the bot (default: Engineer)`
	`70`	`+ --chat_completion_model CHAT_COMPLETION_MODEL`
	`71`	`+ --max_tokens MAX_TOKENS`
	`72`	`+ Maximum number of tokens to generate (default: 256)`
	`73`	`+ --vector_recall_limit VECTOR_RECALL_LIMIT`
	`74`	`+ Maximum number of documents to retrieve from vector recall (default: 1)`
`70`	`75`	```
`71`	`76`	`##Ingest`
`72`	`77`	`In this step, we ingest documents, chunk documents, generate embeddings and index these embeddings for fast query.`
`@@ -146,6 +151,32 @@ Once the discord app is running, you can interact with the chatbot on Discord as`
`146`	`151`
`147`	`152`	`![Discord Chatbot](./images/discord_screenshot.png)`
`148`	`153`
	`154`	`+# Prompt Engineering`
	`155`	`+In addition to relevant context retrieved from vector search, system prompt to generate accurate responses with minimum hallucinations requires prompt engineering.`
	`156`	`+Different chat completion models require different system prompts. Since the prompts including the context are long, they suffer fromlostin the middle problem describedin [this paper](https://arxiv.org/pdf/2307.03172.pdf). Below are some of the prompts that we have usedfor different chat completion models.`
	`157`	`+`
	`158`	`+## Default prompt (GPT-3.5 and open source models)`
	`159`	+```text
	`160`	`+Use the following pieces of context to answer the question at the end.`
	`161`	`+If you don't know the answer, just say that you don't know, don't try to make up an answer.`
	`162`	`+Use three sentences maximum and keep the answer as concise as possible.`
	`163`	`+Always say "thanks for asking!" at the end of the answer.`
	`164`	+```
	`165`	`+`
	`166`	`+## GPT-4 System prompt`
	`167`	+```text
	`168`	`+You are an assistant to answer questions about {topic}.\`
	`169`	`+Your name is {name}. You speak like {persona} in {language}. Use the given list of documents to answer user's question.\`
	`170`	`+Use the conversationhistoryif it is applicable to answer the question.\n Use the following steps:\n \`
	`171`	`+1. Identifyif the user input is really a question.\n \`
	`172`	`+2. If the user input is not related to the {topic}then respond that it is not related to the {topic}.\n \`
	`173`	`+3. If the user input is related to the {topic}then first identify relevant documents from the list of documents.\n \`
	`174`	`+4. If the documents that you found relevant have information to completely and accurately answers the questionthen respond with the answer.\n \`
	`175`	`+5. If the documents that you found relevant have code snippetsthen respond with the code snippets.\n \`
	`176`	`+6. Most importantly, don't make up code snippets that are not present in the documents.\n \`
	`177`	`+7. If the user input is generic like Cool, Thanks, Hello, etc. then respond with a generic answer. \n"`
	`178`	+```
	`179`	`+`
`149`	`180`	`# Developer Guide`
`150`	`181`
`151`	`182`	`1. Clone this repository, start a poetry shell and install dependencies`

0 commit comments

Comments

(0)

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Commit4ceda37

File tree

5 files changed

5 files changed

`‎pgml-apps/pgml-chat/.env.template‎`

`‎pgml-apps/pgml-chat/README.md‎`

0 commit comments