randomchristiancoder/ai-templatePublic template

forked fromJordan-Gilliam/ai-template

NotificationsYou must be signed in to change notification settings
Fork0
Star0

Mercury - Train your own custom GPT. Chat with any file, or website.

License

Unlicense license

0 stars 68 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
components		components
config		config
hooks		hooks
lib		lib
pages		pages
public		public
styles		styles
types		types
.editorconfig		.editorconfig
.env.example		.env.example
.eslintignore		.eslintignore
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.prettierignore		.prettierignore
LICENSE		LICENSE
README.md		README.md
next-env.d.ts		next-env.d.ts
next.config.mjs		next.config.mjs
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
prettier.config.js		prettier.config.js
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json

Repository files navigation

Mercury

Chat with any Document or Website

Train your own custom GPT

Train on specific websites that you define
Train on documents you upload
Builds on dialog with Chat History
Cites sources
Perplexity style UI

Supported Files

.pdf
.docx
.md
.txt
.png
.jpg
.html
.json

Coming Soon

.csv
.pptx
notion
next 13 app dir
vercel ai sdk

Train

1. Upload:`/api/embed-file`

file is uploaded -> cleaned to plain text, and split into 1000-character documents.
OpenAI's embedding API is used to generate embeddings for each document using the "text-embedding-ada-002" model.
The embeddings are stored in a Pinecone namespace.

2. Scrape:`/api/embed-webpage`

Web pages are scraped usingcheerio, cleaned to plain text, and split into 1000-character documents.
OpenAI's embedding API is used to generate embeddings for each document using the "text-embedding-ada-002" model.
The embeddings are stored in a Pinecone namespace.

Query

Responding to queries:`/api/query`

A single embedding is generated from the user prompt.
The embedding is used to perform a similarity search against the vector database.
The results of the similarity search are used to construct a prompt for GPT-3.
The GTP-3 response is then streamed back to the user.

Getting Started

1. Clone Repo and Install Deps

To create a new project based on this template usingdegit:

npx degit https://github.com/Jordan-Gilliam/ai-template ai-template

cd ai-templatecode.

install dependencies

npm i

2. Set-up Pinecone

Visitpinecone to create a free tier account and from the dashboard.
Create a new Pinecone Index with Dimensions1536eg:

Copy your API key
Record your Environment name ex:us-central1-gcp
Record your index name ex:mercury

3. Set-up OpenAi API

Visitopenai to create and copy your API key

You can find this in the OpenAI web portal underAPI Keys

4. Open the`.env.local` file and configure your environment

cp .env.example .env.local

# OpenAIOPENAI_API_KEY="sk-xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx"# PineconePINECONE_API_KEY="xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxx"PINECONE_ENVIRONMENT="us-central1-gcp"PINECONE_INDEX_NAME="mercury"

5. Start the app

npm run dev

Openhttp://localhost:3000 in your browser to view the app.

Template Features

OpenAI API (for generating embeddings and GPT-3 responses)
Pinecone
Nextjs API Routes (Edge runtime) - streaming
Tailwind CSS
Fonts with@next/font
Icons fromLucide
Dark mode withnext-themes
Radix UI Primitives
Automatic import sorting with@ianvs/prettier-plugin-sort-imports

Inspiration:

🍴 Huge thanks to@gannonh and@mayooear for their fantastic work that helped inspire this template.

How embeddings work:

ChatGPT is a great tool for answering general questions, but it falls short when it comes to answering domain-specific questions as it often makes up answers to fill its knowledge gaps and doesn't cite sources. To solve this issue, this starter app uses embeddings coupled with vector search. This app shows how OpenAI's GPT-3 API can be used to create conversational interfaces for domain-specific knowledge.

Embeddings are vectors of floating-point numbers that represent the "relatedness" of text strings. They are very useful for tasks like ranking search results, clustering, and classification. In text embeddings, a high cosine similarity between two embedding vectors indicates that the corresponding text strings are highly related.

This app uses embeddings to generate a vector representation of a document and then uses vector search to find the most similar documents to the query. The results of the vector search are then used to construct a prompt for GPT-3, which generates a response. The response is then streamed back to the user.

About

Mercury - Train your own custom GPT. Chat with any file, or website.

Releases

No releases published

Packages

No packages published

Languages

TypeScript96.2%
CSS2.3%
JavaScript1.5%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Mercury

Chat with any Document or Website

Supported Files

Coming Soon

Train

1. Upload:`/api/embed-file`

2. Scrape:`/api/embed-webpage`

Query

Responding to queries:`/api/query`

Getting Started

1. Clone Repo and Install Deps

2. Set-up Pinecone

3. Set-up OpenAi API

4. Open the`.env.local` file and configure your environment

5. Start the app

Template Features

Inspiration:

How embeddings work:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Languages

Movatterモバイル変換

License

randomchristiancoder/ai-template

Folders and files

Latest commit

History

Repository files navigation

Mercury

Chat with any Document or Website

Supported Files

Coming Soon

Train

1. Upload:/api/embed-file

2. Scrape:/api/embed-webpage

Query

Responding to queries:/api/query

Getting Started

1. Clone Repo and Install Deps

2. Set-up Pinecone

3. Set-up OpenAi API

4. Open the.env.local file and configure your environment

5. Start the app

Template Features

Inspiration:

How embeddings work:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Languages

1. Upload:`/api/embed-file`

2. Scrape:`/api/embed-webpage`

Responding to queries:`/api/query`

4. Open the`.env.local` file and configure your environment

Packages