NotificationsYou must be signed in to change notification settings
Fork352
Star6.6k

Commit3049db3

committed

move guides under pgml

1 parentba4b3a7 commit3049db3Copy full SHA for 3049db3

File tree

23 files changed

+44

-44

lines changed

pgml-cms
- blog
- docs
pgml-dashboard/src/components
- navigation/navbar/marketing
  - template.html
- sections/footers/marketing_footer
  - mod.rs

23 files changed

+44

-44

lines changed

`‎pgml-cms/blog/generating-llm-embeddings-with-open-source-models-in-postgresml.md‎`

Lines changed: 1 addition & 1 deletion

Original file line number	Diff line number	Diff line change
`@@ -120,7 +120,7 @@ LIMIT 5;`
`120`	`120`
`121`	`121`	`##Generating embeddings from natural language text`
`122`	`122`
`123`		-PostgresML provides a simple interface to generate embeddings from text in your database. You can use the[`pgml.embed`](https://postgresml.org/docs/guides/transformers/embeddings) function to generate embeddings for a column of text. The function takes a transformer name and a text value. The transformer will automatically be downloaded and cached on your connection process for reuse. You can see a list of potential good candidate models to generate embeddings on the[Massive Text Embedding Benchmark leaderboard](https://huggingface.co/spaces/mteb/leaderboard).
	`123`	+PostgresML provides a simple interface to generate embeddings from text in your database. You can use the[`pgml.embed`](https://postgresml.org/docs/open-source/pgml/guides/transformers/embeddings) function to generate embeddings for a column of text. The function takes a transformer name and a text value. The transformer will automatically be downloaded and cached on your connection process for reuse. You can see a list of potential good candidate models to generate embeddings on the[Massive Text Embedding Benchmark leaderboard](https://huggingface.co/spaces/mteb/leaderboard).
`124`	`124`
`125`	`125`	Since our corpus of documents (movie reviews) are all relatively short and similar in style, we don't need a large model.[`Alibaba-NLP/gte-base-en-v1.5`](https://huggingface.co/Alibaba-NLP/gte-base-en-v1.5) will be a good first attempt. The great thing about PostgresML is you can always regenerate your embeddings later to experiment with different embedding models.
`126`	`126`

`‎pgml-cms/blog/introducing-the-openai-switch-kit-move-from-closed-to-open-source-ai-in-minutes.md‎`

Lines changed: 1 addition & 1 deletion

Original file line number	Diff line number	Diff line change
`@@ -210,7 +210,7 @@ We have truncated the output to two items`
`210`	`210`
`211`	`211`	`!!!`
`212`	`212`
`213`		-We also have asynchronous versions of the create and`create_stream` functions relatively named`create_async` and`create_stream_async`. Checkout[our documentation](https://postgresml.org/docs/guides/opensourceai) for a complete guide of the open-source AI SDK including guides on how to specify custom models.
	`213`	+We also have asynchronous versions of the create and`create_stream` functions relatively named`create_async` and`create_stream_async`. Checkout[our documentation](https://postgresml.org/docs/open-source/pgml/guides/opensourceai) for a complete guide of the open-source AI SDK including guides on how to specify custom models.
`214`	`214`
`215`	`215`	`PostgresML is free and open source. To run the above examples yourself[create an account](https://postgresml.org/signup), install korvus, and get running!`
`216`	`216`

`‎pgml-cms/blog/semantic-search-in-postgres-in-15-minutes.md‎`

Lines changed: 1 addition & 1 deletion

Original file line number	Diff line number	Diff line change
`@@ -152,7 +152,7 @@ SELECT '[1,2,3]'::vector <=> '[2,3,4]'::vector;`
`152`	`152`
`153`	`153`	`!!!`
`154`	`154`
`155`		`-Other distance functions have similar formulas and provide convenient operators to use as well. It may be worth testing other operators and to see which performs better for your use case. For more information on the other distance functions, take a look at our[Embeddings guide](https://postgresml.org/docs/guides/embeddings/vector-similarity).`
	`155`	`+Other distance functions have similar formulas and provide convenient operators to use as well. It may be worth testing other operators and to see which performs better for your use case. For more information on the other distance functions, take a look at our[Embeddings guide](https://postgresml.org/docs/open-source/pgml/guides/embeddings/vector-similarity).`
`156`	`156`
`157`	`157`	`Going back to our search example, we can compute the cosine distance between our query embedding and our documents:`
`158`	`158`

`‎pgml-cms/docs/SUMMARY.md‎`

Lines changed: 13 additions & 13 deletions

Original file line number	Diff line number	Diff line change
`@@ -72,20 +72,20 @@`
`72`	`72`
`73`	`73`	`##Guides`
`74`	`74`
`75`		`-*[Embeddings](guides/embeddings/README.md)`
`76`		`-*[In-database Generation](guides/embeddings/in-database-generation.md)`
`77`		`-*[Dimensionality Reduction](guides/embeddings/dimensionality-reduction.md)`
`78`		`-*[Aggregation](guides/embeddings/vector-aggregation.md)`
`79`		`-*[Similarity](guides/embeddings/vector-similarity.md)`
`80`		`-*[Normalization](guides/embeddings/vector-normalization.md)`
`81`		`-*[Search](guides/improve-search-results-with-machine-learning.md)`
`82`		`-*[Chatbots](guides/chatbots/README.md)`
	`75`	`+*[Embeddings](open-source/pgml/guides/embeddings/README.md)`
	`76`	`+*[In-database Generation](open-source/pgml/guides/embeddings/in-database-generation.md)`
	`77`	`+*[Dimensionality Reduction](open-source/pgml/guides/embeddings/dimensionality-reduction.md)`
	`78`	`+*[Aggregation](open-source/pgml/guides/embeddings/vector-aggregation.md)`
	`79`	`+*[Similarity](open-source/pgml/guides/embeddings/vector-similarity.md)`
	`80`	`+*[Normalization](open-source/pgml/guides/embeddings/vector-normalization.md)`
	`81`	`+*[Search](open-source/pgml/guides/improve-search-results-with-machine-learning.md)`
	`82`	`+*[Chatbots](open-source/pgml/guides/chatbots/README.md)`
`83`	`83`	`*[Example Application](use-cases/chatbots.md)`
`84`		`-*[Supervised Learning](guides/supervised-learning.md)`
`85`		`-*[Unified RAG](guides/unified-rag.md)`
`86`		`-*[OpenSourceAI](guides/opensourceai.md)`
`87`		`-*[Natural Language Processing](guides/natural-language-processing.md)`
`88`		`-*[Vector database](guides/vector-database.md)`
	`84`	`+*[Supervised Learning](open-source/pgml/guides/supervised-learning.md)`
	`85`	`+*[Unified RAG](open-source/pgml/guides/unified-rag.md)`
	`86`	`+*[OpenSourceAI](open-source/pgml/guides/opensourceai.md)`
	`87`	`+*[Natural Language Processing](open-source/pgml/guides/natural-language-processing.md)`
	`88`	`+*[Vector database](open-source/pgml/guides/vector-database.md)`
`89`	`89`
`90`	`90`	`##Resources`
`91`	`91`

`‎pgml-cms/docs/guides/chatbots/README.md‎renamed to ‎pgml-cms/docs/open-source/pgml/guides/chatbots/README.md‎`

Lines changed: 3 additions & 3 deletions

Original file line number	Diff line number	Diff line change
@@ -108,11 +108,11 @@ What does an `embedding` look like? `Embeddings` are just vectors (for our use c
`108`	`108`	`embedding_1= embed("King")# embed returns something like [0.11, -0.32, 0.46, ...]`
`109`	`109`	```
`110`	`110`
`111`		`-<figure><imgsrc="../../.gitbook/assets/embedding_king.png"alt=""><figcaption><p>The flow of word -> token -> embedding</p></figcaption></figure>`
	`111`	`+<figure><imgsrc="../../../../.gitbook/assets/embedding_king.png"alt=""><figcaption><p>The flow of word -> token -> embedding</p></figcaption></figure>`
`112`	`112`
`113`	`113`	`Embeddings` aren't limited to words, we have models that can embed entire sentences.
`114`	`114`
`115`		`-<figure><imgsrc="../../.gitbook/assets/embeddings_tokens.png"alt=""><figcaption><p>The flow of sentence -> tokens -> embedding</p></figcaption></figure>`
	`115`	`+<figure><imgsrc="../../../../.gitbook/assets/embeddings_tokens.png"alt=""><figcaption><p>The flow of sentence -> tokens -> embedding</p></figcaption></figure>`
`116`	`116`
`117`	`117`	Why do we care about`embeddings`?`Embeddings` have a very interesting property. Words and sentences that have close[semantic similarity](https://en.wikipedia.org/wiki/Semantic\_similarity) sit closer to one another in vector space than words and sentences that do not have close semantic similarity.
`118`	`118`
`@@ -157,7 +157,7 @@ print(context)`
`157`	`157`
`158`	`158`	`Thereis a lot going onwith this, let's check out this diagram and step through it.`
`159`	`159`
`160`		`-<figure><imgsrc="../../.gitbook/assets/chatbot_flow.png"alt=""><figcaption><p>The flow of taking a document, splitting it into chunks, embedding those chunks,and then retrieving a chunk based off of a users query</p></figcaption></figure>`
	`160`	`+<figure><imgsrc="../../../../.gitbook/assets/chatbot_flow.png"alt=""><figcaption><p>The flow of taking a document, splitting it into chunks, embedding those chunks,and then retrieving a chunk based off of a users query</p></figcaption></figure>`
`161`	`161`
`162`	`162`	`Step1: We take the documentand split it into chunks. Chunks are typically a paragraphor twoin size. There are many ways to split documents into chunks,for more information check out [this guide](https://www.pinecone.io/learn/chunking-strategies/).`
`163`	`163`

`‎pgml-cms/docs/guides/embeddings/README.md‎renamed to ‎pgml-cms/docs/open-source/pgml/guides/embeddings/README.md‎`

Lines changed: 1 addition & 1 deletion

Original file line number	Diff line number	Diff line change
@@ -39,7 +39,7 @@ Vectors can be stored in the native Postgres [`ARRAY[]`](https://www.postgresql.
`39`	`39`
`40`	`40`	`!!! warning`
`41`	`41`
`42`		`-Other cloud providers claim to offer embeddings "inside the database", but[benchmarks](../../resources/benchmarks/mindsdb-vs-postgresml.md) show that they are orders of magnitude slower than PostgresML. The reason is they don't actually run inside the database with hardware acceleration. They are thin wrapper functions that make network calls to remote service providers. PostgresML is the only cloud that puts GPU hardware in the database for full acceleration, and it shows.`
	`42`	`+Other cloud providers claim to offer embeddings "inside the database", but[benchmarks](../../../../resources/benchmarks/mindsdb-vs-postgresml.md) show that they are orders of magnitude slower than PostgresML. The reason is they don't actually run inside the database with hardware acceleration. They are thin wrapper functions that make network calls to remote service providers. PostgresML is the only cloud that puts GPU hardware in the database for full acceleration, and it shows.`
`43`	`43`
`44`	`44`	`!!!`
`45`	`45`

`‎pgml-cms/docs/guides/embeddings/dimensionality-reduction.md‎renamed to ‎pgml-cms/docs/open-source/pgml/guides/embeddings/dimensionality-reduction.md‎`

File renamed without changes.

`‎pgml-cms/docs/guides/embeddings/in-database-generation.md‎renamed to ‎pgml-cms/docs/open-source/pgml/guides/embeddings/in-database-generation.md‎`

File renamed without changes.

`‎pgml-cms/docs/guides/embeddings/indexing-w-pgvector.md‎renamed to ‎pgml-cms/docs/open-source/pgml/guides/embeddings/indexing-w-pgvector.md‎`

File renamed without changes.

`‎pgml-cms/docs/guides/embeddings/proprietary-models.md‎renamed to ‎pgml-cms/docs/open-source/pgml/guides/embeddings/proprietary-models.md‎`

File renamed without changes.

0 commit comments

Comments

(0)

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Commit3049db3

File tree

23 files changed

23 files changed

`‎pgml-cms/blog/generating-llm-embeddings-with-open-source-models-in-postgresml.md‎`

`‎pgml-cms/blog/introducing-the-openai-switch-kit-move-from-closed-to-open-source-ai-in-minutes.md‎`

`‎pgml-cms/blog/semantic-search-in-postgres-in-15-minutes.md‎`

`‎pgml-cms/docs/SUMMARY.md‎`

`‎pgml-cms/docs/guides/chatbots/README.md‎renamed to ‎pgml-cms/docs/open-source/pgml/guides/chatbots/README.md‎`

`‎pgml-cms/docs/guides/embeddings/README.md‎renamed to ‎pgml-cms/docs/open-source/pgml/guides/embeddings/README.md‎`

`‎pgml-cms/docs/guides/embeddings/dimensionality-reduction.md‎renamed to ‎pgml-cms/docs/open-source/pgml/guides/embeddings/dimensionality-reduction.md‎`

`‎pgml-cms/docs/guides/embeddings/in-database-generation.md‎renamed to ‎pgml-cms/docs/open-source/pgml/guides/embeddings/in-database-generation.md‎`

`‎pgml-cms/docs/guides/embeddings/indexing-w-pgvector.md‎renamed to ‎pgml-cms/docs/open-source/pgml/guides/embeddings/indexing-w-pgvector.md‎`

`‎pgml-cms/docs/guides/embeddings/proprietary-models.md‎renamed to ‎pgml-cms/docs/open-source/pgml/guides/embeddings/proprietary-models.md‎`

0 commit comments