NotificationsYou must be signed in to change notification settings
Fork352
Star6.6k

Commitcc5d24e

authored

Update docs for serverless v2 (#1485)

1 parentb44f381 commitcc5d24eCopy full SHA for cc5d24e

File tree

57 files changed

+212

-227

lines changed

packages/pgml-rds-proxy
- README.md
pgml-apps/pgml-chat/pgml_chat
- main.py
pgml-cms
- blog
- docs
  - api
    - client-sdk
    - sql-extension
      - README.md
      - pgml.embed.md
      - pgml.transform
        README.md
  - guides
    - chatbots
      - README.md
    - embeddings
      - dimensionality-reduction.md
    - opensourceai.md
  - introduction/getting-started
    - connect-your-app.md
    - import-your-data
      - foreign-data-wrappers.md
  - product
    - vector-database.md
  - resources
    - data-storage-and-retrieval
      - llm-based-pipelines-with-postgresml-and-dbt-data-build-tool.md
      - partitioning.md
    - developer-docs
      - quick-start-with-docker.md
  - use-cases
    - chatbots.md
    - embeddings
      - personalize-embedding-results-with-application-data-in-your-database.md
      - tuning-vector-recall-while-generating-query-embeddings-in-the-database.md
pgml-dashboard
- content/blog/benchmarks/hf_pinecone_vs_postgresml
- src
  - components/pages/demo
    - template.html
  - utils
    - markdown.rs
pgml-extension/examples
- dbt/embeddings
  - README.md
  - dbt_project.yml
- transformers.sql
pgml-sdks/pgml
- javascript/examples
- python
  - examples
  - tests
    - stress_test.py
- src

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

57 files changed

+212

-227

lines changed

`‎packages/pgml-rds-proxy/README.md‎`

Lines changed: 1 addition & 1 deletion

Original file line number	Diff line number	Diff line change
`@@ -76,7 +76,7 @@ SELECT`
`76`	`76`	`FROM`
`77`	`77`	`dblink(`
`78`	`78`	`'postgresml',`
`79`		`- 'SELECT * FROM pgml.embed(''intfloat/e5-small'', ''embed this text'') AS embedding'`
	`79`	`+ 'SELECT * FROM pgml.embed(''Alibaba-NLP/gte-base-en-v1.5'', ''embed this text'') AS embedding'`
`80`	`80`	`) AS t1(embedding real[386]);`
`81`	`81`	```
`82`	`82`

`‎pgml-apps/pgml-chat/pgml_chat/main.py‎`

Lines changed: 3 additions & 4 deletions

Original file line number	Diff line number	Diff line change
`@@ -123,7 +123,7 @@ def handler(signum, frame):`
`123`	`123`	`"--chat_completion_model",`
`124`	`124`	`dest="chat_completion_model",`
`125`	`125`	`type=str,`
`126`		`-default="HuggingFaceH4/zephyr-7b-beta",`
	`126`	`+default="meta-llama/Meta-Llama-3-8B-Instruct",`
`127`	`127`	`)`
`128`	`128`
`129`	`129`	`parser.add_argument(`
`@@ -195,9 +195,8 @@ def handler(signum, frame):`
`195`	`195`	`)`
`196`	`196`
`197`	`197`	`splitter=Splitter(splitter_name,splitter_params)`
`198`		`-model_name="hkunlp/instructor-xl"`
`199`		`-model_embedding_instruction="Represent the %s document for retrieval: "% (bot_topic)`
`200`		`-model_params= {"instruction":model_embedding_instruction}`
	`198`	`+model_name="Alibaba-NLP/gte-base-en-v1.5"`
	`199`	`+model_params= {}`
`201`	`200`
`202`	`201`	`model=Model(model_name,"pgml",model_params)`
`203`	`202`	`pipeline=Pipeline(args.collection_name+"_pipeline",model,splitter)`

`‎pgml-cms/blog/generating-llm-embeddings-with-open-source-models-in-postgresml.md‎`

Lines changed: 13 additions & 7 deletions

Original file line number	Diff line number	Diff line change
`@@ -122,14 +122,14 @@ LIMIT 5;`
`122`	`122`
`123`	`123`	PostgresML provides a simple interface to generate embeddings from text in your database. You can use the[`pgml.embed`](https://postgresml.org/docs/guides/transformers/embeddings) function to generate embeddings for a column of text. The function takes a transformer name and a text value. The transformer will automatically be downloaded and cached on your connection process for reuse. You can see a list of potential good candidate models to generate embeddings on the[Massive Text Embedding Benchmark leaderboard](https://huggingface.co/spaces/mteb/leaderboard).
`124`	`124`
`125`		-Since our corpus of documents (movie reviews) are all relatively short and similar in style, we don't need a large model.[`intfloat/e5-small`](https://huggingface.co/intfloat/e5-small) will be a good first attempt. The great thing about PostgresML is you can always regenerate your embeddings later to experiment with different embedding models.
	`125`	+Since our corpus of documents (movie reviews) are all relatively short and similar in style, we don't need a large model.[`Alibaba-NLP/gte-base-en-v1.5`](https://huggingface.co/Alibaba-NLP/gte-base-en-v1.5) will be a good first attempt. The great thing about PostgresML is you can always regenerate your embeddings later to experiment with different embedding models.
`126`	`126`
`127`		-It takes a couple of minutes to download and cache the`intfloat/e5-small` model to generate the first embedding. After that, it's pretty fast.
	`127`	+It takes a couple of minutes to download and cache the`Alibaba-NLP/gte-base-en-v1.5` model to generate the first embedding. After that, it's pretty fast.
`128`	`128`
`129`	`129`	Note how we prefix the text we want to embed with either`passage:` or`query:` , the e5 model requires us to prefix our data with`passage:` if we're generating embeddings for our corpus and`query:` if we want to find semantically similar content.
`130`	`130`
`131`	`131`	```postgresql
`132`		`-SELECT pgml.embed('intfloat/e5-small', 'passage: hi mom');`
	`132`	`+SELECT pgml.embed('Alibaba-NLP/gte-base-en-v1.5', 'passage: hi mom');`
`133`	`133`	```
`134`	`134`
`135`	`135`	`This is a pretty powerful function, because we can pass any arbitrary text to any open source model, and it will generate an embedding for us. We can benchmark how long it takes to generate an embedding for a single review, using client-side timings in Postgres:`
`@@ -147,7 +147,7 @@ Aside from using this function with strings passed from a client, we can use it`
`147`	`147`	```postgresql
`148`	`148`	`SELECT`
`149`	`149`	`review_body,`
`150`		`- pgml.embed('intfloat/e5-small', 'passage: ' \|\| review_body)`
	`150`	`+ pgml.embed('Alibaba-NLP/gte-base-en-v1.5', 'passage: ' \|\| review_body)`
`151`	`151`	`FROM pgml.amazon_us_reviews`
`152`	`152`	`LIMIT 1;`
`153`	`153`	```
`@@ -171,7 +171,7 @@ Time to generate an embedding increases with the length of the input text, and v`
`171`	`171`	```postgresql
`172`	`172`	`SELECT`
`173`	`173`	`review_body,`
`174`		`- pgml.embed('intfloat/e5-small', 'passage: ' \|\| review_body) AS embedding`
	`174`	`+ pgml.embed('Alibaba-NLP/gte-base-en-v1.5', 'passage: ' \|\| review_body) AS embedding`
`175`	`175`	`FROM pgml.amazon_us_reviews`
`176`	`176`	`LIMIT 1000;`
`177`	`177`	```
`@@ -190,7 +190,7 @@ We can also do a quick sanity check to make sure we're really getting value out`
`190`	`190`	`SELECT`
`191`	`191`	`reviqew_body,`
`192`	`192`	`pgml.embed(`
`193`		`- 'intfloat/e5-small',`
	`193`	`+ 'Alibaba-NLP/gte-base-en-v1.5',`
`194`	`194`	`'passage: ' \|\| review_body,`
`195`	`195`	`'{"device": "cpu"}'`
`196`	`196`	`) AS embedding`
@@ -224,6 +224,12 @@ You can also find embedding models that outperform OpenAI's `text-embedding-ada-
`224`	`224`
`225`	`225`	The current leading model is`hkunlp/instructor-xl`. Instructor models take an additional`instruction` parameter which includes context for the embeddings use case, similar to prompts before text generation tasks.
`226`	`226`
	`227`	`+!!! note`
	`228`	`+`
	`229`	`+"Alibaba-NLP/gte-base-en-v1.5" surpassed the quality of instructor-xl, and should be used instead, but we've left this documentation available for existing users`
	`230`	`+`
	`231`	`+!!!`
	`232`	`+`
`227`	`233`	`Instructions can provide a "classification" or "topic" for the text:`
`228`	`234`
`229`	`235`	`####Classification`
`@@ -325,7 +331,7 @@ BEGIN`
`325`	`331`
`326`	`332`	`UPDATE pgml.amazon_us_reviews`
`327`	`333`	`SET review_embedding_e5_large = pgml.embed(`
`328`		`- 'intfloat/e5-large',`
	`334`	`+ 'Alibaba-NLP/gte-base-en-v1.5',`
`329`	`335`	`'passage: ' \|\| review_body`
`330`	`336`	`)`
`331`	`337`	`WHERE id BETWEEN i AND i + 10`

`‎pgml-cms/blog/introducing-the-openai-switch-kit-move-from-closed-to-open-source-ai-in-minutes.md‎`

Lines changed: 4 additions & 4 deletions

Original file line number	Diff line number	Diff line change
`@@ -44,7 +44,7 @@ The Switch Kit is an open-source AI SDK that provides a drop in replacement for`
`44`	`44`	`constpgml=require("pgml");`
`45`	`45`	`constclient=pgml.newOpenSourceAI();`
`46`	`46`	`constresults=client.chat_completions_create(`
`47`		`-"HuggingFaceH4/zephyr-7b-beta",`
	`47`	`+"meta-llama/Meta-Llama-3-8B-Instruct",`
`48`	`48`	`[`
`49`	`49`	`{`
`50`	`50`	`role:"system",`
`@@ -65,7 +65,7 @@ console.log(results);`
`65`	`65`	`import pgml`
`66`	`66`	`client= pgml.OpenSourceAI()`
`67`	`67`	`results= client.chat_completions_create(`
`68`		`-"HuggingFaceH4/zephyr-7b-beta",`
	`68`	`+"meta-llama/Meta-Llama-3-8B-Instruct",`
`69`	`69`	`[`
`70`	`70`	`{`
`71`	`71`	`"role":"system",`
`@@ -96,7 +96,7 @@ print(results)`
`96`	`96`	`],`
`97`	`97`	`"created":1701291672,`
`98`	`98`	`"id":"abf042d2-9159-49cb-9fd3-eef16feb246c",`
`99`		`-"model":"HuggingFaceH4/zephyr-7b-beta",`
	`99`	`+"model":"meta-llama/Meta-Llama-3-8B-Instruct",`
`100`	`100`	`"object":"chat.completion",`
`101`	`101`	`"system_fingerprint":"eecec9d4-c28b-5a27-f90b-66c3fb6cee46",`
`102`	`102`	`"usage": {`
`@@ -113,7 +113,7 @@ We don't charge per token, so OpenAI “usage” metrics are not particularly re`
`113`	`113`
`114`	`114`	`!!!`
`115`	`115`
`116`		`-The above is an example using our open-source AI SDK withzephyr-7b-beta, an incredibly popular and highly efficient7 billion parameter model.`
	`116`	`+The above is an example using our open-source AI SDK withMeta-Llama-3-8B-Instruct, an incredibly popular and highly efficient8 billion parameter model.`
`117`	`117`
`118`	`118`	Notice there is near one to one relation between the parameters and return type of OpenAI’s`chat.completions.create` and our`chat_completion_create`.
`119`	`119`

`‎pgml-cms/blog/llm-based-pipelines-with-postgresml-and-dbt-data-build-tool.md‎`

Lines changed: 2 additions & 2 deletions

Original file line number	Diff line number	Diff line change
`@@ -119,7 +119,7 @@ vars:`
`119`	`119`	`splitter_name: "recursive_character"`
`120`	`120`	`splitter_parameters: {"chunk_size": 100, "chunk_overlap": 20}`
`121`	`121`	`task: "embedding"`
`122`		`- model_name: "intfloat/e5-base"`
	`122`	`+ model_name: "intfloat/e5-small-v2"`
`123`	`123`	`query_string: 'Lorem ipsum 3'`
`124`	`124`	`limit: 2`
`125`	`125`	```
`@@ -129,7 +129,7 @@ Here's a summary of the key parameters:`
`129`	`129`	* `splitter_name`: Specifies the name of the splitter, set as "recursive\_character".
`130`	`130`	* `splitter_parameters`: Defines the parameters for the splitter, such as a chunk size of 100 and a chunk overlap of 20.
`131`	`131`	* `task`: Indicates the task being performed, specified as "embedding".
`132`		-* `model_name`: Specifies the name of the model to be used, set as "intfloat/e5-base".
	`132`	+* `model_name`: Specifies the name of the model to be used, set as "intfloat/e5-small-v2".
`133`	`133`	* `query_string`: Provides a query string, set as 'Lorem ipsum 3'.
`134`	`134`	* `limit`: Specifies a limit of 2, indicating the maximum number of results to be processed.
`135`	`135`

`‎pgml-cms/blog/personalize-embedding-results-with-application-data-in-your-database.md‎`

Lines changed: 2 additions & 2 deletions

Original file line number	Diff line number	Diff line change
`@@ -137,7 +137,7 @@ We can find a customer that our embeddings model feels is close to the sentiment`
`137`	`137`	```postgresql
`138`	`138`	`WITH request AS (`
`139`	`139`	`SELECT pgml.embed(`
`140`		`- 'intfloat/e5-large',`
	`140`	`+ 'Alibaba-NLP/gte-base-en-v1.5',`
`141`	`141`	`'query: I love all Star Wars, but Empire Strikes Back is particularly amazing'`
`142`	`142`	`)::vector(1024) AS embedding`
`143`	`143`	`)`
`@@ -214,7 +214,7 @@ Now we can write our personalized SQL query. It's nearly the same as our query f`
`214`	`214`	`-- create a request embedding on the fly`
`215`	`215`	`WITH request AS (`
`216`	`216`	`SELECT pgml.embed(`
`217`		`- 'intfloat/e5-large',`
	`217`	`+ 'Alibaba-NLP/gte-base-en-v1.5',`
`218`	`218`	`'query: Best 1980''s scifi movie'`
`219`	`219`	`)::vector(1024) AS embedding`
`220`	`220`	`),`

`‎pgml-cms/blog/pgml-chat-a-command-line-tool-for-deploying-low-latency-knowledge-based-chatbots-part-i.md‎`

Lines changed: 2 additions & 4 deletions

Original file line number	Diff line number	Diff line change
`@@ -127,9 +127,7 @@ cp .env.template .env`
`127`	`127`	```bash
`128`	`128`	`OPENAI_API_KEY=<OPENAI_API_KEY>`
`129`	`129`	`DATABASE_URL=<POSTGRES_DATABASE_URL starts with postgres://>`
`130`		`-MODEL=hkunlp/instructor-xl`
`131`		`-MODEL_PARAMS={"instruction":"Represent the document for retrieval:"}`
`132`		`-QUERY_PARAMS={"instruction":"Represent the question for retrieving supporting documents:"}`
	`130`	`+MODEL=Alibaba-NLP/gte-base-en-v1.5`
`133`	`131`	`SYSTEM_PROMPT=<># System prompt used for OpenAI chat completion`
`134`	`132`	`BASE_PROMPT=<># Base prompt used for OpenAI chat completion for each turn`
`135`	`133`	`SLACK_BOT_TOKEN=<SLACK_BOT_TOKEN># Slack bot token to run Slack chat service`
`@@ -332,7 +330,7 @@ Once the discord app is running, you can interact with the chatbot on Discord as`
`332`	`330`
`333`	`331`	`### PostgresML vs. Hugging Face + Pinecone`
`334`	`332`
`335`		`-To evaluate query latency, we performed an experiment with 10,000 Wikipedia documents from the SQuAD dataset. Embeddings were generated using theintfloat/e5-large model.`
	`333`	`+To evaluate query latency, we performed an experiment with 10,000 Wikipedia documents from the SQuAD dataset. Embeddings were generated using theAlibaba-NLP/gte-base-en-v1.5 model.`
`336`	`334`
`337`	`335`	`For PostgresML, we used a GPU-powered serverless database running on NVIDIA A10G GPUs with clientin us-west-2 region. For HuggingFace, we used their inference API endpoint running on NVIDIA A10G GPUsin us-east-1 region and a clientin the same us-east-1 region. Pinecone was used as the vector search indexfor HuggingFace embeddings.`
`338`	`336`

`‎pgml-cms/blog/speeding-up-vector-recall-5x-with-hnsw.md‎`

Lines changed: 2 additions & 2 deletions

Original file line number	Diff line number	Diff line change
`@@ -45,7 +45,7 @@ Let's run that query again:`
`45`	`45`	```postgresql
`46`	`46`	`WITH request AS (`
`47`	`47`	`SELECT pgml.embed(`
`48`		`- 'intfloat/e5-large',`
	`48`	`+ 'Alibaba-NLP/gte-base-en-v1.5',`
`49`	`49`	`'query: Best 1980''s scifi movie'`
`50`	`50`	`)::vector(1024) AS embedding`
`51`	`51`	`)`
`@@ -100,7 +100,7 @@ Now let's try the query again utilizing the new HNSW index we created.`
`100`	`100`	```postgresql
`101`	`101`	`WITH request AS (`
`102`	`102`	`SELECT pgml.embed(`
`103`		`- 'intfloat/e5-large',`
	`103`	`+ 'Alibaba-NLP/gte-base-en-v1.5',`
`104`	`104`	`'query: Best 1980''s scifi movie'`
`105`	`105`	`)::vector(1024) AS embedding`
`106`	`106`	`)`

`‎pgml-cms/blog/the-1.0-sdk-is-here.md‎`

Lines changed: 2 additions & 2 deletions

Original file line number	Diff line number	Diff line change
`@@ -50,7 +50,7 @@ const pipeline = pgml.newPipeline("my_pipeline", {`
`50`	`50`	`text: {`
`51`	`51`	`splitter: { model:"recursive_character" },`
`52`	`52`	`semantic_search: {`
`53`		`- model:"intfloat/e5-small",`
	`53`	`+ model:"Alibaba-NLP/gte-base-en-v1.5",`
`54`	`54`	`},`
`55`	`55`	`},`
`56`	`56`	`});`
`@@ -90,7 +90,7 @@ pipeline = Pipeline(`
`90`	`90`	`"text": {`
`91`	`91`	`"splitter": {"model":"recursive_character"},`
`92`	`92`	`"semantic_search": {`
`93`		`-"model":"intfloat/e5-small",`
	`93`	`+"model":"Alibaba-NLP/gte-base-en-v1.5",`
`94`	`94`	`},`
`95`	`95`	`},`
`96`	`96`	`},`

`‎pgml-cms/blog/tuning-vector-recall-while-generating-query-embeddings-in-the-database.md‎`

Lines changed: 6 additions & 6 deletions

Original file line number	Diff line number	Diff line change
`@@ -124,7 +124,7 @@ We'll start with semantic search. Given a user query, e.g. "Best 1980's scifi mo`
`124`	`124`	```postgresql
`125`	`125`	`WITH request AS (`
`126`	`126`	`SELECT pgml.embed(`
`127`		`- 'intfloat/e5-large',`
	`127`	`+ 'Alibaba-NLP/gte-base-en-v1.5',`
`128`	`128`	`'query: Best 1980''s scifi movie'`
`129`	`129`	`)::vector(1024) AS embedding`
`130`	`130`	`)`
`@@ -171,7 +171,7 @@ Generating a query plan more quickly and only computing the values once, may mak`
`171`	`171`	`There's some good stuff happening in those query results, so let's break it down:`
`172`	`172`
`173`	`173`	`*It's fast - We're able to generate a request embedding on the fly with a state-of-the-art model, and search 5M reviews in 152ms, including fetching the results back to the client 😍. You can't even generate an embedding from OpenAI's API in that time, much less search 5M reviews in some other database with it.`
`174`		-*It's good - The`review_body` results are very similar to the "Best 1980's scifi movie" request text. We're using the`intfloat/e5-large` open source embedding model, which outperforms OpenAI's`text-embedding-ada-002` in most[quality benchmarks](https://huggingface.co/spaces/mteb/leaderboard).
	`174`	+*It's good - The`review_body` results are very similar to the "Best 1980's scifi movie" request text. We're using the`Alibaba-NLP/gte-base-en-v1.5` open source embedding model, which outperforms OpenAI's`text-embedding-ada-002` in most[quality benchmarks](https://huggingface.co/spaces/mteb/leaderboard).
`175`	`175`	* Qualitatively: the embeddings understand our request for`scifi` being equivalent to`Sci-Fi`,`sci-fi`,`SciFi`, and`sci fi`, as well as`1980's` matching`80s` and`80's` and is close to`seventies` (last place). We didn't have to configure any of this and the most enthusiastic for "best" is at the top, the least enthusiastic is at the bottom, so the model has appropriately captured "sentiment".
`176`	`176`	* Quantitatively: the`cosine_similarity` of all results are high and tight, 0.90-0.95 on a scale from -1:1. We can be confident we recalled very similar results from our 5M candidates, even though it would take 485 times as long to check all of them directly.
`177`	`177`	`*It's reliable - The model is stored in the database, so we don't need to worry about managing a separate service. If you repeat this query over and over, the timings will be extremely consistent, because we don't have to deal with things like random network congestion.`
`@@ -254,7 +254,7 @@ Now we can quickly search for movies by what people have said about them:`
`254`	`254`	```postgresql
`255`	`255`	`WITH request AS (`
`256`	`256`	`SELECT pgml.embed(`
`257`		`- 'intfloat/e5-large',`
	`257`	`+ 'Alibaba-NLP/gte-base-en-v1.5',`
`258`	`258`	`'Best 1980''s scifi movie'`
`259`	`259`	`)::vector(1024) AS embedding`
`260`	`260`	`)`
`@@ -312,7 +312,7 @@ SET ivfflat.probes = 300;`
`312`	`312`	```postgresql
`313`	`313`	`WITH request AS (`
`314`	`314`	`SELECT pgml.embed(`
`315`		`- 'intfloat/e5-large',`
	`315`	`+ 'Alibaba-NLP/gte-base-en-v1.5',`
`316`	`316`	`'Best 1980''s scifi movie'`
`317`	`317`	`)::vector(1024) AS embedding`
`318`	`318`	`)`
`@@ -401,7 +401,7 @@ SET ivfflat.probes = 1;`
`401`	`401`	```postgresql
`402`	`402`	`WITH request AS (`
`403`	`403`	`SELECT pgml.embed(`
`404`		`- 'intfloat/e5-large',`
	`404`	`+ 'Alibaba-NLP/gte-base-en-v1.5',`
`405`	`405`	`'query: Best 1980''s scifi movie'`
`406`	`406`	`)::vector(1024) AS embedding`
`407`	`407`	`)`
`@@ -457,7 +457,7 @@ SQL is a very expressive language that can handle a lot of complexity. To keep t`
`457`	`457`	`-- create a request embedding on the fly`
`458`	`458`	`WITH request AS (`
`459`	`459`	`SELECT pgml.embed(`
`460`		`- 'intfloat/e5-large',`
	`460`	`+ 'Alibaba-NLP/gte-base-en-v1.5',`
`461`	`461`	`'query: Best 1980''s scifi movie'`
`462`	`462`	`)::vector(1024) AS embedding`
`463`	`463`	`),`

0 commit comments

Comments

(0)

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Commitcc5d24e

File tree

57 files changed

Some content is hidden

57 files changed

`‎packages/pgml-rds-proxy/README.md‎`

`‎pgml-apps/pgml-chat/pgml_chat/main.py‎`

`‎pgml-cms/blog/generating-llm-embeddings-with-open-source-models-in-postgresml.md‎`

`‎pgml-cms/blog/introducing-the-openai-switch-kit-move-from-closed-to-open-source-ai-in-minutes.md‎`

`‎pgml-cms/blog/llm-based-pipelines-with-postgresml-and-dbt-data-build-tool.md‎`

`‎pgml-cms/blog/personalize-embedding-results-with-application-data-in-your-database.md‎`

`‎pgml-cms/blog/pgml-chat-a-command-line-tool-for-deploying-low-latency-knowledge-based-chatbots-part-i.md‎`

`‎pgml-cms/blog/speeding-up-vector-recall-5x-with-hnsw.md‎`

`‎pgml-cms/blog/the-1.0-sdk-is-here.md‎`

`‎pgml-cms/blog/tuning-vector-recall-while-generating-query-embeddings-in-the-database.md‎`

0 commit comments