NotificationsYou must be signed in to change notification settings
Fork352
Star6.6k

Commitb88ef63

authored

Added new examples for JavaScript (#953)

1 parentdc27921 commitb88ef63Copy full SHA for b88ef63

File tree

9 files changed

+193

-18

lines changed

pgml-sdks/rust/pgml
- javascript/examples
- python/examples
  - README.md

9 files changed

+193

-18

lines changed

`‎pgml-sdks/rust/pgml/javascript/examples/README.md‎`

Lines changed: 10 additions & 4 deletions

Original file line number	Diff line number	Diff line change
`@@ -1,7 +1,13 @@`
`1`		`-##JavascriptExamples`
	`1`	`+##Examples`
`2`	`2`
`3`		`-Here we have a set of examples of different use cases of the pgml javascript SDK.`
	`3`	`+###[Semantic Search](./semantic_search.js)`
	`4`	+This is a basic example to perform semantic search on a collection of documents. Embeddings are created using`intfloat/e5-small` model. The results are semantically similar documemts to the query. Finally, the collection is archived.
`4`	`5`
`5`		`-##Examples:`
	`6`	`+###[Question Answering](./question_answering.js)`
	`7`	`+This is an example to find documents relevant to a question from the collection of documents. The query is passed to vector search to retrieve documents that match closely in the embeddings space. A score is returned with each of the search result.`
`6`	`8`
`7`		`-1.[Getting Started](./getting-started/) - Simple project that uses the pgml SDK to create a collection, add a pipeline, upsert documents, and run a vector search on the collection.`
	`9`	`+###[Question Answering using Instructore Model](./question_answering_instructor.js)`
	`10`	+In this example, we will use`hknlp/instructor-base` model to build text embeddings instead of the default`intfloat/e5-small` model.
	`11`	`+`
	`12`	`+###[Extractive Question Answering](./extractive_question_answering.js)`
	`13`	+In this example, we will show how to use`vector_recall` result as a`context` to a HuggingFace question answering model. We will use`Builtins.transform()` to run the model on the database.

`‎pgml-sdks/rust/pgml/javascript/examples/extractive_question_answering.js‎`

Lines changed: 62 additions & 0 deletions

Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,62 @@`
	`1`	`+constpgml=require("pgml");`
	`2`	`+require("dotenv").config();`
	`3`	`+`
	`4`	`+pgml.js_init_logger();`
	`5`	`+`
	`6`	`+constmain=async()=>{`
	`7`	`+// Initialize the collection`
	`8`	`+constcollection=pgml.newCollection("my_javascript_eqa_collection_2");`
	`9`	`+`
	`10`	`+// Add a pipeline`
	`11`	`+constmodel=pgml.newModel();`
	`12`	`+constsplitter=pgml.newSplitter();`
	`13`	`+constpipeline=pgml.newPipeline(`
	`14`	`+"my_javascript_eqa_pipeline_1",`
	`15`	`+model,`
	`16`	`+splitter,`
	`17`	`+);`
	`18`	`+awaitcollection.add_pipeline(pipeline);`
	`19`	`+`
	`20`	`+// Upsert documents, these documents are automatically split into chunks and embedded by our pipeline`
	`21`	`+constdocuments=[`
	`22`	`+{`
	`23`	`+id:"Document One",`
	`24`	`+text:"PostgresML is the best tool for machine learning applications!",`
	`25`	`+},`
	`26`	`+{`
	`27`	`+id:"Document Two",`
	`28`	`+text:"PostgresML is open source and available to everyone!",`
	`29`	`+},`
	`30`	`+];`
	`31`	`+awaitcollection.upsert_documents(documents);`
	`32`	`+`
	`33`	`+constquery="What is the best tool for machine learning?";`
	`34`	`+`
	`35`	`+// Perform vector search`
	`36`	`+constqueryResults=awaitcollection`
	`37`	`+.query()`
	`38`	`+.vector_recall(query,pipeline)`
	`39`	`+.limit(1)`
	`40`	`+.fetch_all();`
	`41`	`+`
	`42`	`+// Construct context from results`
	`43`	`+constcontext=queryResults`
	`44`	`+.map((result)=>{`
	`45`	`+returnresult[1];`
	`46`	`+})`
	`47`	`+.join("\n");`
	`48`	`+`
	`49`	`+// Query for answer`
	`50`	`+constbuiltins=pgml.newBuiltins();`
	`51`	`+constanswer=awaitbuiltins.transform("question-answering",[`
	`52`	`+JSON.stringify({question:query,context:context}),`
	`53`	`+]);`
	`54`	`+`
	`55`	`+// Archive the collection`
	`56`	`+awaitcollection.archive();`
	`57`	`+returnanswer;`
	`58`	`+};`
	`59`	`+`
	`60`	`+main().then((results)=>{`
	`61`	`+console.log("Question answer: \n",results);`
	`62`	`+});`

`‎pgml-sdks/rust/pgml/javascript/examples/getting-started/README.md‎`

Lines changed: 0 additions & 12 deletions

This file was deleted.

`‎pgml-sdks/rust/pgml/javascript/examples/getting-started/package-lock.json‎renamed to ‎pgml-sdks/rust/pgml/javascript/examples/package-lock.json‎`

File renamed without changes.

`‎pgml-sdks/rust/pgml/javascript/examples/getting-started/package.json‎renamed to ‎pgml-sdks/rust/pgml/javascript/examples/package.json‎`

File renamed without changes.

`‎pgml-sdks/rust/pgml/javascript/examples/question_answering.js‎`

Lines changed: 55 additions & 0 deletions

Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,55 @@`
	`1`	`+constpgml=require("pgml");`
	`2`	`+require("dotenv").config();`
	`3`	`+`
	`4`	`+constmain=async()=>{`
	`5`	`+// Initialize the collection`
	`6`	`+constcollection=pgml.newCollection("my_javascript_qa_collection");`
	`7`	`+`
	`8`	`+// Add a pipeline`
	`9`	`+constmodel=pgml.newModel();`
	`10`	`+constsplitter=pgml.newSplitter();`
	`11`	`+constpipeline=pgml.newPipeline(`
	`12`	`+"my_javascript_qa_pipeline",`
	`13`	`+model,`
	`14`	`+splitter,`
	`15`	`+);`
	`16`	`+awaitcollection.add_pipeline(pipeline);`
	`17`	`+`
	`18`	`+// Upsert documents, these documents are automatically split into chunks and embedded by our pipeline`
	`19`	`+constdocuments=[`
	`20`	`+{`
	`21`	`+id:"Document One",`
	`22`	`+text:"PostgresML is the best tool for machine learning applications!",`
	`23`	`+},`
	`24`	`+{`
	`25`	`+id:"Document Two",`
	`26`	`+text:"PostgresML is open source and available to everyone!",`
	`27`	`+},`
	`28`	`+];`
	`29`	`+awaitcollection.upsert_documents(documents);`
	`30`	`+`
	`31`	`+// Perform vector search`
	`32`	`+constqueryResults=awaitcollection`
	`33`	`+.query()`
	`34`	`+.vector_recall("What is the best tool for machine learning?",pipeline)`
	`35`	`+.limit(1)`
	`36`	`+.fetch_all();`
	`37`	`+`
	`38`	`+// Convert the results to an array of objects`
	`39`	`+constresults=queryResults.map((result)=>{`
	`40`	`+const[similarity,text,metadata]=result;`
	`41`	`+return{`
	`42`	`+ similarity,`
	`43`	`+ text,`
	`44`	`+ metadata,`
	`45`	`+};`
	`46`	`+});`
	`47`	`+`
	`48`	`+// Archive the collection`
	`49`	`+awaitcollection.archive();`
	`50`	`+returnresults;`
	`51`	`+};`
	`52`	`+`
	`53`	`+main().then((results)=>{`
	`54`	`+console.log("Vector search Results: \n",results);`
	`55`	`+});`

`‎pgml-sdks/rust/pgml/javascript/examples/question_answering_instructor.js‎`

Lines changed: 60 additions & 0 deletions

Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,60 @@`
	`1`	`+constpgml=require("pgml");`
	`2`	`+require("dotenv").config();`
	`3`	`+`
	`4`	`+constmain=async()=>{`
	`5`	`+// Initialize the collection`
	`6`	`+constcollection=pgml.newCollection("my_javascript_qai_collection");`
	`7`	`+`
	`8`	`+// Add a pipeline`
	`9`	`+constmodel=pgml.newModel("hkunlp/instructor-base","pgml",{`
	`10`	`+instruction:"Represent the Wikipedia document for retrieval: ",`
	`11`	`+});`
	`12`	`+constsplitter=pgml.newSplitter();`
	`13`	`+constpipeline=pgml.newPipeline(`
	`14`	`+"my_javascript_qai_pipeline",`
	`15`	`+model,`
	`16`	`+splitter,`
	`17`	`+);`
	`18`	`+awaitcollection.add_pipeline(pipeline);`
	`19`	`+`
	`20`	`+// Upsert documents, these documents are automatically split into chunks and embedded by our pipeline`
	`21`	`+constdocuments=[`
	`22`	`+{`
	`23`	`+id:"Document One",`
	`24`	`+text:"PostgresML is the best tool for machine learning applications!",`
	`25`	`+},`
	`26`	`+{`
	`27`	`+id:"Document Two",`
	`28`	`+text:"PostgresML is open source and available to everyone!",`
	`29`	`+},`
	`30`	`+];`
	`31`	`+awaitcollection.upsert_documents(documents);`
	`32`	`+`
	`33`	`+// Perform vector search`
	`34`	`+constqueryResults=awaitcollection`
	`35`	`+.query()`
	`36`	`+.vector_recall("What is the best tool for machine learning?",pipeline,{`
	`37`	`+instruction:`
	`38`	`+"Represent the Wikipedia question for retrieving supporting documents: ",`
	`39`	`+})`
	`40`	`+.limit(1)`
	`41`	`+.fetch_all();`
	`42`	`+`
	`43`	`+// Convert the results to an array of objects`
	`44`	`+constresults=queryResults.map((result)=>{`
	`45`	`+const[similarity,text,metadata]=result;`
	`46`	`+return{`
	`47`	`+ similarity,`
	`48`	`+ text,`
	`49`	`+ metadata,`
	`50`	`+};`
	`51`	`+});`
	`52`	`+`
	`53`	`+// Archive the collection`
	`54`	`+awaitcollection.archive();`
	`55`	`+returnresults;`
	`56`	`+};`
	`57`	`+`
	`58`	`+main().then((results)=>{`
	`59`	`+console.log("Vector search Results: \n",results);`
	`60`	`+});`

`‎pgml-sdks/rust/pgml/javascript/examples/getting-started/index.js‎renamed to ‎pgml-sdks/rust/pgml/javascript/examples/semantic_search.js‎`

Lines changed: 5 additions & 1 deletion

Original file line number	Diff line number	Diff line change
`@@ -27,7 +27,10 @@ const main = async () => {`
`27`	`27`	`// Perform vector search`
`28`	`28`	`constqueryResults=awaitcollection`
`29`	`29`	`.query()`
`30`		`-.vector_recall("Some user query that will match document one first",pipeline)`
	`30`	`+.vector_recall(`
	`31`	`+"Some user query that will match document one first",`
	`32`	`+pipeline,`
	`33`	`+)`
`31`	`34`	`.limit(2)`
`32`	`35`	`.fetch_all();`
`33`	`36`
`@@ -41,6 +44,7 @@ const main = async () => {`
`41`	`44`	`};`
`42`	`45`	`});`
`43`	`46`
	`47`	`+// Archive the collection`
`44`	`48`	`awaitcollection.archive();`
`45`	`49`	`returnresults;`
`46`	`50`	`};`

`‎pgml-sdks/rust/pgml/python/examples/README.md‎`

Lines changed: 1 addition & 1 deletion

Original file line number	Diff line number	Diff line change
`@@ -1,7 +1,7 @@`
`1`	`1`	`##Examples`
`2`	`2`
`3`	`3`	`###[Semantic Search](./semantic_search.py)`
`4`		-This is a basic example to perform semantic search on a collection of documents. It loads the Quora dataset, creates a collection in a PostgreSQL database, upserts documents, generates chunks and embeddings, and then performs a vector search on a query. Embeddings are created using`intfloat/e5-small` model. The results arearesemantically similar documemts to the query. Finally, the collection is archived.
	`4`	+This is a basic example to perform semantic search on a collection of documents. It loads the Quora dataset, creates a collection in a PostgreSQL database, upserts documents, generates chunks and embeddings, and then performs a vector search on a query. Embeddings are created using`intfloat/e5-small` model. The results are semantically similar documemts to the query. Finally, the collection is archived.
`5`	`5`
`6`	`6`	`###[Question Answering](./question_answering.py)`
`7`	`7`	`This is an example to find documents relevant to a question from the collection of documents. It loads the Stanford Question Answering Dataset (SQuAD) into the database, generates chunks and embeddings. Query is passed to vector search to retrieve documents that match closely in the embeddings space. A score is returned with each of the search result.`

0 commit comments

Comments

(0)

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Commitb88ef63

File tree

9 files changed

9 files changed

`‎pgml-sdks/rust/pgml/javascript/examples/README.md‎`

`‎pgml-sdks/rust/pgml/javascript/examples/extractive_question_answering.js‎`

`‎pgml-sdks/rust/pgml/javascript/examples/getting-started/README.md‎`

`‎pgml-sdks/rust/pgml/javascript/examples/getting-started/package-lock.json‎renamed to ‎pgml-sdks/rust/pgml/javascript/examples/package-lock.json‎`

`‎pgml-sdks/rust/pgml/javascript/examples/getting-started/package.json‎renamed to ‎pgml-sdks/rust/pgml/javascript/examples/package.json‎`

`‎pgml-sdks/rust/pgml/javascript/examples/question_answering.js‎`

`‎pgml-sdks/rust/pgml/javascript/examples/question_answering_instructor.js‎`

`‎pgml-sdks/rust/pgml/javascript/examples/getting-started/index.js‎renamed to ‎pgml-sdks/rust/pgml/javascript/examples/semantic_search.js‎`

`‎pgml-sdks/rust/pgml/python/examples/README.md‎`

0 commit comments