- Notifications
You must be signed in to change notification settings - Fork352
Readme update in progress#586
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.
Already on GitHub?Sign in to your account
Merged
Uh oh!
There was an error while loading.Please reload this page.
Merged
Changes fromall commits
Commits
Show all changes
53 commits Select commitHold shift + click to select a range
0a33546 Readme update in progress
santiatpmlc006523 Updated hugs emoji
santiatpmlba8d050 Readme added dashboard image
santiatpmlae520e6 Getting started in progress
santiatpmld822e43 Getting started in progress
santiatpmla7e9ce4 Added notebooks image
santiatpml7c8b982 Updated dashboard image and some edits
santiatpml3ae4024 Added protobuf for finbert support and text-classification readme in …
santiatpml0daba37 Using sql instead of json for highlighting
santiatpml0e51c29 update dependencies (#588)
montanalow3e06339 Updates to text-classification
santiatpml755580a First version of text classification
santiatpml345eb79 Added grammatical correctness
santiatpmlb6cfcdd Added zero-shot classification
santiatpmld025f12 readme for token classification
santiatpml91557e3 Moved results from sql to json
santiatpml4ffae4e Images for different tasks
santiatpml4f21192 Updated table of contents
santiatpmldb9523c Update to 0.7.4 (#591)
e02eaff fix for np.float32 serialization (#589)
santiatpml8c3ee5e Readme update in progress
santiatpmlb6476eb Updated hugs emoji
santiatpml5a03402 Readme added dashboard image
santiatpml970b7be Getting started in progress
santiatpml3938ba5 Getting started in progress
santiatpml7edfbf4 Added notebooks image
santiatpmlcb9b2d4 Updated dashboard image and some edits
santiatpml2f33c43 Added protobuf for finbert support and text-classification readme in …
santiatpml47e0cea Using sql instead of json for highlighting
santiatpmlad16887 Updates to text-classification
santiatpml8721ce8 First version of text classification
santiatpmldaf045c Added grammatical correctness
santiatpml5749330 Added zero-shot classification
santiatpmla2bcd1d readme for token classification
santiatpml6c3a98c Moved results from sql to json
santiatpml760b520 Images for different tasks
santiatpmlfca5ef2 Updated table of contents
santiatpmlc347f9b Documentation for more tasks
santiatpmla1ef779 Updated with more tasks
santiatpmlf94cc3c Expanded text generation section
santiatpmlf8891c2 Removed Table QA from toc
santiatpml8381fe8 Text2text generation
santiatpml592fc59 Added fill mask section
santiatpml42a6541 Started Vector DB section
santiatpmlc728d7e First version of vector databases
santiatpml3ee5b8c Reset docker compose and docker local to original
santiatpmlc9596a7 Update README.md
santiatpmlbd197a6 Update README.md
santiatpml629ffe0 Update README.md
santiatpmla3f45c9 Update README.md
santiatpmld2bd901 Update README.md
santiatpml0016d07 Update README.md
santiatpml27e1029 Updated tagline
santiatpmlFile filter
Filter by extension
Conversations
Failed to load comments.
Loading
Uh oh!
There was an error while loading.Please reload this page.
Jump to
Jump to file
Failed to load files.
Loading
Uh oh!
There was an error while loading.Please reload this page.
Diff view
Diff view
There are no files selected for viewing
856 changes: 811 additions & 45 deletionsREADME.md
Large diffs are not rendered by default.
Oops, something went wrong.
Uh oh!
There was an error while loading.Please reload this page.
Binary file addedpgml-docs/docs/images/dashboard.png
Loading
Sorry, something went wrong.Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file addedpgml-docs/docs/images/fill-mask.png
Loading
Sorry, something went wrong.Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file addedpgml-docs/docs/images/notebooks.png
Loading
Sorry, something went wrong.Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file addedpgml-docs/docs/images/question-answering.png
Loading
Sorry, something went wrong.Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file addedpgml-docs/docs/images/sentence-similarity.png
Loading
Sorry, something went wrong.Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file addedpgml-docs/docs/images/summarization.png
Loading
Sorry, something went wrong.Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file addedpgml-docs/docs/images/table-question-answering.png
Loading
Sorry, something went wrong.Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file addedpgml-docs/docs/images/text-classification.png
Loading
Sorry, something went wrong.Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file addedpgml-docs/docs/images/text-generation.png
Loading
Sorry, something went wrong.Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file addedpgml-docs/docs/images/text-to-text-generation.png
Loading
Sorry, something went wrong.Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file addedpgml-docs/docs/images/token-classification.png
Loading
Sorry, something went wrong.Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file addedpgml-docs/docs/images/translation.png
Loading
Sorry, something went wrong.Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file addedpgml-docs/docs/images/zero-shot-classification.png
Loading
Sorry, something went wrong.Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
571 changes: 317 additions & 254 deletionspgml-extension/Cargo.lock
Large diffs are not rendered by default.
Oops, something went wrong.
Uh oh!
There was an error while loading.Please reload this page.
6 changes: 3 additions & 3 deletionspgml-extension/Cargo.toml
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
2 changes: 1 addition & 1 deletionpgml-extension/Dockerfile
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
90 changes: 90 additions & 0 deletionspgml-extension/examples/finetune.sql
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,90 @@ | ||
| -- Exit on error (psql) | ||
| \set ON_ERROR_STOP true | ||
| \timing on | ||
| SELECT pgml.load_dataset('kde4', kwargs => '{"lang1": "en", "lang2": "es"}'); | ||
| CREATE OR REPLACE VIEW kde4_en_to_es AS | ||
| SELECT translation->>'en' AS "en", translation->>'es' AS "es" | ||
| FROM pgml.kde4 | ||
| LIMIT 10; | ||
| SELECT pgml.tune( | ||
| 'Translate English to Spanish', | ||
| task => 'translation', | ||
| relation_name => 'kde4_en_to_es', | ||
| y_column_name => 'es', -- translate into spanish | ||
| model_name => 'Helsinki-NLP/opus-mt-en-es', | ||
| hyperparams => '{ | ||
| "learning_rate": 2e-5, | ||
| "per_device_train_batch_size": 16, | ||
| "per_device_eval_batch_size": 16, | ||
| "num_train_epochs": 1, | ||
| "weight_decay": 0.01, | ||
| "max_length": 128 | ||
| }', | ||
| test_size => 0.5, | ||
| test_sampling => 'last' | ||
| ); | ||
| SELECT pgml.load_dataset('imdb'); | ||
| SELECT pgml.tune( | ||
| 'IMDB Review Sentiment', | ||
| task => 'text-classification', | ||
| relation_name => 'pgml.imdb', | ||
| y_column_name => 'label', | ||
| model_name => 'distilbert-base-uncased', | ||
| hyperparams => '{ | ||
| "learning_rate": 2e-5, | ||
| "per_device_train_batch_size": 16, | ||
| "per_device_eval_batch_size": 16, | ||
| "num_train_epochs": 1, | ||
| "weight_decay": 0.01 | ||
| }', | ||
| test_size => 0.5, | ||
| test_sampling => 'last' | ||
| ); | ||
| SELECT pgml.predict('IMDB Review Sentiment', 'I love SQL'); | ||
| SELECT pgml.load_dataset('squad_v2'); | ||
| SELECT pgml.tune( | ||
| 'SQuAD Q&A v2', | ||
| 'question-answering', | ||
| 'pgml.squad_v2', | ||
| 'answers', | ||
| 'deepset/roberta-base-squad2', | ||
| hyperparams => '{ | ||
| "evaluation_strategy": "epoch", | ||
| "learning_rate": 2e-5, | ||
| "per_device_train_batch_size": 16, | ||
| "per_device_eval_batch_size": 16, | ||
| "num_train_epochs": 1, | ||
| "weight_decay": 0.01, | ||
| "max_length": 384, | ||
| "stride": 128 | ||
| }', | ||
| test_size => 11873, | ||
| test_sampling => 'last' | ||
| ); | ||
| SELECT pgml.load_dataset('billsum', kwargs => '{"split": "ca_test"}'); | ||
| CREATE OR REPLACE VIEW billsum_training_data | ||
| AS SELECT title || '\n' || text AS text, summary FROM pgml.billsum; | ||
| SELECT pgml.tune( | ||
| 'Legal Summarization', | ||
| task => 'summarization', | ||
| relation_name => 'billsum_training_data', | ||
| y_column_name => 'summary', | ||
| model_name => 'sshleifer/distilbart-xsum-12-1', | ||
| hyperparams => '{ | ||
| "learning_rate": 2e-5, | ||
| "per_device_train_batch_size": 2, | ||
| "per_device_eval_batch_size": 2, | ||
| "num_train_epochs": 1, | ||
| "weight_decay": 0.01, | ||
| "max_input_length": 1024, | ||
| "max_summary_length": 128 | ||
| }', | ||
| test_size => 0.01, | ||
| test_sampling => 'last' | ||
| ); |
129 changes: 50 additions & 79 deletionspgml-extension/examples/transformers.sql
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
12 changes: 9 additions & 3 deletionspgml-extension/src/bindings/transformers.py
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
1 change: 1 addition & 0 deletionspgml-extension/tests/test.sql
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.