- Notifications
You must be signed in to change notification settings - Fork328
Readme update in progress#586
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.
Already on GitHub?Sign in to your account
Merged
Uh oh!
There was an error while loading.Please reload this page.
Merged
Changes fromall commits
Commits
Show all changes
53 commits Select commitHold shift + click to select a range
0a33546
Readme update in progress
santiatpmlc006523
Updated hugs emoji
santiatpmlba8d050
Readme added dashboard image
santiatpmlae520e6
Getting started in progress
santiatpmld822e43
Getting started in progress
santiatpmla7e9ce4
Added notebooks image
santiatpml7c8b982
Updated dashboard image and some edits
santiatpml3ae4024
Added protobuf for finbert support and text-classification readme in …
santiatpml0daba37
Using sql instead of json for highlighting
santiatpml0e51c29
update dependencies (#588)
montanalow3e06339
Updates to text-classification
santiatpml755580a
First version of text classification
santiatpml345eb79
Added grammatical correctness
santiatpmlb6cfcdd
Added zero-shot classification
santiatpmld025f12
readme for token classification
santiatpml91557e3
Moved results from sql to json
santiatpml4ffae4e
Images for different tasks
santiatpml4f21192
Updated table of contents
santiatpmldb9523c
Update to 0.7.4 (#591)
e02eaff
fix for np.float32 serialization (#589)
santiatpml8c3ee5e
Readme update in progress
santiatpmlb6476eb
Updated hugs emoji
santiatpml5a03402
Readme added dashboard image
santiatpml970b7be
Getting started in progress
santiatpml3938ba5
Getting started in progress
santiatpml7edfbf4
Added notebooks image
santiatpmlcb9b2d4
Updated dashboard image and some edits
santiatpml2f33c43
Added protobuf for finbert support and text-classification readme in …
santiatpml47e0cea
Using sql instead of json for highlighting
santiatpmlad16887
Updates to text-classification
santiatpml8721ce8
First version of text classification
santiatpmldaf045c
Added grammatical correctness
santiatpml5749330
Added zero-shot classification
santiatpmla2bcd1d
readme for token classification
santiatpml6c3a98c
Moved results from sql to json
santiatpml760b520
Images for different tasks
santiatpmlfca5ef2
Updated table of contents
santiatpmlc347f9b
Documentation for more tasks
santiatpmla1ef779
Updated with more tasks
santiatpmlf94cc3c
Expanded text generation section
santiatpmlf8891c2
Removed Table QA from toc
santiatpml8381fe8
Text2text generation
santiatpml592fc59
Added fill mask section
santiatpml42a6541
Started Vector DB section
santiatpmlc728d7e
First version of vector databases
santiatpml3ee5b8c
Reset docker compose and docker local to original
santiatpmlc9596a7
Update README.md
santiatpmlbd197a6
Update README.md
santiatpml629ffe0
Update README.md
santiatpmla3f45c9
Update README.md
santiatpmld2bd901
Update README.md
santiatpml0016d07
Update README.md
santiatpml27e1029
Updated tagline
santiatpmlFile filter
Filter by extension
Conversations
Failed to load comments.
Loading
Uh oh!
There was an error while loading.Please reload this page.
Jump to
Jump to file
Failed to load files.
Loading
Uh oh!
There was an error while loading.Please reload this page.
Diff view
Diff view
There are no files selected for viewing
856 changes: 811 additions & 45 deletionsREADME.md
Large diffs are not rendered by default.
Oops, something went wrong.
Uh oh!
There was an error while loading.Please reload this page.
Binary file addedpgml-docs/docs/images/dashboard.png
Loading
Sorry, something went wrong.Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file addedpgml-docs/docs/images/fill-mask.png
Loading
Sorry, something went wrong.Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file addedpgml-docs/docs/images/notebooks.png
Loading
Sorry, something went wrong.Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file addedpgml-docs/docs/images/question-answering.png
Loading
Sorry, something went wrong.Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file addedpgml-docs/docs/images/sentence-similarity.png
Loading
Sorry, something went wrong.Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file addedpgml-docs/docs/images/summarization.png
Loading
Sorry, something went wrong.Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file addedpgml-docs/docs/images/table-question-answering.png
Loading
Sorry, something went wrong.Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file addedpgml-docs/docs/images/text-classification.png
Loading
Sorry, something went wrong.Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file addedpgml-docs/docs/images/text-generation.png
Loading
Sorry, something went wrong.Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file addedpgml-docs/docs/images/text-to-text-generation.png
Loading
Sorry, something went wrong.Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file addedpgml-docs/docs/images/token-classification.png
Loading
Sorry, something went wrong.Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file addedpgml-docs/docs/images/translation.png
Loading
Sorry, something went wrong.Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file addedpgml-docs/docs/images/zero-shot-classification.png
Loading
Sorry, something went wrong.Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
571 changes: 317 additions & 254 deletionspgml-extension/Cargo.lock
Large diffs are not rendered by default.
Oops, something went wrong.
Uh oh!
There was an error while loading.Please reload this page.
6 changes: 3 additions & 3 deletionspgml-extension/Cargo.toml
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
2 changes: 1 addition & 1 deletionpgml-extension/Dockerfile
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
90 changes: 90 additions & 0 deletionspgml-extension/examples/finetune.sql
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,90 @@ | ||
-- Exit on error (psql) | ||
\set ON_ERROR_STOP true | ||
\timing on | ||
SELECT pgml.load_dataset('kde4', kwargs => '{"lang1": "en", "lang2": "es"}'); | ||
CREATE OR REPLACE VIEW kde4_en_to_es AS | ||
SELECT translation->>'en' AS "en", translation->>'es' AS "es" | ||
FROM pgml.kde4 | ||
LIMIT 10; | ||
SELECT pgml.tune( | ||
'Translate English to Spanish', | ||
task => 'translation', | ||
relation_name => 'kde4_en_to_es', | ||
y_column_name => 'es', -- translate into spanish | ||
model_name => 'Helsinki-NLP/opus-mt-en-es', | ||
hyperparams => '{ | ||
"learning_rate": 2e-5, | ||
"per_device_train_batch_size": 16, | ||
"per_device_eval_batch_size": 16, | ||
"num_train_epochs": 1, | ||
"weight_decay": 0.01, | ||
"max_length": 128 | ||
}', | ||
test_size => 0.5, | ||
test_sampling => 'last' | ||
); | ||
SELECT pgml.load_dataset('imdb'); | ||
SELECT pgml.tune( | ||
'IMDB Review Sentiment', | ||
task => 'text-classification', | ||
relation_name => 'pgml.imdb', | ||
y_column_name => 'label', | ||
model_name => 'distilbert-base-uncased', | ||
hyperparams => '{ | ||
"learning_rate": 2e-5, | ||
"per_device_train_batch_size": 16, | ||
"per_device_eval_batch_size": 16, | ||
"num_train_epochs": 1, | ||
"weight_decay": 0.01 | ||
}', | ||
test_size => 0.5, | ||
test_sampling => 'last' | ||
); | ||
SELECT pgml.predict('IMDB Review Sentiment', 'I love SQL'); | ||
SELECT pgml.load_dataset('squad_v2'); | ||
SELECT pgml.tune( | ||
'SQuAD Q&A v2', | ||
'question-answering', | ||
'pgml.squad_v2', | ||
'answers', | ||
'deepset/roberta-base-squad2', | ||
hyperparams => '{ | ||
"evaluation_strategy": "epoch", | ||
"learning_rate": 2e-5, | ||
"per_device_train_batch_size": 16, | ||
"per_device_eval_batch_size": 16, | ||
"num_train_epochs": 1, | ||
"weight_decay": 0.01, | ||
"max_length": 384, | ||
"stride": 128 | ||
}', | ||
test_size => 11873, | ||
test_sampling => 'last' | ||
); | ||
SELECT pgml.load_dataset('billsum', kwargs => '{"split": "ca_test"}'); | ||
CREATE OR REPLACE VIEW billsum_training_data | ||
AS SELECT title || '\n' || text AS text, summary FROM pgml.billsum; | ||
SELECT pgml.tune( | ||
'Legal Summarization', | ||
task => 'summarization', | ||
relation_name => 'billsum_training_data', | ||
y_column_name => 'summary', | ||
model_name => 'sshleifer/distilbart-xsum-12-1', | ||
hyperparams => '{ | ||
"learning_rate": 2e-5, | ||
"per_device_train_batch_size": 2, | ||
"per_device_eval_batch_size": 2, | ||
"num_train_epochs": 1, | ||
"weight_decay": 0.01, | ||
"max_input_length": 1024, | ||
"max_summary_length": 128 | ||
}', | ||
test_size => 0.01, | ||
test_sampling => 'last' | ||
); |
129 changes: 50 additions & 79 deletionspgml-extension/examples/transformers.sql
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
12 changes: 9 additions & 3 deletionspgml-extension/src/bindings/transformers.py
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
1 change: 1 addition & 0 deletionspgml-extension/tests/test.sql
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.