NotificationsYou must be signed in to change notification settings
Fork328
Star6.4k

Commit66c65c8

committed

README updates

1 parent9284cf1 commit66c65c8Copy full SHA for 66c65c8

File tree

1 file changed

+12

-2

lines changed

README.md

1 file changed

+12

-2

lines changed

`‎README.md`

Lines changed: 12 additions & 2 deletions

Original file line number	Diff line number	Diff line change
`@@ -981,7 +981,7 @@ OFFSET (SELECT COUNT() 0.8 FROM pgml.imdb_shuffled_view);`
`981`	`981`
`982`	`982`	`###5. Fine-Tuning the Language Model`
`983`	`983`
`984`		`-Now, fine-tune the Language Model for text classification using the created training view. In the following sections, you will see a detailed explanation of different parameters used during fine-tuning.`
	`984`	+Now, fine-tune the Language Model for text classification using the created training view. In the following sections, you will see a detailed explanation of different parameters used during fine-tuning. Fine-tuned model is pushed to your public Hugging Face Hub periodically. A new repository will be created under your username using your project name (`imdb_review_sentiment` in this case). You can also choose to push the model to a private repository by setting`hub_private_repo: true` in training arguments.
`985`	`985`
`986`	`986`	```sql
`987`	`987`	`SELECTpgml.tune(`
`@@ -1236,7 +1236,7 @@ SELECT pgml.tune(`
`1236`	`1236`	`"per_device_eval_batch_size": 16,`
`1237`	`1237`	`"num_train_epochs": 1,`
`1238`	`1238`	`"weight_decay": 0.01,`
`1239`		`- "hub_token": "",`
	`1239`	`+ "hub_token": "YOUR_HUB_TOKEN",`
`1240`	`1240`	`"push_to_hub": true`
`1241`	`1241`	`},`
`1242`	`1242`	`"dataset_args": { "text_column": "text", "class_column": "class" }`
`@@ -1246,6 +1246,16 @@ SELECT pgml.tune(`
`1246`	`1246`
`1247`	`1247`	`By following these steps, you can effectively restart trainingfrom a previously trained model, allowing for further refinementand adaptation of the model basedon new requirementsor insights. Adjust parametersas needed for your specific use caseand dataset.`
`1248`	`1248`
	`1249`	`+`
	`1250`	`+## 8. Hugging Face Hub vs. PostgresML as Model Repository`
	`1251`	`+We utilize the Hugging Face Hubas the primary repository for fine-tuning Large Language Models (LLMs). Leveraging the HF hub offers several advantages:`
	`1252`	`+`
	`1253`	`+* The HF repository servesas the platform for pushing incremental updates to the model during the training process.In the event of any disruptionsin the database connection, you have the flexibility to resume trainingfromwhere it was left off.`
	`1254`	`+* If you prefer to keep the model private, you can push it to a private repository within the Hugging Face Hub. This ensures that the model is not publicly accessible by setting the parameter hub_private_repo to true.`
	`1255`	`+* Thepgml.transform function, designed around utilizing modelsfrom the Hugging Face Hub, can be reused without any modifications.`
	`1256`	`+`
	`1257`	+However,in certain scenarios, pushing the model to a central repositoryand pulling it for inference may not be the most suitable approach. To address this situation, we save all the model weightsand additional artifacts, suchas tokenizer configurationsand vocabulary,in thepgml.files table at the end of the training process. It's important to note that as of the current writing, hooks to use models directly from pgml.files in the pgml.transform function have not been implemented. We welcome Pull Requests (PRs) from the community to enhance this functionality.
	`1258`	`+`
`1249`	`1259`	`## Text Classification 9 Classes`
`1250`	`1260`
`1251`	`1261`	`### 1. Load and Shuffle the Dataset`

0 commit comments

Comments

(0)

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Commit66c65c8

File tree

1 file changed

1 file changed

`‎README.md`

0 commit comments