Hub documentation

Quickstart

Hub

API docs

Join the Hugging Face community

and get access to the augmented documentation experience

Collaborate on models, datasets and Spaces

Faster examples with accelerated inference

Switch between documentation themes

to get started

Quickstart

In this guide you will run a Job to fine-tune an open source model on Hugging Face infrastructure in only a few minutes.Make sure you are logged in to Hugging Face and have access to yourJobs page, which is available if you have a PRO account, or a Team or Enterprise subscription.

Getting started

First install the Hugging Face CLI:

1. Install the CLI

Recommended approach:

>>> curl -LsSf https://hf.co/cli/install.sh | bash

Or using Homebrew:

>>> brew install huggingface-cli

Or using uv:

>>> uv tool install hf

2. Login to your Hugging Face account

>>> hf auth login

3. Create your first jobs using the hf jobs command

Run a UV command or script

>>> hfjobs uv run python -c'print("Hello from the cloud!")'Job started with ID: 693aef401a39f67af5a41c0eView at: https://huggingface.co/jobs/lhoestq/693aef401a39f67af5a41c0eHello from the cloud!

>>>echo"print('Hello from uv script!')" > script.py>>> hfjobs uv run script.pyJob started with ID: 695f6cd8d2f3efac77e8cf7fView at: https://huggingface.co/jobs/lhoestq/695f6cd8d2f3efac77e8cf7fHello from uv script!

Run a Docker command

>>> hfjobs run ubuntuecho'Hello from the cloud!'Job started with ID: 693aee76c67c9f186cfe233eView at: https://huggingface.co/jobs/lhoestq/693aee76c67c9f186cfe233eHello from the cloud!

4. Check your first jobs

The job logs appear in your terminal, but you can also see them in your jobs page. Open the job page to see the job information, status and logs:

The training script

Here is a simple training script to fine-tune a base model to a conversational model using Supervised Fine-Tuning (SFT). It uses theQwen/Qwen2.5-0.5B model and thetrl-lib/Capybara dataset, and theTRL library, and saves the resulting model to your Hugging Face account under the name"Qwen2.5-0.5B-SFT":

from datasetsimport load_datasetfrom trlimport SFTTrainerdataset = load_dataset("trl-lib/Capybara", split="train")trainer = SFTTrainer(    model="Qwen/Qwen2.5-0.5B",    train_dataset=dataset,)trainer.train()trainer.push_to_hub("Qwen2.5-0.5B-SFT")

Save this script astrain.py, and we can now run it with UV on Hugging Face Jobs.

Run the training job

hf jobs takes several arguments: select the hardware with--flavor, choose a maximum duration with--timeout, and pass environment variable with--env and--secrets. Here we use the A100 Large GPU flavor with--flavor a100-large and pass your Hugging Face token as a secret with--secrets HF_TOKEN in order to be able to push the resulting model to your account.

Moreover, UV accepts the--with argument to define python dependencies, so we use--with trl to have thetrl library available.

You can now run the final command which looks like this:

hfjobs uv run \    --flavor a100-large \    --timeout 6h \    --with trl \    --secrets HF_TOKEN \    train.py

The logs appear in your terminal, and you can safely Ctrl+C to stop streaming the logs, the job will keep running.

...Downloaded nvidia-cudnn-cu12 Downloaded torchInstalled66 packages in233msGenerating train split:100%|██████████| 15806/15806 [00:00<00:00, 76686.50 examples/s]Generating test split:100%|██████████| 200/200 [00:00<00:00, 43880.36 examples/s]Tokenizing train dataset:100%|██████████| 15806/15806 [00:41<00:00, 384.97 examples/s]Truncating train dataset:100%|██████████| 15806/15806 [00:00<00:00, 212272.92 examples/s]The model is already on multiple devices. Skipping the move to device specified in `args`.The tokenizer has new PAD/BOS/EOS tokens that differ from the model config and generation config. The model config and generation config were aligned accordingly, being updated with the tokenizer's values. Updated tokens: {'bos_token_id': None, 'pad_token_id':151643}.{'loss': 1.7357, 'grad_norm': 4.8733229637146, 'learning_rate': 1.9969635627530365e-05, 'entropy': 1.7238958358764649, 'num_tokens':59528.0, 'mean_token_accuracy': 0.6124177813529968, 'epoch': 0.01}{'loss': 1.6239, 'grad_norm': 6.200186729431152, 'learning_rate': 1.9935897435897437e-05, 'entropy': 1.644005584716797, 'num_tokens':115219.0, 'mean_token_accuracy': 0.6259662985801697, 'epoch': 0.01}{'loss': 1.4449, 'grad_norm': 6.167325496673584, 'learning_rate': 1.990215924426451e-05, 'entropy': 1.5156117916107177, 'num_tokens':171787.0, 'mean_token_accuracy': 0.6586395859718323, 'epoch': 0.02}{'loss': 1.6023, 'grad_norm': 5.133708953857422, 'learning_rate': 1.986842105263158e-05, 'entropy': 1.6885507702827454, 'num_tokens':226067.0, 'mean_token_accuracy': 0.6271904468536377, 'epoch': 0.02}

Follow the Job advancements on the job page on Hugging Face:

Monitor GPU usage and other metrics in the CLI or use theMacOS menu bar. Here with the CLI you get:

>>> hfjobs statsJOB ID                   CPU % NUM CPU MEM % MEM USAGE        NET I/O         GPU UTIL % GPU MEM % GPU MEM USAGE   ------------------------ ----- ------- ----- ---------------- --------------- ---------- --------- --------------- 695e83c5d2f3efac77e8cf18 8%    12.0    7.18% 10.9GB / 152.5GB 0.0bps / 0.0bps 100%       31.92%    25.9GB / 81.2GB

Once the job is done, find your model on your account:

Congrats ! You just run your first Job to fine-tune an open source model 🔥

Feel free to try out your model locally and evaluate it using e.g.transformers by clicking on “Use this model”, or deploy it toInference Endpoints in one click using the “Deploy” button.

Update on GitHub

←Jobs Overview Pricing and Billing→

Movatterモバイル変換

Hub

Quickstart

Getting started

1. Install the CLI

2. Login to your Hugging Face account

3. Create your first jobs using the hf jobs command

4. Check your first jobs

The training script

Run the training job