NotificationsYou must be signed in to change notification settings
Fork327
Star2.2k

Finetuning Granite Speech#307

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Jump to bottom

Open

avihu111 wants to merge5 commits intohuggingface:main

base:main

Choose a base branch

fromavihu111:add_granite_speech

Open

Finetuning Granite Speech#307

avihu111 wants to merge5 commits intohuggingface:mainfromavihu111:add_granite_speech

Conversation

Copy link

avihu111 commentedJun 29, 2025

What does this PR do?

This PR adds a notebook that shows how to finetuneGranite Speech, an open-source model that leads the OpenASR leaderboard.

Who can review?

@merveenoyan @stevhliu can you give that a look? 🙏

finetuning granite speech, initial commit

6a7c815

Copy link

review-notebook-appbot commentedJun 29, 2025

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered byReviewNB

avihu111 added4 commits

June 29, 2025 15:58

update with used packages

66a2963

minor

d11d148

typos

19d89a0

minor

185e05f

Copy link

Member

stevhliu commentedJun 30, 2025

Hi, thanks for your contribution!

The cookbook recipes are more focused on applied use cases so it'd be awesome if you could tailor it more towards solving a specific problem or use case.

Copy link

jack-tol commentedJun 30, 2025•
edited
Loading

Hi, thanks for your contribution!
The cookbook recipes are more focused on applied use cases so it'd be awesome if you could tailor it more towards solving a specific problem or use case.

Might not really be my place to say, but even though this script perhaps doesn't focus on tackling a specific fine-tuning use-case (i.e. domain specific fine-tuning on medical audio etc.), it is nevertheless very important to provide the open-source community with a script to fine-tune a new open-source model on their custom data. Maybe this is in the works already and I'm just jumping the gun, but this contribution surely should exist somewhere within the cookbook or some other resource until perhaps a better, and more robust implementation is available. Just my thoughts.

Copy link

Member

stevhliu commentedJun 30, 2025

Absolutely, we're happy to have a link for it in the Granite Speechdocs in Transformers if nothing else!

Copy link

Author

avihu111 commentedJul 1, 2025•
edited
Loading

Hi@stevhliu, thanks for the feedback!
I expected (like@jack-tol) that the most common use case would be finetuning Granite Speech on custom data (e.g., new language, unseen conditions, etc).
My goal was to show the best way to run inference/finetune the model, along with useful code snippets and a concrete (yet concise and easy to run) example.

We can also finetune Granite Speech on an unseen task like spoken question answering, but I fear people won't find it as useful (a finetuning script was requestedhere andhere ).

I hope it will be suitable for the cookbook - I like the fact that the huggingface webpage presents the notebook nicely. 🙏
If not, I assume the best approach is to add it to the Granite Speech docs.

Copy link

Member

stevhliu commentedJul 1, 2025

I'm wondering if there is some way we can apply your fine-tuning recipe to a more practical application. For example, you can fine-tune Granite Speech and build a Space that transcribes meeting notes, captions videos, etc. This will help you extend the notebook and demonstrate how you can build an AI application with it.

If you decide to keep it as fine-tuning only, then I think it's best to add it to the Granite Speech docs.

Thanks again and we really appreciate the time and effort you put into creating this notebook! 🤗

Copy link

Author

avihu111 commentedJul 3, 2025

Thanks,@stevhliu.
Can you advise on the best way to add this to the Granite Speech docs?
Most of the examples I've seen are short code snippets. Do you have a docs page with an example notebook that you can share?
Any help would be very appreciated - Thanks!

Copy link

Member

stevhliu commentedJul 3, 2025

Yeah, you can open aPR on the Transformers repo and create a## Resources section on theGranite Speech docs with a link to your notebook.

Labels

None yet

3 participants

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Finetuning Granite Speech#307

Are you sure you want to change the base?

Finetuning Granite Speech#307

Uh oh!