- Notifications
You must be signed in to change notification settings - Fork327
Finetuning Granite Speech#307
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.
Already on GitHub?Sign in to your account
base:main
Are you sure you want to change the base?
Uh oh!
There was an error while loading.Please reload this page.
Conversation
Check out this pull request on See visual diffs & provide feedback on Jupyter Notebooks. Powered byReviewNB |
Hi, thanks for your contribution! The cookbook recipes are more focused on applied use cases so it'd be awesome if you could tailor it more towards solving a specific problem or use case. |
jack-tol commentedJun 30, 2025 • edited
Loading Uh oh!
There was an error while loading.Please reload this page.
edited
Uh oh!
There was an error while loading.Please reload this page.
Might not really be my place to say, but even though this script perhaps doesn't focus on tackling a specific fine-tuning use-case (i.e. domain specific fine-tuning on medical audio etc.), it is nevertheless very important to provide the open-source community with a script to fine-tune a new open-source model on their custom data. Maybe this is in the works already and I'm just jumping the gun, but this contribution surely should exist somewhere within the cookbook or some other resource until perhaps a better, and more robust implementation is available. Just my thoughts. |
Absolutely, we're happy to have a link for it in the Granite Speechdocs in Transformers if nothing else! |
avihu111 commentedJul 1, 2025 • edited
Loading Uh oh!
There was an error while loading.Please reload this page.
edited
Uh oh!
There was an error while loading.Please reload this page.
Hi@stevhliu, thanks for the feedback! We can also finetune Granite Speech on an unseen task like spoken question answering, but I fear people won't find it as useful (a finetuning script was requestedhere andhere ). I hope it will be suitable for the cookbook - I like the fact that the huggingface webpage presents the notebook nicely. 🙏 |
I'm wondering if there is some way we can apply your fine-tuning recipe to a more practical application. For example, you can fine-tune Granite Speech and build a Space that transcribes meeting notes, captions videos, etc. This will help you extend the notebook and demonstrate how you can build an AI application with it. If you decide to keep it as fine-tuning only, then I think it's best to add it to the Granite Speech docs. Thanks again and we really appreciate the time and effort you put into creating this notebook! 🤗 |
Thanks,@stevhliu. |
Yeah, you can open aPR on the Transformers repo and create a |
What does this PR do?
This PR adds a notebook that shows how to finetuneGranite Speech, an open-source model that leads the OpenASR leaderboard.
Who can review?
@merveenoyan@stevhliu can you give that a look? 🙏