- Notifications
You must be signed in to change notification settings - Fork0
daedalusLAB/daedalus_pipeline
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
Pipeline is a simple and easy tools to create vertical (.vrt) and corpus files from videos.
It rests on this tools to create the pipeline:
- OpenAI Whisper: is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.
- Spacy: is a free, open-source library for Natural Language Processing in Python. It features NER, POS tagging, dependency parsing, word vectors and more.
- Gentle: is a forced aligner for speech and text. It uses a probabilistic model to align speech to text, and is trained on a large dataset of speech and text. It is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.
Take a videofile as input and generate the corresponding vtt file namedvideofile.vtt in the same directory.
./daedalus_pipeline.py --fileInstall next dependencies to make it work:
Requires the command-line toolffmpeg to be installed on your system, which is available from most package managers:
# on Ubuntu or Debiansudo apt update&& sudo apt install ffmpeg# on MacOS using Homebrew (https://brew.sh/)brew install ffmpeg# on Windows using Chocolatey (https://chocolatey.org/)choco install ffmpeg
Requires spacy models:
python -m spacy download en_core_web_lg pip install -r requirements.txtRequires to install gentle fromhere and be in your path.
About
Pipeline to convert a video file into a vrt file
Resources
Uh oh!
There was an error while loading.Please reload this page.
Stars
Watchers
Forks
Releases
No releases published
Packages0
No packages published
