NotificationsYou must be signed in to change notification settings
Fork0
Star2

Pipeline to convert a video file into a vrt file

You must be signed in to change notification settings

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
imgs		imgs
.gitignore		.gitignore
Readme.md		Readme.md
cwb-make_CORPUSNAME.sh		cwb-make_CORPUSNAME.sh
daedalus_pipeline.py		daedalus_pipeline.py
daedalus_pipeline_rosa_es.py		daedalus_pipeline_rosa_es.py
per_word.py		per_word.py
pytorch_align.py		pytorch_align.py
requirements.txt		requirements.txt

Repository files navigation

Pipeline

Pipeline is a simple and easy tools to create vertical (.vrt) and corpus files from videos.

It rests on this tools to create the pipeline:

OpenAI Whisper: is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.
Spacy: is a free, open-source library for Natural Language Processing in Python. It features NER, POS tagging, dependency parsing, word vectors and more.
Gentle: is a forced aligner for speech and text. It uses a probabilistic model to align speech to text, and is trained on a large dataset of speech and text. It is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.

Usage

Take a videofile as input and generate the corresponding vtt file namedvideofile.vtt in the same directory.

./daedalus_pipeline.py --file

Installation

Install next dependencies to make it work:

Requirements

ffmpeg

Requires the command-line toolffmpeg to be installed on your system, which is available from most package managers:

# on Ubuntu or Debiansudo apt update&& sudo apt install ffmpeg# on MacOS using Homebrew (https://brew.sh/)brew install ffmpeg# on Windows using Chocolatey (https://chocolatey.org/)choco install ffmpeg

Spacy models

Requires spacy models:

python -m spacy download en_core_web_lg

Some python packages

pip install -r requirements.txt

GENTLE

Requires to install gentle fromhere and be in your path.

About

Pipeline to convert a video file into a vrt file

Releases

No releases published

Packages

No packages published

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Pipeline

Usage

Installation

Requirements

ffmpeg

Spacy models

Some python packages

GENTLE

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Languages

Movatterモバイル変換

daedalusLAB/daedalus_pipeline

Folders and files

Latest commit

History

Repository files navigation

Pipeline

Usage

Installation

Requirements

ffmpeg

Spacy models

Some python packages

GENTLE

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Languages

Packages