- Notifications
You must be signed in to change notification settings - Fork15
A collection of basic python modules for spoken natural language processing
License
gooofy/py-nltools
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
A collection of abstraction layers and support functions that form the naturallanguage processing foundation of the Zamia AI project:
phonetics
: translation functions between various phonetic alphabets (IPA, X-SAMPA, X-ARPABET, …)tts
: abstraction layer towards using eSpeak NG, MaryTTS, SVOX Pico TTS or a remote TTS server and sequitur g2pasr
: abstraction layer towards using kaldi-asr and pocketsphinx, models can be found here:http://goofy.zamia.org/voxforge/sequiturclient
: g2p using sequiturpulseplayer
: audio playback through pulseaudiopulserecorder
: audio recording through pulseaudiotokenizer
: english, french and german word tokenizers aimed at spoken language applicationsthreadpool
: simple thread pool implementationvad
: Voice Activity Detection finite state machine based on webrtc VADmacro_engine
: Simple macro engine aimed at generating natural language expansions
I plan to add modules as I need them in the Zamia AI projects. Some modules likephonetics
andtokenizer
have some overlap with larger projects like NLTK or spaCy - my modules tend to be more hands-on and simple mindedthan these and therefore are in no way meant to replace them.
Note: probably incomplete.
Python 2.7
for TTS one or more of:
MaryTTS, py-marytts
espeak-ng, py-espeak-ng
SVOX Pico TTS, py-picotts
for ASR one or more of:
kaldi-asr 5.1, py-kaldi-asr
pocketsphinx
sequitur
pulseaudio
webrtc
My own code is Apache-2.0 licensed unless otherwise noted in the script’s copyrightheaders.
Some scripts and files are based on works of others, in those cases it is myintention to keep the original license intact. Please make sure to check thecopyright headers inside for more information.
Guenter Bartsch <guenter@zamia.org>Paul Guyot <pguyot@kallisys.net>
About
A collection of basic python modules for spoken natural language processing