Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up

A collection of basic python modules for spoken natural language processing

License

NotificationsYou must be signed in to change notification settings

gooofy/py-nltools

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A collection of abstraction layers and support functions that form the naturallanguage processing foundation of the Zamia AI project:

  • phonetics: translation functions between various phonetic alphabets (IPA, X-SAMPA, X-ARPABET, …​)

  • tts: abstraction layer towards using eSpeak NG, MaryTTS, SVOX Pico TTS or a remote TTS server and sequitur g2p

  • asr: abstraction layer towards using kaldi-asr and pocketsphinx, models can be found here:http://goofy.zamia.org/voxforge/

  • sequiturclient: g2p using sequitur

  • pulseplayer: audio playback through pulseaudio

  • pulserecorder: audio recording through pulseaudio

  • tokenizer: english, french and german word tokenizers aimed at spoken language applications

  • threadpool: simple thread pool implementation

  • vad: Voice Activity Detection finite state machine based on webrtc VAD

  • macro_engine: Simple macro engine aimed at generating natural language expansions

I plan to add modules as I need them in the Zamia AI projects. Some modules likephonetics andtokenizerhave some overlap with larger projects like NLTK or spaCy - my modules tend to be more hands-on and simple mindedthan these and therefore are in no way meant to replace them.

Highlevel Diagram

Requirements

Note: probably incomplete.

  • Python 2.7

  • for TTS one or more of:

    • MaryTTS, py-marytts

    • espeak-ng, py-espeak-ng

    • SVOX Pico TTS, py-picotts

  • for ASR one or more of:

    • kaldi-asr 5.1, py-kaldi-asr

    • pocketsphinx

  • sequitur

  • pulseaudio

  • webrtc

License

My own code is Apache-2.0 licensed unless otherwise noted in the script’s copyrightheaders.

Some scripts and files are based on works of others, in those cases it is myintention to keep the original license intact. Please make sure to check thecopyright headers inside for more information.

Authors

Guenter Bartsch <guenter@zamia.org>Paul Guyot <pguyot@kallisys.net>

Packages

No packages published

Languages


[8]ページ先頭

©2009-2025 Movatter.jp