Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up

An end to end ASR Transformer model training repo

License

NotificationsYou must be signed in to change notification settings

MegEngine/End-to-end-ASR-Transformer

Repository files navigation

  • 本项目基于transformer 6*encoder+6*decoder的基本结构构造的端到端的语音识别系统

Model

Instructions

  • 1.数据准备:
    • 自行下载数据,遵循文件结构如下:
├── data│   ├── train│   ├── dev│   ├── test
  • 2.数据预处理:
    • 运行prepare_data.py对数据进行预处理, 获得整个词表,每个样本音频的mel-scale-spectrogram,文本的token-ids
  • 3.模型训练:
    • 运行train_transformer.py --ngpus 8进行transformer网络的训练. 该网络输入mel-scale-spectrogram, 输出token-ids
  • 4.模型推理:
    • 运行evlauate.py在dev/test上测试准确率

Acknowledgements

Reference

Releases

No releases published

Packages

No packages published

Languages


[8]ページ先頭

©2009-2025 Movatter.jp