- Notifications
You must be signed in to change notification settings - Fork127
Open
Description
Thanks for your excellent working!
I want to training my g2p with other language, in my case is vietnamese
phạc ph a_T5 c2num n u_T0 m2rim r i_T0 m2giẫn gi a3_T4 n2toăm t oa2_T0 m2lịu l iu_T5cựi c u2i_T5õng o_T4 ng2I get error:
INFO:phonetisaurus-train:2023-09-06 17:51:21: Checking command configuration...DEBUG:phonetisaurus-train:2023-09-06 17:51:21: Directory does not exist. Trying to create.INFO:phonetisaurus-train:2023-09-06 17:51:21: Checking lexicon for reserved characters: '}', '|', '_'...DEBUG:phonetisaurus-train:2023-09-06 17:51:21: arpa_path: train/model.o8.arpaDEBUG:phonetisaurus-train:2023-09-06 17:51:21: corpus_path: train/model.corpusDEBUG:phonetisaurus-train:2023-09-06 17:51:21: dir_prefix: trainDEBUG:phonetisaurus-train:2023-09-06 17:51:21: grow: FalseDEBUG:phonetisaurus-train:2023-09-06 17:51:21: lexicon_file: /tmp/tmp53qaxdn7.txtDEBUG:phonetisaurus-train:2023-09-06 17:51:21: logger: <Logger phonetisaurus-train (DEBUG)>DEBUG:phonetisaurus-train:2023-09-06 17:51:21: makeJointNgramCommand: <bound method G2PModelTrainer._mitlm of <__main__.G2PModelTrainer object at 0x7fd3aae0ec40>>DEBUG:phonetisaurus-train:2023-09-06 17:51:21: model_path: train/model.fstDEBUG:phonetisaurus-train:2023-09-06 17:51:21: model_prefix: modelDEBUG:phonetisaurus-train:2023-09-06 17:51:21: ngram_order: 8DEBUG:phonetisaurus-train:2023-09-06 17:51:21: seq1_del: FalseDEBUG:phonetisaurus-train:2023-09-06 17:51:21: seq1_max: 2DEBUG:phonetisaurus-train:2023-09-06 17:51:21: seq2_del: TrueDEBUG:phonetisaurus-train:2023-09-06 17:51:21: seq2_max: 2DEBUG:phonetisaurus-train:2023-09-06 17:51:21: verbose: TrueDEBUG:phonetisaurus-train:2023-09-06 17:51:21: phonetisaurus-align --input=/tmp/tmp53qaxdn7.txt --ofile=train/model.corpus --seq1_del=false --seq2_del=true --seq1_max=2 --seq2_max=2 --grow=falseINFO:phonetisaurus-train:2023-09-06 17:51:21: Aligning lexicon...GitRevision: packageLoading input file: /tmp/tmp53qaxdn7.txtPlease provide a valid input file.ERROR:phonetisaurus-train:2023-09-06 17:51:21: Alignment failed. Exiting.Traceback (most recent call last): File "/home/tupk/anaconda3/envs/nlp/bin/phonetisaurus", line 8, in <module> sys.exit(main()) File "/home/tupk/anaconda3/envs/nlp/lib/python3.8/site-packages/phonetisaurus/__main__.py", line 74, in main do_train(args, casing, env) File "/home/tupk/anaconda3/envs/nlp/lib/python3.8/site-packages/phonetisaurus/__main__.py", line 209, in do_train train(lexicon=lexicon, model_path=args.model, corpus_path=args.corpus, env=env) File "/home/tupk/anaconda3/envs/nlp/lib/python3.8/site-packages/phonetisaurus/__init__.py", line 121, in train subprocess.check_call(train_cmd, cwd=temp_dir_str, env=env) File "/home/tupk/anaconda3/envs/nlp/lib/python3.8/subprocess.py", line 364, in check_call raise CalledProcessError(retcode, cmd)subprocess.CalledProcessError: Command '['phonetisaurus-train', '--lexicon', '/tmp/tmp53qaxdn7.txt', '--seq2_del', '--verbose']' returned non-zero exit status 1.But if I train withEnglish lexicon, no problem
How can I fix it?
Thank you
Metadata
Metadata
Assignees
Labels
No labels