CMUSphinx Baum Welch训练第一次迭代失败的原因

时间:2016-08-04 14:12:40

标签: machine-learning speech-recognition cmusphinx

我正在尝试构建一个新的声学模型,我使用语音合成器来训练它,训练文件的总估计小时数为: 0.0389416666666667 ,但是在Baum Welch训练后我一直收到错误,这是我得到的错误:

Sphinxtrain path: /usr/local/lib/sphinxtrain Sphinxtrain binaries path: /usr/local/libexec/sphinxtrain Running the training MODULE: 000 Computing feature from audio files Extracting features from segments starting at (part 1 of 1) Extracting features from segments starting at (part 1 of 1) Feature extraction is done MODULE: 00 verify training files Phase 1: Checking to see if the dict and filler dict agrees with the phonelist file. Found 81 words using 49 phones Phase 2: Checking to make sure there are not duplicate entries in the dictionary Phase 3: Check general format for the fileids file; utterance length (must be positive); files exist Phase 4: Checking number of lines in the transcript file should match lines in fileids file Phase 5: Determine amount of training data, see if n_tied_states seems reasonable. Estimated Total Hours Training: 0.0389416666666667 This is a small amount of data, no comment at this time Phase 6: Checking that all the words in the transcript are in the dictionary Words in dictionary: 78 Words in filler dictionary: 3 Phase 7: Checking that all the phones in the transcript are in the phonelist, and all phones in the phonelist appear at least once MODULE: 0000 train grapheme-to-phoneme model Skipped (set $CFG_G2P_MODEL = 'yes' to enable) MODULE: 01 Train LDA transformation Skipped for multistream setup, see CFG_NUM_STREAMS configuration LDA/MLLT only has sense for single stream features Skipping LDA training MODULE: 02 Train MLLT transformation Skipped for multistream setup, see CFG_NUM_STREAMS configuration LDA/MLLT only has sense for single stream features Skipping MLLT training MODULE: 05 Vector Quantization ERROR: This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details. MODULE: 10 Training Context Independent models for forced alignment and VTLN Skipped: $ST::CFG_FORCEDALIGN set to 'no' in sphinx_train.cfg Skipped: $ST::CFG_VTLN set to 'no' in sphinx_train.cfg MODULE: 11 Force-aligning transcripts Skipped: $ST::CFG_FORCEDALIGN set to 'no' in sphinx_train.cfg MODULE: 12 Force-aligning data for VTLN Skipped: $ST::CFG_VTLN set to 'no' in sphinx_train.cfg MODULE: 20 Training Context Independent models Phase 1: Cleaning up directories: accumulator...logs...qmanager...models... Phase 2: Flat initialize Phase 3: Forward-Backward Baum welch starting for 256 Gaussian(s), iteration: 1 (1 of 1) 0% 20% 30% 60% 90% 100% ERROR: This step had 86 ERROR messages and 0 WARNING messages. Please check the log file for details. ERROR: Training failed in iteration 1

我还将CFG_CD_TRAIN设置为'no',因为我的训练数据很少。

编辑: 我检查了日志文件,这里是日志的pastebin: http://pastebin.com/YBSqfxYW

0 个答案:

没有答案