我正在尝试构建一个新的声学模型,我使用语音合成器来训练它,训练文件的总估计小时数为: 0.0389416666666667 ,但是在Baum Welch训练后我一直收到错误,这是我得到的错误:
Sphinxtrain path: /usr/local/lib/sphinxtrain
Sphinxtrain binaries path: /usr/local/libexec/sphinxtrain
Running the training
MODULE: 000 Computing feature from audio files
Extracting features from segments starting at (part 1 of 1)
Extracting features from segments starting at (part 1 of 1)
Feature extraction is done
MODULE: 00 verify training files
Phase 1: Checking to see if the dict and filler dict agrees with the
phonelist file.
Found 81 words using 49 phones
Phase 2: Checking to make sure there are not duplicate entries in
the dictionary
Phase 3: Check general format for the fileids file; utterance length
(must be positive); files exist
Phase 4: Checking number of lines in the transcript file should
match lines in fileids file
Phase 5: Determine amount of training data, see if n_tied_states
seems reasonable.
Estimated Total Hours Training: 0.0389416666666667
This is a small amount of data, no comment at this time
Phase 6: Checking that all the words in the transcript are in the
dictionary
Words in dictionary: 78
Words in filler dictionary: 3
Phase 7: Checking that all the phones in the transcript are in the
phonelist, and all phones in the phonelist appear at least once
MODULE: 0000 train grapheme-to-phoneme model
Skipped (set $CFG_G2P_MODEL = 'yes' to enable)
MODULE: 01 Train LDA transformation
Skipped for multistream setup, see CFG_NUM_STREAMS configuration
LDA/MLLT only has sense for single stream features
Skipping LDA training
MODULE: 02 Train MLLT transformation
Skipped for multistream setup, see CFG_NUM_STREAMS configuration
LDA/MLLT only has sense for single stream features
Skipping MLLT training
MODULE: 05 Vector Quantization
ERROR: This step had 2 ERROR messages and 0 WARNING messages. Please
check the log file for details.
MODULE: 10 Training Context Independent models for forced alignment and
VTLN
Skipped: $ST::CFG_FORCEDALIGN set to 'no' in sphinx_train.cfg
Skipped: $ST::CFG_VTLN set to 'no' in sphinx_train.cfg
MODULE: 11 Force-aligning transcripts
Skipped: $ST::CFG_FORCEDALIGN set to 'no' in sphinx_train.cfg
MODULE: 12 Force-aligning data for VTLN
Skipped: $ST::CFG_VTLN set to 'no' in sphinx_train.cfg
MODULE: 20 Training Context Independent models
Phase 1: Cleaning up directories:
accumulator...logs...qmanager...models...
Phase 2: Flat initialize
Phase 3: Forward-Backward
Baum welch starting for 256 Gaussian(s), iteration: 1 (1 of 1)
0% 20% 30% 60% 90% 100%
ERROR: This step had 86 ERROR messages and 0 WARNING messages. Please
check the log file for details.
ERROR: Training failed in iteration 1
我还将CFG_CD_TRAIN设置为'no',因为我的训练数据很少。
编辑: 我检查了日志文件,这里是日志的pastebin: http://pastebin.com/YBSqfxYW