Question

我的任务是创建俄语单词列表的语音识别。这就是为什么我需要使用俄罗斯声学模型。为了获得最佳识别结果，我致力于声学模型的适应我遵循文档link，一切似乎都很好，但是当我运行脚本时：

./bw \
-hmmdir model \
-moddeffn model/mdef.txt 
-ts2cbfn .ptm. \
-feat 1s_c_d_dd \
-svspec 0-12/13-25/26-38 \
-cmn current \
-agc none \
-dictfn zero_ru.dic \
-ctlfn lesniki.fields \
-lsnfn lesniki.transcription\
-accumdir .

我接下来的错误：

INFO: s3mixw_Io.c(117): Read model/mixture_weights [4159x1x63 array]
FATAL: "mod_In.c", line 358: Number of feature streams in 
mixture_weights file 1 differs from the configured value 3, check the 
command line options

有我的项目和一些链接 project on git

Answer 1

此错误表示用于bw的配置参数与使用该模型训练的配置参数不匹配。从文件中看来，您正在使用连续模型中的mixture weights，但您正在尝试像ptm模型那样进行调整。对于连续模型，您不需要-svspec选项，但需要-lda feature_transform选项。 -ts2cbfn必须是.cont.，而不是.ptm.。

如果您尝试调整连续模型，则需要相应地更改参数。否则，您从错误的模型中获取了mixture_weights文件。

mixture_weights文件中的要素流数量与配置的值不同

1 个答案: