如何创建和编辑PHN文件?

时间:2020-07-01 21:08:16

标签: speech-recognition training-data kaldi phoneme

我有一组数据文件:音频数据(2-5秒)和这样的音素转录:

MillisecondsPerFrame: 1.0
END OF HEADER
0.000000 56.149734 .sil
56.149734 117.647057 k
117.647057 146.402084 u
146.402084 185.160431 a
185.160431 248.509628 l
248.509628 285.319092 e
285.319092 322.250793 s
322.250793 365.473999 l
365.473999 423.770111 a
423.770111 455.178345 d
455.178345 495.744049 i
495.744049 577.214355 f
577.214355 628.564514 e
628.564514 662.164063 r(
662.164063 761.060730 e
761.060730 802.901672 n
802.901672 852.983948 s
852.983948 877.074158 i
877.074158 912.892517 a
912.892517 955.367371 d
955.367371 1001.962952 e
1001.962952 1058.701782 s
1058.701782 1117.659424 t
1117.659424 1163.810303 e
1163.810303 1226.305176 g
1226.305176 1267.187744 o
1267.187744 1322.426270 b
1322.426270 1384.236694 i
1384.236694 1449.757324 e
1449.757324 1497.879395 r(
1497.879395 1567.449097 n
1567.449097 1718.582886 o
1718.582886 1747.437642 .sil

这是训练Kaldi ASR的数据集。

如何创建/生成此类文件?

谢谢!

0 个答案:

没有答案