现在,我想使用HTS(基于HMM的语音合成系统)为印度尼西亚语构建我自己的TTS(文本到语音)应用程序。当我想将Context-dependent-label和question-set更改为适合我的语言时,我遇到了问题。我怎么能这样做?
到目前为止,我所知道的是我必须在文件中更改依赖于上下文的标签
data/script/label-full.awk
和文件中的问题集
data/script/questions/qst_[verion].hed
(我使用HTS培训演示进行示例项目)。当我打开这些文件时,我得到了这个: (标签full.awk)
(标签full.awk)
### SEGMENT
# boundary
printf "%10.0f %10.0f ", 1e7 * $65, 1e7 * $66 # pp.name
printf "%s", ($63 == "0") ? "x" : $63 # p.name
printf "^%s", ($1 == "0") ? "x" : $1 # c.name
printf "-%s", $2 # n.name
printf "+%s", ($3 == "0") ? "x" : $3 # nn.name
printf "=%s", ($64 == "0") ? "x" : $64
bla..bla..bla..
和.hed文件:
QS "LL-Vowel" {aa^,ae^,ah^,ao^,aw^,ax^,axr^,ay^,eh^,el^,em^,en^,er^,ey^,ih^,ix^,iy^,ow^,oy^,uh^,uw^*}
QS "LL-Consonant" {b^,ch^,d^,dh^,dx^,f^,g^,hh^,hv^,jh^,k^,l^,m^,n^,nx^,ng^,p^,r^,s^,sh^,t^,th^,v^,w^,y^,z^,zh^*}
QS "LL-Stop" {b^,d^,dx^,g^,k^,p^,t^*}
QS "LL-Nasal" {m^,n^,en^,ng^}
QS "LL-Fricative" {ch^,dh^,f^,hh^,hv^,s^,sh^,th^,v^,z^,zh^*}
QS "LL-Liquid" {el^,hh^,l^,r^,w^,y^}
QS "LL-Front" {ae^,b^,eh^,em^,f^,ih^,ix^,iy^,m^,p^,v^,w^}
QS "LL-Central" {ah^,ao^,axr^,d^,dh^,dx^,el^,en^,er^,l^,n^,r^,s^,t^,th^,z^,zh^*}
QS "LL-Back" {aa^,ax^,ch^,g^,hh^,jh^,k^,ng^,ow^,sh^,uh^,uw^,y^*}
bla..bla..bla...
该字符串的含义是什么?如何为这些文件自定义或添加新的上下文(如一个音节的音调)?