如何使用SSML将文本减慢到语音 - 语音声音失真/扭曲/可怕

时间:2012-04-24 12:37:25

标签: ios text-to-speech voice slowdown ssml

我正在使用iPhone的 nuance dragon mobile sdk 来使用文字转语音。
readed文本有点快,我想让它变慢,所以用户可以学习这些单词。我的目标是减慢文本的速度。使用SSML和韵律标签可以很好地工作,请参阅以下代码:

<prosody rate="slow">This is the text which is spoken slow,
but the voice sounds distorted/warped/ghastly</prosody>

声音扭曲,扭曲和可怕 你明白我的意思吗? 我该怎么做才能获得清晰的声音缓慢的语音文字

1 个答案:

答案 0 :(得分:1)

取自此处:http://www.w3.org/TR/speech-synthesis/#S3.2.4

rate: a change in the speaking rate for the contained text. Legal values are: 
a relative change or "x-slow", "slow", "medium", "fast", "x-fast", or "default". 
Labels "x-slow" through "x-fast" represent a sequence of monotonically non-decreasing
speaking rates. When a number is used to specify a relative change it acts as a 
multiplier of the default rate. For example, a value of 1 means no change in speaking 
rate, a value of 2 means a speaking rate twice the default rate, and a value of 0.5 
means a speaking rate of half the default rate. The default rate for a voice depends on 
the language and dialect and on the personality of the voice. The default rate for a 
voice should be such that it is experienced as a normal speaking rate for the voice when 
reading aloud text. Since voices are processor-specific, the default rate will be as 
well.