Question

我已经按照示例应用程序使用GitHub存储库下的文本生成语音。

https://github.com/Azure-Samples/Cognitive-Speech-TTS/tree/master/Samples-Http/CSharp

我的应用程序运行正常只有问题是说话率或每个单词后的中断/暂停。

输入文字：y u 7 f s d 2 3 e

以下是我正在使用的样本SSML：

<speak version="1.0" xmlns="http://www.w3.org/2001/10/synthesis" xmlns:mstts="http://www.w3.org/2001/mstts" xml:lang="en-IN"><voice xml:lang="en-IN" name="Microsoft Server Speech Text to Speech Voice (en-IN, Ravi, Apollo)">y u 7 f s d 2 3 e</voice></speak>

我想在每个字母表后暂停。因为我正在使用此音频以音频模式获取验证码文本。

请建议正确的方法。

P.S：我不想通过复制粘贴重复整个代码。（使用GIT的样本）

我甚至在下面的链接评论中跟随对话，没有运气。

https://docs.microsoft.com/en-us/azure/cognitive-services/speech/home

Answer 1

这 - ＆gt; ＆＃34; Y＆＃34 ;. ＆＃34; U＆＃34 ;. ＆＃34; 7＆＃34 ;. ＆＃34; F＆＃34 ;. ＆＃34; S＆＃34 ;. ＆＃34; d＆＃34 ;. ＆＃34; 2＆＃34 ;. ＆＃34; 3＆＃34 ;. ＆＃34; E＆＃34 ;. ＆lt; - 它适用于bing语音网页测试。对你来说也应该没问题。这是SSML：

<speak version="1.0" xmlns="http://www.w3.org/2001/10/synthesis" xmlns:mstts="http://www.w3.org/2001/mstts" xml:lang="en-US">
    <voice xml:lang="en-US" name="Microsoft Server Speech Text to Speech Voice (en-US, ZiraRUS)">&quot;y&quot;. &quot;u&quot;. &quot;7&quot;. &quot;f&quot;. &quot;s&quot;. &quot;d&quot;. &quot;2&quot;. &quot;3&quot;. &quot;e&quot;.
    </voice>
</speak>

说话率微软Bing语音API - 文本到语音

1 个答案: