学习cognitive
的语音转换(TTS)的Microsoft Azure
服务是我的新手。我可以使用TTS
的{{1}}服务成功地将给定的文本转换为音频文件。当我在Azure
中只有一个voice
元素时,它可以很好地工作SSML
个文档。工作XML
的示例是;
SSML
但是,当我有多个语音标签(基于性别)时,就会导致错误。 <speak version="1.0" xml:lang="en-US">
<voice xml:lang="en-US" xml:gender="Male" name="en-US-Jessa24kRUS">
Hello, this is my sample text to convert into audio?
</voice>
</speak>
是:
SSML
错误是:
响应状态代码不表示成功:400(SSML必须包含最多5个语音元素,实际为6个。)
如果有人解释为什么将我限制为五个<speak version="1.0" xml:lang="en-US">
<voice xml:lang="en-US" xml:gender="Male" name="en-US-Guy24kRUS"> What’s your name? </voice>
<voice xml:lang="en-US" xml:gender="Female" name="en-US-Jessa24kRUS"> My name is Cindy Smith. Do you know John Silver?</voice>
<voice xml:lang="en-US" xml:gender="Male" name="en-US-Guy24kRUS"> John and I are old friends. </voice>
<voice xml:lang="en-US" xml:gender="Female" name="en-US-Jessa24kRUS"> John just joined our company as a salesperson. </voice>
<voice xml:lang="en-US" xml:gender="Male" name="en-US-Guy24kRUS"> That’s good news. John has been a salesperson for chemical products for many years. </voice>
<voice xml:lang="en-US" xml:gender="Female" name="en-US-Jessa24kRUS"> I head he really likes his new job.</voice>
</speak>
,而documentation中没有提到任何限制,这将对我有很大帮助。
答案 0 :(得分:1)
由于延迟,这是已知的设置。我们已经知道并正在努力消除此限制。希望我们能在本周完成修复和部署,如果一切顺利,我们可能会更早完成。