Question

当我查看Developer.ATT.com上的文档时，我看到了我应该使用的格式（代码）示例：

POST https://api.att.com/speech/v3/speechToTextCustom HTTP/1.1
Host: api.att.com
Authorization: Bearer 38C2399A23999
X-SpeechContext: GrammarList
Accept: application/json
Content-length: 7012
Content-Language: en-us
Content-type: multipart/x-srgs-audio; boundary=----------------------------46ef33c653e2

------------------------------46ef33c653e2
Content-Disposition: form-data; name="x-dictionary"; filename="speech_alpha.pls"
Content-Type: application/pls+xml

<?xml version="1.0" encoding="UTF-8"?>
<lexicon version="1.0" alphabet="sampa" xml:lang="en-US">
   <lexeme>
       <grapheme>Qiang</grapheme>
       <phoneme>tS { n</phoneme>
   </lexeme>
</lexicon>

------------------------------46ef33c653e2
Content-Disposition: form-data; name="x-grammar"
Content-Type: application/srgs+xml

<grammar root="top" xml:lang="en-US">
  <rule id="CONTACT">
      <one-of>
        <item>Qiang</item>
        <item>David</item>
      </one-of>
  </rule>
  <rule id="top" scope="public">
      <item>
          <one-of>
            <item>text</item>
            <item>send message to</item>
          </one-of>
      </item>
  <ruleref uri="#CONTACT"/>
  </rule>
</grammar>


------------------------------46ef33c653e2
Content-Disposition: form-data; name="x-voice"; filename="speech_ex.amr"
Content-Type: audio/amr

      {{...audio data...}}

------------------------------45ef33c653e2--

由于我无法使其工作，我想知道是否应该包括帖子的字典和语法部分？为什么呢？

Answer 1

要使用Customspeech引擎（非标准语言），需要使用语法文件，因为您需要在使用Customspeech引擎时提供转录词汇表。您可以使用常规语音API进行转录而不使用语法文件，但它可能无法涵盖您尝试执行的操作。看起来您正试图利用AT＆amp; T Customspeech引擎提供的大量非英语语言。所有这些语言都需要您提供语法文件才能工作。支持的非自定义语言是美式英语和美式西班牙语。

AT＆amp; T Dev Prog：语音到文本自定义帖子

1 个答案: