Question

我有一个项目，我在客户端和主机之间创建了一个聊天程序，我必须在其中嵌入Speech to Text。我有什么办法可以在我的程序中嵌入Google Speech到Text API吗？

Answer 1

PyPI中有一个名为Speech Recognition的软件包看起来会这样做。实时（即通过麦克风）API看起来非常简单。

# NOTE: this requires PyAudio because it uses the Microphone class
import speech_recognition as sr
r = sr.Recognizer()
with sr.Microphone() as source:                # use the default microphone as the audio source
    audio = r.listen(source)                   # listen for the first phrase and extract it into audio data

try:
    print("You said " + r.recognize(audio))    # recognize speech using Google Speech Recognition
except LookupError:                            # speech is unintelligible
    print("Could not understand audio")

它还具有转录WAV文件的功能，作为后台进程运行，为转录提供置信度等。

Answer 2

这可能是你不想在你的情况下使用的，但是对于其他可能需要这个用于一次性项目的人来说，我刚刚在一个简单的python客户端上攻击了一个使用API构建的API进入Chrome进行语音搜索：

https://github.com/korylprince/python-google-transcribe

要使它工作，你必须有16000Hz编码的FLAC，它们必须相当短。

另外，就像提到的评论一样，API是非官方的，所以谁知道它什么时候会停止工作。

Answer 3

你可以试试Nexiwave的免费语音到文本api。这是python示例：http://nexiwave.com/api_samples/nexiwave_py.txt。另请查看API指南：http://nexiwave.com/index.php/119-integrate-in-5-minutes。很简单。

您必须先注册才能使用免费套餐。

如何在Python程序中嵌入Google Speech to Text API？

3 个答案: