Question

我正在尝试使用节点js将音频文件转换为文本，我有一个参考链接https://github.com/GoogleCloudPlatform/nodejs-docs-samples/tree/master/speech，通过使用此链接，我能够转换包含在其中的示例文件（即音频。 raw，gs：//gcs-test-data/vr.flac）从音频到文本

但是当我使用此link的音频文件时，我无法将音频转换为文本，它会显示空响应。回复是{ results: [] }

我也知道使用这些命令

 node recognize.js sync ./resources/audio.raw -e LINEAR16 -r 16000
 node recognize.js async-gcs gs://gcs-test-data/vr.flac -e FLAC -r 16000
 node recognize.js stream ./resources/audio.raw  -e LINEAR16 -r 16000

但是，我不知道如何使用链接中的音频文件将音频转换为文本

Answer 1

使用Google Speech API时，务必确保您的音频格式正确无误。 API不能与mp3等格式一起使用。

为了使用api，您可以使用像sox或ffmpeg这样的工具转换为flac： * Sox：sox input_file.mp3 output.flac * ffmpeg：ffmpeg -i input_file.mp3 ./output.flac

如果使用了错误的格式，Google Speech API将返回一个空答案。

{ results: [] }

无法将音频转换为文本nodejs

1 个答案: