所以我尝试使用speech_recognition.recognize_sphinx()
的语法,但是,我收到以下错误:
RuntimeError: Decoder_set_fsg returned -1
这是我的代码:
Main.py
# Dependencies:
import speech_recognition as sr
# Collect audio sample
r = sr.Recognizer()
print('Please say "perquisition":')
with sr.Microphone() as source:
audio_en = r.listen(source)
print('Processing...')
# Attempt to convert the speech to text
try:
print(r.recognize_sphinx(audio_en, grammar='perquisition.gram'))
except sr.UnknownValueError:
print("Sphinx could not understand audio")
except sr.RequestError as e:
print("Sphinx error; {0}".format(e))
perquisition.gram
#JSGF V1.0;
grammar perquisition;
// Grammar rule names should be [a-zA-Z0-9] only!
public <perquisition> = ( perquisition );
关于发生了什么的任何想法?
答案 0 :(得分:1)
这里有一些事情掩盖了潜在的错误。
speech_recognition.recognize_sphinx()
只是一些CMUsphinx命令的包装器,可以在第746行找到here。对于这个确切的问题,有点混乱,所以我们将重点放在下面的代码片段代替:
# Dependencies
import speech_recognition as sr
import os
import pocketsphinx as ps
# Manually point to the grammar file
grammar = 'search.gram'
try:
# Point to the model files
language_directory = os.path.join(os.path.dirname(os.path.realpath(__file__)), "pocketsphinx-data", "en-US")
acoustic_parameters_directory = os.path.join(language_directory, "acoustic-model")
language_model_file = os.path.join(language_directory, "language-model.lm.bin")
phoneme_dictionary_file = os.path.join(language_directory, "pronounciation-dictionary.dict")
# Create a decoder object with our custom parameters
config = ps.Decoder.default_config()
config.set_string("-hmm",
acoustic_parameters_directory) # set the path of the hidden Markov model (HMM) parameter files
config.set_string("-lm", language_model_file)
config.set_string("-dict", phoneme_dictionary_file)
config.set_string("-logfn", os.devnull) # <--- Prevents you from seeing the actual bug!!!
decoder = ps.Decoder(config)
# Convert grammar
grammar_path = os.path.abspath(os.path.dirname(grammar))
grammar_name = os.path.splitext(os.path.basename(grammar))[0]
fsg_path = "{0}/{1}.fsg".format(grammar_path, grammar_name)
if not os.path.exists(fsg_path): # create FSG grammar if not available
jsgf = ps.Jsgf(grammar)
rule = jsgf.get_rule("{0}.{0}".format(grammar_name))
fsg = jsgf.build_fsg(rule, decoder.get_logmath(), 7.5)
fsg.writefile(fsg_path)
print('Successful JSFG to FSG conversion!!!')
# Pass the fsg file into the decoder
decoder.set_fsg(grammar_name, fsg) # <--- BUG IS HERE!!!
except Exception as e:
print('Ach no! {0}'.format(e))
finally:
os.remove('search.fsg') # Remove again to help prove that the grammar to fsg conversion isn't at fault
运行该代码,我们找到了bug突然出现的那一行,而且还记录了日志信息!随着它,很多文本被转储到终端,这可能是一个麻烦。在这种情况下,重新开始发现它会更有用......
...
ERROR: "fsg_search.c", line 141: The word 'perquisition' is missing in the dictionary
...
现在我们到了某个地方。这给我们留下了两个选择之一。首先,我们可以扫描字典(pocketsphinx.get_model_path()+'/cmudict-en-us.dict'
或类似字体)以确定是否存在单词。然后我们可以决定是否简单地忽略该单词,或者将其添加到字典中。
添加到字典不一定是直接的...根据它与字典中其他单词的相似程度,您可能能够逃脱它。否则,您也必须重新训练模型。可以找到关于如何执行此操作的更好解释here。享受。