将语法与SpeechRecognition包一起使用时出现运行时错误

时间:2018-04-09 12:07:29

标签: python speech-recognition grammar cmusphinx

所以我尝试使用speech_recognition.recognize_sphinx()的语法,但是,我收到以下错误:

RuntimeError: Decoder_set_fsg returned -1

这是我的代码:

Main.py

# Dependencies:
import speech_recognition as sr

# Collect audio sample
r = sr.Recognizer()
print('Please say "perquisition":')
with sr.Microphone() as source:
    audio_en = r.listen(source)
print('Processing...')

# Attempt to convert the speech to text
try:
    print(r.recognize_sphinx(audio_en, grammar='perquisition.gram'))
except sr.UnknownValueError:
    print("Sphinx could not understand audio")
except sr.RequestError as e:
    print("Sphinx error; {0}".format(e))

perquisition.gram

#JSGF V1.0;
grammar perquisition;
// Grammar rule names should be [a-zA-Z0-9] only!
public <perquisition> = ( perquisition );

关于发生了什么的任何想法?

1 个答案:

答案 0 :(得分:1)

这里有一些事情掩盖了潜在的错误。 speech_recognition.recognize_sphinx()只是一些CMUsphinx命令的包装器,可以在第746行找到here。对于这个确切的问题,有点混乱,所以我们将重点放在下面的代码片段代替:

# Dependencies
import speech_recognition as sr
import os
import pocketsphinx as ps

# Manually point to the grammar file
grammar = 'search.gram'
try:
    # Point to the model files
    language_directory = os.path.join(os.path.dirname(os.path.realpath(__file__)), "pocketsphinx-data", "en-US")
    acoustic_parameters_directory = os.path.join(language_directory, "acoustic-model")
    language_model_file = os.path.join(language_directory, "language-model.lm.bin")
    phoneme_dictionary_file = os.path.join(language_directory, "pronounciation-dictionary.dict")

    # Create a decoder object with our custom parameters
    config = ps.Decoder.default_config()
    config.set_string("-hmm",
                      acoustic_parameters_directory)  # set the path of the hidden Markov model (HMM) parameter files
    config.set_string("-lm", language_model_file)
    config.set_string("-dict", phoneme_dictionary_file)
    config.set_string("-logfn", os.devnull)  # <--- Prevents you from seeing the actual bug!!!
    decoder = ps.Decoder(config)

    # Convert grammar
    grammar_path = os.path.abspath(os.path.dirname(grammar))
    grammar_name = os.path.splitext(os.path.basename(grammar))[0]
    fsg_path = "{0}/{1}.fsg".format(grammar_path, grammar_name)
    if not os.path.exists(fsg_path):  # create FSG grammar if not available
        jsgf = ps.Jsgf(grammar)
        rule = jsgf.get_rule("{0}.{0}".format(grammar_name))
        fsg = jsgf.build_fsg(rule, decoder.get_logmath(), 7.5)
        fsg.writefile(fsg_path)
        print('Successful JSFG to FSG conversion!!!')

    # Pass the fsg file into the decoder
    decoder.set_fsg(grammar_name, fsg)  # <--- BUG IS HERE!!!

except Exception as e:
    print('Ach no! {0}'.format(e))
finally:
    os.remove('search.fsg')  # Remove again to help prove that the grammar to fsg conversion isn't at fault

运行该代码,我们找到了bug突然出现的那一行,而且还记录了日志信息!随着它,很多文本被转储到终端,这可能是一个麻烦。在这种情况下,重新开始发现它会更有用......

...
ERROR: "fsg_search.c", line 141: The word 'perquisition' is missing in the dictionary
...

现在我们到了某个地方。这给我们留下了两个选择之一。首先,我们可以扫描字典(pocketsphinx.get_model_path()+'/cmudict-en-us.dict'或类似字体)以确定是否存在单词。然后我们可以决定是否简单地忽略该单词,或者将其添加到字典中。

添加到字典不一定是直接的...根据它与字典中其他单词的相似程度,您可能能够逃脱它。否则,您也必须重新训练模型。可以找到关于如何执行此操作的更好解释here。享受。