应用错误收集

我尝试使用NLTK RegexpParser来提取noungroup。语法（不是英语:)）需要正则表达式反向引用，就像正则表达式一样。有可能吗？

from nltk.chunk import RegexpParser

grammar = '''
    NP: {<NN(.*)><CC><NN\1>} # this gives error
        {<NN(.*)><CC><NN$1>} # this has no effect
    '''
chunker = RegexpParser(grammar)

例如，它将匹配：
“NN + PLURAL CC NN + PLURAL”
但这不会：
“NN + PLURAL CC NN + ACCUSATIVE”

我的问题：nltk.RegexpParser是否有正则表达式反向引用选项？

具有反向引用的NLTK分块语法

0 个答案: