Question

我正在做一个问答任务。我打算使用依赖性分析来找到从段落到查询的候选答案。但是，我不确定如何在查询的依赖树和段落中的句子之间找到相似性。下面是可复制的代码。

import spacy
from spacy import displacy
nlp = spacy.load('en_core_web_sm')

doc1 = nlp('Wall Street Journal just published an interesting piece on crypto currencies')
doc2 = nlp('What did Wall Street Journal published')

displacy.render(doc1, style='dep', jupyter=True, options={'distance': 90})
displacy.render(doc2, style='dep', jupyter=True, options={'distance': 90})

Answer 1

这可能是您要寻找的：

https://github.com/tomkwiat/dependency-graph-similarity-measure

它在Java上，但是应该易于移植到Python。

您可能要研究的另一件事是有关内核图相似性的论文。

如何使用spaCy查找2个依赖树之间的相似性？

1 个答案: