我想知道任何人都有一个信息的参考,谈论句子中2个单词之间的语义距离/相似性基于对这两个单词之间的单词进行计数? 例如:
"The student went to the university to learn about the facts of life"
so if the distance threshold is 3 token then:
"student" and "university" will be connected by an edge (distance 3 words)
" facts and "life" will be connected by an edge (distance 1)
"student" and "Fact" will not be connected (distance 8)
etc.
答案 0 :(得分:0)
要找到两个单词之间的语义相似性,最简单的算法是使用单词lesk算法(https://en.wikipedia.org/wiki/Lesk_algorithm),使用wordnet。我怀疑是否有任何通过单词之间的距离找到语义相似性的方法。