from nltk import ngrams
n = 2
test = training["review_clean"][0].split()
bigram = list(ngrams(test,n))
for gram in bigram:
print(gram)
输出看起来像这样:
[(die,studiengangsgross),(studiengangsgross,学士),(bachelor, ca),(ca,6070),(6070,学生),...]
那么有人知道如何在这里进行近似匹配吗?
预先感谢!