我正在为希伯来语找一个好的词干 - 我一无所获用谷歌......
在HebMorph site上它说:
Stem and Lemma originally have different meanings, but for Semitic languages they seem to be used interchangeably.
这是否意味着对于NLP目的,我可以使用lemmas而不是stems?请记住:Stemmers are much simpler, smaller and usually faster then lemmatizers, and for many applications their results are good enough. Using a lemmatizer for that is a waste of resources.
(source)
谢谢。
答案 0 :(得分:0)
在希伯来语中,词干提取器和lemmatizer都很复杂-您不能像在porter stemmer中那样仅根据单词的结尾来修剪单词中的字母...
关于lemmatizer的现有实现,您可以尝试http://hebrew-nlp.co.il当前处于beta状态,它是免费的