希伯来语有一个好的词干吗?

时间:2014-01-06 15:39:48

标签: nlp hebrew stemming lemmatization

我正在为希伯来语找一个好的词干 - 我一无所获用谷歌......

HebMorph site上它说:

Stem and Lemma originally have different meanings, but for Semitic languages they seem to be used interchangeably.

这是否意味着对于NLP目的,我可以使用lemmas而不是stems?请记住:Stemmers are much simpler, smaller and usually faster then lemmatizers, and for many applications their results are good enough. Using a lemmatizer for that is a waste of resources.source

谢谢。

1 个答案:

答案 0 :(得分:0)

在希伯来语中,词干提取器和lemmatizer都很复杂-您不能像在porter stemmer中那样仅根据单词的结尾来修剪单词中的字母...

关于lemmatizer的现有实现,您可以尝试http://hebrew-nlp.co.il当前处于beta状态,它是免费的