Question

友

我目前正在使用KStemmer在我使用solr开发的应用程序中执行词干操作。发现的是当我们使用kStemmer时，它没有使用protwords.txt

<analyzer type="query">
    <tokenizer class="solr.StandardTokenizerFactory"/>
    <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="true"/>
    <filter class="solr.StopFilterFactory"
            ignoreCase="true"
            words="stopwords.txt"
            enablePositionIncrements="true"
            />
    <filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.EnglishPossessiveFilterFactory"/>
    <filter class="solr.KStemFilterFactory" protected="protwords.txt"/>

  </analyzer>

它不会将protwords中的单词视为受保护。这是Kstem的工作原理吗？

Answer 1

我检查了源代码，但是我看不到实现。顺便说一句，我检查了Solr 3.6的源代码。

它在SnowballPorterFilterFactory中实现，但未在KStemFilterFactory中实现。

kstemmer和protwords

1 个答案: