我的德语词汇SnowballPorterFilterFactory有问题。我无法让它发挥作用。无论是英语还是细菌。我正在使用这个AnalzyderDef。
@AnalyzerDef(name = "customAnalyzer", charFilters = {
@CharFilterDef(factory = HTMLStripCharFilterFactory.class)},
tokenizer = @TokenizerDef(factory = StandardTokenizerFactory.class),
filters = {
@TokenFilterDef(factory = TrimFilterFactory.class),
@TokenFilterDef(factory = LowerCaseFilterFactory.class),
@TokenFilterDef(factory = SnowballPorterFilterFactory.class, params = {
@Parameter(name = "language", value = "German2")
}),
@TokenFilterDef(factory = WordDelimiterFilterFactory.class, params = {
@Parameter(name = "splitOnCaseChange", value = "1"),
@Parameter(name = "splitOnNumerics", value = "1"),
@Parameter(name = "generateWordParts", value = "1"),
@Parameter(name = "preserveOriginal", value = "1")
}),
@TokenFilterDef(factory = EdgeNGramFilterFactory.class, params = {
@Parameter(name = "minGramSize", value = "5"),
@Parameter(name = "maxGramSize", value = "15"),
@Parameter(name = "side", value = "front")
})
}))
我尝试过像步行/步行/散步一样here。 可能是过滤器的组合问题了吗?