Solr多个提升查询

时间:2014-03-13 20:57:13

标签: java apache search solr solr-boost

在Solr 3.6.2中提升静态值的最佳方法是什么?

每个文档都有一个(KeywordTokenized)source值(信息来源),我想表达一些比其他更重要的。如果有帮助,可以进行全面的内容控制。

http://localhost:8080/solr/index/select?fl=source,score
  &defType=edismax
  &qf=title^4
  &q=qux
  &bq=source:foo^10
  &bq=source:bar^14

不幸的是,添加提升查询实际上没有任何影响。解析后的查询看起来像这样(qux被限制为qu)

+DisjunctionMaxQuery(((title:qux qu)^4.0))) source:foo^10.0 source:bar^14.0

前两个结果的调试查询不会明显尊重bq=source:foo^10

5.366351 = (MATCH) sum of: 5.366351 = (MATCH) sum of: 2.067319 = (MATCH) weight(title:qux in 4472), product of: 0.5513683 = queryWeight(title:qux), product of: 7.4988675 = idf(docFreq=35, maxDocs=23918) 0.073526874 = queryNorm 3.7494338 = (MATCH) fieldWeight(title:qux in 4472), product of: 1.0 = tf(termFreq(title:qux)=1) 7.4988675 = idf(docFreq=35, maxDocs=23918) 0.5 = fieldNorm(field=title, doc=4472) 3.299032 = (MATCH) weight(title:qu in 4472), product of: 0.69651634 = queryWeight(title:qu), product of: 9.472949 = idf(docFreq=4, maxDocs=23918) 0.073526874 = queryNorm 4.7364745 = (MATCH) fieldWeight(title:qu in 4472), product of: 1.0 = tf(termFreq(title:qu)=1) 9.472949 = idf(docFreq=4, maxDocs=23918) 0.5 = fieldNorm(field=title, doc=4472)

5.281746 = (MATCH) sum of: 4.134638 = (MATCH) sum of: 4.134638 = (MATCH) weight(title:qux in 4402), product of: 0.5513683 = queryWeight(title:qux), product of: 7.4988675 = idf(docFreq=35, maxDocs=23918) 0.073526874 = queryNorm 7.4988675 = (MATCH) fieldWeight(title:qux in 4402), product of: 1.0 = tf(termFreq(title:qux)=1) 7.4988675 = idf(docFreq=35, maxDocs=23918) 1.0 = fieldNorm(field=title, doc=4402) 1.147108 = (MATCH) weight(source:foo^10.0 in 4402), product of: 0.45919293 = queryWeight(source:foo^10.0), product of: 10.0 = boost 2.4980958 = idf(docFreq=5346, maxDocs=23918) 0.018381719 = queryNorm 2.4980958 = (MATCH) fieldWeight(source:foo in 4402), product of: 1.0 = tf(termFreq(source:foo)=1) 2.4980958 = idf(docFreq=5346, maxDocs=23918) 1.0 = fieldNorm(field=source, doc=4402)

有趣的是,尽管queryWeight字段中包含值,但source的第一个结果仍为 source。由于lemmatization

fieldNorm也有所不同

1 个答案:

答案 0 :(得分:2)

问题很可能是由于queryNorm造成的。看到: http://lucene.472066.n3.nabble.com/QueryNorm-and-FieldNorm-td1992964.html

只要您不根据不同查询的分数比较结果,单个搜索中的相关性排序应该没问题。