问题:我们如何使用包含数字和非数字字段的原始lucene查询字符串提供hibernate搜索?
背景:我们最近升级到HibernateSearch 5.0,由于HibernateSearch查询解析器(pre-lucene)发生了更改,我们的许多查询现在都失败了,并出现以下错误:
The specified query contains a string based sub query which targets the numeric encoded field(s)
在大多数情况下,由于我们运行的查询的复杂性,我们使用lucene的文本语法和MultiFieldQueryParser
将查询传递给HibernateSearch。直到HibernateSearch 5.0,这些都运行得很好。在升级中,我们遇到了从HibernateSearch抛出的异常,这些异常阻止我们的应用程序运行以前的查询。我们不理解为什么抛出异常或向前推进的最佳方式。
在尝试追踪问题时,我试图简化哪些有效,哪些无法以最原始的形式进行。 (这是由HibernateSearch的QueryValidationTest构建的。)
示例:
给出以下Entity类:
@Entity
@Indexed
public static class B {
@Id
@GeneratedValue
private long id;
@Field
private long value;
@Field
private String text;
}
测试1(我们如何为hibernate搜索编写查询:FAILURE):
QueryParser parser = new MultiFieldQueryParser(new String[]{"id","value","num"},new StandardAnalyzer());
Query query = parser.parse("+(value:1 text:test)");
FullTextQuery fullTextQuery = fullTextSession.createFullTextQuery( query, B.class );
fullTextQuery.list();
结果:
org.hibernate.search.exception.SearchException: HSEARCH000233: The specified query '+(value:1 text:test)' contains a string based sub query which targets the numeric encoded field(s) 'value'. Check your query or try limiting the targeted entities.
at org.hibernate.search.query.engine.impl.LazyQueryState.validateQuery(LazyQueryState.java:163)
at org.hibernate.search.query.engine.impl.LazyQueryState.search(LazyQueryState.java:102)
at org.hibernate.search.query.engine.impl.QueryHits.updateTopDocs(QueryHits.java:227)
at org.hibernate.search.query.engine.impl.QueryHits.<init>(QueryHits.java:122)
at org.hibernate.search.query.engine.impl.QueryHits.<init>(QueryHits.java:94)
at org.hibernate.search.query.engine.impl.HSQueryImpl.getQueryHits(HSQueryImpl.java:436)
at org.hibernate.search.query.engine.impl.HSQueryImpl.queryEntityInfos(HSQueryImpl.java:257)
at org.hibernate.search.query.hibernate.impl.FullTextQueryImpl.list(FullTextQueryImpl.java:200)
at org.hibernate.search.test.query.validation.QueryValidationTest.testRawLuceneWithNumericValue(QueryValidationTest.java:156)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:601)
at org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
at org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
at org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
at org.hibernate.testing.junit4.ExtendedFrameworkMethod.invokeExplosively(ExtendedFrameworkMethod.java:62)
at org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
at org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
at org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
at org.hibernate.testing.junit4.FailureExpectedHandler.evaluate(FailureExpectedHandler.java:58)
at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:271)
at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:70)
at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:50)
at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:63)
at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:236)
at org.junit.runners.ParentRunner.access$000(ParentRunner.java:53)
at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:229)
at org.hibernate.testing.junit4.BeforeClassCallbackHandler.evaluate(BeforeClassCallbackHandler.java:43)
at org.hibernate.testing.junit4.AfterClassCallbackHandler.evaluate(AfterClassCallbackHandler.java:42)
at org.junit.runners.ParentRunner.run(ParentRunner.java:309)
at org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:264)
at org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:153)
at org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:124)
at org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:200)
at org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:153)
at org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:103)
测试2 :(使用数值范围变化失败的方式相同:FAILURE):
QueryParser parser = new MultiFieldQueryParser(new String[]{"id","value","text"},new StandardAnalyzer());
Query query = parser.parse("+(value:[1 TO 1] text:test)");
FullTextQuery fullTextQuery = fullTextSession.createFullTextQuery( query, B.class );
fullTextQuery.list();
测试3 :(使用lucene术语:SUCCESS)
TermQuery query = new TermQuery( new Term("text", "bar") );
TermQuery nq = new TermQuery( new Term("value", "1") );
BooleanQuery bq = new BooleanQuery();
bq.add(query, Occur.SHOULD);
bq.add(nq, Occur.SHOULD);
FullTextQuery fullTextQuery = fullTextSession.createFullTextQuery( bq, B.class );
注意:完整版的测试用例中包含的测试说明了我们所看到的内容:https://github.com/abrin/hibernate-search/blob/3fdcc8229f0bfa00329b9d977172fd218d82cac2/orm/src/test/java/org/hibernate/search/test/query/validation/QueryValidationTest.java
感谢
答案 0 :(得分:5)
首先,您的问题的原因是,从搜索5开始,数字类型被索引为Lucene数字字段(而不是基于字符串的字段)。除了性能提升之外,它还允许例如对数字字段进行排序而无需填充。搜索5 documentation说明如下:
在搜索5之前,仅选择数字字段编码 通过@NumericField明确请求。截至搜索5此编码 自动为数字类型选择。避免数字编码 您可以通过显式指定非数字字段桥 @ Field.bridge或@FieldBridge。包裹 org.hibernate.search.bridge.builtin包含一组桥梁 例如,将数字编码为字符串 org.hibernate.search.bridge.builtin.IntegerBridge。
因此,如果您想坚持旧的行为,您需要确保您的数值仍然被索引为字符串。在您的示例中,value
需要使用org.hibernate.search.bridge.builtin.LongBridge
编入索引。您可以使用@FieldBridge
注释实现此目的(您可以忽略id情况,因为文档ID无论如何都被索引为字符串):
@Field
@FieldBridge(impl = LongBridge.class)
private long value;
有关您的测试方案的一些评论:
NumericRangeQuery
进行定位/搜索。如果您仍想使用查询解析器,则需要提供自己的子类并自己处理数字字段。另请参阅 - How do I make the QueryParser in Lucene handle numeric ranges? value:[1 TO 1]
,它只是创建一个文本/字符串范围查询。value
术语被忽略。 TermQuery
是基于字符串的,无法在数字编码字段中找到匹配项。另请参阅Lucene 3.0.3 Numeric term query