DSE SOLR中的多值复制字段支持

时间:2014-03-23 22:38:37

标签: solr cassandra datastax-enterprise datastax

我有一个SOLR架构如下:

<field name="category_id1" type="integer" indexed="false" stored="true" />
<field name="category_id2" type="integer" indexed="false" stored="true" />
<field name="category_id3" type="integer" indexed="false" stored="true" />
<field name="category_ids" type="integer" multiValued="true" indexed="true" stored="true"/>

和复制部分:

<copyField source="category_id1" dest="category_ids" />

但每当我尝试将数据注入DSE / Cassandra时,我都会收到此错误

InvalidRequestException(why:(Expected 4 or 0 byte int (14)) [diem][business][category_ids] failed validation) me.prettyprint.hector.api.exceptions.HInvalidRequestException: InvalidRequestException(why:(Expected 4 or 0 byte int (14)) [diem][business][category_ids] failed validation) Exception in thread "main" me.prettyprint.hector.api.exceptions.HInvalidRequestException: InvalidRequestException(why:(Expected 4 or 0 byte int (14)) [diem][business][category_ids] failed validation) at me.prettyprint.cassandra.service.ExceptionsTranslatorImpl.translate(ExceptionsTranslatorImpl.java:45) at me.prettyprint.cassandra.connection.HConnectionManager.operateWithFailover(HConnectionManager.java:264) at me.prettyprint.cassandra.model.ExecutingKeyspace.doExecuteOperation(ExecutingKeyspace.java:97) at me.prettyprint.cassandra.model.MutatorImpl.execute(MutatorImpl.java:243) at com.diem.db.crud.CassandraStorageManager.insertMultiColumns(CassandraStorageManager.java:197) at com.diem.db.dao.impl.AbstractDaoImpl.saveUUIDEntity(AbstractDaoImpl.java:47) at com.diem.db.dao.impl.BusinessDaoImpl.saveBusiness(BusinessDaoImpl.java:81) at com.diem.data.LoadBusinesses.execute(LoadBusinesses.java:187) at com.diem.data.LoadContent.run(LoadContent.java:121) at com.diem.data.LoadBusinesses.main(LoadBusinesses.java:45) Caused by: InvalidRequestException(why:(Expected 4 or 0 byte int (14)) [diem][business][category_ids] failed validation) at org.apache.cassandra.thrift.Cassandra$batch_mutate_result.read(Cassandra.java:20833) at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:78) at org.apache.cassandra.thrift.Cassandra$Client.recv_batch_mutate(Cassandra.java:964) at org.apache.cassandra.thrift.Cassandra$Client.batch_mutate(Cassandra.java:950) at me.prettyprint.cassandra.model.MutatorImpl$3.execute(MutatorImpl.java:246) at me.prettyprint.cassandra.model.MutatorImpl$3.execute(MutatorImpl.java:243) at me.prettyprint.cassandra.service.Operation.executeAndSetResult(Operation.java:103) at me.prettyprint.cassandra.connection.HConnectionManager.operateWithFailover(HConnectionManager.java:258) ... 8 more

复制到multiValued solr.IntField(整数)中的副本并不特别,我们可以在使用DSE / SOLR之前完成。但我似乎无法在DSE / SOLR组合中获得这项工作。从逻辑上讲,我看不出为什么会失败的原因,因为DSE不应该干扰category_ids字段上的操作,该字段主要用于索引。有没有人看到这种情况有什么问题?在这种情况下我可以做些什么来防止验证错误( 注意 :我不能使用category_ids的文本/字符串类型?)

谢谢!

2 个答案:

答案 0 :(得分:0)

我可以找到问题,我的CF有default_validation_class=BytesType,因此在DSE / Solr中使用BytesType验证multiValued字段category_ids,这将导致错误。因此,除非我使用LIST<int>的类型将我的CF更改为CQL声明并且不使用Hector(至少对于此CF),否则我将无法使用Solr中除text / string字段之外的multiValued字段。

答案 1 :(得分:0)

如果我理解正确,你使用的是thrift表,所以要么将category_ids 声明为UTF8Type(Solr字段可以是任何类型),要么将category_ids Solr字段声明为stored = false(在这种情况下,复制字段不会被存储,只会被索引)。

如果两者中的任何一个适合您,请告诉我们。