我有一个SOLR架构如下:
<field name="category_id1" type="integer" indexed="false" stored="true" />
<field name="category_id2" type="integer" indexed="false" stored="true" />
<field name="category_id3" type="integer" indexed="false" stored="true" />
<field name="category_ids" type="integer" multiValued="true" indexed="true" stored="true"/>
和复制部分:
<copyField source="category_id1" dest="category_ids" />
但每当我尝试将数据注入DSE / Cassandra时,我都会收到此错误
InvalidRequestException(why:(Expected 4 or 0 byte int (14)) [diem][business][category_ids] failed validation)
me.prettyprint.hector.api.exceptions.HInvalidRequestException: InvalidRequestException(why:(Expected 4 or 0 byte int (14)) [diem][business][category_ids] failed validation)
Exception in thread "main" me.prettyprint.hector.api.exceptions.HInvalidRequestException: InvalidRequestException(why:(Expected 4 or 0 byte int (14)) [diem][business][category_ids] failed validation)
at me.prettyprint.cassandra.service.ExceptionsTranslatorImpl.translate(ExceptionsTranslatorImpl.java:45)
at me.prettyprint.cassandra.connection.HConnectionManager.operateWithFailover(HConnectionManager.java:264)
at me.prettyprint.cassandra.model.ExecutingKeyspace.doExecuteOperation(ExecutingKeyspace.java:97)
at me.prettyprint.cassandra.model.MutatorImpl.execute(MutatorImpl.java:243)
at com.diem.db.crud.CassandraStorageManager.insertMultiColumns(CassandraStorageManager.java:197)
at com.diem.db.dao.impl.AbstractDaoImpl.saveUUIDEntity(AbstractDaoImpl.java:47)
at com.diem.db.dao.impl.BusinessDaoImpl.saveBusiness(BusinessDaoImpl.java:81)
at com.diem.data.LoadBusinesses.execute(LoadBusinesses.java:187)
at com.diem.data.LoadContent.run(LoadContent.java:121)
at com.diem.data.LoadBusinesses.main(LoadBusinesses.java:45)
Caused by: InvalidRequestException(why:(Expected 4 or 0 byte int (14)) [diem][business][category_ids] failed validation)
at org.apache.cassandra.thrift.Cassandra$batch_mutate_result.read(Cassandra.java:20833)
at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:78)
at org.apache.cassandra.thrift.Cassandra$Client.recv_batch_mutate(Cassandra.java:964)
at org.apache.cassandra.thrift.Cassandra$Client.batch_mutate(Cassandra.java:950)
at me.prettyprint.cassandra.model.MutatorImpl$3.execute(MutatorImpl.java:246)
at me.prettyprint.cassandra.model.MutatorImpl$3.execute(MutatorImpl.java:243)
at me.prettyprint.cassandra.service.Operation.executeAndSetResult(Operation.java:103)
at me.prettyprint.cassandra.connection.HConnectionManager.operateWithFailover(HConnectionManager.java:258)
... 8 more
复制到multiValued solr.IntField(整数)中的副本并不特别,我们可以在使用DSE / SOLR之前完成。但我似乎无法在DSE / SOLR组合中获得这项工作。从逻辑上讲,我看不出为什么会失败的原因,因为DSE不应该干扰category_ids
字段上的操作,该字段主要用于索引。有没有人看到这种情况有什么问题?在这种情况下我可以做些什么来防止验证错误( 注意 :我不能使用category_ids
的文本/字符串类型?)
谢谢!
答案 0 :(得分:0)
我可以找到问题,我的CF有default_validation_class=BytesType
,因此在DSE / Solr中使用BytesType验证multiValued字段category_ids
,这将导致错误。因此,除非我使用LIST<int>
的类型将我的CF更改为CQL声明并且不使用Hector(至少对于此CF),否则我将无法使用Solr中除text / string字段之外的multiValued字段。
答案 1 :(得分:0)
如果我理解正确,你使用的是thrift表,所以要么将category_ids 列声明为UTF8Type(Solr字段可以是任何类型),要么将category_ids Solr字段声明为stored = false(在这种情况下,复制字段不会被存储,只会被索引)。
如果两者中的任何一个适合您,请告诉我们。