Question

有一个包含5个节点的Cassandra集群。最近，从2.2.7到3.9版本，逐节点地进行了更新。更新是根据Datastax描述的流程完成的：upgrade instruction。一切顺利。整个过程耗时约1小时。但是，几个小时后我发现了以下问题：对于更新周期，一些数据是不一致的，即对于特定的分区密钥和聚类密钥，应该返回一行且仅返回一行。但有时会返回一个，有时两个 - 对于同一个查询。

该表格如下PRIMARY KEY: ((id, year), date time)。所以，查询

SELECT * FROM table_name 
  WHERE id=1 and year=2017 and datetime='2017-01-01T01:01:01:000Z';

有时会返回一行，有时会返回两行。

此外，这些行不同，只有主键的字段相同。

结果示例：

id | year | datetime                | field1 | field2 | field3 | field4
---+------+-------------------------+--------+--------+--------+--------
1  | 2017 | 2017-01-01 01:01:01.000 |   null |   null |      5 |      6
1  | 2017 | 2017-01-01 01:01:01.000 |      3 |      4 |   null |   null

因此，如您所见，第1行包含字段：field3和field4不为null，而第2行有field1，field2不为null。其余损坏的数据也是相同的模式。

此外，一旦我尝试删除这些行，只有第一行消失而第二行保留。我确定字段'datetime'是相同的，这不是毫秒问题，因为 blobAsBigint（timestampAsBlob（datetime））为两行返回相同的值。

做了什么：

nodetool upgradedesstables my_keyspace （在每个节点上逐一）

nodetool修复my_keyspace （逐个修复每个节点）

升级稳定是必需的，因为最初修复返回时“验证失败”错误： https://support.datastax.com/hc/en-us/articles/205256895--Validation-failed-when-running-a-nodetool-repair

CREATE TABLE my_keyspace.my_table (
    id bigint,
    year int,
    datetime timestamp,
    field1 int,
    field2 int,
    field3 set<bigint>,
    field4 boolean,
    field5 map<int, text>,
    field6 timestamp,
    field7 decimal,
    field8 decimal,
    PRIMARY KEY ((id, year), datetime)
) WITH CLUSTERING ORDER BY (datetime ASC)
    AND bloom_filter_fp_chance = 0.01
    AND caching = {'keys': 'ALL', 'rows_per_partition': 'NONE'}
    AND comment = ''
    AND compaction = {'class': 'org.apache.cassandra.db.compaction.SizeTieredCompactionStrategy', 'max_threshold': '32', 'min_threshold': '4'}
    AND compression = {'chunk_length_in_kb': '64', 'class': 'org.apache.cassandra.io.compress.LZ4Compressor'}
    AND crc_check_chance = 1.0
    AND dclocal_read_repair_chance = 0.1
    AND default_time_to_live = 0
    AND gc_grace_seconds = 864000
    AND max_index_interval = 2048
    AND memtable_flush_period_in_ms = 0
    AND min_index_interval = 128
    AND read_repair_chance = 0.0
    AND speculative_retry = '99PERCENTILE';

Answer 1

升级到3.11并执行nodetool scrub。 3.0.11和3.11中修复了迁移中的错误

请参阅：https://issues.apache.org/jira/browse/CASSANDRA-13125了解详情

Cassandra有多行具有相同的parition和clustering键

1 个答案: