我一直在尝试间歇性地成功将我的ndb_backups恢复到新的集群。
我们有一个包含3个API节点的6个NDB节点集群。当我运行ndb_restore时,通常会将前2个或3个节点的备份毫无问题地引入。但是第4和第5个节点因以下错误而失败:
Temporary error: 266: Time-out in NDB, probably caused by deadlock
Temporary error: 266: Time-out in NDB, probably caused by deadlock
Retried transaction 10 times.
Last error266: Time-out in NDB, probably caused by deadlock
...Unable to recover from errors. Exiting...
奇怪的是,有时候我只需重新运行所有6个备份就可以顺利完成。
我希望有人知道我可以对我的配置进行哪些调整以优化此过程。这是我的版本号和配置:
mysql-5.6.22 ndb-7.3.8
我的mgm配置文件:
######################
#MGM CONFIG
######################
[ndbd default]
# Options affecting ndbd processes on all data nodes:
NoOfReplicas=2 # Number of replicas
DataMemory=8144M # How much memory to allocate for data storage
IndexMemory=8144M # How much memory to allocate for index storage
# For DataMemory and IndexMemory, we have used the
# default values. Since the "world" database takes up
# only about 500KB, this should be more than enough for
# this example Cluster setup.
[ndb_mgmd]
# Management process options:
hostname=192.168.207.133 # Hostname or IP address of MGM node
NodeId=1
[ndb_mgmd]
# Management process options:
hostname=192.168.207.45 # Hostname or IP address of MGM node
NodeId=2
[ndbd]
# Options for data node "A":
hostname=192.168.207.135 # Hostname or IP address
NodeId=3
[ndbd]
# Options for data node "B":
hostname=192.168.207.171 # Hostname or IP address
NodeId=4
[ndbd]
# Options for data node "C":
hostname=192.168.207.174 # Hostname or IP address
NodeId=5
[ndbd]
# Options for data node "D":
hostname=192.168.207.27 # Hostname or IP address
NodeId=6
[ndbd]
# Options for data node "E":
hostname=192.168.207.169 # Hostname or IP address
NodeId=7
[ndbd]
# Options for data node "F":
hostname=192.168.207.178 # Hostname or IP address
NodeId=8
[mysqld]
hostname=192.168.207.177
NodeId=10
[mysqld]
hostname=192.168.207.35
NodeId=11
[mysqld]
hostname=192.168.207.148
NodeId=12
我的mysqld和ndb节点配置
######################
#API AND NDB CONFIG
######################
[mysqld]
ndbcluster
[mysql_cluster]
ndb-connectstring=192.168.207.133,192.168.207.45 # location of management server
真的希望有人可以提供帮助。我已经在这一个月了。我们非常广泛地使用数据blob,我知道这会导致这些超时,但我最好奇的是找出为什么6节点恢复有时会成功,有时却没有,以及我如何确保恢复每次都很成功。
我非常愿意尝试并重新发布。我是mysql集群的新手,在过去的几个月里已经学会了堆积,但我渴望了解更多。
提前致谢, GT