恢复Kops Kubernetes集群

时间:2018-10-12 11:36:52

标签: kubernetes kops

有一个由kops创建的kubernetes集群 当运行kops validate时,输出为:

    INSTANCE GROUPS
NAME            ROLE    MACHINETYPE MIN MAX SUBNETS
master-us-east-1a   Master  m4.xlarge   1   1   us-east-1a
nodes           Node    c4.2xlarge  1   75  us-east-1a

NODE STATUS
NAME                ROLE    READY
ip-172-20-59-93.ec2.internal    master  False

VALIDATION ERRORS
KIND    NAME                MESSAGE
Machine i-0a44bbdd18c86e846     machine "i-0a44bbdd18c86e846" has not yet joined cluster
Machine i-0d3302056f3dfeef0     machine "i-0d3302056f3dfeef0" has not yet joined cluster
Machine i-0d6199876b91962f4     machine "i-0d6199876b91962f4" has not yet joined cluster
Node    ip-172-20-59-93.ec2.internal    master "ip-172-20-59-93.ec2.internal" is not ready

Validation Failed

如何恢复该群集?该群集的s3文件可用。

etcd卷显示为“使用中”状态

screenshot

1 个答案:

答案 0 :(得分:1)

kops将群集的状态存储在S3中。

  1. 找到存储状态的存储桶
  2. 设置export KOPS_STATE_STORE=s3://your-k8s-state-store
  3. 运行kops update cluster
  4. 如果失败。
  5. 终止与集群相关的所有实例
  6. 运行kops create cluster

请注意,群集的内部状态不在S3中,而在etcd中。此处的答案具有有关此主题的更多详细信息,以及如何备份/还原etcd:How to restore kubernetes cluster using kops?