rancher 的 rke 无法在新集群上启动

时间:2021-01-30 02:56:51

标签: kubernetes rancher rke

/opt/kubernetes/bin/rke up --config /home/msh/rancher-cluster.yml

rancher-cluser.yml 文件包含:

nodes:
  - address: 192.168.10.34
    internal_address: 172.17.0.2
    user: bsh
    role: [controlplane,etcd]
  - address: 192.168.10.35
    internal_address: 172.17.0.3
    user: bsh
    role: [worker]
  - address: 192.168.10.36 
    internal_address: 172.17.0.4
    user: bsh
    role: [worker]
add_job_timeout: 120

注意:我没有在任何节点上配置任何接口 internal_address。我的理解是 rancher/k8s 会为我设置这些。 . .什么的。

这是 rke 无法启动的结尾。

INFO[0039] Removing container [rke-bundle-cert] on host [192.168.10.34], try #1 
INFO[0039] Image [rancher/rke-tools:v0.1.69] exists on host [192.168.10.34] 
INFO[0039] Starting container [rke-log-linker] on host [192.168.10.34], try #1 
INFO[0040] [etcd] Successfully started [rke-log-linker] container on host [192.168.10.34] 
INFO[0040] Removing container [rke-log-linker] on host [192.168.10.34], try #1 
INFO[0040] [remove/rke-log-linker] Successfully removed container on host [192.168.10.34] 
INFO[0040] [etcd] Successfully started etcd plane.. Checking etcd cluster health 
WARN[0055] [etcd] host [192.168.10.34] failed to check etcd health: failed to get /health for host [192.168.10.34]: Get https://172.17.0.2:2379/health: Unable to access the service on 172.17.0.2:2379. The service might be still starting up. Error: ssh: rejected: connect failed (Connection refused) 
FATA[0055] [etcd] Failed to bring up Etcd Plane: etcd cluster is unhealthy: hosts [192.168.10.34] failed to report healthy. Check etcd container logs on each host for more information 

使用:
Rancher v2.5.2
rke v1.0.16 版本
docker-ce-19.03.14-3.el8.x86_64

1 个答案:

答案 0 :(得分:1)

根据我的理解,接口配置必须预先存在。 RKE 不会负责接口配置。

因此,要么设置一个内部子网并将您的接口分配给它,要么将外部地址也用于内部通信。