我正在尝试使用etcd配置启动kubernetes api服务器(kubernetes使用go-etcd,它有一个方法从配置文件中读取所有参数):
{
"cluster": {
"machines": [ "https://my-public-hostname:2379" ]
},
"config": {
"certFile": "/etc/ssl/etcd/client.pem",
"keyFile": "/etc/ssl/etcd/client.key.pem",
"caCertFiles": [
"/etc/ssl/etcd/ca.pem"
],
"timeout": 5,
"consistency": "WEAK"
}
}
但是kube-apiserver失败了,因为它无法成功达到etcd。我认为这是因为它试图同步集群......但我不知道。
我使用内部ips为广告和客户端地址创建了一个(etcd)集群,但listen-client-urls除外,它设置为0.0.0.0/0。此外,整个群集位于负载均衡器后面,可通过my-public-hostname
访问。
在容器内部(因为我正在使用hyperkube
),除非我设置'--no-sync'参数,否则etcdctl
将无效。如果我使用没有该参数的etcdctl,它会像kube-apiserver那样可疑地失败。但我无法检查kubernetes中执行群集同步化的代码片段......
有什么想法吗?
提前致谢。
修改
这似乎与kubernetes(https://github.com/coreos/go-etcd)中当前的etcd客户端有关,这不是最新的(https://github.com/coreos/etcd/client)。我根据经验进行测试,“etcd / client”有效,但“go-etcd”没有,你可以在这里查看这个测试: https://github.com/glerchundi/etcd-go-clients-test 。
值得注意的是,正在进行的工作是将go-etcd迁移到kubernetes中的etcd / client:https://github.com/kubernetes/kubernetes/issues/11962。
Kubernetes团队的任何人都可以证实这一点吗?
附录1
我正在尝试在CoreOS中运行kubernetes并且:flannel
正常工作,locksmithd
正常工作,fleet
正常工作(他们使用完全相同的etcd客户端凭据访问etcd)所以很可能与kubernetes如何访问etcd端点相关的东西。
附录2 (这些命令在超文本容器中执行,具体来说就是这个命令:gcr.io/google_containers/hyperkube:v1.0.6
)
没有--no-sync的etcdctl输出失败:
root@98b2524464f1:/# etcdctl --cert-file="/etc/ssl/etcd/client.pem" --key-file="/etc/ssl/etcd/client.key.pem" --ca-file="/etc/ssl/etcd/ca.pem" --peers="http//my-public-hostname:2379" ls /
Error: 501: All the given peers are not reachable (failed to propose on members [https://10.1.0.1:2379 https://10.1.0.0:2379 https://10.1.0.2:2379] twice [last error: Get https://10.1.0.0:2379/v2/keys/?quorum=false&recursive=false&sorted=false: dial tcp 10.1.0.0:2379: i/o timeout]) [0]
与此相关的kube-apiserver:
root@98b2524464f1:/# /hyperkube \
apiserver \
--bind-address=0.0.0.0 \
--etcd_config=/etc/kubernetes/ssl/etcd.json \
--allow-privileged=true \
--service-cluster-ip-range=10.3.0.0/24 \
--secure_port=443 \
--advertise-address=10.0.0.2 \
--admission-control=NamespaceLifecycle,NamespaceExists,LimitRanger,SecurityContextDeny,ServiceAccount,ResourceQuota \
--tls-cert-file=/etc/kubernetes/ssl/apiserver.pem \
--tls-private-key-file=/etc/kubernetes/ssl/apiserver.key.pem \
--client-ca-file=/etc/kubernetes/ssl/ca.pem \
--service-account-key-file=/etc/kubernetes/ssl/apiserver.key.pem
F1002 09:47:29.348527 384 controller.go:80] Unable to perform initial IP allocation check: unable to refresh the service IP block: 501: All the given peers are not reachable (failed to propose on members [https://my-public-hostname:2379] twice [last error: Get https://my-public-hostname:2379/v2/keys/registry/ranges/serviceips?quorum=false&recursive=false&sorted=false: dial tcp: i/o timeout]) [0]
附录3
etcd #0:
etcd2:
name: etcd0
initial-cluster-state: new
initial-cluster: etcd0=http://10.1.0.0:2380,etcd1=http://10.1.0.1:2380,etcd2=http://10.1.0.2:2380
data-dir: /var/lib/etcd2
advertise-client-urls: https://10.1.0.0:2379
initial-advertise-peer-urls: http://10.1.0.0:2380
listen-client-urls: https://0.0.0.0:2379
listen-peer-urls: http://10.1.0.0:2380
client-cert-auth: true
trusted-ca-file: /etc/ssl/etcd/certs/ca-chain.cert.pem
cert-file: /etc/ssl/etcd/certs/etcd-server.cert.pem
key-file: /etc/ssl/etcd/private/etcd-server.key.pem
etcd #1:
etcd2:
name: etcd1
initial-cluster-state: new
initial-cluster: etcd0=http://10.1.0.0:2380,etcd1=http://10.1.0.1:2380,etcd2=http://10.1.0.2:2380
data-dir: /var/lib/etcd2
advertise-client-urls: https://10.1.0.1:2379
initial-advertise-peer-urls: http://10.1.0.1:2380
listen-client-urls: https://0.0.0.0:2379
listen-peer-urls: http://10.1.0.1:2380
client-cert-auth: true
trusted-ca-file: /etc/ssl/etcd/certs/ca-chain.cert.pem
cert-file: /etc/ssl/etcd/certs/etcd-server.cert.pem
key-file: /etc/ssl/etcd/private/etcd-server.key.pem
etcd #2:
etcd2:
name: etcd2
initial-cluster-state: new
initial-cluster: etcd0=http://10.1.0.0:2380,etcd1=http://10.1.0.1:2380,etcd2=http://10.1.0.2:2380
data-dir: /var/lib/etcd2
advertise-client-urls: https://10.1.0.2:2379
initial-advertise-peer-urls: http://10.1.0.2:2380
listen-client-urls: https://0.0.0.0:2379
listen-peer-urls: http://10.1.0.2:2380
client-cert-auth: true
trusted-ca-file: /etc/ssl/etcd/certs/ca-chain.cert.pem
cert-file: /etc/ssl/etcd/certs/etcd-server.cert.pem
key-file: /etc/ssl/etcd/private/etcd-server.key.pem
答案 0 :(得分:2)
最后,我找出导致此问题的原因。超时未正确定义,因为go-etcd
将json超时值解组为time.Duration,其使用纳秒作为基本单位。因此,如果值为1,则应写入1000000000。
按照上面的例子:
{
"cluster": {
"machines": [ "https://my-public-hostname:2379" ]
},
"config": {
"certFile": "/etc/ssl/etcd/client.pem",
"keyFile": "/etc/ssl/etcd/client.key.pem",
"caCertFiles": [
"/etc/ssl/etcd/ca.pem"
],
"timeout": 5000000000,
"consistency": "WEAK"
}
}