Question

我创建了一个带有3个节点的天蓝色AKS（标准DS3 v2（4个vcpus，14 GB内存））。我正在摆弄群集，并创建了一个包含1000个副本的部署。在此完整群集关闭后。

azureuser@saa:~$ k get cs
NAME                 STATUS      MESSAGE                                                                                        ERROR
controller-manager   Unhealthy   Get http://127.0.0.1:10252/healthz: dial tcp 127.0.0.1:10252: getsockopt: connection refused   
scheduler            Unhealthy   Get http://127.0.0.1:10251/healthz: dial tcp 127.0.0.1:10251: getsockopt: connection refused   
etcd-0               Healthy     {"health": "true"}

从调试开始，调度程序和Controller-manager似乎都失败了。如何解决这个问题？

创建具有1000个副本的部署时到底发生了什么？应该由k8s照顾吗？

输出的调试命令很少：

  kubectl cluster-info
    Kubernetes master is running at https://cg-games-e5252212.hcp.eastus.azmk8s.io:443
    Heapster is running at https://cg-games-e5252212.hcp.eastus.azmk8s.io:443/api/v1/namespaces/kube-system/services/heapster/proxy
    KubeDNS is running at https://cg-games-e5252212.hcp.eastus.azmk8s.io:443/api/v1/namespaces/kube-system/services/kube-dns:dns/proxy
    kubernetes-dashboard is running at https://cg-games-e5252212.hcp.eastus.azmk8s.io:443/api/v1/namespaces/kube-system/services/kubernetes-dashboard/proxy

记录kubectl cluster-info dump @ http://termbin.com/e6wb

azureuser@sim:~$ az aks scale -n cg -g cognitive-games -c 4 --verbose
Deployment failed. Correlation ID: 4df797b2-28bf-4c18-a26a-4e341xxxxx. Operation failed with status: 200. Details: Resource state Failed

没有节点显示

azureuser@si:~$ k get nodes
No resources found

Answer 1

群集资源有限（4个vcpus，14 GB内存），现在如果您尝试使用有限的资源部署大量的pod，则Scheduler将无法安排它们。由于Scheduler根据资源分配pod。

Number of pods per node

每个节点不超过100个pod

正如您在kubectl cluster-info dump中看到的那样。

＆＃34;原因＆＃34;：＆＃34; FailedScheduling＆＃34;，

＆＃34;消息＆＃34;：＆＃34;没有可用于安排播客的节点＆＃34;，

Answer 2

看起来很傻但是当在RG中创建AKS时，令人惊讶的是，两个RG与AKS一起创建，另一个RG与一些具有所有VMS的随机散列。我删除了第二个RG，基本的AKS停止了工作。

Azure没有找到任何节点

2 个答案: