重新启动kubernetes(v1.10.0)集群中的从属节点和主节点后,pod陷入了ContainerCreating错误。重新启动机器后, 我跑了:
sudo kubeadm init --apiserver-advertise-address=<master IP here> --token-ttl 100000h
(在主节点上) 和
sudo kubeadm join --token <token> <ip_Address:port> --discovery-token-ca-cert-hash sha256:<token hash>
kubectl get节点在主节点中运行时显示所有节点“就绪”,但吊舱卡在“ ContainerCreating”错误中。
kubectl describe pods/<podname>
显示此日志:
Namespace: madhurap-namespace
Node: knuth/10.24.6.74
Start Time: Wed, 25 Dec 2019 19:21:19 +0530
Labels: <none>
Annotations: <none>
Status: Pending
IP:
Containers:
py:
Container ID:
Image: ubuntu:16.04
Image ID:
Port: <none>
Command:
/bin/bash
/storage/home/madhurap/test/simple.sh
State: Waiting
Reason: ContainerCreating
Ready: False
Restart Count: 0
Limits:
alpha.kubernetes.io/nvidia-gpu: 1
cpu: 2
memory: 12Gi
Requests:
alpha.kubernetes.io/nvidia-gpu: 1
cpu: 2
memory: 12Gi
Environment: <none>
Mounts:
/lib from lib (rw)
/scratch/scratch6 from scratch6 (rw)
/storage/home/madhurap from home (rw)
/tools from tools (rw)
/usr/bin/ from bin (rw)
/usr/lib/ from usrlib (rw)
/usr/local/nvidia/bin from nvbin (rw)
/usr/local/nvidia/lib from nvlib (rw)
/var/run/secrets/kubernetes.io/serviceaccount from default-token-tjpf5 (ro)
Conditions:
Type Status
Initialized True
Ready False
PodScheduled True
Volumes:
home:
Type: PersistentVolumeClaim (a reference to a PersistentVolumeClaim in the same namespace)
ClaimName: home
ReadOnly: false
tools:
Type: PersistentVolumeClaim (a reference to a PersistentVolumeClaim in the same namespace)
ClaimName: tools
ReadOnly: false
scratch6:
Type: PersistentVolumeClaim (a reference to a PersistentVolumeClaim in the same namespace)
ClaimName: scratch6
ReadOnly: false
nvbin:
Type: HostPath (bare host directory volume)
Path: /usr/lib/nvidia-driver/bin
nvlib:
Type: HostPath (bare host directory volume)
Path: /usr/lib/nvidia-driver
usrlib:
Type: HostPath (bare host directory volume)
Path: /usr/lib/
bin:
Type: HostPath (bare host directory volume)
Path: /usr/bin
lib:
Type: HostPath (bare host directory volume)
Path: /lib
default-token-tjpf5:
Type: Secret (a volume populated by a Secret)
SecretName: default-token-tjpf5
Optional: false
QoS Class: Guaranteed
Node-Selectors: kubernetes.io/hostname=knuth
Tolerations: node.alpha.kubernetes.io/notReady:NoExecute for 300s
node.alpha.kubernetes.io/unreachable:NoExecute for 300s
Events: <none>
我只是kubernetes的初学者,任何帮助将不胜感激。