集群重启后,Kubernetes Pods陷入了ContainerCreating错误

时间:2019-12-25 14:28:47

标签: docker kubernetes ubuntu-16.04 nfs

重新启动kubernetes(v1.10.0)集群中的从属节点和主节点后,pod陷入了ContainerCreating错误。重新启动机器后, 我跑了:

sudo kubeadm init --apiserver-advertise-address=<master IP here> --token-ttl 100000h

(在主节点上) 和

sudo kubeadm join --token <token> <ip_Address:port> --discovery-token-ca-cert-hash sha256:<token hash>

kubectl get节点在主节点中运行时显示所有节点“就绪”,但吊舱卡在“ ContainerCreating”错误中。

kubectl describe pods/<podname>

显示此日志:

Namespace:    madhurap-namespace
Node:         knuth/10.24.6.74
Start Time:   Wed, 25 Dec 2019 19:21:19 +0530
Labels:       <none>
Annotations:  <none>
Status:       Pending
IP:           
Containers:
  py:
    Container ID:  
    Image:         ubuntu:16.04
    Image ID:      
    Port:          <none>
    Command:
      /bin/bash
      /storage/home/madhurap/test/simple.sh
    State:          Waiting
      Reason:       ContainerCreating
    Ready:          False
    Restart Count:  0
    Limits:
      alpha.kubernetes.io/nvidia-gpu:  1
      cpu:                             2
      memory:                          12Gi
    Requests:
      alpha.kubernetes.io/nvidia-gpu:  1
      cpu:                             2
      memory:                          12Gi
    Environment:                       <none>
    Mounts:
      /lib from lib (rw)
      /scratch/scratch6 from scratch6 (rw)
      /storage/home/madhurap from home (rw)
      /tools from tools (rw)
      /usr/bin/ from bin (rw)
      /usr/lib/ from usrlib (rw)
      /usr/local/nvidia/bin from nvbin (rw)
      /usr/local/nvidia/lib from nvlib (rw)
      /var/run/secrets/kubernetes.io/serviceaccount from default-token-tjpf5 (ro)
Conditions:
  Type           Status
  Initialized    True 
  Ready          False 
  PodScheduled   True 
Volumes:
  home:
    Type:       PersistentVolumeClaim (a reference to a PersistentVolumeClaim in the same namespace)
    ClaimName:  home
    ReadOnly:   false
  tools:
    Type:       PersistentVolumeClaim (a reference to a PersistentVolumeClaim in the same namespace)
    ClaimName:  tools
    ReadOnly:   false
  scratch6:
    Type:       PersistentVolumeClaim (a reference to a PersistentVolumeClaim in the same namespace)
    ClaimName:  scratch6
    ReadOnly:   false
  nvbin:
    Type:  HostPath (bare host directory volume)
    Path:  /usr/lib/nvidia-driver/bin
  nvlib:
    Type:  HostPath (bare host directory volume)
    Path:  /usr/lib/nvidia-driver
  usrlib:
    Type:  HostPath (bare host directory volume)
    Path:  /usr/lib/
  bin:
    Type:  HostPath (bare host directory volume)
    Path:  /usr/bin
  lib:
    Type:  HostPath (bare host directory volume)
    Path:  /lib
  default-token-tjpf5:
    Type:        Secret (a volume populated by a Secret)
    SecretName:  default-token-tjpf5
    Optional:    false
QoS Class:       Guaranteed
Node-Selectors:  kubernetes.io/hostname=knuth
Tolerations:     node.alpha.kubernetes.io/notReady:NoExecute for 300s
                 node.alpha.kubernetes.io/unreachable:NoExecute for 300s
Events:          <none>

我只是kubernetes的初学者,任何帮助将不胜感激。

0 个答案:

没有答案