Kubernetes教程故障排除

时间:2019-07-08 20:06:02

标签: kubernetes rancher

我正在研究位于https://kubernetes.io/docs/tasks/job/coarse-parallel-processing-work-queue/#before-you-begin的粗略并行处理Kubernetes教程。我使用EC2实例在AWS上通过Rancher设置了集群。当我运行

 kubectl apply -f ./job.yaml
 kubectl describe jobs/job-wq-1

我收到以下输出

Name:           job-wq-1
Namespace:      default
Selector:       controller-uid=5f9e1780-a1b9-11e9-a6b7-026525d9a49a
Labels:         controller-uid=5f9e1780-a1b9-11e9-a6b7-026525d9a49a
                job-name=job-wq-1
Annotations:    kubectl.kubernetes.io/last-applied-configuration:
              {"apiVersion":"batch/v1","kind":"Job","metadata":{"annotations":{},"name":"job-wq-1","namespace":"default"},"spec":{"completions":8,"paral...
Parallelism:    2
Completions:    8
Start Time:     Mon, 08 Jul 2019 15:48:35 -0400
Pods Statuses:  0 Running / 0 Succeeded / 2 Failed
Pod Template:
Labels:  controller-uid=5f9e1780-a1b9-11e9-a6b7-026525d9a49a
       job-name=job-wq-1
Containers:
c:
Image:      mgladden/job-wq-1
Port:       <none>
Host Port:  <none>
Environment:
  BROKER_URL:  amqp://guest:guest@rabbitmq-service:5672
  QUEUE:       job1
Mounts:        <none>
Volumes:         <none>
Events:
Type     Reason                Age    From            Message
----     ------                ----   ----            -------
Normal   SuccessfulCreate      10m    job-controller  Created pod: job-wq-1-z8kn6
Normal   SuccessfulCreate      10m    job-controller  Created pod: job-wq-1-lqcfs
Normal   SuccessfulDelete      9m35s  job-controller  Deleted pod: job-wq-1-z8kn6
Normal   SuccessfulDelete      9m35s  job-controller  Deleted pod: job-wq-1-lqcfs

目前我不确定如何解决。似乎没有成功。可能是由于我的Rancher成立了吗?我确实在本教程中注意到,注释是空白的,并且我的工作是输出的。

1 个答案:

答案 0 :(得分:0)

感谢您的帮助。我检查了错误日志,发现以下错误“登录到AMQP服务器:发生套接字错误”在构建docker映像时,使用旧版14.04版本的ubuntu似乎是一个问题。当我切换到ubuntu的18.04版本时,教程按预期完成。