运行OpenShift OKD 3.10且无所事事和Ansible-连接被拒绝

时间:2018-12-23 08:39:21

标签: ansible vagrant openshift virtualbox okd

我已经尝试了几天,以使Openshift在Vagrant设置的单个虚拟机上运行并使用ansible进行安装。我经历过很多关于openshift-ansible的github问题,但最后还是没有运气。这就是我走了多远。因此,我从Vagrantfile开始,并使用CentosOS / 7作为框。当他们似乎从xfs切换文件系统时,这是我遇到的第一个错误,因为docker不想工作。因此,我查找了VM-Box Centos的变更日志,并将其降级为V1804.02。这就是我现在拥有的Vagrantfile

流浪文件

$lab_openshift = <<SCRIPT
yum -y update
yum install -y epel-release git docker httpd-tools java-1.8.0-openjdk-headless
yum install -y ansible python-passlib
systemctl start docker
systemctl enable docker
git clone -b release-3.10 https://github.com/openshift/openshift-ansible /root/openshift-ansible
ssh-keygen -f /root/.ssh/id_rsa -N ''
cp /root/.ssh/id_rsa.pub /root/.ssh/authorized_keys
ssh-keyscan 172.24.0.11 >> .ssh/known_hosts
cp .ssh/known_hosts /root/.ssh/known_hosts
ssh-copy-id -f -i /root/.ssh/id_rsa root@172.24.0.11
cp /home/vagrant/etc.ansible.hosts /etc/ansible/hosts
cp /home/vagrant/etc.selinux.config /etc/selinux/config

reboot
SCRIPT

Vagrant.configure(2) do |config|
 config.vm.define "openshift" do |conf|
    # conf.vm.box = "peru/my_centos-7-x86_64"
    # conf.vm.box_version = "20181211.01"
    conf.vm.box = "centos/7"
    config.vm.box_version = "1804.02"
    conf.vm.hostname = 'openshift.example.com'
    conf.vm.network "private_network", ip: "172.24.0.11"
    conf.vm.provision "file", source: "./etc.ansible.hosts", destination: "~/etc.ansible.hosts"
    conf.vm.provision "file", source: "./etc.selinux.config", destination: "~/etc.selinux.config"
    conf.vm.provider "virtualbox" do |v|
        v.memory = 6144
        v.cpus = 2
    end
    conf.vm.provision "shell", inline: $lab_openshift
 end
end

在执行或禁用SELinux时会出现错误,这是SELinux的配置

SELinux Conf

SELINUX=permissive

现在我从清单中的ansible hosts.localhost开始。我遇到的第一个错误是在docker_image_availability检查中。在GitHub上,ppl说您应该禁用它。因此,这是第一个更改。但是您仍然会通过安装遇到一些问题(错误消息“控制平面舱未弹出”)。因此,下一个更改是根据某些github问题将osm_etcd_image更改为osm_etcd_image = registry.access.redhat.com / rhel7 / etcd,这使我只能使用当前的ansible主机文件

主机

#bare minimum hostfile

[OSEv3:children]
masters
nodes
etcd

[OSEv3:vars]

osm_etcd_image=registry.access.redhat.com/rhel7/etcd
openshift_deployment_type=origin
openshift_release=v3.10
openshift_portal_net=172.30.0.0/16
openshift_disable_check=disk_availability,memory_availability,docker_image_availability

openshift_node_groups=[{'name': 'node-config-all-in-one', 'labels': ['node-role.kubernetes.io/master=true', 'node-role.kubernetes.io/infra=true', 'node-role.kubernetes.io/compute=true']}]


[masters]
172.24.0.11 ansible_connection=local

[etcd]
172.24.0.11 ansible_connection=local

[nodes]
# openshift_node_group_name should refer to a dictionary with matching key of name in list openshift_node_groups.
172.24.0.11 ansible_connection=local openshift_node_group_name="node-config-all-in-one"

不幸的是,由于它无法重试一次,我仍然无法使集群运行

/ bin / oc获取pod master-etcd-openshift.example.com -o json -n kube-system“

这似乎拒绝了连接并留下了此错误消息

  

与服务器openshift.example.com:8443的连接被拒绝-   您指定了正确的主机或端口吗?\ n“

现在,我刚刚发现了关于该错误的一些未解决的github问题,终于被卡住了。也许有人知道我在做什么错。

[编辑]

哦,我的任何etc / hosts都以 172.24.0.11 openshift.example.com 并ping通172.24.0.11和openshift.example.com成功

还有一件值得一提的事情是docker conttainer list -a还会告诉我,一个容器一直在尝试重新启动,但没有成功

  

ab4651c81600 96f98d080ffd“ / bin / bash -c'#!/ ...”大约一分钟前退出(255)35秒前k8s_api_master-api-openshift.example.com_kube-system_fabe879b27fee405485858817f14f32f_9

因此,这几乎是本期https://github.com/openshift/openshift-ansible/issues/9894中描述的内容,但是我无法弄清楚我的配置出了什么问题

[/ edit]

[edit2]

The log of the k8s_api_master container is also not really helping me out

I1224 11:46:42.874204       1 plugins.go:84] Registered admission plugin "NamespaceLifecycle"
I1224 11:46:42.874390       1 plugins.go:84] Registered admission plugin "Initializers"
I1224 11:46:42.874400       1 plugins.go:84] Registered admission plugin "ValidatingAdmissionWebhook"
I1224 11:46:42.874408       1 plugins.go:84] Registered admission plugin "MutatingAdmissionWebhook"
I1224 11:46:42.874420       1 plugins.go:84] Registered admission plugin "AlwaysAdmit"
I1224 11:46:42.874425       1 plugins.go:84] Registered admission plugin "AlwaysPullImages"
I1224 11:46:42.874432       1 plugins.go:84] Registered admission plugin "LimitPodHardAntiAffinityTopology"
I1224 11:46:42.874440       1 plugins.go:84] Registered admission plugin "DefaultTolerationSeconds"
I1224 11:46:42.874446       1 plugins.go:84] Registered admission plugin "AlwaysDeny"
I1224 11:46:42.874458       1 plugins.go:84] Registered admission plugin "EventRateLimit"
I1224 11:46:42.874465       1 plugins.go:84] Registered admission plugin "DenyEscalatingExec"
I1224 11:46:42.874470       1 plugins.go:84] Registered admission plugin "DenyExecOnPrivileged"
I1224 11:46:42.874477       1 plugins.go:84] Registered admission plugin "ExtendedResourceToleration"
I1224 11:46:42.874483       1 plugins.go:84] Registered admission plugin "OwnerReferencesPermissionEnforcement"
I1224 11:46:42.874495       1 plugins.go:84] Registered admission plugin "ImagePolicyWebhook"
I1224 11:46:42.874503       1 plugins.go:84] Registered admission plugin "InitialResources"
I1224 11:46:42.874509       1 plugins.go:84] Registered admission plugin "LimitRanger"
I1224 11:46:42.874517       1 plugins.go:84] Registered admission plugin "NamespaceAutoProvision"
I1224 11:46:42.874524       1 plugins.go:84] Registered admission plugin "NamespaceExists"
I1224 11:46:42.874530       1 plugins.go:84] Registered admission plugin "NodeRestriction"
I1224 11:46:42.874538       1 plugins.go:84] Registered admission plugin "PersistentVolumeLabel"
I1224 11:46:42.874544       1 plugins.go:84] Registered admission plugin "PodNodeSelector"
I1224 11:46:42.874552       1 plugins.go:84] Registered admission plugin "PodPreset"
I1224 11:46:42.874559       1 plugins.go:84] Registered admission plugin "PodTolerationRestriction"
I1224 11:46:42.874566       1 plugins.go:84] Registered admission plugin "ResourceQuota"
I1224 11:46:42.874573       1 plugins.go:84] Registered admission plugin "PodSecurityPolicy"
I1224 11:46:42.874579       1 plugins.go:84] Registered admission plugin "Priority"
I1224 11:46:42.874590       1 plugins.go:84] Registered admission plugin "SecurityContextDeny"
I1224 11:46:42.874598       1 plugins.go:84] Registered admission plugin "ServiceAccount"
I1224 11:46:42.874604       1 plugins.go:84] Registered admission plugin "DefaultStorageClass"
I1224 11:46:42.874611       1 plugins.go:84] Registered admission plugin "PersistentVolumeClaimResize"
I1224 11:46:42.874619       1 plugins.go:84] Registered admission plugin "StorageObjectInUseProtection"
F1224 11:47:12.886869       1 start_api.go:68] dial tcp 127.0.0.1:2379: connect: connection refused

[/ edit2]

1 个答案:

答案 0 :(得分:0)

好的,花了很多功夫,但我终于发现了问题所在。基本上,您的本地主机必须指向您在vagrantfile中定义的ip。否则它将无法正常工作。在etc / hosts中控制。我也在使用鹰式指标。如果您不想这样做,也不需要安装java-18.0-openjdk-headless

流浪文件

$lab_openshift = <<SCRIPT
yum -y update

yum install -y epel-release
echo "==================Installing PYTHON=================="
yum install -y python-pip python-devel python python-passlib

echo "==================Installing GIT=================="
yum install -y git

echo "==================Installing ANSIBLE=================="
yum install -y ansible

echo "==================Installing java-1.8.0-openjdk-headless================="
yum install -y java-1.8.0-openjdk-headless

cp /home/vagrant/etc.ansible.hosts /etc/ansible/hosts
cp /home/vagrant/etc.selinux.config /etc/selinux/config
cp /home/vagrant/etc.hosts /etc/hosts

git clone -b release-3.10 https://github.com/openshift/openshift-ansible /root/openshift-ansible


reboot
SCRIPT


Vagrant.configure(2) do |config|
 config.vm.define "openshift" do |conf|
    conf.vm.box = "centos/7"
    config.vm.box_version = "1804.02"
    conf.vm.hostname = 'openshift.example.com'
    conf.vm.network "private_network", ip: "172.24.0.11"
    conf.vm.provision "file", source: "./etc.ansible.hosts", destination: "~/etc.ansible.hosts"
    conf.vm.provision "file", source: "./etc.selinux.config", destination: "~/etc.selinux.config"
    conf.vm.provision "file", source: "./etc.hosts", destination: "~/etc.hosts"
    conf.vm.provider "virtualbox" do |v|
        v.memory = 6144
        v.cpus = 2
    end
    conf.vm.provision "shell", inline: $lab_openshift
 end
end

SELinux Config(etc.selinux.config)

SELINUX=permissive

etc / ansible / hosts(etc.ansible.hosts)

[OSEv3:children]
masters
nodes
etcd

[OSEv3:vars]

openshift_ip=172.24.0.11

openshift_deployment_type=origin
openshift_disable_check=memory_availability,disk_availability

ansible_service_broker_install=false
openshift_master_cluster_hostname=172.24.0.11
openshift_master_cluster_public_hostname=openshift.example.com
openshift_hostname=172.24.0.11
openshift_public_hostname=openshift.example.com
openshift_metrics_install_metrics=true
openshift_metrics_image_version=v3.10
openshift_master_default_subdomain=openshift.example.com

openshift_disable_check=disk_availability,memory_availability,docker_image_availability

    openshift_node_groups=[{'name': 'node-config-all-in-one', 'labels': ['node-role.kubernetes.io/master=true', 'node-role.kubernetes.io/infra=true', 'node-role.kubernetes.io/compute=true']}]


[masters]
172.24.0.11 ansible_connection=local

[etcd]
172.24.0.11 ansible_connection=local

[nodes]
172.24.0.11 ansible_connection=local openshift_node_group_name="node-config-all-in-one"

/ etc / hosts(等主机)

172.24.0.11    localhost   openshift.example.com   openshift
::1         localhost localhost.localdomain localhost6 localhost6.localdomain6