我有一个带有连接到外部数据库的应用程序服务器的容器。为了实现冗余,我想运行多个Pod,因此我使用rolingupdate策略(maxSurge = 1和maxUnavailable = 1)将部署扩展到3。
有时(大多数情况下),由于我使用的是liquibase,导致pod首次创建失败,并且所有的pod都尝试同时锁定数据库。
对我来说,最简单的解决方案似乎是依次启动Pod。因此,启动Pod 1,等待60秒,然后启动Pod 2,等等。
这是有效的解决方案吗?如何在k8s(v1.14)中实现?
这是kubectl describe deploy
的输出:
Name: jx-apollon
Namespace: jx-staging
CreationTimestamp: Sun, 27 Oct 2019 21:28:07 +0100
Labels: chart=apollon-1.0.348
draft=draft-app
jenkins.io/chart-release=jx
jenkins.io/namespace=jx-staging
jenkins.io/version=4
Annotations: deployment.kubernetes.io/revision: 3
jenkins.io/chart: env
kubectl.kubernetes.io/last-applied-configuration:
{"apiVersion":"extensions/v1beta1","kind":"Deployment","metadata":{"annotations":{"jenkins.io/chart":"env"},"labels":{"chart":"apollon-1.0...
Selector: app=jx-apollon,draft=draft-app
Replicas: 0 desired | 0 updated | 0 total | 0 available | 0 unavailable
StrategyType: RollingUpdate
MinReadySeconds: 0
RollingUpdateStrategy: 1 max unavailable, 1 max surge
Pod Template:
Labels: app=jx-apollon
draft=draft-app
Init Containers:
postgres-listener-short:
Image: alpine
Port: <none>
Host Port: <none>
Command:
sh
-c
echo 'Waiting in init container for DB to become available.'; echo $DB; for i in $(seq 1 5); do echo 'nc ping' $i && nc -z -w3 $DB 5432 && echo 'DB is available, continuing now to application initialization.' && exit 0 || sleep 3; done; echo 'DB is not yet available.'; exit 1
Environment:
DB: jx-apollon-postgresql-db-alias
Mounts: <none>
postgres-listener-longer:
Image: alpine
Port: <none>
Host Port: <none>
Command:
sh
-c
echo 'Waiting in init container for DB to become available.'; echo $DB; for i in $(seq 1 100); do echo 'nc ping' $i && nc -z -w3 $DB 5432 && echo 'DB is available, continuing now to application initialization.' && exit 0 || sleep 3; done; echo 'DB is not yet available.'; exit 1
Environment:
DB: jx-apollon-postgresql-db-alias
Mounts: <none>
Containers:
apollon:
Image: <redacted>
Ports: 8080/TCP, 8443/TCP
Host Ports: 0/TCP, 0/TCP
Limits:
cpu: 2
memory: 6Gi
Requests:
cpu: 100m
memory: 3584Mi
Liveness: http-get http://:8080/ delay=60s timeout=1s period=10s #success=1 #failure=3
Readiness: http-get http://:8080/ delay=0s timeout=1s period=10s #success=1 #failure=3
Environment:
DB: jx-apollon-postgresql-db-alias
POSTGRES_PASSWORD: <redacted>
RULES_CLIENT: demo
_JAVA_OPTIONS: -XX:+UseContainerSupport -XX:MaxRAMPercentage=90.0 -XX:+UseG1GC
Mounts: <none>
Volumes: <none>
Conditions:
Type Status Reason
---- ------ ------
Available True MinimumReplicasAvailable
OldReplicaSets: <none>
NewReplicaSet: jx-apollon-95b4c77cb (0/0 replicas created)
Events: <none>
答案 0 :(得分:1)
我将initContainer部分更改为此代码,该代码一直等到它从liquibase创建的databasechangeloglock表中读取错误的值。
initContainers:
- name: postgres-listener
image: postgres
env:
- name: DB
value: jx-apollon-postgresql-db-alias
command: ['sh', '-c', '\
until psql -qtAX -h $DB -d postgres -c \
"select count(locked) from databasechangeloglock where locked = false group by locked";
do echo waiting for dbchangeloglock of postgres db to be false; sleep 2; done;
']