在连接到3节点Apache kafka集群中的zookeeper集群时,zk无法连接到其他zk集群节点。 请提出如何调试此问题? 这个apache kafka群集运行良好,直到上个月,现在已经观察到此错误。
另外,在启动docker-compose时,已经观察到与python2有关的密码学错误,因此使用
安装了cryptography == 2.4.2。pip install cryptography==2.4.2
在zookeeper docker日志中观察到以下错误:
[2020-09-04 17:46:14,711] INFO Notification time out: 60000 (org.apache.zookeeper.server.quorum.FastLeaderElection)
[2020-09-04 17:47:10,370] INFO Accepted socket connection from /10.10.6.5:38930 (org.apache.zookeeper.server.NIOServerCnxnFactory)
[2020-09-04 17:47:12,511] INFO Processing stat command from /10.10.6.5:38930 (org.apache.zookeeper.server.NIOServerCnxn)
[2020-09-04 17:47:12,512] INFO Closed socket connection for client /10.10.6.5:38930 (no session established for client) (org.apache.zookeeper.server.NIOServerCnxn)
[2020-09-04 17:47:19,712] WARN Cannot open channel to 2 at election address /10.10.7.5:3888 (org.apache.zookeeper.server.quorum.QuorumCnxManager)
java.net.SocketTimeoutException: connect timed out
at java.net.PlainSocketImpl.socketConnect(Native Method)
at java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:350)
at java.net.AbstractPlainSocketImpl.connectToAddress(AbstractPlainSocketImpl.java:206)
at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:188)
at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392)
at java.net.Socket.connect(Socket.java:589)
at org.apache.zookeeper.server.quorum.QuorumCnxManager.connectOne(QuorumCnxManager.java:381)
at org.apache.zookeeper.server.quorum.QuorumCnxManager.connectAll(QuorumCnxManager.java:426)
at org.apache.zookeeper.server.quorum.FastLeaderElection.lookForLeader(FastLeaderElection.java:843)
at org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:822)
如果删除与群集节点的zookeeper相关的配置,并且在单节点中运行,它将正常工作。仅当我尝试在3节点群集中运行时才会看到问题。
在AWS实例中打开了端口2888和3888。