在3节点Vertica群集中创建/启动数据库时出错

时间:2016-08-02 15:27:02

标签: amazon-web-services vertica

我正在使用Vertica社区版7.2.3并在AWS VPC上设置3节点集群。 我的install_vertica脚本运行正常。创建数据库时,我通过adminTools发出命令的节点出现,但两个节点的其余部分都停止了。

Node Status: v_mpp_test_node0001: (DOWN) v_mpp_test_node0002: (UP) v_mpp_test_node0003: (DOWN)

根据&my.vertica.com/docs/Ecosystem/Amazon/HP_Vertica_7.1.x_Vertica_AWS.pdf'所有必需的端口;是开放的。 在3个服务器之间存在密码少ssh。

以下是netstat输出:

[root@ip-10-0-3-xxx ec2-user]# netstat -an | egrep 'tcp|udp'
tcp        0      0 10.0.2.185:4803             0.0.0.0:*                   LISTEN
tcp        0      0 0.0.0.0:5444                0.0.0.0:*                   LISTEN
tcp        0      0 0.0.0.0:36582               0.0.0.0:*                   LISTEN
tcp        0      0 0.0.0.0:111                 0.0.0.0:*                   LISTEN
tcp        0      0 0.0.0.0:22                  0.0.0.0:*                   LISTEN
tcp        0      0 127.0.0.1:631               0.0.0.0:*                   LISTEN
tcp        0      0 0.0.0.0:5433                0.0.0.0:*                   LISTEN
tcp        0      0 127.0.0.1:25                0.0.0.0:*                   LISTEN
tcp        0      0 0.0.0.0:5434                0.0.0.0:*                   LISTEN
tcp        0      0 10.0.2.185:22               10.0.200.5:53124            ESTABLISHED
tcp        0      0 10.0.2.185:5434             10.0.2.185:36504            ESTABLISHED
tcp        0      0 10.0.2.185:36504            10.0.2.185:5434             ESTABLISHED
tcp        0      0 :::111                      :::*                        LISTEN
tcp        0      0 :::22                       :::*                        LISTEN
tcp        0      0 ::1:631                     :::*                        LISTEN
tcp        0      0 :::5433                     :::*                        LISTEN
tcp        0      0 ::1:25                      :::*                        LISTEN
udp        0      0 0.0.0.0:111                 0.0.0.0:*
udp        0      0 0.0.0.0:631                 0.0.0.0:*
udp        0      0 10.0.2.185:123              0.0.0.0:*
udp        0      0 127.0.0.1:123               0.0.0.0:*
udp        0      0 0.0.0.0:123                 0.0.0.0:*
udp        0      0 0.0.0.0:33666               0.0.0.0:*
udp        0      0 0.0.0.0:935                 0.0.0.0:*
udp        0      0 10.0.2.185:5433             0.0.0.0:*
udp        0      0 10.0.2.185:4803             0.0.0.0:*
udp        0      0 10.0.2.185:4804             0.0.0.0:*
udp        0      0 0.0.0.0:68                  0.0.0.0:*
udp        0      0 0.0.0.0:42327               0.0.0.0:*
udp        0      0 :::111                      :::*
udp        0      0 fe80::8bf:7fff:fe9f:bd1b:123 :::*
udp        0      0 ::1:123                     :::*
udp        0      0 :::123                      :::*
udp        0      0 :::935                      :::*

[root@ip-10-0-2-185 ec2-user]# nc -vz -u 10.0.2.186 4803
Connection to 10.0.2.186 4803 port [udp/notateit-disc] succeeded!
[root@ip-10-0-2-185 ec2-user]# nc -vz -u 10.0.2.184 4803
Connection to 10.0.2.184 4803 port [udp/notateit-disc] succeeded!

但是我在运行vnetpref时遇到以下错误

[dbadmin@ip-10-0-2-184 ~]$/opt/vertica/bin/vnetperf
2016-08-02_13:10:28,619 ERROR: [Connector Thread 10.0.2.186 0x7fe59578e700] Couldn't connect to 10.0.2.186 (family 2, attempt 0): Connection timed out; errno=110 (Connection timed out)
2016-08-02_13:10:28,619 ERROR: [Connector Thread 10.0.2.185 0x7fe59618f700] Couldn't connect to 10.0.2.185 (family 2, attempt 0): Connection timed out; errno=110 (Connection timed out)
2016-08-02_13:11:32,619 ERROR: [Connector Thread 10.0.2.186 0x7fe59578e700] Couldn't connect to 10.0.2.186 (family 2, attempt 1): Connection timed out; errno=110 (Connection timed out)
2016-08-02_13:11:32,619 ERROR: [Connector Thread 10.0.2.185 0x7fe59618f700] Couldn't connect to 10.0.2.185 (family 2, attempt 1): Connection timed out; errno=110 (Connection timed out)
2016-08-02_13:12:36,619 ERROR: [Connector Thread 10.0.2.186 0x7fe59578e700] Couldn't connect to 10.0.2.186 (family 2, attempt 2): Connection timed out; errno=110 (Connection timed out)
2016-08-02_13:12:36,619 ERROR: [Connector Thread 10.0.2.185 0x7fe59618f700] Couldn't connect to 10.0.2.185 (family 2, attempt 2): Connection timed out; errno=110 (Connection timed out)
2016-08-02_13:12:37,619 ERROR: [Connector Thread 10.0.2.186 0x7fe59578e700] Could not find anything to connect to for 10.0.2.186; errno=110 (Connection timed out)
2016-08-02_13:12:37,619 ERROR: [Connector Thread 10.0.2.185 0x7fe59618f700] Could not find anything to connect to for 10.0.2.185; errno=110 (Connection timed out)
2016-08-02_13:12:37,619 ERROR: [main 0x7fe596b92720] Caught error: Unable to connect to host 10.0.2.185:14159
Unable to connect to host 10.0.2.186:14159; errno=0 (Success)

我还必须专门打开端口14159吗?可能是什么问题? 如果需要任何特定日志,请告诉我。

1 个答案:

答案 0 :(得分:0)

是的,服务需要将端口14159打开到调用实例。你试过了吗?