成员重启后,Hazelcast会终止客户端连接

时间:2017-05-10 10:53:10

标签: cluster-computing hazelcast hazelcast-imap

我正在使用3.8.1,我注意到在成员被退回之后围绕客户端的重新连接行为的一些问题。尽管设置了重新连接尝试,但是在记录了无法获得群集连接的重复警告之后,客户端通常会断开连接 - 尽管始终至少有一个幸存的成员。

那为什么要断开连接呢?我相信,如果至少有一个成员活着,那么任何客户都不应该断开连接。 为什么客户在会员退回后会反复登录并发出警告?

测试: 我有2个成员加入。 我有3个客户端通过tcp连接到这两个成员。 客户端配置为重新连接1000次。 客户端1循环并简单地将随机值放入同一个密钥。 客户2& 3是该地图的听众并记录更新。

步骤: 启动成员1,成员2。 启动客户端2,3(听众)。 启动客户端1(编写者)。 一切都还好。日志中没有警告。

弹跳成员1,等待它开始。 弹跳会员2。 弹跳成员1。

并非总是如此,但客户经常会报告此事:

Put 29: d16b0acd-d0d6-4722-b511-7fd975774f8c
May 10, 2017 11:41:10 AM com.hazelcast.client.spi.impl.ClusterListenerSupport
WARNING: hz.client_0 [dev] [3.8.1] Unable to get alive cluster connection, try in 0 ms later, attempt 4 of 1000.
May 10, 2017 11:41:10 AM com.hazelcast.client.spi.impl.ClusterListenerSupport
INFO: hz.client_0 [dev] [3.8.1] Trying to connect to [127.0.0.1]:5701 as owner member
Put 30: f91ec949-19bd-4039-95d3-28c7abd0f241
May 10, 2017 11:41:15 AM com.hazelcast.client.spi.impl.ClusterListenerSupport
INFO: hz.client_0 [dev] [3.8.1] Trying to connect to [127.0.0.1]:5702 as owner member
Put 31: 2af9fb36-d501-4b0b-9fc8-6b36d467a929
May 10, 2017 11:41:20 AM com.hazelcast.client.spi.impl.ClusterListenerSupport
INFO: hz.client_0 [dev] [3.8.1] Trying to connect to [127.0.0.1]:5701 as owner member
Put 32: f08bd6ca-a79f-44f4-9064-42e47953c37a

客户仍然可以操作并听取事件但是经过一段时间后,他们经常断开连接:

May 10, 2017 11:25:04 AM com.hazelcast.core.LifecycleService
INFO: hz.client_0 [dev] [3.8.1] HazelcastClient 3.8.1 (20170411 - f1e9264) is SHUTTING_DOWN
May 10, 2017 11:25:04 AM com.hazelcast.client.connection.ClientConnectionManager
INFO: hz.client_0 [dev] [3.8.1] Removed connection to endpoint: [localhost]:5701, connection: ClientConnection{alive=false, connectionId=5, socketChannel=DefaultSocketChannelWrapper{socketChannel=java.nio.channels.SocketChannel[closed]}, remoteEndpoint=[localhost]:5701, lastReadTime=2017-05-10 11:25:04.021, lastWriteTime=2017-05-10 11:25:04.084, closedTime=2017-05-10 11:25:04.084, lastHeartbeatRequested=2017-05-10 11:25:03.834, lastHeartbeatReceived=2017-05-10 11:25:03.834, connected server version=3.8.1}
May 10, 2017 11:25:04 AM com.hazelcast.client.connection.ClientConnectionManager
INFO: hz.client_0 [dev] [3.8.1] Removed connection to endpoint: [localhost]:5702, connection: ClientConnection{alive=false, connectionId=6, socketChannel=DefaultSocketChannelWrapper{socketChannel=java.nio.channels.SocketChannel[closed]}, remoteEndpoint=[localhost]:5702, lastReadTime=2017-05-10 11:24:59.513, lastWriteTime=2017-05-10 11:25:04.084, closedTime=2017-05-10 11:25:04.084, lastHeartbeatRequested=2017-05-10 11:17:43.840, lastHeartbeatReceived=2017-05-10 11:17:43.840, connected server version=3.8.1}
May 10, 2017 11:25:04 AM com.hazelcast.core.LifecycleService
INFO: hz.client_0 [dev] [3.8.1] HazelcastClient 3.8.1 (20170411 - f1e9264) is SHUTDOWN

客户断开连接:

INFO: hz.client_0 [dev] [3.8.1] Removed connection to endpoint: [localhost]:5701, connection: ClientConnection{alive=false, connectionId=8, socketChannel=DefaultSocketChannelWrapper{socketChannel=java.nio.channels.SocketChannel[closed]}, remoteEndpoint=[localhost]:5701, lastReadTime=2017-05-10 11:45:31.791, lastWriteTime=2017-05-10 11:45:31.791, closedTime=2017-05-10 11:45:31.791, lastHeartbeatRequested=2017-05-10 11:45:23.604, lastHeartbeatReceived=2017-05-10 11:45:23.605, connected server version=3.8.1}
May 10, 2017 11:45:31 AM com.hazelcast.core.LifecycleService
INFO: hz.client_0 [dev] [3.8.1] HazelcastClient 3.8.1 (20170411 - f1e9264) is CLIENT_DISCONNECTED

会员报告:

May 10, 2017 11:47:32 AM com.hazelcast.client.impl.protocol.task.AuthenticationMessageTask
WARNING: [localhost]:5701 [dev] [3.8.1] Member having uuid 8847c2e3-2fcb-428f-a827-d0e24f5624a1 is not part of the cluster. Client Authentication rejected.
May 10, 2017 11:47:32 AM com.hazelcast.client.impl.protocol.task.AuthenticationMessageTask
WARNING: [localhost]:5701 [dev] [3.8.1] Received auth from Connection[id=222, /127.0.0.1:5701->/127.0.0.1:53883, endpoint=null, alive=true, type=NONE] with principal ClientPrincipal{uuid='f1e5d928-23b2-4fdf-bd3f-9db1778a5a8c', ownerUuid='8847c2e3-2fcb-428f-a827-d0e24f5624a1'} , authentication failed

主要成员:

public class HzNodeTest {
    private HazelcastInstance service;

    @Before
    public void setUp() throws Exception {
        Config config = new Config();
        JoinConfig join = config.getNetworkConfig().setPort(5701).getJoin();
        join.getMulticastConfig().setEnabled(false);
        join.getAwsConfig().setEnabled(false);
        join.getTcpIpConfig().addMember("localhost:5701").addMember("localhost:5702").setEnabled(true);
        service = Hazelcast.newHazelcastInstance(config);
    }

    @After
    public void tearDown() throws Exception {
        service.shutdown();
    }

    @Test
    public void testStart() throws InterruptedException {
        Thread.sleep(1000000000);
    }
}

Secondary member:
public class HzNodeSecondaryTest {
    private HazelcastInstance service;

    @Before
    public void setUp() throws Exception {
        Config config = new Config();
        JoinConfig join = config.getNetworkConfig().setPort(5702).getJoin();
        join.getMulticastConfig().setEnabled(false);
        join.getAwsConfig().setEnabled(false);
        join.getTcpIpConfig().addMember("localhost:5701").addMember("localhost:5702").setEnabled(true);
        service = Hazelcast.newHazelcastInstance(config);
    }

    @After
    public void tearDown() throws Exception {
        service.shutdown();
    }

    @Test
    public void testStart() throws InterruptedException {
        Thread.sleep(1000000000);
    }
}

Listener:

public class HzListenerTest {
    private HazelcastInstance service;
    private AtomicLong counter = new AtomicLong();

    @Before
    public void setUp() throws Exception {
        ClientConfig clientConfig = new ClientConfig();
        clientConfig.getNetworkConfig().addAddress("localhost:5701").addAddress("localhost:5702").setConnectionAttemptLimit(1000);
        service = HazelcastClient.newHazelcastClient(clientConfig);
    }

    @After
    public void tearDown() throws Exception {
        service.shutdown();
    }

    @Test
    public void testListen() throws InterruptedException {
        service.getMap("TEST").addEntryListener(new Listener(), true);

        Thread.sleep(1000000);
    }

    private class Listener implements EntryAddedListener, EntryUpdatedListener, EntryRemovedListener,
            EntryEvictedListener {

        @Override
        public void entryAdded(EntryEvent event) {
            System.out.println("onAdded " + counter.getAndIncrement() + ": " + event);
        }

        @Override
        public void entryEvicted(EntryEvent event) {
            System.out.println("onEvicted " + counter.getAndIncrement() + ": " + event);
        }

        @Override
        public void entryRemoved(EntryEvent event) {
            System.out.println("onRemoved " + counter.getAndIncrement() + ": " + event);
        }

        @Override
        public void entryUpdated(EntryEvent event) {
            System.out.println("onUpdated " + counter.getAndIncrement() + ": "+ event);
        }
    }
}

Updater:

public class HzUpdaterTest {
    private HazelcastInstance service;

    @Before
    public void setUp() throws Exception {
        ClientConfig clientConfig = new ClientConfig();
        clientConfig.getNetworkConfig().addAddress("localhost:5701").addAddress("localhost:5702").setConnectionAttemptLimit(1000);
        service = HazelcastClient.newHazelcastClient(clientConfig);
        service.getMap("TEST").put("1", UUID.randomUUID().toString());
    }

    @After
    public void tearDown() throws Exception {
        service.shutdown();
    }

    @Test
    public void testSpin() {
        for (int i = 0; i < 10000; i++) {
            try {
                String value = UUID.randomUUID().toString();
                service.getMap("TEST").put("1", value);
                System.out.println("Put " + i + ": " + value);
                Thread.sleep(5000);
            } catch (Exception ex) {
                System.out.println(ex);
            }
        }
    }
}

2 个答案:

答案 0 :(得分:0)

每当Hazelcast客户端连接到群集时,客户端可以建立连接的第一个成员(这是CLIENT_CONNECTED被触发时)成为该客户端的所有者成员。客户端从所有者成员获取群集信息和其他成员的地址。之后,客户端知道所有成员并根据需要直接连接到他们。

当您关闭的成员恰好是该客户的所有者成员时,CLIENT_DISCONNECTED事件将被触发。但是在很短的时间内(在同一秒左右),客户端应该重新建立与剩余成员的连接。

答案 1 :(得分:0)

在您的日志中我看到了

Unable to get alive cluster connection, try in 0 ms later, attempt 4 of 1000.

可以试试这个属性

<propertyname="hazelcast.invalidation.reconciliation.interval.seconds">5</property>