没有x的物理地址,在jgroups集群中重新启动节点上的消息

时间:2016-04-28 05:47:53

标签: jgroups

我的节点日志充满警告消息"将单播消息丢弃到错误的目的地"当重新启动集群中的一个节点时。 我们正在使用Jgroups,TCP,jgroups-3.4.1.Final的版本。 我的服务器没有出现,这些警告信息被传染性地抛出

以下是警告信息  [0; 33mWARN [TransferQueueBundler,h-broadcast,h-13] [TCP] JGRP000032:h-13:没有物理地址8281f201-7fb1-f6ac-faf3-d6837bc39087,丢弃消息

[0; 33mWARN [INT-1,h-broadcast,h-13] [TCP] JGRP000031:h-13:将单播消息丢弃到错误的目的地d205fcba-151c-ad58-8323-fe4f49117f88

请让我知道如何解决此问题

谢谢, Nivedita

<TCP loopback="true" 
    recv_buf_size="${tcp.recv_buf_size:20M}" 
    send_buf_size="${tcp.send_buf_size:640K}"
    discard_incompatible_packets="true" 
    max_bundle_size="64K" 
    max_bundle_timeout="5" 
    enable_bundling="true" 
    use_send_queues="true"
    sock_conn_timeout="300" 
    timer_type="new" 
    timer.min_threads="4" 
    timer.max_threads="10" 
    timer.keep_alive_time="3000"
    timer.queue_max_size="500" 
    thread_pool.enabled="true" 
    thread_pool.min_threads="4" 
    thread_pool.max_threads="10"
    thread_pool.keep_alive_time="5000" 
    thread_pool.queue_enabled="true" 
    thread_pool.queue_max_size="100000"
    thread_pool.rejection_policy="discard" 
    oob_thread_pool.enabled="true" 
    oob_thread_pool.min_threads="1"
    oob_thread_pool.max_threads="8" 
    oob_thread_pool.keep_alive_time="5000" 
    oob_thread_pool.queue_enabled="false"
    oob_thread_pool.queue_max_size="100" 
    oob_thread_pool.rejection_policy="discard" 
    bind_addr="${hybris.jgroups.bind_addr}" 
    bind_port="${hybris.jgroups.bind_port}" />
<TCPPING timeout="3000" 
    initial_hosts="xxx.xx.xx.4[7800],xxx.xx.xx.5[7800],xxx.xx.xx.6[7800], xxx.xx.xx.7[7800], xxx.xx.xx.8[7800], xxx.xx.xx.9[7800], xxx.xx.xx.10[7800], xxx.xx.xx.11[7800], xxx.xx.xx.12[7800], xxx.xx.xx.13[7800], xxx.xx.xx.68[7800], xxx.xx.xx.69[7800], xxx.xx.xx.70[7800], xxx.xx.xx.4[7800], xxx.xx.xx.5[7800], xxx.xx.xx.6[7800]" 
    num_initial_members="16"/>

<MERGE2 min_interval="10000" max_interval="30000" />
    <FD_SOCK />
    <FD timeout="3000" max_tries="3" />
    <VERIFY_SUSPECT timeout="1500" />
    <BARRIER />
    <pbcast.NAKACK use_mcast_xmit="false" exponential_backoff="500" discard_delivered_msgs="true" />
    <UNICAST2 />
    <pbcast.STABLE stability_delay="1000" desired_avg_gossip="50000" max_bytes="4M" />
    <pbcast.GMS print_local_addr="true" join_timeout="3000" view_bundling="true" />
    <UFC max_credits="20M" min_threshold="0.4" />
    <MFC max_credits="20M" min_threshold="0.4" />
    <FRAG2 frag_size="60K" />
    <pbcast.STATE_TRANSFER />

3 个答案:

答案 0 :(得分:3)

非常感谢您的建议。 当其中一个有问题的节点被删除时,群集节点自我修复(与其他能够远程登录的节点相比,它无法进行远程登录)

答案 1 :(得分:2)

我假设您正在使用TCP:TCPPING?您是否在TCPPING.initial_hosts列出所有成员?这是上述警告的最可能原因。

在每个成员中都有一个缓存映射UUID(JGroups&#39;集群成员的内部表示)到物理地址。

您可以通过JMX或probe.sh uuids查看内容。 h13 8281f201-7fb1-f6ac-faf3-d6837bc39087应该有h13的映射,但它已丢失。同样,很可能因为TCPPING中未列出MPING

您可以尝试其他发现协议(例如FILE_PING,如果支持IP多播,TCPGOSSIP需要共享文件系统,var productListArr = new Array('Salad Server Set', 'Party Serviette Holder', 'Tea Set', 'Mixing Bowl Set', 'Knife Block Set', 'Coffee Capsule Holder', 'Plastic Sensor Soap Pump', 'Storage Bucket', 'Oven Glove', 'Apron', 'Biscuit Barrel', 'Chopping Board', 'Carioca Cups', 'Soup Bowls', 'Elevate Wood Turner', 'Pasta Machine', 'Teapot', 'Cake Pop Scoop', 'Cookbook Stand', 'Chocolate Station', 'Coffee Maker', 'Pepper Mill', 'Salt Mill', 'Glass Storage Jar', 'Measuring jug', 'Kitchen Scale', 'Tenderiser', 'Pizza Docker', 'Knife Sharpener', 'Steel Cork Opener', 'Steel Garlic Press', 'Steel Can Opener', 'Stainless Steel Crank Flour Sifter', 'Mineral Stone Mortar and Pestle', 'Citrus Cather', 'Cherry & Olive Pitter', 'Multi Grater-Detachable', 'Stainless Steel Colander', 'Steel Pizza Pan', 'Pop Container'); var priceListArr = new Array(18.70, 11.95, 39.95, 49.95, 99.95, 29.95, 79.95, 24.95, 9.95, 29.95, 39.95, 12.95, 54.95, 43.00, 19.95, 144.95, 29.95, 9.95, 29.95, 34.95, 29.00, 84.94, 84.95, 4.95, 19.95, 39.95, 34.95, 19.95, 79.95, 36.95, 34.95, 36.95, 33.95, 74.95, 19.95, 27.95, 26.95, 44.95, 12.95, 22.95); var orderedProductCodeArr = new Array (); // Not used var quantityArr = new Array (); // Not used var repeat = 4; // Repeat a set of columns, this many times, side by side var itemIndex = 0; document.write('<table width="80%" height="150px" cellpadding="2px" cellspacing="3px" border="1">'); // Set the table header. We will repeat this based on how many 'sets' of tables we show document.write('<tr>'); for (i = 0; i < repeat; i++) { document.write('<th>Code</th><th>Product</th><th>Price</th>'); } document.write('</tr>'); for (var j = 0; j < productListArr.length; j += repeat) { document.write('<tr>'); for (var m = 0; m < repeat; m++) { itemIndex = j + m; if ((productListArr[itemIndex] !== undefined) && (priceListArr[itemIndex] !== undefined)) { document.write('<td>' + itemIndex + '</td><td>' + productListArr[itemIndex] + '</td><td>' + priceListArr[itemIndex] + '</td>'); } } document.write('</tr>'); } document.write('</table>'); 需要外部查找服务等。有关详细信息,请查阅手册。

答案 2 :(得分:0)

我这周遇到了这个问题,正在调整防火墙以使 JGroups 工作几天,直到今天,我将 JGroup Stack 从“UDP”切换到“TCP”,突然所有问题都消失了。