Windows 2016上的Mesos Slave。不与Master连接

时间:2017-12-08 19:52:41

标签: mesos

我目前的设置如下:

  1. Mesos Master - 10.20.200.300:14081 - RHEL 7
  2. Zookeeper - 10.20.200.300:14080 - RHEL 7
  3. Mesos代理 - 10.21.210.310:5051 - Windows 2016
  4. 主人起来&能够连接到zookeeper。但是,在启动代理时,即使代理连接到zookeeper,它也不会连接到主服务器。

    Master作为systemd进程启动,在/ etc / mesos-master -

    下使用以下参数
    hostname - mymaster.mesos.com    
    quorum - 1    
    work_dir - /var/lib/mesos   
    advertise_ip - 10.20.200.300
    advertise_port - 14081
    

    以下是来自master,slave和amp;的日志。动物园管理员。

    主日志(在10.20.200.300:14081上运行):

    E1208 12:22:21.269227  4302 process.cpp:2455] Failed to shutdown socket with fd 26, address 10.20.200.300:14081: Transport endpoint is not connected
    

    Zookeeper日志(在10.20.200.300:14080上运行):

    2017-12-08 12:22:21,185 [myid:] - INFO  [NIOServerCxn.Factory:0.0.0.0/0.0.0.0:14080:ZooKeeperServer@942] - Client attempting to establish new session at /10.21.210.310:63039     
    2017-12-08 12:22:21,196 [myid:] - INFO  [SyncThread:0:ZooKeeperServer@687] - Established session 0x160372c2b770010 with negotiated timeout 10000 for client /10.21.210.310:63039
    

    奴隶日志(在10.21.210.310:5051上运行):

    I1208 12:22:21.179652  4224 slave.cpp:1007] New master detected at master@10.20.200.300:14081    
    I1208 12:22:21.195278  4224 slave.cpp:1031] No credentials provided. Attempting to register without authentication     
    I1208 12:22:21.195278  4224 slave.cpp:1042] Detecting new master     
    I1208 12:22:21.210924  6156 slave.cpp:5135] Got exited event for master@10.20.200.300:14081     
    W1208 12:22:21.210924  6156 slave.cpp:5140] Master disconnected! Waiting for a new master to be elected     
    I1208 12:22:21.226510  2700 slave.cpp:5135] Got exited event for master@10.20.200.300:14081     
    W1208 12:22:21.226510  2700 slave.cpp:5140] Master disconnected! Waiting for a new master to be elected     
    

    有谁知道这些的原因? 我测试了slave之间的连接 - >主人和主人 - >奴隶&它很成功。

    Test-NetConnection -ComputerName 10.20.200.300 -Port 14081
    ComputerName     : 10.20.200.300     
    RemoteAddress    : 10.20.200.300     
    RemotePort       : 14081     
    InterfaceAlias   : Ethernet     
    SourceAddress    : 10.21.210.310     
    TcpTestSucceeded : True    
    
    [root@mesos-master]# telnet 10.21.210.310 5051
    Trying 10.21.210.310...
    Connected to 10.21.210.310.
    Escape character is '^]'. 
    

    我用以下参数启动了代理 -

    C:\Mesos\mesos\build\src>C:\Mesos\mesos\build\src\mesos-agent.exe \
             --master=zk://10.20.200.300:14080/mesos \
             --work_dir=C:\Mesos\Logs \
             --launcher_dir=C:\Mesos\mesos\build\src \
             --ip=10.21.210.310 \
             --advertise_ip=10.21.210.310 \
             --advertise_port=5051
    

    主/州日志 -

    {
        "version": "1.3.1",
        "git_sha": "1beaede8c13f0832d4921121da34f924deec8950",
        "git_tag": "1.3.1",
        "build_date": "2017-09-05 18:02:12",
        "build_time": 1504634532,
        "build_user": "centos",
        "start_time": 1513010072.51033,
        "elected_time": 1513010072.67995,
        "id": "90f5702f-f867-41ac-8087-5d20c87ea96f",
        "pid": "master@10.20.200.300:14081",
        "hostname": "MYhost.COM",
        "activated_slaves": 0,
        "deactivated_slaves": 0,
        "unreachable_slaves": 0,
        "leader": "master@10.20.200.300:14081",
        "leader_info": {
            "id": "90f5702f-f867-41ac-8087-5d20c87ea96f",
            "pid": "master@10.20.200.300:14081",
            "port": 14081,
            "hostname": "MYhost.COM"
        },
        "log_dir": "/var/log/mesos",
        "flags": {
            "advertise_ip": "10.20.200.300",
            "advertise_port": "14081",
            "agent_ping_timeout": "15secs",
            "agent_reregister_timeout": "10mins",
            "allocation_interval": "1secs",
            "allocator": "HierarchicalDRF",
            "authenticate_agents": "false",
            "authenticate_frameworks": "false",
            "authenticate_http_frameworks": "false",
            "authenticate_http_readonly": "false",
            "authenticate_http_readwrite": "false",
            "authenticators": "crammd5",
            "authorizers": "local",
            "framework_sorter": "drf",
            "help": "false",
            "hostname": "MYhost.COM",
            "hostname_lookup": "true",
            "http_authenticators": "basic",
            "initialize_driver_logging": "true",
            "log_auto_initialize": "true",
            "log_dir": "/var/log/mesos",
            "logbufsecs": "0",
            "logging_level": "INFO",
            "max_agent_ping_timeouts": "5",
            "max_completed_frameworks": "50",
            "max_completed_tasks_per_framework": "1000",
            "max_unreachable_tasks_per_framework": "1000",
            "port": "14081",
            "quiet": "false",
            "quorum": "1",
            "recovery_agent_removal_limit": "100%",
            "registry": "replicated_log",
            "registry_fetch_timeout": "1mins",
            "registry_gc_interval": "15mins",
            "registry_max_agent_age": "2weeks",
            "registry_max_agent_count": "102400",
            "registry_store_timeout": "20secs",
            "registry_strict": "false",
            "root_submissions": "true",
            "user_sorter": "drf",
            "version": "false",
            "webui_dir": "/usr/share/mesos/webui",
            "work_dir": "/var/lib/mesos",
            "zk": "zk://localhost:14080/mesos",
            "zk_session_timeout": "10secs"
        },
        "slaves": [],
        "recovered_slaves": [],
        "frameworks": [],
        "completed_frameworks": [],
        "orphan_tasks": [],
        "unregistered_frameworks": []
    }
    

    我们是否需要测试任何其他连接或此错误是出于其他原因?

1 个答案:

答案 0 :(得分:0)

我会尝试这个

  1. 在奴隶上设置主机名(您可以说主机名= 10.21.210.310)
  2. 检查Windows计算机上的防火墙。允许传入的连接到5051端口