我已经在两个数据中心(每个数据中心3个节点)上安装了数据共享社区Cassandra。我认为最初的Cassandra配置是正确的,但是当我查看Opscenter(监视通过配置文件设置的现有集群)时,我有所有HETZNER1的性能指标,但没有OVH1。
我已经检查过datastax-agent日志,但没有任何东西对我说出错误。
以下是Opscenter v5的几张照片:
正如您所看到的,它显示了6个节点中的3个(存储空间应远大于1tb)然而,所有代理都已连接。
带有数据的HETZNER1节点:)
没有数据的OVH1节点:(
检查OVH1 NODE1上的datastax-agent日志(startup.log)显示:
INFO [main] 2014-07-25 10:36:09,475 Loading conf files: /var/lib/datastax-agent/conf/address.yaml
INFO [main] 2014-07-25 10:36:09,512 Java vendor/version: Java HotSpot(TM) 64-Bit Server VM/1.7.0_60
INFO [main] 2014-07-25 10:36:09,512 DataStax Agent version: 5.0.0
INFO [main] 2014-07-25 10:36:09,552 Default config values: {:rollups300_ttl 2419200, :settings_cf "settings", :restore_req_update_period 60, :my_channel_prefix "/agent", :poll_period 60, :kerberos_hostname nil, :storage_dc nil, :thrift_conn_timeout 10000, :thrift_max_frame_size 15728640, :rollups60_ttl 604800, :stomp_port 61620, :shorttime_interval 10, :longtime_interval 300, :private-conf-props ["initial_token" "listen_address" "broadcast_address" "rpc_address"], :thrift_port 9160, :async_retry_timeout 5, :agent-conf-group "global-cluster-agent-group", :jmx_host "127.0.0.1", :ec2_metadata_api_host "OMITTED", :metrics_enabled 1, :async_queue_size 5000, :disk_usage_update_period 60, :autodiscovery_interval 120, :rollups7200_ttl 31536000, :autodiscovery_enabled true, :thrift_ssl_truststore nil, :rollup_snapshot_period 300, :is_package true, :monitor_command "/usr/share/datastax-agent/bin/datastax_agent_monitor", :thrift_socket_timeout 5000, :cassandra_log_location "/var/log/cassandra/system.log", :config_md5 nil, :jmx_port 7199, :jmx_metrics_threadpool_size 4, :use_ssl 0, :rollups86400_ttl -1, :nodedetails_threadpool_size 3, :api_port 61621, :kerberos_service nil, :kerberos_client_principal nil, :jmx_thread_pool_size 5, :production 1, :runs_sudo 1, :stomp_interface "OMITTED", :storage_keyspace "OpsCenter", :rollup_snapshot_threshold 300, :thrift_ssl_truststore_type "JKS", :realtime_interval 5}
INFO [main] 2014-07-25 10:36:09,553 Waiting for the config from OpsCenter
INFO [main] 2014-07-25 10:36:09,554 Attempting to determine Cassandra's broadcast address through JMX
INFO [main] 2014-07-25 10:36:09,554 Starting Stomp
INFO [Initialization] 2014-07-25 10:36:09,556 New JMX connection (127.0.0.1:7199)
INFO [main] 2014-07-25 10:36:09,557 SSL communication is disabled
INFO [main] 2014-07-25 10:36:09,557 Creating stomp connection to OMITTED:61620
INFO [StompConnection receiver] 2014-07-25 10:36:09,562 Reconnecting in 0s.
INFO [StompConnection receiver] 2014-07-25 10:36:09,566 Connected to OMITTED:61620
INFO [main] 2014-07-25 10:36:09,672 Starting Jetty server: {:port 61621, :host nil, :ssl? false, :join? false}
INFO [StompConnection receiver] 2014-07-25 10:36:09,687 Got new config from OpsCenter: {:kerberos_use_keytab true, :rollups300_ttl 2419200, :kerberos_use_ticket_cache true, :rollups60_ttl 604800, :thrift_port 9160, :ec2_metadata_api_host "OMITTED", :metrics_enabled 1, :rollups7200_ttl 31536000, :thrift_ssl_truststore nil, :metrics_ignored_column_families "", :cassandra_log_location "/var/log/cassandra/system.log", :thrift_rpc_interface "OMITTED", :config_md5 "abfe7ce1d2750e030dada2ffb4551777", :jmx_port 7199, :provisioning 0, :use_ssl 0, :kerberos_debug false, :rollups86400_ttl -1, :api_port "61621", :storage_keyspace "OpsCenter", :kerberos_renew_tgt true, :metrics_ignored_solr_cores "", :thrift_ssl_truststore_type "JKS", :metrics_ignored_keyspaces "system, system_traces, system_auth, dse_auth, OpsCenter", :rollup_subscriptions [], :cassandra_install_location ""}
INFO [StompConnection receiver] 2014-07-25 10:36:09,688 New JMX connection (127.0.0.1:7199)
INFO [Jetty] 2014-07-25 10:36:09,698 Jetty server started
agent.log:
df: `/var/named/chroot/etc/named.rfc1912.zones': Permission denied
df: `/var/named/chroot/etc/rndc.key': Permission denied
df: `/var/named/chroot/usr/lib64/bind': Permission denied
df: `/var/named/chroot/etc/named.iscdlv.key': Permission denied
df: `/var/named/chroot/etc/named.root.key': Permission denied
Filesystem Type 1G-blocks Used Available Use% Mounted on
rootfs rootfs 884 2 838 1% /
/dev/root ext3 884 2 838 1% /
devtmpfs devtmpfs 63 1 63 1% /dev
tmpfs tmpfs 63 0 63 0% /dev/shm
我不认为对chroot分区的许可是非常必要的......?
installer.log:
2014-07-24 19:57:38 +0200
2014-07-24 19:57:38 +0200 Installed:
2014-07-24 19:57:38 +0200 datastax-agent.noarch 0:5.0.0-1
2014-07-24 19:57:38 +0200
2014-07-24 19:57:38 +0200 Complete!
2014-07-24 19:57:38 +0200 Installing certificates from opscenterd...
2014-07-24 19:57:38 +0200 cp: cannot stat `ssl/agentKeyStore': No such file or directory
2014-07-24 19:57:38 +0200 Setting up agent node state...
2014-07-24 19:57:38 +0200 Starting new agent...
2014-07-24 19:57:38 +0200 Starting DataStax Agent datastax-agent
2014-07-24 19:57:38 +0200 Starting datastax-agent [ OK ]
2014-07-24 19:57:38 +0200 log4j:WARN No appenders could be found for logger (org.eclipse.jetty.util.log).
2014-07-24 19:57:38 +0200 log4j:WARN Please initialize the log4j system properly.
2014-07-24 19:57:38 +0200 log4j:WARN See http://logging.apache.org/log4j/1.2/faq.html#noconfig for more info.
2014-07-24 19:57:42 +0200 Agent installation complete.
我真的无法想象这一点。有人可以帮忙吗?
由于
答案 0 :(得分:4)
好的,所以我找到了罪魁祸首,datastax-agent
作为非sudo用户运行。这是因为df命令失败,数据没有被发送到opscenter。
将/etc/init.d/datastax-agent
中的用户更改为特权用户解决了问题!
答案 1 :(得分:0)
我通过修复每个节点的stomp_interface来解决:
----------配置指向Opscenter ------
stomp_interface:OPSCENTER-INTERNAL-IP