我使用的是HDP 2.4.2,之前我安装过zeppelin服务器。它工作正常但今天当我重新启动集群(重新启动AWS节点)时,Ambari显示Zeppelin服务器未运行且无法启动服务器,并出现以下错误:
Traceback (most recent call last):
File "/var/lib/ambari-agent/cache/stacks/HDP/2.4/services/ZEPPELIN/package/scripts/master.py", line 235, in <module>
Master().execute()
File "/usr/lib/python2.6/site-packages/resource_management/libraries/script/script.py", line 219, in execute
method(env)
File "/var/lib/ambari-agent/cache/stacks/HDP/2.4/services/ZEPPELIN/package/scripts/master.py", line 169, in start
+ params.zeppelin_log_file, user=params.zeppelin_user)
File "/usr/lib/python2.6/site-packages/resource_management/core/base.py", line 154, in __init__
self.env.run()
File "/usr/lib/python2.6/site-packages/resource_management/core/environment.py", line 158, in run
self.run_action(resource, action)
File "/usr/lib/python2.6/site-packages/resource_management/core/environment.py", line 121, in run_action
provider_action()
File "/usr/lib/python2.6/site-packages/resource_management/core/providers/system.py", line 238, in action_run
tries=self.resource.tries, try_sleep=self.resource.try_sleep)
File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 70, in inner
result = function(command, **kwargs)
File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 92, in checked_call
tries=tries, try_sleep=try_sleep)
File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 140, in _call_wrapper
result = _call(command, **kwargs_copy)
File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 291, in _call
raise Fail(err_msg)
resource_management.core.exceptions.Fail: Execution of '/usr/hdp/current/zeppelin-server/lib/bin/zeppelin-daemon.sh start >> /var/log/zeppelin/zeppelin-setup.log' returned 1. /usr/hdp/current/zeppelin-server/lib/bin/zeppelin-daemon.sh: line 187: /var/run/zeppelin-notebook/zeppelin-zeppelin-ip-10-0-0-11.eu-west-1.compute.internal.pid: Permission denied
cat: /var/run/zeppelin-notebook/zeppelin-zeppelin-ip-10-0-0-11.eu-west-1.compute.internal.pid: No such file or directory
在zeppelin日志中:
ERROR [2016-06-06 03:20:36,714]({main} VFSNotebookRepo.java [list]:140) - 无法读取备注文件:/// usr / hdp / current / zeppelin-server / lib / notebook / screenshots java.io.IOException:file:///usr/hdp/current/zeppelin-server/lib/notebook/screenshots/note.json not found
ERROR [2016-06-06 03:34:12,795]({main} Notebook.java [loadNoteFromRepo]:330) - 无法加载2BHU1G67J java.io.IOException:file:/// usr / hdp / current / zeppelin-server / lib / notebook / 2BHU1G67J不是目录
但由于某种原因,zeppelin端口正在侦听,尽管存在这些错误,但zeppelin服务器运行正常并执行所有查询。请告知如何纠正Ambari中的问题并在没有错误的情况下启动服务。
答案 0 :(得分:1)
问题在于zeppelin服务的PID文件。它由错误的用户拥有或具有错误的权限。手动停止zeppelin服务,然后删除位于/var/run/zeppelin-notebook/zeppelin-zeppelin-ip-10-0-0-11.eu-west-1.compute.internal.pid
的pid文件。同时仔细检查/var/run/zeppelin-notebook
文件夹上的所有者/权限。然后,您应该能够在Ambari UI中重新启动该服务。