EC2 starcluster停止工作

时间:2013-09-06 18:16:56

标签: r amazon-ec2 cluster-computing starcluster

EC2上的Starcluster ubuntu停止了工作。我不知道任何故意的配置更改或其他任何事情,我无法找到太多帮助。如果有人有任何建议,你能告诉我吗?启动,终止甚至列出群集都是错误的。以下是我尝试列出群集时出现的错误:

ubuntu@ip-10-29-249-203:~$ starcluster listclusters
StarCluster - (http://star.mit.edu/cluster) (v. 0.94)
Software Tools for Academics and Researchers (STAR)
Please submit bug reports to starcluster@mit.edu

---------------------------------
revcls (security group: @sc-revcls)
---------------------------------
!!! ERROR - Unhandled exception occured
Traceback (most recent call last):
  File "/usr/local/lib/python2.7/dist-packages/StarCluster-0.94-py2.7.egg/starcluster/cli.py", line 274, in main
    sc.execute(args)
  File "/usr/local/lib/python2.7/dist-packages/StarCluster-0.94-py2.7.egg/starcluster/commands/listclusters.py", line 36, in execute
    show_ssh_status=self.opts.show_ssh_status)
  File "/usr/local/lib/python2.7/dist-packages/StarCluster-0.94-py2.7.egg/starcluster/cluster.py", line 290, in list_clusters
    nodes = cl.nodes
  File "/usr/local/lib/python2.7/dist-packages/StarCluster-0.94-py2.7.egg/starcluster/cluster.py", line 720, in nodes
    if n.is_master():
  File "/usr/local/lib/python2.7/dist-packages/StarCluster-0.94-py2.7.egg/starcluster/node.py", line 883, in is_master
    return self.alias == "master"
  File "/usr/local/lib/python2.7/dist-packages/StarCluster-0.94-py2.7.egg/starcluster/node.py", line 139, in alias
    aliases = aliasestxt.splitlines()[2:]
AttributeError: 'NoneType' object has no attribute 'splitlines'

!!! ERROR - Oops! Looks like you've found a bug in StarCluster
!!! ERROR - Crash report written to: /home/ubuntu/.starcluster/logs/crash-report-1183.txt
!!! ERROR - Please remove any sensitive data from the crash report
!!! ERROR - and submit it to starcluster@mit.edu

1 个答案:

答案 0 :(得分:0)

您使用的是哪个版本的StarCluster?它是最新版本还是最新的开发版本?

如果我查看最新开发版本的错误,则会引用此行:

https://github.com/jtriley/StarCluster/blob/5442e91e159acca48b645f5ddf629d65c59cd7e0/starcluster/node.py#L139

我的猜测是您的群集别名标签存在问题。在任何情况下,甚至终止集群并启动新集群并不能解决问题,这是非常奇怪的。

如果您发现更多内容,可以重现问题或需要更多帮助,您也可以使用StarCluster邮件列表。 http://mailman.mit.edu/mailman/listinfo/starcluster