我正在处理一个大文件,我正在使用ssh登录每个节点来做我想做的事情。我登录其中一个节点并在那里扣动扳机。
nohup ssh root@nodes717"cat /nsf/share/input_file_aa | python /nsf/share/process.py" > output_aa &
nohup ssh root@nodes716"cat /nsf/share/input_file_ab | python /nsf/share/process.py" > output_ab &
nohup ssh root@nodes715"cat /nsf/share/input_file_ac | python /nsf/share/process.py" > output_ac &
nohup ssh root@nodes714"cat /nsf/share/input_file_ad | python /nsf/share/process.py" > output_ad &
nohup ssh root@nodes713"cat /nsf/share/input_file_ae | python /nsf/share/process.py" > output_ae &
nohup ssh root@nodes712"cat /nsf/share/input_file_af | python /nsf/share/process.py" > output_af &
nohup ssh root@nodes711"cat /nsf/share/input_file_ag | python /nsf/share/process.py" > output_ag &
nohup ssh root@nodes710"cat /nsf/share/input_file_ah | python /nsf/share/process.py" > output_ah &
nohup ssh root@nodes709"cat /nsf/share/input_file_ai | python /nsf/share/process.py" > output_ai &
nohup ssh root@nodes708"cat /nsf/share/input_file_aj | python /nsf/share/process.py" > output_aj &
nohup ssh root@nodes707"cat /nsf/share/input_file_ak | python /nsf/share/process.py" > output_ak &
nohup ssh root@nodes706"cat /nsf/share/input_file_al | python /nsf/share/process.py" > output_al &
nohup ssh root@nodes705"cat /nsf/share/input_file_am | python /nsf/share/process.py" > output_am &
nohup ssh root@nodes704"cat /nsf/share/input_file_an | python /nsf/share/process.py" > output_an &
但是,我可以完成大部分工作,但不断,几分钟后,我有两三个nohup工作。 (失败)。我想知道导致此错误的可能原因是什么?太多的ssh频道或者只是“ssh-map-reduce”的错误?