Question

我试图向SGE提交工作。它过去一直以同样的方式为我工作。相反，所有工作都停留在qw状态。

＆＃34; qstat -g c＆＃34;输出：

> CLUSTER QUEUE   CQLOAD   USED  AVAIL  TOTAL
> all.q           0.38      0    160   1920   
> gpu6.q          -NA-      0      0      4    
> par6.q          0.38    750    135   1800      
> seq6.q          0.41    103    170    416   
> smp3.q          1.01      0      0     96

＆＃34; qstat命令＆＃34;输出看起来总是如此。

谷歌搜索只给了我一些根本访问权限的提示，这是我没有的。建议任何人？

感谢。

编辑：工作是通过＆＃34; qsub -q seq6.q scriptname＆＃34;提交的。或者smp3.q或par6.q。

＆＃34; qstat -j jobid＆＃34;在我看来，没有什么特别之处：

job_number:                 2821318
exec_file:                  job_scripts/2821318
submission_time:            Wed Mar  4 12:07:15 2015
owner:                      username
uid:                        31519
group:                      dch
gid:                        1150
sge_o_home:                 /home/hudson/pg/username
sge_o_log_name:             username
sge_o_path:                 /gpfs/hamilton6/apps/intel_comp_2014/composer_xe_2013_sp1.2.144/bin/intel64:/usr/local/bin:/bin:/usr/bin:/usr/lpp/mmfs/bin:/usr/local/Cluster-Apps/sge/6.1u6/bin/lx24-amd64:/panfs/panasas1.hpc.dur.ac.uk/apps/nag/fll6a21dpl/scripts
sge_o_shell:                /bin/tcsh
sge_o_workdir:              /panfs/panasas1.hpc.dur.ac.uk/username/path
sge_o_host:                 hamilton1
account:                    sge
mail_list:                  username@hamilton1
notify:                     FALSE
job_name:                   scriptname
jobshare:                   0
hard_queue_list:            seq6.q
env_list:                   
script_file:                scriptname
scheduling info:            (Collecting of scheduler job information is turned off)

Answer 1

我今天遇到了同样的问题。我们正在为客户运行Univa Grid Engine。我配置了一些复合体用于运行作业，这些作业在masterhost上请求大量内存（h_stack = 64M，memory_free = 4G，virtual_free = 4G）。此配置作业将挂起在等待队列中。这种配置在我们所有的执行主机上与3G相匹配多年。我将在下一天测试这个新配置（4G）。所有服务器都有足够的内存 INGO

SGE：乔布斯陷入qw状态

1 个答案: