我是HTcondor的新手,在我通过Putty提交作业后,作业被暂停,日志文件显示以下错误:
007 (274036.000.000) 06/28 23:12:08 Shadow exception!
Error from slot1@hana-******: Failed to execute '/var/lib/condor/execute/dir_107122/condor_exec.exe': (errno=8: 'Exec format error')
0 - Run Bytes Sent By Job
16520 - Run Bytes Received By Job
...
012 (274036.000.000) 06/28 23:12:08 Job was held.
Error from slot1@hana-2.*******: Failed to execute'/var/lib/condor/execute/dir_107122/condor_exec.exe': (errno=8: 'Exec format error')
Code 6 Subcode 8
...
下面是我提交的描述文件:
# defining the programm
cmd = /***/my_prog.ipynb
# defining the parameters
#args = /***/model_1/data_1.csv
#args = /***/model_2/data_2.csv
#args = /***/model_3/data_3.csv
#args = /***/model_4/data_4.csv
# defining memory requirements
request_memory = 100 GB
# defining to use 1 CPU
request_cpus = 100
# defining the max. run time of the job
+REQUESTED_WALLTIME = 72
# defining the mail address for notification
notify_user = ***@**.com
notification = Always
should_transfer_files = if_needed
when_to_transfer_output = on_exit
output = $(Cluster)-$(Process)_job.out
error = $(Cluster)-$(Process)_job.error
log = $(Cluster)-$(Process)_job.log
queue 1
在这种情况下,my_prog.ipynb本身有代码来读取本地四个文件作为输入,同时,my_prog.ipynb还包含" print"命令将结果写入本地文件。
基于这种情况,如果我还在提交描述文件中配置args,我真的很困惑吗?
请给我任何小提示,这个问题让我很难坚持3天......
任何建议都会感激不尽!
谢谢!
答案 0 :(得分:0)
检查可执行文件的执行权限。我认为影子日志显示未设置可执行位。
答案 1 :(得分:0)
在我的情况下,我收到此错误,因为该作业是一个python脚本,它没有一个shebang作为第一行。