climateprediction.net home page
Task 11101567

Task 11101567

Name hadsm3dhet2_jvml_006603983_0
Workunit 6807356
Created 15 Mar 2010, 12:12:29 UTC
Sent 3 Jun 2010, 10:36:51 UTC
Report deadline 16 May 2011, 15:56:51 UTC
Received 11 Jun 2010, 12:36:06 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1072839
Run time 7 days 21 hours 55 min 7 sec
CPU time 7 days 4 hours 33 min 4 sec
Validate state Invalid
Credit 5,259.90
Device peak FLOPS 2.46 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=7388, selfPID=7388, iMonCtr=1
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	10:40:34 PM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
MainError:	11:03:15 AM	No files match the supplied pattern.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11464, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11464, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11464, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11464, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11464, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11464, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4560, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4560, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4560, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4560, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4560, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4560, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jvml_006603983/dataout/restart.day
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2928, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jvml_006603983/dataout/restart.day
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2928, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jvml_006603983/dataout/restart.day
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2928, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jvml_006603983/dataout/restart.day
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2928, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jvml_006603983/dataout/restart.day
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2928, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jvml_006603983/dataout/restart.day
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2928, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
11 Jun 2010 05:55:20 1072839 11101567 hadsm3dhet2_jvml_006603983_0 54,010 610,905 1.0671
11 Jun 2010 01:51:04 1072839 11101567 hadsm3dhet2_jvml_006603983_0 43,208 599,370 1.0671
10 Jun 2010 22:08:40 1072839 11101567 hadsm3dhet2_jvml_006603983_0 32,406 587,880 1.0671
10 Jun 2010 18:30:05 1072839 11101567 hadsm3dhet2_jvml_006603983_0 21,604 576,541 1.0675
10 Jun 2010 14:53:56 1072839 11101567 hadsm3dhet2_jvml_006603983_0 10,802 564,784 1.0670
10 Jun 2010 11:02:56 1072839 11101567 hadsm3dhet2_jvml_006603983_0 259,248 552,310 1.0652
10 Jun 2010 07:09:41 1072839 11101567 hadsm3dhet2_jvml_006603983_0 248,446 539,952 1.0635
10 Jun 2010 03:03:57 1072839 11101567 hadsm3dhet2_jvml_006603983_0 237,644 527,205 1.0610
09 Jun 2010 23:31:00 1072839 11101567 hadsm3dhet2_jvml_006603983_0 226,842 515,201 1.0599
09 Jun 2010 19:33:29 1072839 11101567 hadsm3dhet2_jvml_006603983_0 216,040 503,656 1.0597
09 Jun 2010 17:38:17 1072839 11101567 hadsm3dhet2_jvml_006603983_0 205,238 492,330 1.0599
09 Jun 2010 11:08:05 1072839 11101567 hadsm3dhet2_jvml_006603983_0 194,436 480,778 1.0597
09 Jun 2010 07:36:34 1072839 11101567 hadsm3dhet2_jvml_006603983_0 183,634 469,372 1.0598
09 Jun 2010 04:52:24 1072839 11101567 hadsm3dhet2_jvml_006603983_0 172,832 457,831 1.0596
09 Jun 2010 00:45:30 1072839 11101567 hadsm3dhet2_jvml_006603983_0 162,030 446,273 1.0593
08 Jun 2010 21:03:20 1072839 11101567 hadsm3dhet2_jvml_006603983_0 151,228 435,231 1.0603
08 Jun 2010 17:37:05 1072839 11101567 hadsm3dhet2_jvml_006603983_0 140,426 423,922 1.0607
08 Jun 2010 14:20:46 1072839 11101567 hadsm3dhet2_jvml_006603983_0 129,624 412,927 1.0619
08 Jun 2010 11:06:34 1072839 11101567 hadsm3dhet2_jvml_006603983_0 118,822 401,776 1.0627
08 Jun 2010 07:49:45 1072839 11101567 hadsm3dhet2_jvml_006603983_0 108,020 390,770 1.0640


©2024 cpdn.org