climateprediction.net home page
Task 14926497

Task 14926497

Name hadcm3n_zlg2_1880_40_008026540_3
Workunit 8181654
Created 17 Jul 2012, 6:33:48 UTC
Sent 17 Jul 2012, 6:34:08 UTC
Report deadline 16 Oct 2012, 14:01:19 UTC
Received 7 Aug 2012, 14:15:18 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1211524
Run time 5 days 16 hours 15 min 38 sec
CPU time 4 days 3 hours 51 min 52 sec
Validate state Invalid
Credit 3,421.44
Device peak FLOPS 1.73 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
12:45:26 (9264): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
04:26:12 (6420): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:26:13 (6420): No heartbeat from core client for 30 sec - exiting
04:26:14 (6420): No heartbeat from core client for 30 sec - exiting
04:26:15 (6420): No heartbeat from core client for 30 sec - exiting
04:26:17 (6420): No heartbeat from core client for 30 sec - exiting
04:26:18 (6420): No heartbeat from core client for 30 sec - exiting
10:06:45 (17688): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=700, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=700, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=700, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=700, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=700, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6068, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
07 Aug 2012 14:20:07 1211524 14926497 hadcm3n_zlg2_1880_40_008026540_3 285,120 331,054 1.1611
07 Aug 2012 14:20:07 1211524 14926497 hadcm3n_zlg2_1880_40_008026540_3 259,200 299,692 1.1562
07 Aug 2012 14:20:07 1211524 14926497 hadcm3n_zlg2_1880_40_008026540_3 233,280 268,844 1.1525
07 Aug 2012 14:20:07 1211524 14926497 hadcm3n_zlg2_1880_40_008026540_3 207,360 238,329 1.1493
07 Aug 2012 14:20:07 1211524 14926497 hadcm3n_zlg2_1880_40_008026540_3 181,440 207,692 1.1447
03 Aug 2012 08:51:27 1211524 14926497 hadcm3n_zlg2_1880_40_008026540_3 155,520 177,235 1.1396
02 Aug 2012 20:38:52 1211524 14926497 hadcm3n_zlg2_1880_40_008026540_3 129,600 147,002 1.1343
02 Aug 2012 09:42:01 1211524 14926497 hadcm3n_zlg2_1880_40_008026540_3 103,680 116,781 1.1264
01 Aug 2012 21:34:48 1211524 14926497 hadcm3n_zlg2_1880_40_008026540_3 77,760 86,825 1.1166
01 Aug 2012 09:35:33 1211524 14926497 hadcm3n_zlg2_1880_40_008026540_3 51,840 56,700 1.0938
31 Jul 2012 22:02:55 1211524 14926497 hadcm3n_zlg2_1880_40_008026540_3 25,920 26,594 1.0260


©2024 cpdn.org