climateprediction.net home page
Task 15626323

Task 15626323

Name hadcm3n_zao8_1880_40_008253001_3
Workunit 8408125
Created 23 Feb 2013, 0:53:35 UTC
Sent 23 Feb 2013, 0:53:40 UTC
Report deadline 25 May 2013, 8:20:51 UTC
Received 28 Feb 2013, 13:05:26 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1204010
Run time 4 days 18 hours 9 min 22 sec
CPU time 4 days 5 hours 9 min 2 sec
Validate state Invalid
Credit 2,799.36
Device peak FLOPS 3.03 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
05:25:52 (3968): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:16:03 (18852): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:16:05 (18852): No heartbeat from core client for 30 sec - exiting
19:16:06 (18852): No heartbeat from core client for 30 sec - exiting
19:16:07 (18852): No heartbeat from core client for 30 sec - exiting
19:16:08 (18852): No heartbeat from core client for 30 sec - exiting
19:16:09 (18852): No heartbeat from core client for 30 sec - exiting
19:16:10 (18852): No heartbeat from core client for 30 sec - exiting
19:16:11 (18852): No heartbeat from core client for 30 sec - exiting
19:18:10 (16660): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
20:14:24 (15136): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:14:26 (15136): No heartbeat from core client for 30 sec - exiting
20:14:27 (15136): No heartbeat from core client for 30 sec - exiting
20:15:48 (7680): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
21:52:03 (1340): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:52:05 (1340): No heartbeat from core client for 30 sec - exiting
21:52:06 (1340): No heartbeat from core client for 30 sec - exiting
21:52:07 (1340): No heartbeat from core client for 30 sec - exiting
21:52:08 (1340): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
02:03:25 (6472): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:03:27 (6472): No heartbeat from core client for 30 sec - exiting
02:04:32 (17456): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
02:55:16 (16344): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:56:24 (5552): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=888, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
28 Feb 2013 05:21:51 1204010 15626323 hadcm3n_zao8_1880_40_008253001_3 233,280 343,496 1.4725
27 Feb 2013 12:50:00 1204010 15626323 hadcm3n_zao8_1880_40_008253001_3 207,360 302,191 1.4573
26 Feb 2013 20:27:13 1204010 15626323 hadcm3n_zao8_1880_40_008253001_3 181,440 262,727 1.4480
26 Feb 2013 08:10:07 1204010 15626323 hadcm3n_zao8_1880_40_008253001_3 155,520 226,065 1.4536
25 Feb 2013 14:31:39 1204010 15626323 hadcm3n_zao8_1880_40_008253001_3 129,600 187,149 1.4441
24 Feb 2013 22:14:58 1204010 15626323 hadcm3n_zao8_1880_40_008253001_3 103,680 145,492 1.4033
24 Feb 2013 11:21:26 1204010 15626323 hadcm3n_zao8_1880_40_008253001_3 77,760 109,349 1.4062
23 Feb 2013 23:51:42 1204010 15626323 hadcm3n_zao8_1880_40_008253001_3 51,840 72,956 1.4073
23 Feb 2013 12:27:54 1204010 15626323 hadcm3n_zao8_1880_40_008253001_3 25,920 36,419 1.4051


©2024 climateprediction.net