climateprediction.net home page
Task 13420685

Task 13420685

Name hadcm3n_t688_1940_40_007448433_3
Workunit 7645936
Created 25 Sep 2011, 14:06:18 UTC
Sent 25 Sep 2011, 14:13:20 UTC
Report deadline 25 Dec 2011, 21:40:31 UTC
Received 28 Sep 2011, 0:43:34 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1165372
Run time 22 hours 27 min 51 sec
CPU time 21 hours 0 min 3 sec
Validate state Invalid
Credit 622.08
Device peak FLOPS 2.80 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:03:19 (5776): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:03:24 (5776): No heartbeat from core client for 30 sec - exiting
19:03:25 (5776): No heartbeat from core client for 30 sec - exiting
19:03:26 (5776): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
20:02:48 (12272): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:02:49 (12272): No heartbeat from core client for 30 sec - exiting
20:02:50 (12272): No heartbeat from core client for 30 sec - exiting
20:02:51 (12272): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
12:53:23 (3888): No heartbeat from core client for 30 sec - exiting
12:53:24 (3888): No heartbeat from core client for 30 sec - exiting
12:53:25 (3888): No heartbeat from core client for 30 sec - exiting
12:53:26 (3888): No heartbeat from core client for 30 sec - exiting
12:53:27 (3888): No heartbeat from core client for 30 sec - exiting
12:53:28 (3888): No heartbeat from core client for 30 sec - exiting
12:53:29 (3888): No heartbeat from core client for 30 sec - exiting
12:53:31 (3888): No heartbeat from core client for 30 sec - exiting
12:53:32 (3888): No heartbeat from core client for 30 sec - exiting
12:53:33 (3888): No heartbeat from core client for 30 sec - exiting
12:53:34 (3888): No heartbeat from core client for 30 sec - exiting
12:53:35 (3888): No heartbeat from core client for 30 sec - exiting
12:53:36 (3888): No heartbeat from core client for 30 sec - exiting
12:53:37 (3888): No heartbeat from core client for 30 sec - exiting
12:53:38 (3888): No heartbeat from core client for 30 sec - exiting
12:53:39 (3888): No heartbeat from core client for 30 sec - exiting
12:53:40 (3888): No heartbeat from core client for 30 sec - exiting
12:53:41 (3888): No heartbeat from core client for 30 sec - exiting
12:53:42 (3888): No heartbeat from core client for 30 sec - exiting
12:53:44 (3888): No heartbeat from core client for 30 sec - exiting
12:53:45 (3888): No heartbeat from core client for 30 sec - exiting
12:53:46 (3888): No heartbeat from core client for 30 sec - exiting
12:53:47 (3888): No heartbeat from core client for 30 sec - exiting
12:53:48 (3888): No heartbeat from core client for 30 sec - exiting
12:53:49 (3888): No heartbeat from core client for 30 sec - exiting
12:53:50 (3888): No heartbeat from core client for 30 sec - exiting
12:53:51 (3888): No heartbeat from core client for 30 sec - exiting
12:53:52 (3888): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:53:53 (3888): No heartbeat from core client for 30 sec - exiting
12:53:54 (3888): No heartbeat from core client for 30 sec - exiting
12:53:56 (3888): No heartbeat from core client for 30 sec - exiting
12:53:57 (3888): No heartbeat from core client for 30 sec - exiting
12:53:58 (3888): No heartbeat from core client for 30 sec - exiting
12:53:59 (3888): No heartbeat from core client for 30 sec - exiting
12:54:00 (3888): No heartbeat from core client for 30 sec - exiting
12:54:01 (3888): No heartbeat from core client for 30 sec - exiting
12:54:02 (3888): No heartbeat from core client for 30 sec - exiting
12:54:03 (3888): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
17:06:41 (3220): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5680, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5680, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5680, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5680, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5680, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5680, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
28 Sep 2011 00:43:25 1165372 13420685 hadcm3n_t688_1940_40_007448433_3 51,840 73,279 1.4136
27 Sep 2011 11:24:53 1165372 13420685 hadcm3n_t688_1940_40_007448433_3 25,920 36,965 1.4261


©2024 cpdn.org