climateprediction.net home page
Task 14291897

Task 14291897

Name hadcm3n_yh11_1940_40_007834263_0
Workunit 7989375
Created 19 Mar 2012, 15:31:52 UTC
Sent 19 Mar 2012, 22:08:36 UTC
Report deadline 19 Jun 2012, 5:35:47 UTC
Received 4 Apr 2012, 18:43:24 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1167875
Run time 18 hours 7 min 15 sec
CPU time 17 hours 28 min 35 sec
Validate state Invalid
Credit 622.08
Device peak FLOPS 4.09 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
18:32:42 (4024): No heartbeat from core client for 30 sec - exiting
18:32:43 (4024): No heartbeat from core client for 30 sec - exiting
18:32:44 (4024): No heartbeat from core client for 30 sec - exiting
18:32:45 (4024): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
22:24:42 (1616): No heartbeat from core client for 30 sec - exiting
22:24:43 (1616): No heartbeat from core client for 30 sec - exiting
22:24:44 (1616): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1748, iMonCtr=1
Model crash detected, will try to restart...
19:00:04 (4220): No heartbeat from core client for 30 sec - exiting
19:00:06 (4220): No heartbeat from core client for 30 sec - exiting
19:00:07 (4220): No heartbeat from core client for 30 sec - exiting
19:00:08 (4220): No heartbeat from core client for 30 sec - exiting
19:00:09 (4220): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
22:12:46 (4528): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:15:23 (6112): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
20:18:32 (2992): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:06:27 (2200): No heartbeat from core client for 30 sec - exiting
19:06:28 (2200): No heartbeat from core client for 30 sec - exiting
19:06:29 (2200): No heartbeat from core client for 30 sec - exiting
19:06:30 (2200): No heartbeat from core client for 30 sec - exiting
19:06:31 (2200): No heartbeat from core client for 30 sec - exiting
19:06:32 (2200): No heartbeat from core client for 30 sec - exiting
19:06:33 (2200): No heartbeat from core client for 30 sec - exiting
19:06:34 (2200): No heartbeat from core client for 30 sec - exiting
19:06:35 (2200): No heartbeat from core client for 30 sec - exiting
19:06:36 (2200): No heartbeat from core client for 30 sec - exiting
19:06:37 (2200): No heartbeat from core client for 30 sec - exiting
19:06:38 (2200): No heartbeat from core client for 30 sec - exiting
19:06:39 (2200): No heartbeat from core client for 30 sec - exiting
19:06:40 (2200): No heartbeat from core client for 30 sec - exiting
19:06:41 (2200): No heartbeat from core client for 30 sec - exiting
19:06:42 (2200): No heartbeat from core client for 30 sec - exiting
19:06:43 (2200): No heartbeat from core client for 30 sec - exiting
19:06:44 (2200): No heartbeat from core client for 30 sec - exiting
19:06:45 (2200): No heartbeat from core client for 30 sec - exiting
19:06:46 (2200): No heartbeat from core client for 30 sec - exiting
19:06:47 (2200): No heartbeat from core client for 30 sec - exiting
19:06:48 (2200): No heartbeat from core client for 30 sec - exiting
19:06:49 (2200): No heartbeat from core client for 30 sec - exiting
19:06:50 (2200): No heartbeat from core client for 30 sec - exiting
19:06:51 (2200): No heartbeat from core client for 30 sec - exiting
19:06:52 (2200): No heartbeat from core client for 30 sec - exiting
19:06:53 (2200): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
19:42:32 (5596): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
18:38:21 (4480): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:45:15 (5688): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:55:53 (1760): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
07:22:36 (4884): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
07:39:53 (4548): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4932, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4932, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4932, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4932, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4932, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4932, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
29 Mar 2012 20:05:09 1167875 14291897 hadcm3n_yh11_1940_40_007834263_0 51,840 45,791 0.8833
22 Mar 2012 22:39:21 1167875 14291897 hadcm3n_yh11_1940_40_007834263_0 25,920 23,011 0.8878


©2024 climateprediction.net