climateprediction.net home page
Task 13347411

Task 13347411

Name hadcm3n_t0zw_1940_40_007442820_1
Workunit 7640323
Created 8 Sep 2011, 22:19:00 UTC
Sent 8 Sep 2011, 22:24:21 UTC
Report deadline 9 Dec 2011, 5:51:32 UTC
Received 25 Sep 2011, 0:54:27 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1168455
Run time 5 days 22 hours 14 min 25 sec
CPU time 5 days 20 hours 45 min 20 sec
Validate state Invalid
Credit 4,976.64
Device peak FLOPS 3.09 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
20:57:42 (6756): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
18:29:55 (3728): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:53:23 (2196): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
14:27:44 (4748): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4624, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
14:58:51 (5908): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8092, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8092, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8092, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8092, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8092, iMonCtr=1
Model crash detected, will try to restart...
17:50:53 (5932): No heartbeat from core client for 30 sec - exiting
17:50:54 (5932): No heartbeat from core client for 30 sec - exiting
17:50:55 (5932): No heartbeat from core client for 30 sec - exiting
17:50:56 (5932): No heartbeat from core client for 30 sec - exiting
17:50:57 (5932): No heartbeat from core client for 30 sec - exiting
17:50:58 (5932): No heartbeat from core client for 30 sec - exiting
17:50:59 (5932): No heartbeat from core client for 30 sec - exiting
17:51:00 (5932): No heartbeat from core client for 30 sec - exiting
17:51:01 (5932): No heartbeat from core client for 30 sec - exiting
17:51:02 (5932): No heartbeat from core client for 30 sec - exiting
17:51:03 (5932): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5732, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 Sep 2011 17:22:48 1168455 13347411 hadcm3n_t0zw_1940_40_007442820_1 414,720 488,619 1.1782
22 Sep 2011 22:57:37 1168455 13347411 hadcm3n_t0zw_1940_40_007442820_1 388,800 457,151 1.1758
22 Sep 2011 04:25:06 1168455 13347411 hadcm3n_t0zw_1940_40_007442820_1 362,880 426,552 1.1755
21 Sep 2011 19:17:44 1168455 13347411 hadcm3n_t0zw_1940_40_007442820_1 336,960 396,671 1.1772
20 Sep 2011 22:44:53 1168455 13347411 hadcm3n_t0zw_1940_40_007442820_1 311,040 366,042 1.1768
20 Sep 2011 05:46:02 1168455 13347411 hadcm3n_t0zw_1940_40_007442820_1 285,120 335,764 1.1776
19 Sep 2011 20:33:33 1168455 13347411 hadcm3n_t0zw_1940_40_007442820_1 259,200 304,552 1.1750
19 Sep 2011 03:09:20 1168455 13347411 hadcm3n_t0zw_1940_40_007442820_1 233,280 274,217 1.1755
18 Sep 2011 18:41:17 1168455 13347411 hadcm3n_t0zw_1940_40_007442820_1 207,360 244,016 1.1768
18 Sep 2011 09:58:05 1168455 13347411 hadcm3n_t0zw_1940_40_007442820_1 181,440 214,398 1.1816
17 Sep 2011 22:23:14 1168455 13347411 hadcm3n_t0zw_1940_40_007442820_1 155,520 184,526 1.1865
17 Sep 2011 13:12:35 1168455 13347411 hadcm3n_t0zw_1940_40_007442820_1 129,600 151,988 1.1727
17 Sep 2011 04:38:22 1168455 13347411 hadcm3n_t0zw_1940_40_007442820_1 103,680 121,418 1.1711
16 Sep 2011 07:29:57 1168455 13347411 hadcm3n_t0zw_1940_40_007442820_1 77,760 91,531 1.1771
10 Sep 2011 04:39:01 1168455 13347411 hadcm3n_t0zw_1940_40_007442820_1 51,840 60,490 1.1669
09 Sep 2011 19:13:15 1168455 13347411 hadcm3n_t0zw_1940_40_007442820_1 25,920 30,896 1.1920


©2024 cpdn.org