climateprediction.net home page
Task 15806594

Task 15806594

Name hadcm3n_4bco_1940_40_008307644_3
Workunit 8458779
Created 30 May 2013, 11:03:38 UTC
Sent 20 Jun 2013, 6:57:01 UTC
Report deadline 19 Sep 2013, 14:24:12 UTC
Received 30 Jun 2013, 17:29:51 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1194844
Run time 5 days 16 hours 7 min 49 sec
CPU time 5 days 11 hours 13 min 41 sec
Validate state Invalid
Credit 5,287.68
Device peak FLOPS 3.62 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:30:11 (151364): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:29:41 (206320): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
18:40:03 (4284): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:44:33 (16248): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:37:27 (16284): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
14:14:20 (39844): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:18:20 (65640): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:42:49 (64096): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
20:06:52 (68116): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
07:57:12 (96380): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:58:53 (115652): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:00:12 (116696): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:01:31 (116312): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
10:04:12 (172528): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:08:22 (185032): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:08:23 (185032): No heartbeat from core client for 30 sec - exiting
10:08:24 (185032): No heartbeat from core client for 30 sec - exiting
10:08:25 (185032): No heartbeat from core client for 30 sec - exiting
10:08:26 (185032): No heartbeat from core client for 30 sec - exiting
10:08:27 (185032): No heartbeat from core client for 30 sec - exiting
10:08:28 (185032): No heartbeat from core client for 30 sec - exiting
10:08:29 (185032): No heartbeat from core client for 30 sec - exiting
10:08:30 (185032): No heartbeat from core client for 30 sec - exiting
10:08:31 (185032): No heartbeat from core client for 30 sec - exiting
10:08:32 (185032): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
14:19:38 (184588): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:33:08 (194872): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:43:42 (196404): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
12:51:16 (236720): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=319484, iMonCtr=1
Model crash detected, will try to restart...
19:18:51 (6080): No heartbeat from core client for 30 sec - exiting
19:18:52 (6080): No heartbeat from core client for 30 sec - exiting
19:18:53 (6080): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
18:24:56 (13976): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:26:29 (15044): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:28:32 (11180): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:28:40 (11180): No heartbeat from core client for 30 sec - exiting
18:28:41 (11180): No heartbeat from core client for 30 sec - exiting
18:31:47 (9424): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:32:01 (9424): No heartbeat from core client for 30 sec - exiting
18:32:02 (9424): No heartbeat from core client for 30 sec - exiting
18:32:03 (9424): No heartbeat from core client for 30 sec - exiting
18:32:04 (9424): No heartbeat from core client for 30 sec - exiting
18:33:02 (14296): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:33:39 (14296): No heartbeat from core client for 30 sec - exiting
18:33:40 (14296): No heartbeat from core client for 30 sec - exiting
18:33:41 (14296): No heartbeat from core client for 30 sec - exiting
18:34:17 (15648): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
18:35:23 (16080): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:35:56 (15248): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
18:37:30 (16272): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
18:38:41 (14220): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:39:17 (15468): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
A18:40:36 (16200): No heartbeat from core client for 30 sec - exiting
tmos Hold Restart file rename failed on atmos_restart.hold
CPDN Monitor - No 'heartbeat' from BOINC...
18:40:52 (16200): No heartbeat from core client for 30 sec - exiting
18:40:53 (16200): No heartbeat from core client for 30 sec - exiting
18:40:54 (16200): No heartbeat from core client for 30 sec - exiting
18:41:55 (17088): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:43:06 (16808): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:44:53 (16868): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:45:42 (16644): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:47:31 (16804): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16940, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16940, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16940, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16940, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16940, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16940, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
02 Jul 2013 10:44:42 1194844 15806594 hadcm3n_4bco_1940_40_008307644_3 440,640 470,998 1.0689
02 Jul 2013 10:32:28 1194844 15806594 hadcm3n_4bco_1940_40_008307644_3 414,720 443,376 1.0691
02 Jul 2013 10:02:44 1194844 15806594 hadcm3n_4bco_1940_40_008307644_3 388,800 415,713 1.0692
28 Jun 2013 05:22:04 1194844 15806594 hadcm3n_4bco_1940_40_008307644_3 362,880 389,865 1.0744
27 Jun 2013 06:33:01 1194844 15806594 hadcm3n_4bco_1940_40_008307644_3 336,960 362,351 1.0754
26 Jun 2013 08:19:18 1194844 15806594 hadcm3n_4bco_1940_40_008307644_3 311,040 334,401 1.0751
25 Jun 2013 23:53:50 1194844 15806594 hadcm3n_4bco_1940_40_008307644_3 285,120 306,570 1.0752
25 Jun 2013 11:37:28 1194844 15806594 hadcm3n_4bco_1940_40_008307644_3 259,200 278,996 1.0764
25 Jun 2013 03:33:52 1194844 15806594 hadcm3n_4bco_1940_40_008307644_3 233,280 251,205 1.0768
24 Jun 2013 19:16:56 1194844 15806594 hadcm3n_4bco_1940_40_008307644_3 207,360 223,518 1.0779
23 Jun 2013 16:20:32 1194844 15806594 hadcm3n_4bco_1940_40_008307644_3 181,440 195,693 1.0786
22 Jun 2013 08:11:12 1194844 15806594 hadcm3n_4bco_1940_40_008307644_3 155,520 167,627 1.0778
22 Jun 2013 00:15:02 1194844 15806594 hadcm3n_4bco_1940_40_008307644_3 129,600 140,060 1.0807
21 Jun 2013 16:24:42 1194844 15806594 hadcm3n_4bco_1940_40_008307644_3 103,680 112,107 1.0813
21 Jun 2013 08:18:11 1194844 15806594 hadcm3n_4bco_1940_40_008307644_3 77,760 84,104 1.0816
21 Jun 2013 00:03:40 1194844 15806594 hadcm3n_4bco_1940_40_008307644_3 51,840 56,103 1.0822
20 Jun 2013 16:01:43 1194844 15806594 hadcm3n_4bco_1940_40_008307644_3 25,920 28,114 1.0846


©2024 climateprediction.net