climateprediction.net home page
Task 15637678

Task 15637678

Name hadcm3n_3lzk_1940_40_008267996_2
Workunit 8423120
Created 24 Feb 2013, 23:03:53 UTC
Sent 25 Feb 2013, 6:43:55 UTC
Report deadline 27 May 2013, 14:11:06 UTC
Received 9 Apr 2013, 5:08:44 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1237173
Run time 2 days 5 hours 26 min 52 sec
CPU time 1 days 19 hours 8 min 21 sec
Validate state Invalid
Credit 1,555.20
Device peak FLOPS 3.76 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
17:37:45 (37976): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:37:51 (37976): No heartbeat from core client for 30 sec - exiting
17:37:52 (37976): No heartbeat from core client for 30 sec - exiting
17:37:53 (37976): No heartbeat from core client for 30 sec - exiting
19:50:22 (39576): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:50:25 (39576): No heartbeat from core client for 30 sec - exiting
19:50:26 (39576): No heartbeat from core client for 30 sec - exiting
19:50:27 (39576): No heartbeat from core client for 30 sec - exiting
19:50:28 (39576): No heartbeat from core client for 30 sec - exiting
19:50:29 (39576): No heartbeat from core client for 30 sec - exiting
19:50:30 (39576): No heartbeat from core client for 30 sec - exiting
19:50:31 (39576): No heartbeat from core client for 30 sec - exiting
19:50:32 (39576): No heartbeat from core client for 30 sec - exiting
19:50:33 (39576): No heartbeat from core client for 30 sec - exiting
19:50:34 (39576): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
21:07:04 (40124): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:07:05 (40124): No heartbeat from core client for 30 sec - exiting
21:07:06 (40124): No heartbeat from core client for 30 sec - exiting
21:07:07 (40124): No heartbeat from core client for 30 sec - exiting
21:07:08 (40124): No heartbeat from core client for 30 sec - exiting
21:07:09 (40124): No heartbeat from core client for 30 sec - exiting
21:07:10 (40124): No heartbeat from core client for 30 sec - exiting
21:07:11 (40124): No heartbeat from core client for 30 sec - exiting
21:07:12 (40124): No heartbeat from core client for 30 sec - exiting
21:07:13 (40124): No heartbeat from core client for 30 sec - exiting
21:07:14 (40124): No heartbeat from core client for 30 sec - exiting
23:25:30 (40948): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:54:41 (41120): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:18:22 (41880): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:18:16 (39448): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:18:21 (39448): No heartbeat from core client for 30 sec - exiting
02:18:22 (39448): No heartbeat from core client for 30 sec - exiting
02:18:23 (39448): No heartbeat from core client for 30 sec - exiting
02:18:24 (39448): No heartbeat from core client for 30 sec - exiting
02:18:25 (39448): No heartbeat from core client for 30 sec - exiting
02:18:26 (39448): No heartbeat from core client for 30 sec - exiting
02:18:27 (39448): No heartbeat from core client for 30 sec - exiting
02:18:28 (39448): No heartbeat from core client for 30 sec - exiting
02:18:29 (39448): No heartbeat from core client for 30 sec - exiting
02:18:30 (39448): No heartbeat from core client for 30 sec - exiting
02:22:10 (42736): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:22:15 (42736): No heartbeat from core client for 30 sec - exiting
02:22:16 (42736): No heartbeat from core client for 30 sec - exiting
02:22:17 (42736): No heartbeat from core client for 30 sec - exiting
02:22:18 (42736): No heartbeat from core client for 30 sec - exiting
03:02:11 (40220): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:02:20 (40220): No heartbeat from core client for 30 sec - exiting
03:02:21 (40220): No heartbeat from core client for 30 sec - exiting
03:02:22 (40220): No heartbeat from core client for 30 sec - exiting
03:02:23 (40220): No heartbeat from core client for 30 sec - exiting
03:02:24 (40220): No heartbeat from core client for 30 sec - exiting
03:02:25 (40220): No heartbeat from core client for 30 sec - exiting
03:02:26 (40220): No heartbeat from core client for 30 sec - exiting
03:02:27 (40220): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
03:20:15 (42664): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:20:16 (42664): No heartbeat from core client for 30 sec - exiting
03:20:17 (42664): No heartbeat from core client for 30 sec - exiting
03:20:18 (42664): No heartbeat from core client for 30 sec - exiting
03:20:19 (42664): No heartbeat from core client for 30 sec - exiting
03:20:20 (42664): No heartbeat from core client for 30 sec - exiting
03:20:21 (42664): No heartbeat from core client for 30 sec - exiting
03:20:22 (42664): No heartbeat from core client for 30 sec - exiting
03:20:23 (42664): No heartbeat from core client for 30 sec - exiting
03:20:24 (42664): No heartbeat from core client for 30 sec - exiting
03:20:25 (42664): No heartbeat from core client for 30 sec - exiting
04:01:08 (42424): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:01:11 (42424): No heartbeat from core client for 30 sec - exiting
04:06:37 (43932): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:20:52 (43924): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:17:45 (43852): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:48:24 (7200): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:09:10 (8864): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
12:34:21 (9336): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:37:19 (6428): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:57:32 (10116): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
17:58:39 (11124): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:58:40 (11124): No heartbeat from core client for 30 sec - exiting
17:58:41 (11124): No heartbeat from core client for 30 sec - exiting
17:58:42 (11124): No heartbeat from core client for 30 sec - exiting
17:58:43 (11124): No heartbeat from core client for 30 sec - exiting
17:58:44 (11124): No heartbeat from core client for 30 sec - exiting
17:58:45 (11124): No heartbeat from core client for 30 sec - exiting
18:04:41 (15688): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:10:42 (10848): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
02:48:45 (17368): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:48:52 (17368): No heartbeat from core client for 30 sec - exiting
02:48:53 (17368): No heartbeat from core client for 30 sec - exiting
02:48:54 (17368): No heartbeat from core client for 30 sec - exiting
02:48:55 (17368): No heartbeat from core client for 30 sec - exiting
02:48:56 (17368): No heartbeat from core client for 30 sec - exiting
02:48:57 (17368): No heartbeat from core client for 30 sec - exiting
02:48:58 (17368): No heartbeat from core client for 30 sec - exiting
02:53:54 (20028): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:57:01 (24592): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:08:05 (7360): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:15:23 (16932): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:26:42 (26580): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:39:46 (27244): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:49:51 (25644): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:53:04 (22580): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:06:09 (27156): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:21:54 (24436): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:28:25 (24656): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:38:03 (21216): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:26:44 (28124): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=25880, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=34548, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=34548, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=34548, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=40296, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5680, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
06 Apr 2013 01:40:40 1237173 15637678 hadcm3n_3lzk_1940_40_008267996_2 129,600 137,764 1.0630
05 Apr 2013 16:52:25 1237173 15637678 hadcm3n_3lzk_1940_40_008267996_2 103,680 110,071 1.0616
05 Apr 2013 07:39:03 1237173 15637678 hadcm3n_3lzk_1940_40_008267996_2 77,760 82,426 1.0600
04 Apr 2013 21:20:00 1237173 15637678 hadcm3n_3lzk_1940_40_008267996_2 51,840 54,954 1.0601
30 Mar 2013 08:24:15 1237173 15637678 hadcm3n_3lzk_1940_40_008267996_2 25,920 27,837 1.0740


©2024 climateprediction.net