climateprediction.net home page
Task 13603368

Task 13603368

Name hadcm3n_yfyw_1980_40_007537178_1
Workunit 7734410
Created 5 Nov 2011, 15:17:36 UTC
Sent 7 Nov 2011, 21:51:13 UTC
Report deadline 7 Feb 2012, 5:18:24 UTC
Received 10 Dec 2011, 10:08:25 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1044932
Run time 12 days 17 hours 23 min 21 sec
CPU time 10 days 10 hours 25 min 34 sec
Validate state Invalid
Credit 2,488.32
Device peak FLOPS 0.94 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
23:06:46 (960): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:06:47 (960): No heartbeat from core client for 30 sec - exiting
23:06:48 (960): No heartbeat from core client for 30 sec - exiting
01:45:06 (7388): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:45:08 (7388): No heartbeat from core client for 30 sec - exiting
01:45:09 (7388): No heartbeat from core client for 30 sec - exiting
01:45:10 (7388): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2280, iMonCtr=1
Model crash detected, will try to restart...
17:41:34 (6136): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:41:35 (6136): No heartbeat from core client for 30 sec - exiting
17:47:11 (4444): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:23:47 (3452): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:01:32 (3588): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:01:34 (3588): No heartbeat from core client for 30 sec - exiting
22:21:36 (5240): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5680, iMonCtr=1
Model crash detected, will try to restart...
05:47:28 (5324): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:26:28 (3736): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:26:30 (3736): No heartbeat from core client for 30 sec - exiting
06:26:31 (3736): No heartbeat from core client for 30 sec - exiting
06:26:32 (3736): No heartbeat from core client for 30 sec - exiting
06:26:33 (3736): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
15:42:53 (4884): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:42:55 (4884): No heartbeat from core client for 30 sec - exiting
15:46:12 (4396): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:51:05 (548): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5960, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5056, iMonCtr=1
Model crash detected, will try to restart...
20:37:34 (4492): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:29:36 (8044): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:29:38 (8044): No heartbeat from core client for 30 sec - exiting
03:45:42 (3512): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:45:50 (3512): No heartbeat from core client for 30 sec - exiting
03:45:51 (3512): No heartbeat from core client for 30 sec - exiting
03:45:52 (3512): No heartbeat from core client for 30 sec - exiting
03:45:53 (3512): No heartbeat from core client for 30 sec - exiting
03:45:55 (3512): No heartbeat from core client for 30 sec - exiting
03:45:56 (3512): No heartbeat from core client for 30 sec - exiting
03:45:57 (3512): No heartbeat from core client for 30 sec - exiting
03:45:58 (3512): No heartbeat from core client for 30 sec - exiting
11:36:17 (5936): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:36:18 (5936): No heartbeat from core client for 30 sec - exiting
11:38:28 (3696): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:42:49 (1952): No heartbeat from core client for 30 sec - exiting
11:42:50 (1952): No heartbeat from core client for 30 sec - exiting
11:42:51 (1952): No heartbeat from core client for 30 sec - exiting
11:42:52 (1952): No heartbeat from core client for 30 sec - exiting
11:42:53 (1952): No heartbeat from core client for 30 sec - exiting
11:42:54 (1952): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:55:59 (4624): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:56:00 (4624): No heartbeat from core client for 30 sec - exiting
11:56:01 (4624): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
22:27:14 (560): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:27:19 (560): No heartbeat from core client for 30 sec - exiting
02:51:28 (4708): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:51:29 (4708): No heartbeat from core client for 30 sec - exiting
02:51:30 (4708): No heartbeat from core client for 30 sec - exiting
02:51:31 (4708): No heartbeat from core client for 30 sec - exiting
02:51:32 (4708): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6248, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2384, iMonCtr=1
Model crash detected, will try to restart...
22:16:30 (5428): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:16:31 (5428): No heartbeat from core client for 30 sec - exiting
23:46:12 (4840): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:49:09 (4520): No heartbeat from core client for 30 sec - exiting
23:49:10 (4520): No heartbeat from core client for 30 sec - exiting
23:49:11 (4520): No heartbeat from core client for 30 sec - exiting
23:49:12 (4520): No heartbeat from core client for 30 sec - exiting
23:49:13 (4520): No heartbeat from core client for 30 sec - exiting
23:49:14 (4520): No heartbeat from core client for 30 sec - exiting
23:49:15 (4520): No heartbeat from core client for 30 sec - exiting
23:49:16 (4520): No heartbeat from core client for 30 sec - exiting
23:49:17 (4520): No heartbeat from core client for 30 sec - exiting
23:49:18 (4520): No heartbeat from core client for 30 sec - exiting
23:49:19 (4520): No heartbeat from core client for 30 sec - exiting
23:49:20 (4520): No heartbeat from core client for 30 sec - exiting
23:49:21 (4520): No heartbeat from core client for 30 sec - exiting
23:49:22 (4520): No heartbeat from core client for 30 sec - exiting
23:49:23 (4520): No heartbeat from core client for 30 sec - exiting
23:49:24 (4520): No heartbeat from core client for 30 sec - exiting
23:49:25 (4520): No heartbeat from core client for 30 sec - exiting
23:49:26 (4520): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6752, iMonCtr=1
Model crash detected, will try to restart...
16:57:15 (4384): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:06:48 (5896): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:31:47 (2828): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:31:49 (2828): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3932, iMonCtr=1
Model crash detected, will try to restart...
10:57:30 (6004): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:57:31 (6004): No heartbeat from core client for 30 sec - exiting
11:20:24 (5896): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:20:25 (5896): No heartbeat from core client for 30 sec - exiting
19:05:31 (1028): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:05:33 (1028): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4536, iMonCtr=1
Model crash detected, will try to restart...
02:38:27 (5752): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6796, iMonCtr=1
Model crash detected, will try to restart...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
06 Dec 2011 23:27:32 1044932 13603368 hadcm3n_yfyw_1980_40_007537178_1 207,360 833,893 4.0215
30 Nov 2011 16:01:23 1044932 13603368 hadcm3n_yfyw_1980_40_007537178_1 181,440 729,318 4.0196
29 Nov 2011 08:01:05 1044932 13603368 hadcm3n_yfyw_1980_40_007537178_1 155,520 624,702 4.0169
27 Nov 2011 09:25:53 1044932 13603368 hadcm3n_yfyw_1980_40_007537178_1 129,600 518,917 4.0040
23 Nov 2011 21:48:17 1044932 13603368 hadcm3n_yfyw_1980_40_007537178_1 103,680 415,051 4.0032
22 Nov 2011 15:15:31 1044932 13603368 hadcm3n_yfyw_1980_40_007537178_1 77,760 311,432 4.0050
17 Nov 2011 10:13:45 1044932 13603368 hadcm3n_yfyw_1980_40_007537178_1 51,840 207,573 4.0041
16 Nov 2011 02:32:08 1044932 13603368 hadcm3n_yfyw_1980_40_007537178_1 25,920 104,513 4.0321


©2024 cpdn.org