climateprediction.net home page
Task 10951878

Task 10951878

Name hadsm3dhet2_jk2t_006589015_2
Workunit 6792388
Created 15 Mar 2010, 11:51:17 UTC
Sent 25 Oct 2010, 7:23:25 UTC
Report deadline 7 Oct 2011, 12:43:25 UTC
Received 10 Feb 2011, 4:11:16 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1109371
Run time 12 days 21 hours 10 min 31 sec
CPU time 11 days 0 hours 46 min 3 sec
Validate state Invalid
Credit 6,153.09
Device peak FLOPS 2.14 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.12.4</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
MainError:	11:11:00 AM	No files match the supplied pattern.
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
MainError:	11:01:58 AM	No files match the supplied pattern.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4472, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4472, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4484, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4484, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4484, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4484, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4484, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3712, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2508, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4116, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5028, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4752, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
08 Feb 2011 15:15:58 1109371 10951878 hadsm3dhet2_jk2t_006589015_2 151,228 940,474 1.4043
08 Feb 2011 09:42:15 1109371 10951878 hadsm3dhet2_jk2t_006589015_2 140,426 925,613 1.4047
08 Feb 2011 04:59:24 1109371 10951878 hadsm3dhet2_jk2t_006589015_2 129,624 910,749 1.4052
08 Feb 2011 00:37:34 1109371 10951878 hadsm3dhet2_jk2t_006589015_2 118,822 895,958 1.4058
07 Feb 2011 19:55:06 1109371 10951878 hadsm3dhet2_jk2t_006589015_2 108,020 880,545 1.4055
07 Feb 2011 15:13:27 1109371 10951878 hadsm3dhet2_jk2t_006589015_2 97,218 865,180 1.4052
07 Feb 2011 12:38:33 1109371 10951878 hadsm3dhet2_jk2t_006589015_2 86,416 849,947 1.4051
04 Feb 2011 10:00:07 1109371 10951878 hadsm3dhet2_jk2t_006589015_2 75,614 835,125 1.4057
01 Feb 2011 18:29:38 1109371 10951878 hadsm3dhet2_jk2t_006589015_2 64,812 820,560 1.4067
01 Feb 2011 13:32:47 1109371 10951878 hadsm3dhet2_jk2t_006589015_2 54,010 805,629 1.4072
30 Jan 2011 16:10:55 1109371 10951878 hadsm3dhet2_jk2t_006589015_2 43,208 790,686 1.4077
30 Jan 2011 04:53:00 1109371 10951878 hadsm3dhet2_jk2t_006589015_2 32,406 775,354 1.4074
29 Jan 2011 18:28:09 1109371 10951878 hadsm3dhet2_jk2t_006589015_2 21,604 760,433 1.4079
27 Jan 2011 10:46:50 1109371 10951878 hadsm3dhet2_jk2t_006589015_2 10,802 745,145 1.4078
26 Jan 2011 11:04:46 1109371 10951878 hadsm3dhet2_jk2t_006589015_2 259,248 729,766 1.4075
26 Jan 2011 05:01:54 1109371 10951878 hadsm3dhet2_jk2t_006589015_2 248,446 714,711 1.4078
25 Jan 2011 11:24:37 1109371 10951878 hadsm3dhet2_jk2t_006589015_2 237,644 700,129 1.4090
24 Jan 2011 20:14:42 1109371 10951878 hadsm3dhet2_jk2t_006589015_2 226,842 684,770 1.4087
24 Jan 2011 15:11:30 1109371 10951878 hadsm3dhet2_jk2t_006589015_2 216,040 669,076 1.4077
24 Jan 2011 10:37:05 1109371 10951878 hadsm3dhet2_jk2t_006589015_2 205,238 653,769 1.4075


©2024 climateprediction.net