climateprediction.net home page
Task 11055368

Task 11055368

Name hadsm3dhet2_js2a_006599364_0
Workunit 6802737
Created 15 Mar 2010, 12:06:23 UTC
Sent 23 Jun 2010, 2:53:57 UTC
Report deadline 5 Jun 2011, 8:13:57 UTC
Received 24 May 2011, 3:43:16 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -177 (0xFFFFFF4F) ERR_RSC_LIMIT_EXCEEDED
Computer ID 1056144
Run time 94 days 22 hours 16 min 50 sec
CPU time 86 days 18 hours 24 min 13 sec
Validate state Invalid
Credit 2,580.33
Device peak FLOPS 3.00 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
Maximum elapsed time exceeded
</message>
<stderr_txt>
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5760, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=972, iMonCtr=1
Model crash detected, will try to rNo heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4592, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
MainError:	02:43:00 AM	No files match the supplied pattern.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4248, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Abort request from BOINC...
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
22 May 2011 14:47:38 1056144 11055368 hadsm3dhet2_js2a_006599364_0 21,604 7,454,591 26.5428
14 May 2011 07:48:38 1056144 11055368 hadsm3dhet2_js2a_006599364_0 10,802 6,853,004 25.3768
06 May 2011 02:45:00 1056144 11055368 hadsm3dhet2_js2a_006599364_0 259,248 6,255,273 24.1285
28 Apr 2011 15:28:27 1056144 11055368 hadsm3dhet2_js2a_006599364_0 248,446 5,668,874 22.8173
20 Apr 2011 19:52:49 1056144 11055368 hadsm3dhet2_js2a_006599364_0 237,644 5,082,327 21.3863
09 Apr 2011 04:34:47 1056144 11055368 hadsm3dhet2_js2a_006599364_0 226,842 4,488,850 19.7884
30 Mar 2011 08:07:20 1056144 11055368 hadsm3dhet2_js2a_006599364_0 216,040 3,898,167 18.0437
08 Mar 2011 12:07:19 1056144 11055368 hadsm3dhet2_js2a_006599364_0 205,238 3,305,845 16.1074
15 Feb 2011 19:55:09 1056144 11055368 hadsm3dhet2_js2a_006599364_0 194,436 2,713,614 13.9563
24 Jan 2011 04:02:11 1056144 11055368 hadsm3dhet2_js2a_006599364_0 183,634 2,119,620 11.5426
19 Dec 2010 13:50:55 1056144 11055368 hadsm3dhet2_js2a_006599364_0 172,832 1,523,544 8.8152
17 Nov 2010 00:33:13 1056144 11055368 hadsm3dhet2_js2a_006599364_0 162,030 930,195 5.7409
07 Sep 2010 14:12:53 1056144 11055368 hadsm3dhet2_js2a_006599364_0 151,228 339,273 2.2435
26 Jul 2010 15:26:52 1056144 11055368 hadsm3dhet2_js2a_006599364_0 140,426 147,925 1.0534
26 Jul 2010 03:14:18 1056144 11055368 hadsm3dhet2_js2a_006599364_0 129,624 136,558 1.0535
18 Jul 2010 13:45:29 1056144 11055368 hadsm3dhet2_js2a_006599364_0 118,822 125,333 1.0548
16 Jul 2010 19:13:32 1056144 11055368 hadsm3dhet2_js2a_006599364_0 108,020 114,673 1.0616
14 Jul 2010 21:00:56 1056144 11055368 hadsm3dhet2_js2a_006599364_0 97,218 103,358 1.0632
12 Jul 2010 02:13:37 1056144 11055368 hadsm3dhet2_js2a_006599364_0 86,416 91,526 1.0591
09 Jul 2010 19:50:31 1056144 11055368 hadsm3dhet2_js2a_006599364_0 75,614 80,231 1.0611


©2024 cpdn.org