Name | hadsm3dhet2_js2a_006599364_0 |
Workunit | 6802737 |
Created | 15 Mar 2010, 12:06:23 UTC |
Sent | 23 Jun 2010, 2:53:57 UTC |
Report deadline | 5 Jun 2011, 8:13:57 UTC |
Received | 24 May 2011, 3:43:16 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -177 (0xFFFFFF4F) ERR_RSC_LIMIT_EXCEEDED |
Computer ID | 1056144 |
Run time | 94 days 22 hours 16 min 50 sec |
CPU time | 86 days 18 hours 24 min 13 sec |
Validate state | Invalid |
Credit | 2,580.33 |
Device peak FLOPS | 3.00 GFLOPS |
Application version | UK Met Office HadSM3 Slab Model v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> Maximum elapsed time exceeded </message> <stderr_txt> No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5760, iMonCtr=1 Model crash detected, will try to restart... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=972, iMonCtr=1 Model crash detected, will try to rNo heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4592, iMonCtr=1 Model crash detected, will try to restart... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. MainError: 02:43:00 AM No files match the supplied pattern. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4248, iMonCtr=1 Model crash detected, will try to restart... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Abort request from BOINC... called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
22 May 2011 14:47:38 | 1056144 | 11055368 | hadsm3dhet2_js2a_006599364_0 | 21,604 | 7,454,591 | 26.5428 |
14 May 2011 07:48:38 | 1056144 | 11055368 | hadsm3dhet2_js2a_006599364_0 | 10,802 | 6,853,004 | 25.3768 |
06 May 2011 02:45:00 | 1056144 | 11055368 | hadsm3dhet2_js2a_006599364_0 | 259,248 | 6,255,273 | 24.1285 |
28 Apr 2011 15:28:27 | 1056144 | 11055368 | hadsm3dhet2_js2a_006599364_0 | 248,446 | 5,668,874 | 22.8173 |
20 Apr 2011 19:52:49 | 1056144 | 11055368 | hadsm3dhet2_js2a_006599364_0 | 237,644 | 5,082,327 | 21.3863 |
09 Apr 2011 04:34:47 | 1056144 | 11055368 | hadsm3dhet2_js2a_006599364_0 | 226,842 | 4,488,850 | 19.7884 |
30 Mar 2011 08:07:20 | 1056144 | 11055368 | hadsm3dhet2_js2a_006599364_0 | 216,040 | 3,898,167 | 18.0437 |
08 Mar 2011 12:07:19 | 1056144 | 11055368 | hadsm3dhet2_js2a_006599364_0 | 205,238 | 3,305,845 | 16.1074 |
15 Feb 2011 19:55:09 | 1056144 | 11055368 | hadsm3dhet2_js2a_006599364_0 | 194,436 | 2,713,614 | 13.9563 |
24 Jan 2011 04:02:11 | 1056144 | 11055368 | hadsm3dhet2_js2a_006599364_0 | 183,634 | 2,119,620 | 11.5426 |
19 Dec 2010 13:50:55 | 1056144 | 11055368 | hadsm3dhet2_js2a_006599364_0 | 172,832 | 1,523,544 | 8.8152 |
17 Nov 2010 00:33:13 | 1056144 | 11055368 | hadsm3dhet2_js2a_006599364_0 | 162,030 | 930,195 | 5.7409 |
07 Sep 2010 14:12:53 | 1056144 | 11055368 | hadsm3dhet2_js2a_006599364_0 | 151,228 | 339,273 | 2.2435 |
26 Jul 2010 15:26:52 | 1056144 | 11055368 | hadsm3dhet2_js2a_006599364_0 | 140,426 | 147,925 | 1.0534 |
26 Jul 2010 03:14:18 | 1056144 | 11055368 | hadsm3dhet2_js2a_006599364_0 | 129,624 | 136,558 | 1.0535 |
18 Jul 2010 13:45:29 | 1056144 | 11055368 | hadsm3dhet2_js2a_006599364_0 | 118,822 | 125,333 | 1.0548 |
16 Jul 2010 19:13:32 | 1056144 | 11055368 | hadsm3dhet2_js2a_006599364_0 | 108,020 | 114,673 | 1.0616 |
14 Jul 2010 21:00:56 | 1056144 | 11055368 | hadsm3dhet2_js2a_006599364_0 | 97,218 | 103,358 | 1.0632 |
12 Jul 2010 02:13:37 | 1056144 | 11055368 | hadsm3dhet2_js2a_006599364_0 | 86,416 | 91,526 | 1.0591 |
09 Jul 2010 19:50:31 | 1056144 | 11055368 | hadsm3dhet2_js2a_006599364_0 | 75,614 | 80,231 | 1.0611 |
©2024 cpdn.org