Name | hadsm3dhet2_u9ir_006726914_8 |
Workunit | 6930257 |
Created | 17 Sep 2010, 8:10:10 UTC |
Sent | 17 Sep 2010, 13:48:34 UTC |
Report deadline | 30 Aug 2011, 19:08:34 UTC |
Received | 21 Sep 2011, 14:26:08 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Aborted by user |
Exit status | -197 (0xFFFFFF3B) ERR_ABORTED_VIA_GUI |
Computer ID | 1083596 |
Run time | 11 days 7 hours 34 min 55 sec |
CPU time | 8 days 21 hours 8 min 32 sec |
Validate state | Invalid |
Credit | 2,778.81 |
Device peak FLOPS | 1.67 GFLOPS |
Application version | UK Met Office HadSM3 Slab Model v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> aborted by user </message> <stderr_txt> CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2376, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2432, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5540, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3432, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4932, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4028, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4580, iMonCtr=1 Model crash detected, will try to restart... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2328, iMonCtr=1 Model crash detected, will try to restart... CCPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4216, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2128, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2484, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=996, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4148, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4504, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2332, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4736, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3900, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2584, iMonCtr=1 Model crash detected, will try to restart... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1108, iMonCtr=1 Model crash detected, will try to restart... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4212, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5340, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5304, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4504, iMonCtr=1 Model crash detected, will try to restart... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4112, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4984, iMonCtr=1 Model crash detected, will try to restart... MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. MainError: 12:02:37 AM No files match the supplied pattern. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3060, iMonCtr=1 Model crash detected, will try to restart... CCPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=264, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5020, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2440, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5348, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3904, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4164, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5204, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3088, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4544, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5484, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4136, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Abort request from BOINC... called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
02 Sep 2011 13:16:51 | 1083596 | 11909564 | hadsm3dhet2_u9ir_006726914_8 | 43,208 | 756,762 | 2.5021 |
24 Aug 2011 20:28:16 | 1083596 | 11909564 | hadsm3dhet2_u9ir_006726914_8 | 32,406 | 731,786 | 2.5091 |
18 Aug 2011 19:00:39 | 1083596 | 11909564 | hadsm3dhet2_u9ir_006726914_8 | 21,604 | 707,574 | 2.5194 |
11 Aug 2011 10:28:45 | 1083596 | 11909564 | hadsm3dhet2_u9ir_006726914_8 | 10,802 | 682,833 | 2.5285 |
28 Jul 2011 12:02:52 | 1083596 | 11909564 | hadsm3dhet2_u9ir_006726914_8 | 259,248 | 657,487 | 2.5361 |
24 Jun 2011 15:49:45 | 1083596 | 11909564 | hadsm3dhet2_u9ir_006726914_8 | 248,446 | 632,130 | 2.5443 |
06 Jun 2011 14:14:14 | 1083596 | 11909564 | hadsm3dhet2_u9ir_006726914_8 | 237,644 | 606,439 | 2.5519 |
12 May 2011 10:44:13 | 1083596 | 11909564 | hadsm3dhet2_u9ir_006726914_8 | 226,842 | 579,426 | 2.5543 |
13 Apr 2011 07:36:04 | 1083596 | 11909564 | hadsm3dhet2_u9ir_006726914_8 | 216,040 | 552,531 | 2.5575 |
28 Mar 2011 09:12:27 | 1083596 | 11909564 | hadsm3dhet2_u9ir_006726914_8 | 205,238 | 524,625 | 2.5562 |
25 Feb 2011 14:49:58 | 1083596 | 11909564 | hadsm3dhet2_u9ir_006726914_8 | 194,436 | 497,981 | 2.5612 |
16 Feb 2011 09:39:43 | 1083596 | 11909564 | hadsm3dhet2_u9ir_006726914_8 | 183,634 | 471,300 | 2.5665 |
08 Feb 2011 07:20:56 | 1083596 | 11909564 | hadsm3dhet2_u9ir_006726914_8 | 172,832 | 443,544 | 2.5663 |
02 Feb 2011 08:34:05 | 1083596 | 11909564 | hadsm3dhet2_u9ir_006726914_8 | 162,030 | 417,014 | 2.5737 |
21 Jan 2011 11:50:28 | 1083596 | 11909564 | hadsm3dhet2_u9ir_006726914_8 | 151,228 | 388,071 | 2.5661 |
31 Dec 2010 09:43:51 | 1083596 | 11909564 | hadsm3dhet2_u9ir_006726914_8 | 140,426 | 361,113 | 2.5716 |
22 Dec 2010 11:04:57 | 1083596 | 11909564 | hadsm3dhet2_u9ir_006726914_8 | 129,624 | 332,772 | 2.5672 |
16 Dec 2010 18:45:58 | 1083596 | 11909564 | hadsm3dhet2_u9ir_006726914_8 | 118,822 | 303,866 | 2.5573 |
10 Dec 2010 14:05:28 | 1083596 | 11909564 | hadsm3dhet2_u9ir_006726914_8 | 108,020 | 273,675 | 2.5336 |
05 Dec 2010 11:04:52 | 1083596 | 11909564 | hadsm3dhet2_u9ir_006726914_8 | 97,218 | 244,169 | 2.5116 |
©2025 cpdn.org