Name | hadsm3dhet2_u9hr_006726894_10 |
Workunit | 6930237 |
Created | 17 Sep 2010, 15:44:26 UTC |
Sent | 17 Sep 2010, 16:01:37 UTC |
Report deadline | 30 Aug 2011, 21:21:37 UTC |
Received | 3 Mar 2012, 13:24:18 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 983041 |
Run time | 14 days 2 hours 5 min 1 sec |
CPU time | 13 days 2 hours 40 min 43 sec |
Validate state | Invalid |
Credit | 6,649.31 |
Device peak FLOPS | 1.97 GFLOPS |
Application version | UK Met Office HadSM3 Slab Model v6.07 windows_intelx86 |
Stderr | <core_client_version>6.6.36</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> CCPDN Monitor - Quit request from BOINC... No Process Handle CPDN process is not running, exiting, bRetVal = 1, checkPID=4076, selfPID=4076, iMonCtr=1 CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3800, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3352, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2952, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1568, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2136, iMonCtr=1 Model crash detected, will try to restart... MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. MainError: 12:29:01 AM No files match the supplied pattern. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2128, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1904, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3908, iMonCtr=1 Model crash detected, will try to restart... CCPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2368, iMonCtr=1 Model crash detected, will try to restart... MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. MainError: 07:46:18 PM No files match the supplied pattern. CCCPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3888, iMonCtr=1 Model crash detected, will try to restart... No Process Handle CPDN process is not running, exiting, bRetVal = 1, checkPID=4064, selfPID=4064, iMonCtr=1 CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3172, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1064, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1064, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1064, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1064, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1064, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1064, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1064, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1064, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1064, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1064, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1064, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1064, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1064, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1064, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1064, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1064, iMonCtr=1 Model crash detected, will try to restart... No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3236, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
15 Feb 2012 11:29:30 | 983041 | 11910169 | hadsm3dhet2_u9hr_006726894_10 | 205,238 | 1,125,585 | 1.5552 |
15 Feb 2012 04:36:22 | 983041 | 11910169 | hadsm3dhet2_u9hr_006726894_10 | 194,436 | 1,108,637 | 1.5550 |
08 Feb 2012 23:10:22 | 983041 | 11910169 | hadsm3dhet2_u9hr_006726894_10 | 183,634 | 1,091,772 | 1.5549 |
26 Jan 2012 05:23:40 | 983041 | 11910169 | hadsm3dhet2_u9hr_006726894_10 | 172,832 | 1,075,042 | 1.5550 |
25 Jan 2012 23:12:26 | 983041 | 11910169 | hadsm3dhet2_u9hr_006726894_10 | 162,030 | 1,058,276 | 1.5551 |
11 Jan 2012 20:14:06 | 983041 | 11910169 | hadsm3dhet2_u9hr_006726894_10 | 151,228 | 1,041,534 | 1.5552 |
08 Jan 2012 13:05:53 | 983041 | 11910169 | hadsm3dhet2_u9hr_006726894_10 | 140,426 | 1,024,706 | 1.5551 |
31 Dec 2011 19:12:10 | 983041 | 11910169 | hadsm3dhet2_u9hr_006726894_10 | 129,624 | 1,007,809 | 1.5550 |
06 Dec 2011 16:57:34 | 983041 | 11910169 | hadsm3dhet2_u9hr_006726894_10 | 118,822 | 991,082 | 1.5551 |
05 Dec 2011 17:59:14 | 983041 | 11910169 | hadsm3dhet2_u9hr_006726894_10 | 108,020 | 974,183 | 1.5549 |
25 Nov 2011 15:52:45 | 983041 | 11910169 | hadsm3dhet2_u9hr_006726894_10 | 97,218 | 957,467 | 1.5551 |
21 Nov 2011 05:52:43 | 983041 | 11910169 | hadsm3dhet2_u9hr_006726894_10 | 86,416 | 940,720 | 1.5551 |
19 Nov 2011 23:05:34 | 983041 | 11910169 | hadsm3dhet2_u9hr_006726894_10 | 75,614 | 923,930 | 1.5551 |
16 Nov 2011 01:26:39 | 983041 | 11910169 | hadsm3dhet2_u9hr_006726894_10 | 64,812 | 907,195 | 1.5553 |
31 Oct 2011 18:18:15 | 983041 | 11910169 | hadsm3dhet2_u9hr_006726894_10 | 54,010 | 890,544 | 1.5555 |
11 Oct 2011 09:02:26 | 983041 | 11910169 | hadsm3dhet2_u9hr_006726894_10 | 43,208 | 873,804 | 1.5556 |
02 Oct 2011 01:30:04 | 983041 | 11910169 | hadsm3dhet2_u9hr_006726894_10 | 32,406 | 857,096 | 1.5558 |
09 Sep 2011 03:17:41 | 983041 | 11910169 | hadsm3dhet2_u9hr_006726894_10 | 21,604 | 840,295 | 1.5558 |
09 Aug 2011 18:10:46 | 983041 | 11910169 | hadsm3dhet2_u9hr_006726894_10 | 10,802 | 823,293 | 1.5554 |
29 Jul 2011 19:46:59 | 983041 | 11910169 | hadsm3dhet2_u9hr_006726894_10 | 259,248 | 806,164 | 1.5548 |
©2024 cpdn.org