Name | hadsm3dhet2_juu0_006602954_3 |
Workunit | 6806327 |
Created | 15 Mar 2010, 12:11:05 UTC |
Sent | 7 Jun 2010, 8:16:57 UTC |
Report deadline | 20 May 2011, 13:36:57 UTC |
Received | 16 Sep 2010, 2:23:30 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1074285 |
Run time | 61 days 16 hours 17 min 26 sec |
CPU time | 52 days 4 hours 5 min 14 sec |
Validate state | Invalid |
Credit | 5,160.66 |
Device peak FLOPS | 1.42 GFLOPS |
Application version | UK Met Office HadSM3 Slab Model v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 02:39:56 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. MainError: 01:38:07 AM No files match the supplied pattern. forrtl: Access is denied. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6132, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6132, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6132, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5716, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6844, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6156, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6156, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6156, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6156, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6156, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6156, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
14 Jul 2010 23:00:38 | 1074285 | 11091272 | hadsm3dhet2_juu0_006602954_3 | 43,208 | 1,286,768 | 2.2908 |
14 Jul 2010 12:01:16 | 1074285 | 11091272 | hadsm3dhet2_juu0_006602954_3 | 32,406 | 1,262,191 | 2.2911 |
14 Jul 2010 01:44:10 | 1074285 | 11091272 | hadsm3dhet2_juu0_006602954_3 | 21,604 | 1,237,467 | 2.2912 |
13 Jul 2010 15:23:09 | 1074285 | 11091272 | hadsm3dhet2_juu0_006602954_3 | 10,802 | 1,212,785 | 2.2913 |
12 Jul 2010 01:39:37 | 1074285 | 11091272 | hadsm3dhet2_juu0_006602954_3 | 259,248 | 1,187,984 | 2.2912 |
11 Jul 2010 15:13:51 | 1074285 | 11091272 | hadsm3dhet2_juu0_006602954_3 | 248,446 | 1,162,892 | 2.2905 |
11 Jul 2010 06:04:20 | 1074285 | 11091272 | hadsm3dhet2_juu0_006602954_3 | 237,644 | 1,138,124 | 2.2905 |
10 Jul 2010 19:04:47 | 1074285 | 11091272 | hadsm3dhet2_juu0_006602954_3 | 226,842 | 1,113,322 | 2.2904 |
10 Jul 2010 08:50:51 | 1074285 | 11091272 | hadsm3dhet2_juu0_006602954_3 | 216,040 | 1,088,598 | 2.2904 |
09 Jul 2010 23:52:00 | 1074285 | 11091272 | hadsm3dhet2_juu0_006602954_3 | 205,238 | 1,063,933 | 2.2906 |
09 Jul 2010 12:17:59 | 1074285 | 11091272 | hadsm3dhet2_juu0_006602954_3 | 194,436 | 1,039,455 | 2.2911 |
09 Jul 2010 02:52:13 | 1074285 | 11091272 | hadsm3dhet2_juu0_006602954_3 | 183,634 | 1,014,885 | 2.2915 |
08 Jul 2010 17:28:29 | 1074285 | 11091272 | hadsm3dhet2_juu0_006602954_3 | 172,832 | 990,354 | 2.2921 |
08 Jul 2010 08:19:00 | 1074285 | 11091272 | hadsm3dhet2_juu0_006602954_3 | 162,030 | 965,769 | 2.2925 |
07 Jul 2010 19:33:43 | 1074285 | 11091272 | hadsm3dhet2_juu0_006602954_3 | 151,228 | 941,184 | 2.2929 |
07 Jul 2010 07:44:30 | 1074285 | 11091272 | hadsm3dhet2_juu0_006602954_3 | 140,426 | 916,577 | 2.2933 |
06 Jul 2010 20:29:35 | 1074285 | 11091272 | hadsm3dhet2_juu0_006602954_3 | 129,624 | 891,893 | 2.2935 |
06 Jul 2010 00:28:36 | 1074285 | 11091272 | hadsm3dhet2_juu0_006602954_3 | 118,822 | 867,074 | 2.2934 |
05 Jul 2010 14:39:11 | 1074285 | 11091272 | hadsm3dhet2_juu0_006602954_3 | 108,020 | 842,218 | 2.2932 |
05 Jul 2010 04:52:40 | 1074285 | 11091272 | hadsm3dhet2_juu0_006602954_3 | 97,218 | 817,324 | 2.2929 |
©2024 cpdn.org