Name | hadsm3dhet2_jjpo_006588542_0 |
Workunit | 6791915 |
Created | 15 Mar 2010, 11:50:27 UTC |
Sent | 26 Oct 2010, 14:37:49 UTC |
Report deadline | 8 Oct 2011, 19:57:49 UTC |
Received | 14 Dec 2010, 15:31:56 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1032008 |
Run time | 23 days 3 hours 23 min 3 sec |
CPU time | 20 days 19 hours 44 min 47 sec |
Validate state | Invalid |
Credit | 4,862.93 |
Device peak FLOPS | 0.96 GFLOPS |
Application version | UK Met Office HadSM3 Slab Model v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4620, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4620, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4620, iMonCtr=1 Model crash detected, will try to restart... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4464, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4524, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5964, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4616, iMonCtr=1 Model crash detected, will try to restart... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=276, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4208, iMonCtr=1 Model crash detected, will try to restart... MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. MainError: 07:02:13 PM No files match the supplied pattern. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5816, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5816, iMonCtr=1 Model crash detected, will try to restart... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4236, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5368, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5964, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5476, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5076, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5328, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5168, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5064, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5064, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5236, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4428, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4892, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5064, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5440, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4816, iMonCtr=1 Model crash detected, will try to restart... No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5572, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4324, iMonCtr=1 Model crash detected, will try to restart... MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. MainError: 04:31:54 AM No files match the supplied pattern. CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5248, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5248, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5248, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5248, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5248, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5248, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
13 Dec 2010 15:02:58 | 1032008 | 10947145 | hadsm3dhet2_jjpo_006588542_0 | 10,802 | 1,765,628 | 3.3358 |
13 Dec 2010 09:31:09 | 1032008 | 10947145 | hadsm3dhet2_jjpo_006588542_0 | 259,248 | 1,730,117 | 3.3368 |
12 Dec 2010 17:42:50 | 1032008 | 10947145 | hadsm3dhet2_jjpo_006588542_0 | 248,446 | 1,694,313 | 3.3373 |
11 Dec 2010 17:40:05 | 1032008 | 10947145 | hadsm3dhet2_jjpo_006588542_0 | 237,644 | 1,657,911 | 3.3366 |
10 Dec 2010 21:14:48 | 1032008 | 10947145 | hadsm3dhet2_jjpo_006588542_0 | 226,842 | 1,621,798 | 3.3364 |
09 Dec 2010 22:01:42 | 1032008 | 10947145 | hadsm3dhet2_jjpo_006588542_0 | 216,040 | 1,584,330 | 3.3334 |
08 Dec 2010 22:07:17 | 1032008 | 10947145 | hadsm3dhet2_jjpo_006588542_0 | 205,238 | 1,544,716 | 3.3256 |
07 Dec 2010 21:33:31 | 1032008 | 10947145 | hadsm3dhet2_jjpo_006588542_0 | 194,436 | 1,506,542 | 3.3207 |
06 Dec 2010 19:56:04 | 1032008 | 10947145 | hadsm3dhet2_jjpo_006588542_0 | 183,634 | 1,470,767 | 3.3209 |
05 Dec 2010 20:25:19 | 1032008 | 10947145 | hadsm3dhet2_jjpo_006588542_0 | 172,832 | 1,435,233 | 3.3217 |
05 Dec 2010 09:50:07 | 1032008 | 10947145 | hadsm3dhet2_jjpo_006588542_0 | 162,030 | 1,399,679 | 3.3225 |
04 Dec 2010 23:11:42 | 1032008 | 10947145 | hadsm3dhet2_jjpo_006588542_0 | 151,228 | 1,364,326 | 3.3238 |
04 Dec 2010 11:34:34 | 1032008 | 10947145 | hadsm3dhet2_jjpo_006588542_0 | 140,426 | 1,328,171 | 3.3231 |
03 Dec 2010 13:33:39 | 1032008 | 10947145 | hadsm3dhet2_jjpo_006588542_0 | 129,624 | 1,292,300 | 3.3232 |
03 Dec 2010 09:24:41 | 1032008 | 10947145 | hadsm3dhet2_jjpo_006588542_0 | 118,822 | 1,256,825 | 3.3243 |
02 Dec 2010 14:41:20 | 1032008 | 10947145 | hadsm3dhet2_jjpo_006588542_0 | 108,020 | 1,220,706 | 3.3237 |
01 Dec 2010 07:40:52 | 1032008 | 10947145 | hadsm3dhet2_jjpo_006588542_0 | 97,218 | 1,184,810 | 3.3238 |
29 Nov 2010 21:30:20 | 1032008 | 10947145 | hadsm3dhet2_jjpo_006588542_0 | 86,416 | 1,149,243 | 3.3247 |
28 Nov 2010 15:23:06 | 1032008 | 10947145 | hadsm3dhet2_jjpo_006588542_0 | 75,614 | 1,113,276 | 3.3246 |
27 Nov 2010 19:56:53 | 1032008 | 10947145 | hadsm3dhet2_jjpo_006588542_0 | 64,812 | 1,077,613 | 3.3254 |
©2024 climateprediction.net