climateprediction.net home page
Task 11025986

Task 11025986

Name hadsm3dhet2_jpsn_006596425_8
Workunit 6799798
Created 15 Mar 2010, 12:02:42 UTC
Sent 30 Sep 2010, 16:46:18 UTC
Report deadline 12 Sep 2011, 22:06:18 UTC
Received 18 Nov 2010, 16:58:16 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1032711
Run time
CPU time 4 days 6 hours 10 min 46 sec
Validate state Invalid
Credit 2,679.57
Device peak FLOPS 2.06 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.2.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=304, selfPID=304, iMonCtr=1
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=2680, selfPID=2680, iMonCtr=1
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=852, selfPID=852, iMonCtr=1
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=4892, selfPID=4892, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=4788, selfPID=4788, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=3564, selfPID=3564, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=308, selfPID=308, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=5052, selfPID=5052, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
MainError:	05:01:16 PM	No files match the supplied pattern.
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1036, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1036, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1036, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1036, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1036, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
Suspended CPDN Monitor - Quit request from BOINC...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4152, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4152, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4152, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4152, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4152, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4152, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
18 Nov 2010 16:59:14 1032711 11025986 hadsm3dhet2_jpsn_006596425_8 32,406 363,643 1.2468
18 Nov 2010 16:59:14 1032711 11025986 hadsm3dhet2_jpsn_006596425_8 21,604 350,473 1.2479
18 Nov 2010 16:59:14 1032711 11025986 hadsm3dhet2_jpsn_006596425_8 10,802 337,229 1.2488
18 Nov 2010 16:59:14 1032711 11025986 hadsm3dhet2_jpsn_006596425_8 259,248 324,313 1.2510
18 Nov 2010 16:59:14 1032711 11025986 hadsm3dhet2_jpsn_006596425_8 248,446 311,035 1.2519
29 Oct 2010 16:21:00 1032711 11025986 hadsm3dhet2_jpsn_006596425_8 237,644 296,794 1.2489
29 Oct 2010 10:58:05 1032711 11025986 hadsm3dhet2_jpsn_006596425_8 226,842 283,538 1.2499
29 Oct 2010 10:58:05 1032711 11025986 hadsm3dhet2_jpsn_006596425_8 216,040 269,986 1.2497
27 Oct 2010 14:51:58 1032711 11025986 hadsm3dhet2_jpsn_006596425_8 205,238 256,824 1.2513
25 Oct 2010 11:43:50 1032711 11025986 hadsm3dhet2_jpsn_006596425_8 194,436 242,825 1.2489
22 Oct 2010 11:33:34 1032711 11025986 hadsm3dhet2_jpsn_006596425_8 183,634 229,340 1.2489
20 Oct 2010 15:14:46 1032711 11025986 hadsm3dhet2_jpsn_006596425_8 172,832 216,403 1.2521
18 Oct 2010 19:36:44 1032711 11025986 hadsm3dhet2_jpsn_006596425_8 162,030 203,049 1.2532
15 Oct 2010 16:56:44 1032711 11025986 hadsm3dhet2_jpsn_006596425_8 151,228 188,882 1.2490
13 Oct 2010 18:24:03 1032711 11025986 hadsm3dhet2_jpsn_006596425_8 140,426 174,420 1.2421
11 Oct 2010 16:41:42 1032711 11025986 hadsm3dhet2_jpsn_006596425_8 129,624 161,001 1.2421
11 Oct 2010 14:16:07 1032711 11025986 hadsm3dhet2_jpsn_006596425_8 118,822 147,634 1.2425
11 Oct 2010 14:16:07 1032711 11025986 hadsm3dhet2_jpsn_006596425_8 108,020 134,622 1.2463
11 Oct 2010 14:16:07 1032711 11025986 hadsm3dhet2_jpsn_006596425_8 97,218 121,683 1.2517
11 Oct 2010 14:16:07 1032711 11025986 hadsm3dhet2_jpsn_006596425_8 86,416 108,816 1.2592


©2024 cpdn.org