climateprediction.net home page
Task 10989770

Task 10989770

Name hadsm3dhet2_jn02_006592804_3
Workunit 6796177
Created 15 Mar 2010, 11:57:58 UTC
Sent 14 Oct 2010, 14:38:38 UTC
Report deadline 26 Sep 2011, 19:58:38 UTC
Received 21 Nov 2010, 18:03:54 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 266516
Run time 7 days 21 hours 10 min 23 sec
CPU time 5 days 23 hours 17 min 4 sec
Validate state Invalid
Credit 1,687.14
Device peak FLOPS 1.31 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.6.20</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3860, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3860, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3860, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3860, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3860, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3860, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
19 Nov 2010 12:59:25 266516 10989770 hadsm3dhet2_jn02_006592804_3 183,634 492,366 2.6812
16 Nov 2010 20:25:56 266516 10989770 hadsm3dhet2_jn02_006592804_3 172,832 463,519 2.6819
15 Nov 2010 08:49:18 266516 10989770 hadsm3dhet2_jn02_006592804_3 162,030 434,527 2.6818
12 Nov 2010 10:01:54 266516 10989770 hadsm3dhet2_jn02_006592804_3 151,228 403,235 2.6664
09 Nov 2010 16:14:24 266516 10989770 hadsm3dhet2_jn02_006592804_3 140,426 372,627 2.6535
07 Nov 2010 09:05:14 266516 10989770 hadsm3dhet2_jn02_006592804_3 129,624 342,664 2.6435
03 Nov 2010 21:41:16 266516 10989770 hadsm3dhet2_jn02_006592804_3 118,822 313,403 2.6376
01 Nov 2010 11:41:08 266516 10989770 hadsm3dhet2_jn02_006592804_3 108,020 283,676 2.6261
30 Oct 2010 10:37:51 266516 10989770 hadsm3dhet2_jn02_006592804_3 97,218 254,373 2.6165
28 Oct 2010 13:20:17 266516 10989770 hadsm3dhet2_jn02_006592804_3 86,416 224,914 2.6027
26 Oct 2010 17:03:17 266516 10989770 hadsm3dhet2_jn02_006592804_3 75,614 196,210 2.5949
24 Oct 2010 18:13:25 266516 10989770 hadsm3dhet2_jn02_006592804_3 64,812 167,994 2.5920
22 Oct 2010 11:53:53 266516 10989770 hadsm3dhet2_jn02_006592804_3 54,010 139,539 2.5836
20 Oct 2010 16:25:58 266516 10989770 hadsm3dhet2_jn02_006592804_3 43,208 111,386 2.5779
19 Oct 2010 10:26:42 266516 10989770 hadsm3dhet2_jn02_006592804_3 32,406 83,195 2.5673
17 Oct 2010 16:10:34 266516 10989770 hadsm3dhet2_jn02_006592804_3 21,604 55,381 2.5635
16 Oct 2010 10:39:07 266516 10989770 hadsm3dhet2_jn02_006592804_3 10,802 28,004 2.5925


©2024 cpdn.org