climateprediction.net home page
Task 11091648

Task 11091648

Name hadsm3dhet2_juv1_006602991_9
Workunit 6806364
Created 15 Mar 2010, 12:11:07 UTC
Sent 7 Jun 2010, 4:47:09 UTC
Report deadline 20 May 2011, 10:07:09 UTC
Received 29 Jul 2010, 21:10:51 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1063040
Run time 3 days 9 hours 30 min 33 sec
CPU time 3 days 9 hours 42 min 6 sec
Validate state Invalid
Credit 1,488.65
Device peak FLOPS 2.54 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.10.56</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=8580, selfPID=8580, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=9128, selfPID=9128, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=6132, selfPID=6132, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=4164, selfPID=4164, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=6288, selfPID=6288, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=6572, selfPID=6572, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=6612, selfPID=6612, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=7808, selfPID=7808, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=6516, selfPID=6516, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=4328, selfPID=4328, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=948, selfPID=948, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=8688, selfPID=8688, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=7800, selfPID=7800, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=9392, selfPID=9392, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=8392, selfPID=8392, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=9776, selfPID=9776, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=7376, selfPID=7376, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=8088, selfPID=8088, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=9648, selfPID=9648, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=6592, selfPID=6592, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=5824, selfPID=5824, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=6020, selfPID=6020, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=7080, selfPID=7080, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=1900, selfPID=1900, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=10220, selfPID=10220, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=9172, selfPID=9172, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=8096, selfPID=8096, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=8480, selfPID=8480, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=8500, selfPID=8500, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=6120, selfPID=6120, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=8008, selfPID=8008, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=2696, selfPID=2696, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=3104, selfPID=3104, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=8108, selfPID=8108, iMonCtr=1
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=1996, selfPID=1996, iMonCtr=1
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=1284, selfPID=1284, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=6588, selfPID=6588, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=6352, selfPID=6352, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=8476, selfPID=8476, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=6404, selfPID=6404, iMonCtr=1
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=7116, selfPID=7116, iMonCtr=1
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7156, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7156, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7156, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7156, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7156, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7156, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
called boinc_finish
CPDN process is not running, exiting, bRetVal = 1, checkPID=6956, selfPID=6956, iMonCtr=1

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
29 Jul 2010 00:46:37 1063040 11091648 hadsm3dhet2_juv1_006602991_9 162,030 188,898 1.1658
29 Jul 2010 00:46:37 1063040 11091648 hadsm3dhet2_juv1_006602991_9 151,228 176,597 1.1678
29 Jul 2010 00:46:37 1063040 11091648 hadsm3dhet2_juv1_006602991_9 140,426 164,326 1.1702
28 Jul 2010 01:50:32 1063040 11091648 hadsm3dhet2_juv1_006602991_9 129,624 151,898 1.1718
27 Jul 2010 23:42:49 1063040 11091648 hadsm3dhet2_juv1_006602991_9 118,822 139,122 1.1708
27 Jul 2010 11:40:53 1063040 11091648 hadsm3dhet2_juv1_006602991_9 108,020 126,531 1.1714
26 Jul 2010 17:02:10 1063040 11091648 hadsm3dhet2_juv1_006602991_9 97,218 113,794 1.1705
25 Jul 2010 13:20:20 1063040 11091648 hadsm3dhet2_juv1_006602991_9 86,416 101,117 1.1701
21 Jul 2010 18:52:12 1063040 11091648 hadsm3dhet2_juv1_006602991_9 75,614 88,465 1.1700
20 Jul 2010 02:49:02 1063040 11091648 hadsm3dhet2_juv1_006602991_9 64,812 75,651 1.1672
19 Jul 2010 22:27:16 1063040 11091648 hadsm3dhet2_juv1_006602991_9 54,010 62,691 1.1607
19 Jul 2010 08:35:40 1063040 11091648 hadsm3dhet2_juv1_006602991_9 43,208 49,826 1.1532
18 Jul 2010 12:11:25 1063040 11091648 hadsm3dhet2_juv1_006602991_9 32,406 37,020 1.1424
02 Jul 2010 13:15:58 1063040 11091648 hadsm3dhet2_juv1_006602991_9 21,604 24,458 1.1321
02 Jul 2010 08:59:06 1063040 11091648 hadsm3dhet2_juv1_006602991_9 10,802 12,198 1.1292


©2024 cpdn.org