climateprediction.net home page
Task 10991037

Task 10991037

Name hadsm3dhet2_jn3l_006592931_0
Workunit 6796304
Created 15 Mar 2010, 11:58:03 UTC
Sent 14 Oct 2010, 10:30:38 UTC
Report deadline 26 Sep 2011, 15:50:38 UTC
Received 28 Oct 2010, 20:23:52 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 277996
Run time 6 days 4 hours 9 min 28 sec
CPU time 5 days 20 hours 26 min 48 sec
Validate state Invalid
Credit 1,984.87
Device peak FLOPS 1.51 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.10.36</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4188, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4188, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4188, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4188, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4188, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4188, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
28 Oct 2010 16:01:02 277996 10991037 hadsm3dhet2_jn3l_006592931_0 216,040 499,876 2.3138
28 Oct 2010 08:12:50 277996 10991037 hadsm3dhet2_jn3l_006592931_0 205,238 472,988 2.3046
28 Oct 2010 03:46:16 277996 10991037 hadsm3dhet2_jn3l_006592931_0 194,436 446,602 2.2969
27 Oct 2010 15:33:43 277996 10991037 hadsm3dhet2_jn3l_006592931_0 183,634 418,428 2.2786
27 Oct 2010 08:53:05 277996 10991037 hadsm3dhet2_jn3l_006592931_0 172,832 391,221 2.2636
27 Oct 2010 03:48:57 277996 10991037 hadsm3dhet2_jn3l_006592931_0 162,030 366,185 2.2600
26 Oct 2010 17:44:04 277996 10991037 hadsm3dhet2_jn3l_006592931_0 151,228 343,450 2.2711
25 Oct 2010 21:28:57 277996 10991037 hadsm3dhet2_jn3l_006592931_0 140,426 317,888 2.2637
25 Oct 2010 12:40:14 277996 10991037 hadsm3dhet2_jn3l_006592931_0 129,624 291,697 2.2503
25 Oct 2010 06:17:47 277996 10991037 hadsm3dhet2_jn3l_006592931_0 118,822 268,610 2.2606
24 Oct 2010 23:33:53 277996 10991037 hadsm3dhet2_jn3l_006592931_0 108,020 244,883 2.2670
24 Oct 2010 16:33:05 277996 10991037 hadsm3dhet2_jn3l_006592931_0 97,218 219,938 2.2623
24 Oct 2010 09:51:02 277996 10991037 hadsm3dhet2_jn3l_006592931_0 86,416 196,155 2.2699
24 Oct 2010 03:17:44 277996 10991037 hadsm3dhet2_jn3l_006592931_0 75,614 172,834 2.2857
23 Oct 2010 20:10:36 277996 10991037 hadsm3dhet2_jn3l_006592931_0 64,812 147,529 2.2763
23 Oct 2010 12:30:13 277996 10991037 hadsm3dhet2_jn3l_006592931_0 54,010 120,670 2.2342
23 Oct 2010 04:18:12 277996 10991037 hadsm3dhet2_jn3l_006592931_0 43,208 91,639 2.1209
21 Oct 2010 14:11:51 277996 10991037 hadsm3dhet2_jn3l_006592931_0 32,406 67,659 2.0879
21 Oct 2010 07:47:48 277996 10991037 hadsm3dhet2_jn3l_006592931_0 21,604 44,815 2.0744
21 Oct 2010 03:24:18 277996 10991037 hadsm3dhet2_jn3l_006592931_0 10,802 22,287 2.0632


©2024 cpdn.org