climateprediction.net home page
Task 11282322

Task 11282322

Name hadsm3dhet2_k9kn_006622057_3
Workunit 6825430
Created 15 Mar 2010, 12:35:59 UTC
Sent 11 Apr 2010, 21:17:44 UTC
Report deadline 25 Mar 2011, 2:37:44 UTC
Received 9 Jun 2010, 17:41:22 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1068446
Run time 2 days 14 hours 31 min 58 sec
CPU time 2 days 14 hours 35 min 37 sec
Validate state Invalid
Credit 1,687.14
Device peak FLOPS 0.00 GFLOPS
Application version ---
Stderr
<core_client_version>6.10.56</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2012, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2012, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2012, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2012, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2012, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2012, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2012, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1656, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3140, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2808, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2808, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2808, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2808, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2808, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2808, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2808, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2808, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2808, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
08 Jun 2010 23:53:54 1068446 11282322 hadsm3dhet2_k9kn_006622057_3 183,634 217,202 1.1828
08 Jun 2010 23:53:54 1068446 11282322 hadsm3dhet2_k9kn_006622057_3 172,832 204,510 1.1833
08 Jun 2010 23:53:54 1068446 11282322 hadsm3dhet2_k9kn_006622057_3 162,030 191,908 1.1844
08 Jun 2010 23:53:54 1068446 11282322 hadsm3dhet2_k9kn_006622057_3 151,228 179,257 1.1853
08 Jun 2010 23:53:54 1068446 11282322 hadsm3dhet2_k9kn_006622057_3 140,426 166,534 1.1859
08 Jun 2010 23:53:54 1068446 11282322 hadsm3dhet2_k9kn_006622057_3 129,624 153,796 1.1865
08 Jun 2010 23:53:54 1068446 11282322 hadsm3dhet2_k9kn_006622057_3 118,822 141,115 1.1876
08 Jun 2010 23:53:54 1068446 11282322 hadsm3dhet2_k9kn_006622057_3 108,020 128,404 1.1887
08 Jun 2010 23:53:54 1068446 11282322 hadsm3dhet2_k9kn_006622057_3 97,218 115,768 1.1908
08 Jun 2010 23:53:54 1068446 11282322 hadsm3dhet2_k9kn_006622057_3 86,416 103,185 1.1940
08 Jun 2010 23:53:54 1068446 11282322 hadsm3dhet2_k9kn_006622057_3 75,614 90,587 1.1980
08 Jun 2010 23:53:54 1068446 11282322 hadsm3dhet2_k9kn_006622057_3 64,812 78,009 1.2036
08 Jun 2010 23:53:54 1068446 11282322 hadsm3dhet2_k9kn_006622057_3 54,010 65,406 1.2110
08 Jun 2010 23:53:54 1068446 11282322 hadsm3dhet2_k9kn_006622057_3 43,208 52,823 1.2225
08 Jun 2010 23:53:54 1068446 11282322 hadsm3dhet2_k9kn_006622057_3 32,406 39,947 1.2327
08 Jun 2010 23:53:54 1068446 11282322 hadsm3dhet2_k9kn_006622057_3 21,604 27,161 1.2572
09 May 2010 20:11:36 1068446 11282322 hadsm3dhet2_k9kn_006622057_3 10,802 13,775 1.2752


©2024 cpdn.org