climateprediction.net home page
Task 13091543

Task 13091543

Name hadcm3n_y8h1_1900_40_007343839_0
Workunit 7541269
Created 6 Jul 2011, 13:20:43 UTC
Sent 22 Jul 2011, 22:28:14 UTC
Report deadline 22 Oct 2011, 5:55:25 UTC
Received 4 Aug 2011, 11:39:39 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1043044
Run time 3 days 23 hours 27 min
CPU time 2 days 22 hours 32 min 6 sec
Validate state Invalid
Credit 1,555.20
Device peak FLOPS 2.43 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.60</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
17:04:45 (4300): No heartbeat from core client for 30 sec - exiting
17:04:46 (4300): No heartbeat from core client for 30 sec - exiting
17:04:47 (4300): No heartbeat from core client for 30 sec - exiting
17:04:48 (4300): No heartbeat from core client for 30 sec - exiting
17:04:49 (4300): No heartbeat from core client for 30 sec - exiting
17:04:50 (4300): No heartbeat from core client for 30 sec - exiting
17:04:51 (4300): No heartbeat from core client for 30 sec - exiting
17:04:52 (4300): No heartbeat from core client for 30 sec - exiting
17:04:53 (4300): No heartbeat from core client for 30 sec - exiting
17:04:54 (4300): No heartbeat from core client for 30 sec - exiting
17:04:55 (4300): No heartbeat from core client for 30 sec - exiting
17:04:56 (4300): No heartbeat from core client for 30 sec - exiting
17:04:57 (4300): No heartbeat from core client for 30 sec - exiting
17:04:58 (4300): No heartbeat from core client for 30 sec - exiting
17:04:59 (4300): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:34:30 (180): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:34:31 (180): No heartbeat from core client for 30 sec - exiting
18:24:31 (5384): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
02:46:34 (1908): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1884, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1884, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1884, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1884, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1884, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1884, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
31 Jul 2011 17:50:03 1043044 13091543 hadcm3n_y8h1_1900_40_007343839_0 129,600 238,045 1.8368
31 Jul 2011 00:41:23 1043044 13091543 hadcm3n_y8h1_1900_40_007343839_0 103,680 190,617 1.8385
28 Jul 2011 14:32:45 1043044 13091543 hadcm3n_y8h1_1900_40_007343839_0 77,760 142,467 1.8321
27 Jul 2011 23:07:01 1043044 13091543 hadcm3n_y8h1_1900_40_007343839_0 51,840 95,391 1.8401
27 Jul 2011 07:29:58 1043044 13091543 hadcm3n_y8h1_1900_40_007343839_0 25,920 47,900 1.8480


©2024 cpdn.org