climateprediction.net home page
Task 13466493

Task 13466493

Name hadcm3n_u5qe_1980_40_007458857_2
Workunit 7656360
Created 7 Oct 2011, 19:46:43 UTC
Sent 7 Oct 2011, 19:46:51 UTC
Report deadline 7 Jan 2012, 3:14:02 UTC
Received 15 Nov 2011, 18:27:01 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1047669
Run time 16 days 19 hours 41 min 27 sec
CPU time 16 days 9 hours 3 min 52 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 2.15 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.26</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
14:52:45 (4052): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:06:20 (584): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:50:06 (2608): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:52:20 (3276): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:34:26 (3448): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
10:51:14 (2176): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:26:26 (2780): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:11:59 (2744): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
13:17:25 (3644): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=328, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=328, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=328, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=328, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=328, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=328, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
31 Oct 2011 16:43:26 1047669 13466493 hadcm3n_u5qe_1980_40_007458857_2 777,600 1,381,183 1.7762
31 Oct 2011 15:41:58 1047669 13466493 hadcm3n_u5qe_1980_40_007458857_2 751,680 1,334,880 1.7759
31 Oct 2011 15:01:53 1047669 13466493 hadcm3n_u5qe_1980_40_007458857_2 725,760 1,288,713 1.7757
31 Oct 2011 13:30:15 1047669 13466493 hadcm3n_u5qe_1980_40_007458857_2 699,840 1,242,445 1.7753
31 Oct 2011 13:30:15 1047669 13466493 hadcm3n_u5qe_1980_40_007458857_2 673,920 1,196,147 1.7749
31 Oct 2011 13:30:15 1047669 13466493 hadcm3n_u5qe_1980_40_007458857_2 648,000 1,149,703 1.7742
31 Oct 2011 13:30:15 1047669 13466493 hadcm3n_u5qe_1980_40_007458857_2 622,080 1,103,371 1.7737
31 Oct 2011 13:30:15 1047669 13466493 hadcm3n_u5qe_1980_40_007458857_2 596,160 1,056,961 1.7729
31 Oct 2011 13:30:15 1047669 13466493 hadcm3n_u5qe_1980_40_007458857_2 570,240 1,010,621 1.7723
31 Oct 2011 13:30:15 1047669 13466493 hadcm3n_u5qe_1980_40_007458857_2 544,320 964,264 1.7715
31 Oct 2011 13:30:15 1047669 13466493 hadcm3n_u5qe_1980_40_007458857_2 518,400 917,942 1.7707
31 Oct 2011 13:30:15 1047669 13466493 hadcm3n_u5qe_1980_40_007458857_2 492,480 871,469 1.7696
31 Oct 2011 13:30:15 1047669 13466493 hadcm3n_u5qe_1980_40_007458857_2 466,560 825,177 1.7686
31 Oct 2011 13:30:15 1047669 13466493 hadcm3n_u5qe_1980_40_007458857_2 440,640 778,960 1.7678
19 Oct 2011 02:48:52 1047669 13466493 hadcm3n_u5qe_1980_40_007458857_2 414,720 732,558 1.7664
18 Oct 2011 13:34:02 1047669 13466493 hadcm3n_u5qe_1980_40_007458857_2 388,800 686,423 1.7655
18 Oct 2011 00:37:11 1047669 13466493 hadcm3n_u5qe_1980_40_007458857_2 362,880 640,335 1.7646
17 Oct 2011 11:08:44 1047669 13466493 hadcm3n_u5qe_1980_40_007458857_2 336,960 594,320 1.7638
16 Oct 2011 21:10:05 1047669 13466493 hadcm3n_u5qe_1980_40_007458857_2 311,040 548,315 1.7628
16 Oct 2011 07:25:49 1047669 13466493 hadcm3n_u5qe_1980_40_007458857_2 285,120 502,307 1.7617


©2024 cpdn.org