climateprediction.net home page
Task 12790476

Task 12790476

Name hadam3p_pnw_ycqp_1961_1_007213314_2
Workunit 7411594
Created 10 Apr 2011, 0:38:04 UTC
Sent 10 Apr 2011, 1:32:25 UTC
Report deadline 22 Mar 2012, 6:52:25 UTC
Received 29 Jun 2011, 23:04:20 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1143523
Run time 4 days 11 hours 6 min 37 sec
CPU time 4 days 4 hours 3 min 8 sec
Validate state Invalid
Credit 2,254.93
Device peak FLOPS 2.56 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>6.10.60</core_client_version>
<![CDATA[
<stderr_txt>
18:31:31 (3460): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
18:21:36 (4472): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:53:08 (4344): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:14:55 (1216): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
11:22:53 (3276): No heartbeat from core client for 30 sec - exiting
11:22:54 (3276): No heartbeat from core client for 30 sec - exiting
11:22:55 (3276): No heartbeat from core client for 30 sec - exiting
11:22:56 (3276): No heartbeat from core client for 30 sec - exiting
11:22:57 (3276): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5700, selfPID=5700, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1796, selfPID=1796, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:22:50 (2768): No heartbeat from core client for 30 sec - exiting
20:22:51 (2768): No heartbeat from core client for 30 sec - exiting
20:22:52 (2768): No heartbeat from core client for 30 sec - exiting
20:22:53 (2768): No heartbeat from core client for 30 sec - exiting
20:22:54 (2768): No heartbeat from core client for 30 sec - exiting
20:22:55 (2768): No heartbeat from core client for 30 sec - exiting
20:22:56 (2768): No heartbeat from core client for 30 sec - exiting
20:22:57 (2768): No heartbeat from core client for 30 sec - exiting
20:22:58 (2768): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:26:47 (2680): No heartbeat from core client for 30 sec - exiting
20:26:48 (2680): No heartbeat from core client for 30 sec - exiting
20:26:49 (2680): No heartbeat from core client for 30 sec - exiting
20:26:50 (2680): No heartbeat from core client for 30 sec - exiting
20:26:51 (2680): No heartbeat from core client for 30 sec - exiting
20:26:52 (2680): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:26:53 (2680): No heartbeat from core client for 30 sec - exiting
20:24:13 (2936): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:24:15 (2936): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
13:47:24 (2856): No heartbeat from core client for 30 sec - exiting
13:47:25 (2856): No heartbeat from core client for 30 sec - exiting
13:47:26 (2856): No heartbeat from core client for 30 sec - exiting
13:47:27 (2856): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3488, selfPID=3488, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4776, selfPID=4776, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4000, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3040, selfPID=3916, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4632, selfPID=4632, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2360, selfPID=780, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:42:55 (1432): No heartbeat from core client for 30 sec - exiting
20:42:56 (1432): No heartbeat from core client for 30 sec - exiting
20:42:57 (1432): No heartbeat from core client for 30 sec - exiting
20:42:58 (1432): No heartbeat from core client for 30 sec - exiting
20:42:59 (1432): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:43:01 (1432): No heartbeat from core client for 30 sec - exiting
20:59:49 (3068): No heartbeat from core client for 30 sec - exiting
20:59:50 (3068): No heartbeat from core client for 30 sec - exiting
20:59:51 (3068): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
21:35:00 (4360): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4412, selfPID=4412, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 9
19:39:40 (2932): No heartbeat from core client for 30 sec - exiting
19:39:41 (2932): No heartbeat from core client for 30 sec - exiting
19:39:42 (2932): No heartbeat from core client for 30 sec - exiting
19:39:43 (2932): No heartbeat from core client for 30 sec - exiting
Called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_pnw_ycqp_1961_1_007213314_2_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ycqp_1961_1_007213314_2_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ycqp_1961_1_007213314_2_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
26 Jun 2011 19:49:16 1143523 12790476 hadam3p_pnw_ycqp_1961_1_007213314_2 103,776 328,808 3.1684
22 Jun 2011 02:03:01 1143523 12790476 hadam3p_pnw_ycqp_1961_1_007213314_2 92,256 292,205 3.1673
19 Jun 2011 21:59:57 1143523 12790476 hadam3p_pnw_ycqp_1961_1_007213314_2 80,736 262,913 3.2565
10 Jun 2011 00:23:28 1143523 12790476 hadam3p_pnw_ycqp_1961_1_007213314_2 69,216 228,684 3.3039
04 Jun 2011 02:28:44 1143523 12790476 hadam3p_pnw_ycqp_1961_1_007213314_2 57,696 191,229 3.3144
26 May 2011 00:20:26 1143523 12790476 hadam3p_pnw_ycqp_1961_1_007213314_2 46,176 154,553 3.3470
22 May 2011 20:16:54 1143523 12790476 hadam3p_pnw_ycqp_1961_1_007213314_2 34,656 116,326 3.3566
18 May 2011 21:48:54 1143523 12790476 hadam3p_pnw_ycqp_1961_1_007213314_2 23,136 78,398 3.3886
28 Apr 2011 18:22:27 1143523 12790476 hadam3p_pnw_ycqp_1961_1_007213314_2 11,616 39,505 3.4009


©2024 climateprediction.net