climateprediction.net home page
Task 18351266

Task 18351266

Name hadam3p_anz_d4vc_2012_1_009785565_0
Workunit 9841529
Created 24 Apr 2015, 18:56:13 UTC
Sent 26 Apr 2015, 5:11:41 UTC
Report deadline 7 Apr 2016, 10:31:41 UTC
Received 7 Jun 2015, 20:18:36 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1278732
Run time 6 days 1 hours 7 min 11 sec
CPU time 5 days 21 hours 40 min 21 sec
Validate state Invalid
Credit 3,490.64
Device peak FLOPS 2.64 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.2.34</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3984, selfPID=3984, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2364, selfPID=4316, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=996, selfPID=996, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2492, selfPID=5580, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2072, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1140, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
22:24:52 (5892): No heartbeat from core client for 30 sec - exiting
22:24:53 (5892): No heartbeat from core client for 30 sec - exiting
22:24:54 (5892): No heartbeat from core client for 30 sec - exiting
22:24:55 (5892): No heartbeat from core client for 30 sec - exiting
22:24:56 (5892): No heartbeat from core client for 30 sec - exiting
22:24:58 (5892): No heartbeat from core client for 30 sec - exiting
22:24:59 (5892): No heartbeat from core client for 30 sec - exiting
22:25:00 (5892): No heartbeat from core client for 30 sec - exiting
22:25:01 (5892): No heartbeat from core client for 30 sec - exiting
22:25:02 (5892): No heartbeat from core client for 30 sec - exiting
22:25:03 (5892): No heartbeat from core client for 30 sec - exiting
22:25:04 (5892): No heartbeat from core client for 30 sec - exiting
22:25:05 (5892): No heartbeat from core client for 30 sec - exiting
22:25:06 (5892): No heartbeat from core client for 30 sec - exiting
22:25:07 (5892): No heartbeat from core client for 30 sec - exiting
22:25:09 (5892): No heartbeat from core client for 30 sec - exiting
22:25:10 (5892): No heartbeat from core client for 30 sec - exiting
22:25:11 (5892): No heartbeat from core client for 30 sec - exiting
22:25:12 (5892): No heartbeat from core client for 30 sec - exiting
22:25:13 (5892): No heartbeat from core client for 30 sec - exiting
22:25:14 (5892): No heartbeat from core client for 30 sec - exiting
22:25:15 (5892): No heartbeat from core client for 30 sec - exiting
22:25:16 (5892): No heartbeat from core client for 30 sec - exiting
22:25:17 (5892): No heartbeat from core client for 30 sec - exiting
22:25:18 (5892): No heartbeat from core client for 30 sec - exiting
22:25:19 (5892): No heartbeat from core client for 30 sec - exiting
22:25:21 (5892): No heartbeat from core client for 30 sec - exiting
22:25:22 (5892): No heartbeat from core client for 30 sec - exiting
22:25:23 (5892): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5332, selfPID=5332, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
07:01:12 (2772): No heartbeat from core client for 30 sec - exiting
07:01:13 (2772): No heartbeat from core client for 30 sec - exiting
07:01:14 (2772): No heartbeat from core client for 30 sec - exiting
07:01:15 (2772): No heartbeat from core client for 30 sec - exiting
07:01:16 (2772): No heartbeat from core client for 30 sec - exiting
07:01:17 (2772): No heartbeat from core client for 30 sec - exiting
07:01:18 (2772): No heartbeat from core client for 30 sec - exiting
07:01:19 (2772): No heartbeat from core client for 30 sec - exiting
07:01:21 (2772): No heartbeat from core client for 30 sec - exiting
07:01:22 (2772): No heartbeat from core client for 30 sec - exiting
07:01:23 (2772): No heartbeat from core client for 30 sec - exiting
07:01:24 (2772): No heartbeat from core client for 30 sec - exiting
07:01:25 (2772): No heartbeat from core client for 30 sec - exiting
07:01:26 (2772): No heartbeat from core client for 30 sec - exiting
07:01:27 (2772): No heartbeat from core client for 30 sec - exiting
07:01:28 (2772): No heartbeat from core client for 30 sec - exiting
07:01:29 (2772): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:43:17 (1096): No heartbeat from core client for 30 sec - exiting
08:43:18 (1096): No heartbeat from core client for 30 sec - exiting
08:43:19 (1096): No heartbeat from core client for 30 sec - exiting
08:43:20 (1096): No heartbeat from core client for 30 sec - exiting
08:43:21 (1096): No heartbeat from core client for 30 sec - exiting
08:43:23 (1096): No heartbeat from core client for 30 sec - exiting
08:43:24 (1096): No heartbeat from core client for 30 sec - exiting
08:43:25 (1096): No heartbeat from core client for 30 sec - exiting
08:43:26 (1096): No heartbeat from core client for 30 sec - exiting
08:43:27 (1096): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1608, selfPID=584, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7032, selfPID=7032, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7032, selfPID=5632, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_d4vc_2012_1_009785565_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_d4vc_2012_1_009785565_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_d4vc_2012_1_009785565_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_d4vc_2012_1_009785565_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_d4vc_2012_1_009785565_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
07 Jun 2015 20:21:18 1278732 18351266 hadam3p_anz_d4vc_2012_1_009785565_0 80,939 466,362 5.7619
25 May 2015 20:31:17 1278732 18351266 hadam3p_anz_d4vc_2012_1_009785565_0 69,419 415,535 5.9859
24 May 2015 08:14:32 1278732 18351266 hadam3p_anz_d4vc_2012_1_009785565_0 57,899 352,370 6.0859
24 May 2015 08:14:32 1278732 18351266 hadam3p_anz_d4vc_2012_1_009785565_0 46,379 290,050 6.2539
24 May 2015 08:14:32 1278732 18351266 hadam3p_anz_d4vc_2012_1_009785565_0 34,859 226,297 6.4918
24 May 2015 08:14:32 1278732 18351266 hadam3p_anz_d4vc_2012_1_009785565_0 23,339 162,226 6.9509
09 May 2015 17:03:11 1278732 18351266 hadam3p_anz_d4vc_2012_1_009785565_0 11,819 91,014 7.7007


©2024 cpdn.org