climateprediction.net home page
Task 16976163

Task 16976163

Name hadam3p_anz_rq39_2012_1_008960392_2
Workunit 9104567
Created 5 Sep 2014, 20:18:30 UTC
Sent 5 Sep 2014, 20:36:58 UTC
Report deadline 19 Aug 2015, 1:56:58 UTC
Received 10 Sep 2014, 13:27:19 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1316407
Run time 1 days 20 hours 1 min 38 sec
CPU time 1 days 16 hours 26 min 18 sec
Validate state Invalid
Credit 1,006.54
Device peak FLOPS 2.85 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9148, selfPID=9148, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6256, selfPID=6256, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
07:08:58 (10032): No heartbeat from core client for 30 sec - exiting
07:08:59 (10032): No heartbeat from core client for 30 sec - exiting
07:09:00 (10032): No heartbeat from core client for 30 sec - exiting
07:09:01 (10032): No heartbeat from core client for 30 sec - exiting
07:09:02 (10032): No heartbeat from core client for 30 sec - exiting
07:09:03 (10032): No heartbeat from core client for 30 sec - exiting
07:09:04 (10032): No heartbeat from core client for 30 sec - exiting
07:09:05 (10032): No heartbeat from core client for 30 sec - exiting
07:09:06 (10032): No heartbeat from core client for 30 sec - exiting
07:09:07 (10032): No heartbeat from core client for 30 sec - exiting
07:09:08 (10032): No heartbeat from core client for 30 sec - exiting
07:09:09 (10032): No heartbeat from core client for 30 sec - exiting
07:09:10 (10032): No heartbeat from core client for 30 sec - exiting
07:09:11 (10032): No heartbeat from core client for 30 sec - exiting
07:09:12 (10032): No heartbeat from core client for 30 sec - exiting
07:09:13 (10032): No heartbeat from core client for 30 sec - exiting
07:09:14 (10032): No heartbeat from core client for 30 sec - exiting
07:09:15 (10032): No heartbeat from core client for 30 sec - exiting
07:09:16 (10032): No heartbeat from core client for 30 sec - exiting
07:09:17 (10032): No heartbeat from core client for 30 sec - exiting
07:09:18 (10032): No heartbeat from core client for 30 sec - exiting
07:09:19 (10032): No heartbeat from core client for 30 sec - exiting
07:09:20 (10032): No heartbeat from core client for 30 sec - exiting
07:09:21 (10032): No heartbeat from core client for 30 sec - exiting
07:09:22 (10032): No heartbeat from core client for 30 sec - exiting
07:09:23 (10032): No heartbeat from core client for 30 sec - exiting
07:09:24 (10032): No heartbeat from core client for 30 sec - exiting
07:09:25 (10032): No heartbeat from core client for 30 sec - exiting
07:09:26 (10032): No heartbeat from core client for 30 sec - exiting
07:09:27 (10032): No heartbeat from core client for 30 sec - exiting
07:09:28 (10032): No heartbeat from core client for 30 sec - exiting
07:09:29 (10032): No heartbeat from core client for 30 sec - exiting
07:09:30 (10032): No heartbeat from core client for 30 sec - exiting
07:09:31 (10032): No heartbeat from core client for 30 sec - exiting
07:09:32 (10032): No heartbeat from core client for 30 sec - exiting
07:09:33 (10032): No heartbeat from core client for 30 sec - exiting
07:09:34 (10032): No heartbeat from core client for 30 sec - exiting
07:09:35 (10032): No heartbeat from core client for 30 sec - exiting
07:09:36 (10032): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:34:11 (5136): No heartbeat from core client for 30 sec - exiting
19:34:12 (5136): No heartbeat from core client for 30 sec - exiting
19:34:13 (5136): No heartbeat from core client for 30 sec - exiting
19:34:14 (5136): No heartbeat from core client for 30 sec - exiting
19:34:15 (5136): No heartbeat from core client for 30 sec - exiting
19:34:16 (5136): No heartbeat from core client for 30 sec - exiting
19:34:18 (5136): No heartbeat from core client for 30 sec - exiting
19:34:19 (5136): No heartbeat from core client for 30 sec - exiting
19:34:20 (5136): No heartbeat from core client for 30 sec - exiting
19:34:21 (5136): No heartbeat from core client for 30 sec - exiting
19:34:22 (5136): No heartbeat from core client for 30 sec - exiting
19:34:23 (5136): No heartbeat from core client for 30 sec - exiting
19:34:24 (5136): No heartbeat from core client for 30 sec - exiting
19:34:25 (5136): No heartbeat from core client for 30 sec - exiting
19:34:26 (5136): No heartbeat from core client for 30 sec - exiting
19:34:27 (5136): No heartbeat from core client for 30 sec - exiting
19:34:28 (5136): No heartbeat from core client for 30 sec - exiting
19:34:29 (5136): No heartbeat from core client for 30 sec - exiting
19:34:30 (5136): No heartbeat from core client for 30 sec - exiting
19:34:31 (5136): No heartbeat from core client for 30 sec - exiting
19:34:32 (5136): No heartbeat from core client for 30 sec - exiting
19:34:33 (5136): No heartbeat from core client for 30 sec - exiting
19:34:34 (5136): No heartbeat from core client for 30 sec - exiting
19:34:35 (5136): No heartbeat from core client for 30 sec - exiting
19:34:36 (5136): No heartbeat from core client for 30 sec - exiting
19:34:37 (5136): No heartbeat from core client for 30 sec - exiting
19:34:38 (5136): No heartbeat from core client for 30 sec - exiting
19:34:39 (5136): No heartbeat from core client for 30 sec - exiting
19:34:40 (5136): No heartbeat from core client for 30 sec - exiting
19:34:41 (5136): No heartbeat from core client for 30 sec - exiting
19:34:42 (5136): No heartbeat from core client for 30 sec - exiting
19:34:44 (5136): No heartbeat from core client for 30 sec - exiting
19:34:46 (5136): No heartbeat from core client for 30 sec - exiting
19:34:47 (5136): No heartbeat from core client for 30 sec - exiting
19:34:48 (5136): No heartbeat from core client for 30 sec - exiting
19:34:49 (5136): No heartbeat from core client for 30 sec - exiting
19:34:50 (5136): No heartbeat from core client for 30 sec - exiting
19:34:51 (5136): No heartbeat from core client for 30 sec - exiting
19:34:52 (5136): No heartbeat from core client for 30 sec - exiting
19:34:53 (5136): No heartbeat from core client for 30 sec - exiting
19:34:55 (5136): No heartbeat from core client for 30 sec - exiting
19:34:56 (5136): No heartbeat from core client for 30 sec - exiting
19:34:57 (5136): No heartbeat from core client for 30 sec - exiting
19:34:58 (5136): No heartbeat from core client for 30 sec - exiting
19:34:59 (5136): No heartbeat from core client for 30 sec - exiting
19:35:00 (5136): No heartbeat from core client for 30 sec - exiting
19:35:01 (5136): No heartbeat from core client for 30 sec - exiting
19:35:02 (5136): No heartbeat from core client for 30 sec - exiting

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
CPDN Monitor - No 'heartbeat' from BOINC...
19:35:29 (9576): Can't acquire lockfile (32) - waiting 35s

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4308, selfPID=9576, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_rq39_2012_1_008960392_2_3.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rq39_2012_1_008960392_2_4.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rq39_2012_1_008960392_2_5.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rq39_2012_1_008960392_2_6.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rq39_2012_1_008960392_2_7.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rq39_2012_1_008960392_2_8.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rq39_2012_1_008960392_2_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rq39_2012_1_008960392_2_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rq39_2012_1_008960392_2_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_rq39_2012_1_008960392_2_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
08 Sep 2014 07:14:53 1316407 16976163 hadam3p_anz_rq39_2012_1_008960392_2 23,339 107,507 4.6063
07 Sep 2014 09:33:39 1316407 16976163 hadam3p_anz_rq39_2012_1_008960392_2 11,819 53,811 4.5529


©2024 cpdn.org