climateprediction.net home page
Task 17529520

Task 17529520

Name hadam3p_anz_f1a9_2012_1_009265096_0
Workunit 9358012
Created 1 Dec 2014, 15:49:23 UTC
Sent 3 Dec 2014, 13:49:27 UTC
Report deadline 15 Nov 2015, 19:09:27 UTC
Received 10 Dec 2014, 21:39:58 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1326297
Run time 5 days 23 hours 3 min 26 sec
CPU time 5 days 18 hours 40 min 49 sec
Validate state Invalid
Credit 4,484.28
Device peak FLOPS 3.00 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.4.27</core_client_version>
<![CDATA[
<stderr_txt>
10:11:33 (2732): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:25:55 (5448): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:26:47 (2800): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:27:29 (5380): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:29:00 (6844): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:30:20 (1524): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4968, selfPID=4968, iMonCtr=2
10:32:20 (4184): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:33:39 (2116): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:35:12 (4156): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:35:57 (2924): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:19:55 (2600): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:20:53 (1416): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7136, selfPID=7136, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7356, selfPID=7356, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7152, selfPID=5376, iMonCtr=1
Model crash detected, will try to restart...
13:23:51 (6776): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:25:08 (8148): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7812, selfPID=7812, iMonCtr=2
13:25:09 (8148): No heartbeat from core client for 30 sec - exiting
13:25:10 (8148): No heartbeat from core client for 30 sec - exiting
13:25:11 (8148): No heartbeat from core client for 30 sec - exiting
13:25:12 (8148): No heartbeat from core client for 30 sec - exiting
13:25:13 (8148): No heartbeat from core client for 30 sec - exiting
13:25:14 (8148): No heartbeat from core client for 30 sec - exiting
13:25:15 (8148): No heartbeat from core client for 30 sec - exiting
13:25:16 (8148): No heartbeat from core client for 30 sec - exiting
13:25:17 (8148): No heartbeat from core client for 30 sec - exiting
13:25:18 (8148): No heartbeat from core client for 30 sec - exiting
13:25:57 (7112): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:06:10 (3736): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:13:24 (988): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:28:52 (6956): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
12:11:53 (7732): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
03:58:27 (3012): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6636, selfPID=6636, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7732, selfPID=7732, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
16:05:10 (3568): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
14:57:15 (2356): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:06:00 (2596): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:07:35 (8692): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:09:50 (7264): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:11:34 (7200): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:15:06 (5676): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:16:12 (6764): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:16:54 (7668): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2024, selfPID=6608, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6012, selfPID=6536, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3068, selfPID=3424, iMonCtr=1
Model crash detected, will try to restart...
02:44:38 (5564): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:46:02 (1452): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:56:33 (5048): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:01:57 (5280): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:01:58 (5280): No heartbeat from core client for 30 sec - exiting
03:01:59 (5280): No heartbeat from core client for 30 sec - exiting
03:02:00 (5280): No heartbeat from core client for 30 sec - exiting
03:02:01 (5280): No heartbeat from core client for 30 sec - exiting
03:02:02 (5280): No heartbeat from core client for 30 sec - exiting
03:02:03 (5280): No heartbeat from core client for 30 sec - exiting
03:03:09 (2144): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:05:00 (6524): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:07:03 (260): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:08:37 (3304): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:12:58 (4372): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
06:56:12 (1776): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:56:17 (1776): No heartbeat from core client for 30 sec - exiting
06:56:18 (1776): No heartbeat from core client for 30 sec - exiting
06:56:19 (1776): No heartbeat from core client for 30 sec - exiting
06:56:20 (1776): No heartbeat from core client for 30 sec - exiting
06:56:24 (1776): No heartbeat from core client for 30 sec - exiting
06:56:25 (1776): No heartbeat from core client for 30 sec - exiting
06:58:11 (6896): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
07:54:19 (1936): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:55:20 (6848): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:39:11 (2176): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:14:22 (1512): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
RCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16
RCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5596, selfPID=5248, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
RCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16
RCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

GCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 61 - Return code = 16


Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/xaakm.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_f1a9_2012_1_009265096_0_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_f1a9_2012_1_009265096_0_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_f1a9_2012_1_009265096_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
10 Dec 2014 05:20:40 1326297 17529520 hadam3p_anz_f1a9_2012_1_009265096_0 103,979 465,042 4.4725
09 Dec 2014 07:18:08 1326297 17529520 hadam3p_anz_f1a9_2012_1_009265096_0 92,459 414,047 4.4782
08 Dec 2014 09:08:26 1326297 17529520 hadam3p_anz_f1a9_2012_1_009265096_0 80,939 362,483 4.4785
07 Dec 2014 18:33:53 1326297 17529520 hadam3p_anz_f1a9_2012_1_009265096_0 69,419 311,070 4.4810
07 Dec 2014 02:58:45 1326297 17529520 hadam3p_anz_f1a9_2012_1_009265096_0 57,899 258,610 4.4666
06 Dec 2014 08:52:18 1326297 17529520 hadam3p_anz_f1a9_2012_1_009265096_0 46,379 207,680 4.4779
05 Dec 2014 15:14:20 1326297 17529520 hadam3p_anz_f1a9_2012_1_009265096_0 34,859 156,106 4.4782
04 Dec 2014 21:53:49 1326297 17529520 hadam3p_anz_f1a9_2012_1_009265096_0 23,339 104,237 4.4662
04 Dec 2014 05:41:39 1326297 17529520 hadam3p_anz_f1a9_2012_1_009265096_0 11,819 52,858 4.4723


©2024 cpdn.org