climateprediction.net home page
Task 13701865

Task 13701865

Name hadam3p_saf_7mud_2003_1_007586542_0
Workunit 7764672
Created 2 Dec 2011, 17:45:13 UTC
Sent 3 Dec 2011, 0:36:41 UTC
Report deadline 14 Nov 2012, 5:56:41 UTC
Received 9 Dec 2011, 15:07:55 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1107482
Run time 14 hours 8 min 22 sec
CPU time 13 hours 40 min 37 sec
Validate state Invalid
Credit 375.31
Device peak FLOPS 1.43 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.09
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
10:46:18 (3164): No heartbeat from core client for 30 sec - exiting
10:46:19 (3164): No heartbeat from core client for 30 sec - exiting
10:46:20 (3164): No heartbeat from core client for 30 sec - exiting
10:46:21 (3164): No heartbeat from core client for 30 sec - exiting
10:46:22 (3164): No heartbeat from core client for 30 sec - exiting
10:46:23 (3164): No heartbeat from core client for 30 sec - exiting
10:46:24 (3164): No heartbeat from core client for 30 sec - exiting
10:46:25 (3164): No heartbeat from core client for 30 sec - exiting
10:46:26 (3164): No heartbeat from core client for 30 sec - exiting
10:46:27 (3164): No heartbeat from core client for 30 sec - exiting
10:46:28 (3164): No heartbeat from core client for 30 sec - exiting
10:46:29 (3164): No heartbeat from core client for 30 sec - exiting
10:46:30 (3164): No heartbeat from core client for 30 sec - exiting
10:46:31 (3164): No heartbeat from core client for 30 sec - exiting
10:46:32 (3164): No heartbeat from core client for 30 sec - exiting
10:46:33 (3164): No heartbeat from core client for 30 sec - exiting
10:46:34 (3164): No heartbeat from core client for 30 sec - exiting
10:46:35 (3164): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5756, selfPID=5756, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
23:49:13 (1352): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:57:28 (2116): No heartbeat from core client for 30 sec - exiting
23:57:29 (2116): No heartbeat from core client for 30 sec - exiting
23:57:30 (2116): No heartbeat from core client for 30 sec - exiting
23:57:31 (2116): No heartbeat from core client for 30 sec - exiting
23:57:32 (2116): No heartbeat from core client for 30 sec - exiting
23:57:33 (2116): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2652, selfPID=3036, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish
CPDN Monitor - Quit request from BOINC...

GCM: BUFFIN : Read Failed: No error
GCM : BUFFIN: C I/O Error feof - Unit 21 - Return code = 16
GCM : BUFFIN: C I/O Error feof - Unit 21 - Return code = 16


Model crashed: READDUMP: BAD BUFFIN OF DATA                                                                                                                                                                                                                                    tmp/xaakm.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_saf_7mud_2003_1_007586542_0_3.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_7mud_2003_1_007586542_0_4.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_7mud_2003_1_007586542_0_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_7mud_2003_1_007586542_0_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_7mud_2003_1_007586542_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_7mud_2003_1_007586542_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_7mud_2003_1_007586542_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_7mud_2003_1_007586542_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_7mud_2003_1_007586542_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_7mud_2003_1_007586542_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
09 Dec 2011 12:23:33 1107482 13701865 hadam3p_saf_7mud_2003_1_007586542_0 23,136 48,777 2.1083
06 Dec 2011 12:51:03 1107482 13701865 hadam3p_saf_7mud_2003_1_007586542_0 11,616 25,157 2.1657


©2024 cpdn.org