climateprediction.net home page
Task 12551480

Task 12551480

Name hadam3p_saf_255y_1981_1_007038782_1
Workunit 7242098
Created 1 Feb 2011, 17:00:53 UTC
Sent 16 Feb 2011, 8:43:29 UTC
Report deadline 29 Jan 2012, 14:03:29 UTC
Received 8 Mar 2011, 9:58:08 UTC
Server state Over
Outcome No reply
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1119858
Run time 23 hours 42 min 53 sec
CPU time 17 hours 19 min 27 sec
Validate state Invalid
Credit 375.31
Device peak FLOPS 2.92 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.08
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4040, selfPID=4416, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1476, selfPID=1476, iMonCtr=2
09:55:08 (3892): No heartbeat from core client for 30 sec - exiting
09:55:09 (3892): No heartbeat from core client for 30 sec - exiting
09:55:10 (3892): No heartbeat from core client for 30 sec - exiting
09:55:11 (3892): No heartbeat from core client for 30 sec - exiting
09:55:12 (3892): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3928, selfPID=3928, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
09:35:05 (3992): No heartbeat from core client for 30 sec - exiting
09:35:06 (3992): No heartbeat from core client for 30 sec - exiting
09:35:08 (3992): No heartbeat from core client for 30 sec - exiting
09:35:09 (3992): No heartbeat from core client for 30 sec - exiting
09:35:10 (3992): No heartbeat from core client for 30 sec - exiting
09:35:11 (3992): No heartbeat from core client for 30 sec - exiting
09:35:12 (3992): No heartbeat from core client for 30 sec - exiting
09:35:13 (3992): No heartbeat from core client for 30 sec - exiting
09:35:14 (3992): No heartbeat from core client for 30 sec - exiting
09:35:15 (3992): No heartbeat from core client for 30 sec - exiting
09:35:16 (3992): No heartbeat from core client for 30 sec - exiting
09:35:17 (3992): No heartbeat from core client for 30 sec - exiting
09:35:18 (3992): No heartbeat from core client for 30 sec - exiting
09:35:20 (3992): No heartbeat from core client for 30 sec - exiting
09:35:21 (3992): No heartbeat from core client for 30 sec - exiting
09:35:22 (3992): No heartbeat from core client for 30 sec - exiting
09:35:23 (3992): No heartbeat from core client for 30 sec - exiting
09:35:24 (3992): No heartbeat from core client for 30 sec - exiting
09:35:25 (3992): No heartbeat from core client for 30 sec - exiting
09:35:26 (3992): No heartbeat from core client for 30 sec - exiting
09:35:27 (3992): No heartbeat from core client for 30 sec - exiting
09:35:28 (3992): No heartbeat from core client for 30 sec - exiting
09:35:29 (3992): No heartbeat from core client for 30 sec - exiting
09:35:30 (3992): No heartbeat from core client for 30 sec - exiting
09:35:32 (3992): No heartbeat from core client for 30 sec - exiting
09:35:33 (3992): No heartbeat from core client for 30 sec - exiting
09:35:34 (3992): No heartbeat from core client for 30 sec - exiting
09:35:35 (3992): No heartbeat from core client for 30 sec - exiting
09:35:36 (3992): No heartbeat from core client for 30 sec - exiting
09:35:37 (3992): No heartbeat from core client for 30 sec - exiting
09:35:38 (3992): No heartbeat from core client for 30 sec - exiting
09:35:39 (3992): No heartbeat from core client for 30 sec - exiting
09:35:40 (3992): No heartbeat from core client for 30 sec - exiting
09:35:41 (3992): No heartbeat from core client for 30 sec - exiting
09:35:42 (3992): No heartbeat from core client for 30 sec - exiting
09:35:44 (3992): No heartbeat from core client for 30 sec - exiting
09:35:45 (3992): No heartbeat from core client for 30 sec - exiting
09:35:46 (3992): No heartbeat from core client for 30 sec - exiting
09:35:48 (3992): No heartbeat from core client for 30 sec - exiting
09:35:49 (3992): No heartbeat from core client for 30 sec - exiting
09:35:50 (3992): No heartbeat from core client for 30 sec - exiting
09:35:51 (3992): No heartbeat from core client for 30 sec - exiting
09:35:52 (3992): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Model crash detected, will try to restart...
10:11:19 (1344): No heartbeat from core client for 30 sec - exiting
10:11:26 (1344): No heartbeat from core client for 30 sec - exiting
10:11:27 (1344): No heartbeat from core client for 30 sec - exiting
10:11:28 (1344): No heartbeat from core client for 30 sec - exiting
10:11:30 (1344): No heartbeat from core client for 30 sec - exiting
10:11:31 (1344): No heartbeat from core client for 30 sec - exiting
10:11:32 (1344): No heartbeat from core client for 30 sec - exiting
10:11:33 (1344): No heartbeat from core client for 30 sec - exiting
10:11:34 (1344): No heartbeat from core client for 30 sec - exiting
10:11:35 (1344): No heartbeat from core client for 30 sec - exiting
10:11:36 (1344): No heartbeat from core client for 30 sec - exiting
10:11:37 (1344): No heartbeat from core client for 30 sec - exiting
10:11:39 (1344): No heartbeat from core client for 30 sec - exiting
10:11:40 (1344): No heartbeat from core client for 30 sec - exiting
10:11:41 (1344): No heartbeat from core client for 30 sec - exiting
10:11:42 (1344): No heartbeat from core client for 30 sec - exiting
10:11:43 (1344): No heartbeat from core client for 30 sec - exiting
10:11:45 (1344): No heartbeat from core client for 30 sec - exiting
10:11:46 (1344): No heartbeat from core client for 30 sec - exiting
10:11:47 (1344): No heartbeat from core client for 30 sec - exiting
10:11:48 (1344): No heartbeat from core client for 30 sec - exiting
10:11:49 (1344): No heartbeat from core client for 30 sec - exiting
10:11:50 (1344): No heartbeat from core client for 30 sec - exiting
10:11:51 (1344): No heartbeat from core client for 30 sec - exiting
10:11:52 (1344): No heartbeat from core client for 30 sec - exiting
10:11:53 (1344): No heartbeat from core client for 30 sec - exiting
10:11:54 (1344): No heartbeat from core client for 30 sec - exiting
10:11:55 (1344): No heartbeat from core client for 30 sec - exiting
10:11:57 (1344): No heartbeat from core client for 30 sec - exiting
10:11:58 (1344): No heartbeat from core client for 30 sec - exiting
10:11:59 (1344): No heartbeat from core client for 30 sec - exiting
10:12:00 (1344): No heartbeat from core client for 30 sec - exiting
10:12:01 (1344): No heartbeat from core client for 30 sec - exiting
10:12:02 (1344): No heartbeat from core client for 30 sec - exiting
10:12:03 (1344): No heartbeat from core client for 30 sec - exiting
10:12:04 (1344): No heartbeat from core client for 30 sec - exiting
10:12:05 (1344): No heartbeat from core client for 30 sec - exiting
10:12:06 (1344): No heartbeat from core client for 30 sec - exiting
10:12:07 (1344): No heartbeat from core client for 30 sec - exiting
10:12:09 (1344): No heartbeat from core client for 30 sec - exiting
10:12:10 (1344): No heartbeat from core client for 30 sec - exiting
10:12:11 (1344): No heartbeat from core client for 30 sec - exiting
10:12:12 (1344): No heartbeat from core client for 30 sec - exiting
10:12:13 (1344): No heartbeat from core client for 30 sec - exiting
10:12:14 (1344): No heartbeat from core client for 30 sec - exiting
10:12:15 (1344): No heartbeat from core client for 30 sec - exiting
10:12:16 (1344): No heartbeat from core client for 30 sec - exiting
10:12:17 (1344): No heartbeat from core client for 30 sec - exiting
10:12:18 (1344): No heartbeat from core client for 30 sec - exiting
10:12:19 (1344): No heartbeat from core client for 30 sec - exiting
10:12:21 (1344): No heartbeat from core client for 30 sec - exiting
10:12:22 (1344): No heartbeat from core client for 30 sec - exiting
10:12:23 (1344): No heartbeat from core client for 30 sec - exiting
10:12:24 (1344): No heartbeat from core client for 30 sec - exiting
10:12:25 (1344): No heartbeat from core client for 30 sec - exiting
10:12:26 (1344): No heartbeat from core client for 30 sec - exiting
10:12:27 (1344): No heartbeat from core client for 30 sec - exiting
10:12:28 (1344): No heartbeat from core client for 30 sec - exiting

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakg.pipe_dummy                                                            2048    
CPDN Monitor - No 'heartbeat' from BOINC...

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakg.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...
10:13:25 (2204): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_saf_255y_1981_1_007038782_1_3.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_255y_1981_1_007038782_1_4.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_255y_1981_1_007038782_1_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_255y_1981_1_007038782_1_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_255y_1981_1_007038782_1_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_255y_1981_1_007038782_1_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_255y_1981_1_007038782_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_255y_1981_1_007038782_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_255y_1981_1_007038782_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_255y_1981_1_007038782_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
08 Mar 2011 10:05:33 1119858 12551480 hadam3p_saf_255y_1981_1_007038782_1 23,136 38,482 1.6633
25 Feb 2011 14:43:19 1119858 12551480 hadam3p_saf_255y_1981_1_007038782_1 11,616 13,851 1.1924


©2024 cpdn.org