climateprediction.net home page
Task 11966571

Task 11966571

Name hadam3p_pnw_v5zc_1984_1_006743409_0
Workunit 6946753
Created 4 Nov 2010, 2:16:32 UTC
Sent 4 Nov 2010, 18:24:16 UTC
Report deadline 17 Oct 2011, 23:44:16 UTC
Received 17 Dec 2010, 14:46:46 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 962831
Run time 1 days 6 hours 12 min 32 sec
CPU time 1 days 4 hours 3 min 26 sec
Validate state Invalid
Credit 753.03
Device peak FLOPS 2.76 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.05
windows_intelx86
Stderr
<core_client_version>6.12.6</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
14:42:05 (3080): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5880, selfPID=5880, iMonCtr=2
15:16:20 (1988): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=34576, iMonCtr=1
15:17:28 (15828): No heartbeat from core client for 30 sec - exiting
15:17:29 (15828): No heartbeat from core client for 30 sec - exiting
15:17:30 (15828): No heartbeat from core client for 30 sec - exiting
15:17:31 (15828): No heartbeat from core client for 30 sec - exiting
15:17:32 (15828): No heartbeat from core client for 30 sec - exiting
15:17:33 (15828): No heartbeat from core client for 30 sec - exiting
15:17:34 (15828): No heartbeat from core client for 30 sec - exiting
15:17:35 (15828): No heartbeat from core client for 30 sec - exiting
15:17:36 (15828): No heartbeat from core client for 30 sec - exiting
15:17:37 (15828): No heartbeat from core client for 30 sec - exiting
15:17:38 (15828): No heartbeat from core client for 30 sec - exiting
15:17:39 (15828): No heartbeat from core client for 30 sec - exiting
15:17:40 (15828): No heartbeat from core client for 30 sec - exiting
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=26152, iMonCtr=1
15:19:36 (35084): No heartbeat from core client for 30 sec - exiting
15:19:37 (35084): No heartbeat from core client for 30 sec - exiting
15:19:38 (35084): No heartbeat from core client for 30 sec - exiting
15:19:39 (35084): No heartbeat from core client for 30 sec - exiting
15:19:40 (35084): No heartbeat from core client for 30 sec - exiting
15:19:41 (35084): No heartbeat from core client for 30 sec - exiting
15:19:42 (35084): No heartbeat from core client for 30 sec - exiting
15:19:43 (35084): No heartbeat from core client for 30 sec - exiting
15:19:44 (35084): No heartbeat from core client for 30 sec - exiting
15:19:45 (35084): No heartbeat from core client for 30 sec - exiting
15:19:46 (35084): No heartbeat from core client for 30 sec - exiting
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
CPDN Monitor - No 'heartbeat' from BOINC...
15:21:52 (25956): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worke15:45:36 (36868): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=26680, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=28868, selfPID=28868, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=28868, selfPID=32320, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 3
15:46:51 (32320): No heartbeat from core client for 30 sec - exiting
15:46:52 (32320): No heartbeat from core client for 30 sec - exiting
15:46:53 (32320): No heartbeat from core client for 30 sec - exiting
15:46:54 (32320): No heartbeat from core client for 30 sec - exiting
15:46:55 (32320): No heartbeat from core client for 30 sec - exiting
15:46:55 (32320): called boinc_finish
15:46:56 (32320): No heartbeat from core client for 30 sec - exiting
15:46:57 (32320): No heartbeat from core client for 30 sec - exiting

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_pnw_v5zc_1984_1_006743409_0_4.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_v5zc_1984_1_006743409_0_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_v5zc_1984_1_006743409_0_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_v5zc_1984_1_006743409_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_v5zc_1984_1_006743409_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_v5zc_1984_1_006743409_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_v5zc_1984_1_006743409_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_v5zc_1984_1_006743409_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_v5zc_1984_1_006743409_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
17 Dec 2010 13:00:11 962831 11966571 hadam3p_pnw_v5zc_1984_1_006743409_0 34,656 88,739 2.5606
16 Dec 2010 09:46:09 962831 11966571 hadam3p_pnw_v5zc_1984_1_006743409_0 23,136 59,386 2.5668
15 Dec 2010 10:00:23 962831 11966571 hadam3p_pnw_v5zc_1984_1_006743409_0 11,616 30,163 2.5967


©2024 cpdn.org