climateprediction.net home page
Task 14576091

Task 14576091

Name hadam3p_pnw_ys8i_1977_1_006882058_1
Workunit 7085374
Created 23 Apr 2012, 11:05:36 UTC
Sent 24 Apr 2012, 21:03:17 UTC
Report deadline 7 Apr 2013, 2:23:17 UTC
Received 5 Nov 2012, 8:17:40 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1214410
Run time 2 days 16 hours 30 min 44 sec
CPU time 2 days 9 hours 13 min 55 sec
Validate state Invalid
Credit 1,253.67
Device peak FLOPS 3.00 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>7.0.25</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4904, selfPID=4904, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5480, selfPID=4460, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5884, selfPID=5884, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3852, iMonCtr=2
Model crash detected, will try to restart...
20:47:00 (408): No heartbeat from core client for 30 sec - exiting
20:47:01 (408): No heartbeat from core client for 30 sec - exiting
20:47:02 (408): No heartbeat from core client for 30 sec - exiting
20:47:03 (408): No heartbeat from core client for 30 sec - exiting
20:47:04 (408): No heartbeat from core client for 30 sec - exiting
20:47:06 (408): No heartbeat from core client for 30 sec - exiting
20:47:07 (408): No heartbeat from core client for 30 sec - exiting
20:47:08 (408): No heartbeat from core client for 30 sec - exiting
20:47:09 (408): No heartbeat from core client for 30 sec - exiting
20:47:10 (408): No heartbeat from core client for 30 sec - exiting
20:47:11 (408): No heartbeat from core client for 30 sec - exiting
20:47:12 (408): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CCPDN Monitor - Quit request from BOINC...
18:20:20 (3756): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
16:06:11 (3568): No heartbeat from core client for 30 sec - exiting
16:06:12 (3568): No heartbeat from core client for 30 sec - exiting
16:06:13 (3568): No heartbeat from core client for 30 sec - exiting
16:06:14 (3568): No heartbeat from core client for 30 sec - exiting
16:06:15 (3568): No heartbeat from core client for 30 sec - exiting
16:06:16 (3568): No heartbeat from core client for 30 sec - exiting
16:06:17 (3568): No heartbeat from core client for 30 sec - exiting
16:06:18 (3568): No heartbeat from core client for 30 sec - exiting
16:06:19 (3568): No heartbeat from core client for 30 sec - exiting
16:06:20 (3568): No heartbeat from core client for 30 sec - exiting
16:06:21 (3568): No heartbeat from core client for 30 sec - exiting
16:06:22 (3568): No heartbeat from core client for 30 sec - exiting
16:06:23 (3568): No heartbeat from core client for 30 sec - exiting
16:06:24 (3568): No heartbeat from core client for 30 sec - exiting
16:06:25 (3568): No heartbeat from core client for 30 sec - exiting
16:06:26 (3568): No heartbeat from core client for 30 sec - exiting
16:06:27 (3568): No heartbeat from core client for 30 sec - exiting
16:06:29 (3568): No heartbeat from core client for 30 sec - exiting
16:06:30 (3568): No heartbeat from core client for 30 sec - exiting
16:06:31 (3568): No heartbeat from core client for 30 sec - exiting
16:06:32 (3568): No heartbeat from core client for 30 sec - exiting
16:06:33 (3568): No heartbeat from core client for 30 sec - exiting
16:06:34 (3568): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1420, selfPID=1420, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2212, selfPID=2212, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3104, selfPID=3104, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2920, selfPID=2920, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1220, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2220, selfPID=4856, iMonCtr=1
Model crash detected, will try to restart...
20:22:11 (1392): No heartbeat from core client for 30 sec - exiting
20:22:13 (1392): No heartbeat from core client for 30 sec - exiting
20:22:14 (1392): No heartbeat from core client for 30 sec - exiting
20:22:15 (1392): No heartbeat from core client for 30 sec - exiting
20:22:16 (1392): No heartbeat from core client for 30 sec - exiting
20:22:17 (1392): No heartbeat from core client for 30 sec - exiting
20:22:18 (1392): No heartbeat from core client for 30 sec - exiting
20:22:19 (1392): No heartbeat from core client for 30 sec - exiting
20:22:20 (1392): No heartbeat from core client for 30 sec - exiting
20:22:21 (1392): No heartbeat from core client for 30 sec - exiting
20:22:22 (1392): No heartbeat from core client for 30 sec - exiting
20:22:23 (1392): No heartbeat from core client for 30 sec - exiting
20:22:24 (1392): No heartbeat from core client for 30 sec - exiting
20:22:26 (1392): No heartbeat from core client for 30 sec - exiting
20:22:27 (1392): No heartbeat from core client for 30 sec - exiting
20:22:28 (1392): No heartbeat from core client for 30 sec - exiting
20:22:29 (1392): No heartbeat from core client for 30 sec - exiting
20:22:30 (1392): No heartbeat from core client for 30 sec - exiting
20:22:31 (1392): No heartbeat from core client for 30 sec - exiting
20:22:32 (1392): No heartbeat from core client for 30 sec - exiting
20:22:33 (1392): No heartbeat from core client for 30 sec - exiting
20:22:34 (1392): No heartbeat from core client for 30 sec - exiting
20:22:35 (1392): No heartbeat from core client for 30 sec - exiting
20:22:36 (1392): No heartbeat from core client for 30 sec - exiting
20:22:38 (1392): No heartbeat from core client for 30 sec - exiting
20:22:39 (1392): No heartbeat from core client for 30 sec - exiting
20:22:40 (1392): No heartbeat from core client for 30 sec - exiting
20:22:41 (1392): No heartbeat from core client for 30 sec - exiting
20:22:42 (1392): No heartbeat from core client for 30 sec - exiting
20:22:43 (1392): No heartbeat from core client for 30 sec - exiting
20:22:44 (1392): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:22:45 (1392): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4772, selfPID=4772, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=816, selfPID=816, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2764, selfPID=2764, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2764, selfPID=1044, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 5
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_ys8i_1977_1_006882058_1_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ys8i_1977_1_006882058_1_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ys8i_1977_1_006882058_1_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ys8i_1977_1_006882058_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ys8i_1977_1_006882058_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ys8i_1977_1_006882058_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ys8i_1977_1_006882058_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
07 Sep 2012 09:09:58 1214410 14576091 hadam3p_pnw_ys8i_1977_1_006882058_1 57,696 188,017 3.2588
19 Jun 2012 23:03:50 1214410 14576091 hadam3p_pnw_ys8i_1977_1_006882058_1 46,177 151,640 3.2839
15 Jun 2012 18:17:26 1214410 14576091 hadam3p_pnw_ys8i_1977_1_006882058_1 46,176 151,247 3.2754
05 Jun 2012 20:20:19 1214410 14576091 hadam3p_pnw_ys8i_1977_1_006882058_1 34,656 113,725 3.2815
24 May 2012 20:56:11 1214410 14576091 hadam3p_pnw_ys8i_1977_1_006882058_1 23,136 73,448 3.1746
26 Apr 2012 17:59:40 1214410 14576091 hadam3p_pnw_ys8i_1977_1_006882058_1 11,616 35,546 3.0601


©2024 cpdn.org