climateprediction.net home page
Task 14575833

Task 14575833

Name hadam3p_pnw_ys1i_1965_1_006881806_1
Workunit 7085122
Created 23 Apr 2012, 11:02:53 UTC
Sent 24 Apr 2012, 23:04:45 UTC
Report deadline 7 Apr 2013, 4:24:45 UTC
Received 5 Nov 2012, 8:17:40 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1214410
Run time 2 days 8 hours 55 min 10 sec
CPU time 2 days 2 hours 7 min 14 sec
Validate state Invalid
Credit 1,003.35
Device peak FLOPS 3.00 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>7.0.25</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4528, selfPID=4528, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4480, selfPID=4480, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5616, selfPID=5616, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4584, selfPID=4584, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2536, selfPID=3868, iMonCtr=1
Model crash detected, will try to restart...
20:47:00 (3676): No heartbeat from core client for 30 sec - exiting
20:47:01 (3676): No heartbeat from core client for 30 sec - exiting
20:47:02 (3676): No heartbeat from core client for 30 sec - exiting
20:47:03 (3676): No heartbeat from core client for 30 sec - exiting
20:47:04 (3676): No heartbeat from core client for 30 sec - exiting
20:47:06 (3676): No heartbeat from core client for 30 sec - exiting
20:47:07 (3676): No heartbeat from core client for 30 sec - exiting
20:47:08 (3676): No heartbeat from core client for 30 sec - exiting
20:47:09 (3676): No heartbeat from core client for 30 sec - exiting
20:47:10 (3676): No heartbeat from core client for 30 sec - exiting
20:47:11 (3676): No heartbeat from core client for 30 sec - exiting
20:47:12 (3676): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1516, selfPID=1516, iMonCtr=2
18:20:20 (3808): No heartbeat from core client for 30 sec - exiting
18:20:21 (3808): No heartbeat from core client for 30 sec - exiting
18:20:22 (3808): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3612, selfPID=3612, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2456, selfPID=2456, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4828, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4664, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
16:06:11 (1372): No heartbeat from core client for 30 sec - exiting
16:06:12 (1372): No heartbeat from core client for 30 sec - exiting
16:06:13 (1372): No heartbeat from core client for 30 sec - exiting
16:06:14 (1372): No heartbeat from core client for 30 sec - exiting
16:06:15 (1372): No heartbeat from core client for 30 sec - exiting
16:06:16 (1372): No heartbeat from core client for 30 sec - exiting
16:06:17 (1372): No heartbeat from core client for 30 sec - exiting
16:06:18 (1372): No heartbeat from core client for 30 sec - exiting
16:06:19 (1372): No heartbeat from core client for 30 sec - exiting
16:06:20 (1372): No heartbeat from core client for 30 sec - exiting
16:06:21 (1372): No heartbeat from core client for 30 sec - exiting
16:06:22 (1372): No heartbeat from core client for 30 sec - exiting
16:06:23 (1372): No heartbeat from core client for 30 sec - exiting
16:06:24 (1372): No heartbeat from core client for 30 sec - exiting
16:06:25 (1372): No heartbeat from core client for 30 sec - exiting
16:06:26 (1372): No heartbeat from core client for 30 sec - exiting
16:06:27 (1372): No heartbeat from core client for 30 sec - exiting
16:06:29 (1372): No heartbeat from core client for 30 sec - exiting
16:06:30 (1372): No heartbeat from core client for 30 sec - exiting
16:06:31 (1372): No heartbeat from core client for 30 sec - exiting
16:06:32 (1372): No heartbeat from core client for 30 sec - exiting
16:06:33 (1372): No heartbeat from core client for 30 sec - exiting
16:06:34 (1372): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3692, selfPID=3692, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
20:22:11 (3360): No heartbeat from core client for 30 sec - exiting
20:22:13 (3360): No heartbeat from core client for 30 sec - exiting
20:22:14 (3360): No heartbeat from core client for 30 sec - exiting
20:22:15 (3360): No heartbeat from core client for 30 sec - exiting
20:22:16 (3360): No heartbeat from core client for 30 sec - exiting
20:22:17 (3360): No heartbeat from core client for 30 sec - exiting
20:22:18 (3360): No heartbeat from core client for 30 sec - exiting
20:22:19 (3360): No heartbeat from core client for 30 sec - exiting
20:22:20 (3360): No heartbeat from core client for 30 sec - exiting
20:22:21 (3360): No heartbeat from core client for 30 sec - exiting
20:22:22 (3360): No heartbeat from core client for 30 sec - exiting
20:22:23 (3360): No heartbeat from core client for 30 sec - exiting
20:22:24 (3360): No heartbeat from core client for 30 sec - exiting
20:22:26 (3360): No heartbeat from core client for 30 sec - exiting
20:22:27 (3360): No heartbeat from core client for 30 sec - exiting
20:22:28 (3360): No heartbeat from core client for 30 sec - exiting
20:22:29 (3360): No heartbeat from core client for 30 sec - exiting
20:22:30 (3360): No heartbeat from core client for 30 sec - exiting
20:22:31 (3360): No heartbeat from core client for 30 sec - exiting
20:22:32 (3360): No heartbeat from core client for 30 sec - exiting
20:22:33 (3360): No heartbeat from core client for 30 sec - exiting
20:22:34 (3360): No heartbeat from core client for 30 sec - exiting
20:22:35 (3360): No heartbeat from core client for 30 sec - exiting
20:22:36 (3360): No heartbeat from core client for 30 sec - exiting
20:22:38 (3360): No heartbeat from core client for 30 sec - exiting
20:22:39 (3360): No heartbeat from core client for 30 sec - exiting
20:22:40 (3360): No heartbeat from core client for 30 sec - exiting
20:22:41 (3360): No heartbeat from core client for 30 sec - exiting
20:22:42 (3360): No heartbeat from core client for 30 sec - exiting
20:22:43 (3360): No heartbeat from core client for 30 sec - exiting
20:22:44 (3360): No heartbeat from core client for 30 sec - exiting
20:22:45 (3360): No heartbeat from core client for 30 sec - exiting
20:22:46 (3360): No heartbeat from core client for 30 sec - exiting
20:22:47 (3360): No heartbeat from core client for 30 sec - exiting
20:22:48 (3360): No heartbeat from core client for 30 sec - exiting
20:22:50 (3360): No heartbeat from core client for 30 sec - exiting
20:22:51 (3360): No heartbeat from core client for 30 sec - exiting
20:22:52 (3360): No heartbeat from core client for 30 sec - exiting
20:22:53 (3360): No heartbeat from core client for 30 sec - exiting
20:22:54 (3360): No heartbeat from core client for 30 sec - exiting
20:22:55 (3360): No heartbeat from core client for 30 sec - exiting
20:22:56 (3360): No heartbeat from core client for 30 sec - exiting
20:22:57 (3360): No heartbeat from core client for 30 sec - exiting
20:22:58 (3360): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
GCPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2408, selfPID=2288, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 4
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_ys1i_1965_1_006881806_1_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ys1i_1965_1_006881806_1_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ys1i_1965_1_006881806_1_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ys1i_1965_1_006881806_1_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ys1i_1965_1_006881806_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ys1i_1965_1_006881806_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ys1i_1965_1_006881806_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ys1i_1965_1_006881806_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
29 Aug 2012 05:19:33 1214410 14575833 hadam3p_pnw_ys1i_1965_1_006881806_1 46,176 154,248 3.3404
15 Jun 2012 09:10:10 1214410 14575833 hadam3p_pnw_ys1i_1965_1_006881806_1 34,657 117,923 3.4026
06 Jun 2012 02:23:16 1214410 14575833 hadam3p_pnw_ys1i_1965_1_006881806_1 34,656 117,576 3.3927
25 May 2012 01:12:32 1214410 14575833 hadam3p_pnw_ys1i_1965_1_006881806_1 23,136 77,253 3.3391
30 Apr 2012 19:13:22 1214410 14575833 hadam3p_pnw_ys1i_1965_1_006881806_1 11,616 36,708 3.1601


©2024 cpdn.org