climateprediction.net home page
Task 14346166

Task 14346166

Name hadam3p_pnw_z20a_1969_1_006913922_2
Workunit 7117238
Created 2 Apr 2012, 13:09:33 UTC
Sent 2 Apr 2012, 13:09:38 UTC
Report deadline 15 Mar 2013, 18:29:38 UTC
Received 28 Apr 2012, 18:48:57 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1057729
Run time 3 days 13 hours 40 min 21 sec
CPU time 3 days 10 hours 33 min 28 sec
Validate state Invalid
Credit 2,004.61
Device peak FLOPS 2.53 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6124, selfPID=6124, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3508, selfPID=3508, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6212, selfPID=6212, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
07:09:18 (5512): No heartbeat from core client for 30 sec - exiting
07:09:19 (5512): No heartbeat from core client for 30 sec - exiting
07:09:20 (5512): No heartbeat from core client for 30 sec - exiting
07:09:21 (5512): No heartbeat from core client for 30 sec - exiting
07:09:23 (5512): No heartbeat from core client for 30 sec - exiting
07:09:24 (5512): No heartbeat from core client for 30 sec - exiting
07:09:25 (5512): No heartbeat from core client for 30 sec - exiting
07:09:26 (5512): No heartbeat from core client for 30 sec - exiting
07:09:27 (5512): No heartbeat from core client for 30 sec - exiting
07:09:28 (5512): No heartbeat from core client for 30 sec - exiting
07:09:29 (5512): No heartbeat from core client for 30 sec - exiting
07:09:30 (5512): No heartbeat from core client for 30 sec - exiting
07:09:31 (5512): No heartbeat from core client for 30 sec - exiting
07:09:32 (5512): No heartbeat from core client for 30 sec - exiting
07:09:33 (5512): No heartbeat from core client for 30 sec - exiting
07:09:35 (5512): No heartbeat from core client for 30 sec - exiting
07:09:36 (5512): No heartbeat from core client for 30 sec - exiting
07:09:37 (5512): No heartbeat from core client for 30 sec - exiting
07:09:38 (5512): No heartbeat from core client for 30 sec - exiting
07:09:39 (5512): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:10:50 (6836): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6128, selfPID=6128, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6044, selfPID=6044, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3176, selfPID=3176, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4684, selfPID=4684, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9060, selfPID=9060, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8088, selfPID=8088, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
21:37:04 (4476): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:39:53 (7104): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7040, selfPID=7040, iMonCtr=2
19:50:22 (2344): No heartbeat from core client for 30 sec - exiting
19:50:24 (2344): No heartbeat from core client for 30 sec - exiting
19:50:25 (2344): No heartbeat from core client for 30 sec - exiting
19:50:26 (2344): No heartbeat from core client for 30 sec - exiting
19:50:27 (2344): No heartbeat from core client for 30 sec - exiting
19:50:28 (2344): No heartbeat from core client for 30 sec - exiting
19:50:29 (2344): No heartbeat from core client for 30 sec - exiting
19:50:30 (2344): No heartbeat from core client for 30 sec - exiting
19:50:31 (2344): No heartbeat from core client for 30 sec - exiting
19:50:32 (2344): No heartbeat from core client for 30 sec - exiting
19:50:33 (2344): No heartbeat from core client for 30 sec - exiting
19:50:34 (2344): No heartbeat from core client for 30 sec - exiting
19:50:36 (2344): No heartbeat from core client for 30 sec - exiting
19:50:37 (2344): No heartbeat from core client for 30 sec - exiting
19:50:38 (2344): No heartbeat from core client for 30 sec - exiting
19:50:39 (2344): No heartbeat from core client for 30 sec - exiting
19:50:40 (2344): No heartbeat from core client for 30 sec - exiting
19:50:41 (2344): No heartbeat from core client for 30 sec - exiting
19:50:42 (2344): No heartbeat from core client for 30 sec - exiting
19:50:43 (2344): No heartbeat from core client for 30 sec - exiting
19:50:44 (2344): No heartbeat from core client for 30 sec - exiting
19:50:45 (2344): No heartbeat from core client for 30 sec - exiting
19:50:46 (2344): No heartbeat from core client for 30 sec - exiting
19:50:48 (2344): No heartbeat from core client for 30 sec - exiting
19:50:49 (2344): No heartbeat from core client for 30 sec - exiting
19:50:50 (2344): No heartbeat from core client for 30 sec - exiting
19:50:51 (2344): No heartbeat from core client for 30 sec - exiting
19:50:52 (2344): No heartbeat from core client for 30 sec - exiting
19:50:53 (2344): No heartbeat from core client for 30 sec - exiting
19:50:54 (2344): No heartbeat from core client for 30 sec - exiting
19:50:55 (2344): No heartbeat from core client for 30 sec - exiting
19:50:56 (2344): No heartbeat from core client for 30 sec - exiting
19:50:57 (2344): No heartbeat from core client for 30 sec - exiting
19:50:58 (2344): No heartbeat from core client for 30 sec - exiting
19:50:59 (2344): No heartbeat from core client for 30 sec - exiting
19:51:01 (2344): No heartbeat from core client for 30 sec - exiting
19:51:02 (2344): No heartbeat from core client for 30 sec - exiting
19:51:03 (2344): No heartbeat from core client for 30 sec - exiting
19:51:04 (2344): No heartbeat from core client for 30 sec - exiting
19:51:05 (2344): No heartbeat from core client for 30 sec - exiting
19:51:06 (2344): No heartbeat from core client for 30 sec - exiting
19:51:07 (2344): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6320, selfPID=3520, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7032, selfPID=7032, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4628, selfPID=4628, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5644, selfPID=5644, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=652, selfPID=652, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
22:52:16 (2636): No heartbeat from core client for 30 sec - exiting
22:52:17 (2636): No heartbeat from core client for 30 sec - exiting
22:52:19 (2636): No heartbeat from core client for 30 sec - exiting
22:52:20 (2636): No heartbeat from core client for 30 sec - exiting
22:52:21 (2636): No heartbeat from core client for 30 sec - exiting
22:52:22 (2636): No heartbeat from core client for 30 sec - exiting
22:52:23 (2636): No heartbeat from core client for 30 sec - exiting
22:52:24 (2636): No heartbeat from core client for 30 sec - exiting
22:52:25 (2636): No heartbeat from core client for 30 sec - exiting
22:52:26 (2636): No heartbeat from core client for 30 sec - exiting
22:52:27 (2636): No heartbeat from core client for 30 sec - exiting
22:52:28 (2636): No heartbeat from core client for 30 sec - exiting
22:52:29 (2636): No heartbeat from core client for 30 sec - exiting
22:52:31 (2636): No heartbeat from core client for 30 sec - exiting
22:52:32 (2636): No heartbeat from core client for 30 sec - exiting
22:52:33 (2636): No heartbeat from core client for 30 sec - exiting
22:52:34 (2636): No heartbeat from core client for 30 sec - exiting
22:52:35 (2636): No heartbeat from core client for 30 sec - exiting
22:52:36 (2636): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7076, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6632, selfPID=6308, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 8
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_z20a_1969_1_006913922_2_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_z20a_1969_1_006913922_2_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_z20a_1969_1_006913922_2_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_z20a_1969_1_006913922_2_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
25 Apr 2012 14:40:06 1057729 14346166 hadam3p_pnw_z20a_1969_1_006913922_2 92,256 277,718 3.0103
22 Apr 2012 16:16:01 1057729 14346166 hadam3p_pnw_z20a_1969_1_006913922_2 80,736 243,577 3.0170
20 Apr 2012 19:28:02 1057729 14346166 hadam3p_pnw_z20a_1969_1_006913922_2 69,216 209,306 3.0240
14 Apr 2012 20:13:42 1057729 14346166 hadam3p_pnw_z20a_1969_1_006913922_2 57,696 174,851 3.0306
13 Apr 2012 18:31:28 1057729 14346166 hadam3p_pnw_z20a_1969_1_006913922_2 46,176 139,515 3.0214
10 Apr 2012 16:23:34 1057729 14346166 hadam3p_pnw_z20a_1969_1_006913922_2 34,656 104,373 3.0117
09 Apr 2012 12:48:54 1057729 14346166 hadam3p_pnw_z20a_1969_1_006913922_2 23,136 69,099 2.9866
05 Apr 2012 15:38:43 1057729 14346166 hadam3p_pnw_z20a_1969_1_006913922_2 11,616 34,941 3.0080


©2024 cpdn.org