climateprediction.net home page
Task 16312361

Task 16312361

Name hadam3p_eu_i0od_2013_1_008530365_0
Workunit 8677877
Created 3 Mar 2014, 14:54:16 UTC
Sent 4 Mar 2014, 18:34:00 UTC
Report deadline 14 Feb 2015, 23:54:00 UTC
Received 8 Mar 2014, 22:52:51 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1291948
Run time 20 hours 11 min 3 sec
CPU time 12 hours 11 min 25 sec
Validate state Invalid
Credit 597.84
Device peak FLOPS 3.42 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>7.2.39</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9160, selfPID=7968, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6104, selfPID=3728, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish
19:41:21 (4000): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4188, selfPID=4188, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5408, selfPID=5408, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
11:02:39 (3212): No heartbeat from core client for 30 sec - exiting
11:02:40 (3212): No heartbeat from core client for 30 sec - exiting
11:02:41 (3212): No heartbeat from core client for 30 sec - exiting
11:02:42 (3212): No heartbeat from core client for 30 sec - exiting
11:02:44 (3212): No heartbeat from core client for 30 sec - exiting
11:02:45 (3212): No heartbeat from core client for 30 sec - exiting
11:02:46 (3212): No heartbeat from core client for 30 sec - exiting
11:02:47 (3212): No heartbeat from core client for 30 sec - exiting
11:02:48 (3212): No heartbeat from core client for 30 sec - exiting
11:02:49 (3212): No heartbeat from core client for 30 sec - exiting
11:02:50 (3212): No heartbeat from core client for 30 sec - exiting
11:02:51 (3212): No heartbeat from core client for 30 sec - exiting
11:02:52 (3212): No heartbeat from core client for 30 sec - exiting
11:02:53 (3212): No heartbeat from core client for 30 sec - exiting
11:02:55 (3212): No heartbeat from core client for 30 sec - exiting
11:02:56 (3212): No heartbeat from core client for 30 sec - exiting
11:02:57 (3212): No heartbeat from core client for 30 sec - exiting
11:02:58 (3212): No heartbeat from core client for 30 sec - exiting
11:02:59 (3212): No heartbeat from core client for 30 sec - exiting
11:03:00 (3212): No heartbeat from core client for 30 sec - exiting
11:03:01 (3212): No heartbeat from core client for 30 sec - exiting
11:03:02 (3212): No heartbeat from core client for 30 sec - exiting
11:03:03 (3212): No heartbeat from core client for 30 sec - exiting
11:03:04 (3212): No heartbeat from core client for 30 sec - exiting
11:03:05 (3212): No heartbeat from core client for 30 sec - exiting
11:03:07 (3212): No heartbeat from core client for 30 sec - exiting
11:03:08 (3212): No heartbeat from core client for 30 sec - exiting
11:03:09 (3212): No heartbeat from core client for 30 sec - exiting
11:03:10 (3212): No heartbeat from core client for 30 sec - exiting
11:03:11 (3212): No heartbeat from core client for 30 sec - exiting
11:03:12 (3212): No heartbeat from core client for 30 sec - exiting
11:03:13 (3212): No heartbeat from core client for 30 sec - exiting
11:03:14 (3212): No heartbeat from core client for 30 sec - exiting
11:03:15 (3212): No heartbeat from core client for 30 sec - exiting
11:03:16 (3212): No heartbeat from core client for 30 sec - exiting
11:03:17 (3212): No heartbeat from core client for 30 sec - exiting
11:03:19 (3212): No heartbeat from core client for 30 sec - exiting
11:03:20 (3212): No heartbeat from core client for 30 sec - exiting
11:03:21 (3212): No heartbeat from core client for 30 sec - exiting
11:03:22 (3212): No heartbeat from core client for 30 sec - exiting
11:03:23 (3212): No heartbeat from core client for 30 sec - exiting
11:03:24 (3212): No heartbeat from core client for 30 sec - exiting
11:03:25 (3212): No heartbeat from core client for 30 sec - exiting
11:03:26 (3212): No heartbeat from core client for 30 sec - exiting
11:03:27 (3212): No heartbeat from core client for 30 sec - exiting
11:03:29 (3212): No heartbeat from core client for 30 sec - exiting
11:03:30 (3212): No heartbeat from core client for 30 sec - exiting
11:03:31 (3212): No heartbeat from core client for 30 sec - exiting
11:03:32 (3212): No heartbeat from core client for 30 sec - exiting
11:03:33 (3212): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3204, selfPID=2860, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt><message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_i0od_2013_1_008530365_0_4.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_i0od_2013_1_008530365_0_5.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_i0od_2013_1_008530365_0_6.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_i0od_2013_1_008530365_0_7.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_i0od_2013_1_008530365_0_8.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_i0od_2013_1_008530365_0_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_i0od_2013_1_008530365_0_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_i0od_2013_1_008530365_0_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_i0od_2013_1_008530365_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
08 Mar 2014 20:06:48 1291948 16312361 hadam3p_eu_i0od_2013_1_008530365_0 34,656 43,546 1.2565
08 Mar 2014 15:51:15 1291948 16312361 hadam3p_eu_i0od_2013_1_008530365_0 23,136 29,061 1.2561
05 Mar 2014 21:59:05 1291948 16312361 hadam3p_eu_i0od_2013_1_008530365_0 11,616 14,998 1.2912


©2024 climateprediction.net