climateprediction.net home page
Task 16622093

Task 16622093

Name hadam3p_anz_r6w7_2012_1_008738549_0
Workunit 8884527
Created 8 May 2014, 19:12:27 UTC
Sent 15 May 2014, 17:21:43 UTC
Report deadline 27 Apr 2015, 22:41:43 UTC
Received 10 Jan 2015, 18:57:26 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1309837
Run time 3 days 0 hours 38 min 53 sec
CPU time 2 days 13 hours 49 min 27 sec
Validate state Invalid
Credit 1,006.54
Device peak FLOPS 0.42 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPIDCPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
19:46:33 (5032): No heartbeat from core client for 30 sec - exiting
19:46:34 (5032): No heartbeat from core client for 30 sec - exiting
19:46:36 (5032): No heartbeat from core client for 30 sec - exiting
19:46:37 (5032): No heartbeat from core client for 30 sec - exiting
19:46:38 (5032): No heartbeat from core client for 30 sec - exiting
19:46:39 (5032): No heartbeat from core client for 30 sec - exiting
19:46:40 (5032): No heartbeat from core client for 30 sec - exiting
19:46:41 (5032): No heartbeat from core client for 30 sec - exiting
19:46:42 (5032): No heartbeat from core client for 30 sec - exiting
19:46:43 (5032): No heartbeat from core client for 30 sec - exiting
19:46:44 (5032): No heartbeat from core client for 30 sec - exiting
19:46:45 (5032): No heartbeat from core client for 30 sec - exiting
19:46:46 (5032): No heartbeat from core client for 30 sec - exiting
19:46:48 (5032): No heartbeat from core client for 30 sec - exiting
19:46:49 (5032): No heartbeat from core client for 30 sec - exiting
19:46:50 (5032): No heartbeat from core client for 30 sec - exiting
19:46:51 (5032): No heartbeat from core client for 30 sec - exiting
19:46:52 (5032): No heartbeat from core client for 30 sec - exiting
19:46:53 (5032): No heartbeat from core client for 30 sec - exiting
19:46:54 (5032): No heartbeat from core client for 30 sec - exiting
19:46:55 (5032): No heartbeat from core client for 30 sec - exiting
19:46:56 (5032): No heartbeat from core client for 30 sec - exiting
19:46:57 (5032): No heartbeat from core client for 30 sec - exiting
19:46:58 (5032): No heartbeat from core client for 30 sec - exiting
19:47:00 (5032): No heartbeat from core client for 30 sec - exiting
19:47:01 (5032): No heartbeat from core client for 30 sec - exiting
19:47:02 (5032): No heartbeat from core client for 30 sec - exiting
19:47:03 (5032): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4200, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2360, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
GCPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3960, selfPID=4172, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5676, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4468, selfPID=4216, iMonCtr=1
Model crash detected, will try to restart...
CCPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3528, selfPID=1476, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
GCPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4476, selfPID=6608, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=960, iMonCtr=1
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5300, selfPID=7056, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_r6w7_2012_1_008738549_0_3.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r6w7_2012_1_008738549_0_4.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r6w7_2012_1_008738549_0_5.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r6w7_2012_1_008738549_0_6.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r6w7_2012_1_008738549_0_7.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r6w7_2012_1_008738549_0_8.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r6w7_2012_1_008738549_0_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r6w7_2012_1_008738549_0_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r6w7_2012_1_008738549_0_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r6w7_2012_1_008738549_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
19 Jun 2014 02:15:13 1309837 16622093 hadam3p_anz_r6w7_2012_1_008738549_0 23,339 206,120 8.8316
26 May 2014 16:43:24 1309837 16622093 hadam3p_anz_r6w7_2012_1_008738549_0 11,819 105,269 8.9068


©2024 cpdn.org