climateprediction.net home page
Task 17529054

Task 17529054

Name hadam3p_anz_d99w_2012_1_009264638_0
Workunit 9357554
Created 1 Dec 2014, 15:45:32 UTC
Sent 3 Dec 2014, 17:41:48 UTC
Report deadline 15 Nov 2015, 23:01:48 UTC
Received 30 Dec 2014, 11:31:37 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1323943
Run time 2 days 15 hours 30 min 30 sec
CPU time 2 days 8 hours 33 min 21 sec
Validate state Invalid
Credit 1,503.36
Device peak FLOPS 2.98 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1068, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5160, iMonCtr=2
09:36:01 (4592): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:38:44 (3792): No heartbeat from core client for 30 sec - exiting
09:38:45 (3792): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3416, selfPID=3416, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7040, selfPID=1948, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8108, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1276, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5836, selfPID=4704, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6848, selfPID=4104, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
08:12:29 (4124): No heartbeat from core client for 30 sec - exiting
08:12:30 (4124): No heartbeat from core client for 30 sec - exiting
08:12:31 (4124): No heartbeat from core client for 30 sec - exiting
08:12:32 (4124): No heartbeat from core client for 30 sec - exiting
08:12:33 (4124): No heartbeat from core client for 30 sec - exiting
08:12:34 (4124): No heartbeat from core client for 30 sec - exiting
08:12:35 (4124): No heartbeat from core client for 30 sec - exiting
08:12:36 (4124): No heartbeat from core client for 30 sec - exiting
08:12:37 (4124): No heartbeat from core client for 30 sec - exiting
08:12:39 (4124): No heartbeat from core client for 30 sec - exiting
08:12:40 (4124): No heartbeat from core client for 30 sec - exiting
08:12:41 (4124): No heartbeat from core client for 30 sec - exiting
08:12:42 (4124): No heartbeat from core client for 30 sec - exiting
08:12:43 (4124): No heartbeat from core client for 30 sec - exiting
08:12:44 (4124): No heartbeat from core client for 30 sec - exiting
08:12:45 (4124): No heartbeat from core client for 30 sec - exiting
08:12:46 (4124): No heartbeat from core client for 30 sec - exiting
08:12:47 (4124): No heartbeat from core client for 30 sec - exiting
08:12:48 (4124): No heartbeat from core client for 30 sec - exiting
08:12:50 (4124): No heartbeat from core client for 30 sec - exiting
08:12:51 (4124): No heartbeat from core client for 30 sec - exiting
08:12:52 (4124): No heartbeat from core client for 30 sec - exiting
08:12:53 (4124): No heartbeat from core client for 30 sec - exiting
08:12:54 (4124): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:14:16 (7524): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7216, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7380, selfPID=2900, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_d99w_2012_1_009264638_0_4.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_d99w_2012_1_009264638_0_5.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_d99w_2012_1_009264638_0_6.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_d99w_2012_1_009264638_0_7.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_d99w_2012_1_009264638_0_8.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_d99w_2012_1_009264638_0_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_d99w_2012_1_009264638_0_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_d99w_2012_1_009264638_0_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_d99w_2012_1_009264638_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
29 Dec 2014 13:19:49 1323943 17529054 hadam3p_anz_d99w_2012_1_009264638_0 34,859 180,982 5.1918
22 Dec 2014 15:12:59 1323943 17529054 hadam3p_anz_d99w_2012_1_009264638_0 23,339 121,153 5.1910
17 Dec 2014 19:38:45 1323943 17529054 hadam3p_anz_d99w_2012_1_009264638_0 11,819 61,302 5.1867


©2024 climateprediction.net