climateprediction.net home page
Task 18520117

Task 18520117

Name hadam3p_anz_r40c_2012_1_008734810_2
Workunit 8880788
Created 1 Jun 2015, 2:50:17 UTC
Sent 3 Jun 2015, 13:08:51 UTC
Report deadline 15 May 2016, 18:28:51 UTC
Received 25 Jun 2015, 12:36:03 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1353395
Run time 5 days 6 hours 18 min 35 sec
CPU time 5 days 0 hours 34 min 15 sec
Validate state Invalid
Credit 4,484.28
Device peak FLOPS 4.15 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.4.42</core_client_version>
<![CDATA[
<stderr_txt>
17:19:12 (5360): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5864, selfPID=5864, iMonCtr=2
17:25:00 (5820): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5732, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5456, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5612, selfPID=4644, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5260, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5296, selfPID=5080, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=840, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5080, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=604, selfPID=3184, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1872, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2104, selfPID=4804, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4924, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4948, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5016, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1200, selfPID=5072, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4868, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5748, selfPID=3440, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2220, selfPID=2220, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5344, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5368, selfPID=4000, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2356, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5660, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5912, selfPID=4860, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2300, selfPID=4596, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2600, iMonCtr=2
Model crash detected, will try to restart...

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1860, selfPID=4508, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_r40c_2012_1_008734810_2_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r40c_2012_1_008734810_2_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_r40c_2012_1_008734810_2_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
24 Jun 2015 17:34:44 1353395 18520117 hadam3p_anz_r40c_2012_1_008734810_2 103,979 423,574 4.0736
23 Jun 2015 13:32:44 1353395 18520117 hadam3p_anz_r40c_2012_1_008734810_2 92,459 378,556 4.0943
18 Jun 2015 20:57:57 1353395 18520117 hadam3p_anz_r40c_2012_1_008734810_2 80,939 314,778 3.8891
17 Jun 2015 16:55:59 1353395 18520117 hadam3p_anz_r40c_2012_1_008734810_2 69,419 268,919 3.8739
16 Jun 2015 12:01:26 1353395 18520117 hadam3p_anz_r40c_2012_1_008734810_2 57,899 220,812 3.8137
11 Jun 2015 20:31:57 1353395 18520117 hadam3p_anz_r40c_2012_1_008734810_2 46,379 165,713 3.5730
09 Jun 2015 20:36:43 1353395 18520117 hadam3p_anz_r40c_2012_1_008734810_2 34,859 124,007 3.5574
08 Jun 2015 18:37:33 1353395 18520117 hadam3p_anz_r40c_2012_1_008734810_2 23,339 83,735 3.5878
04 Jun 2015 16:32:34 1353395 18520117 hadam3p_anz_r40c_2012_1_008734810_2 11,819 41,296 3.4940


©2024 climateprediction.net