climateprediction.net home page
Task 19212889

Task 19212889

Name hadam3p_anz_k3mh_201212_12_306_010267250_0
Workunit 10267250
Created 25 Jan 2016, 13:23:25 UTC
Sent 25 Jan 2016, 13:25:00 UTC
Report deadline 6 Jan 2017, 18:45:00 UTC
Received 23 Feb 2016, 10:49:56 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1137733
Run time 4 days 1 hours 15 min 29 sec
CPU time 3 days 7 hours 34 min 35 sec
Validate state Invalid
Credit 3,490.64
Device peak FLOPS 3.24 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.6.22</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2180, selfPID=1460, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting13:17:21 (5204): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2500, selfPID=2500, iMonCtr=2
13:17:22 (5204): No heartbeat from core client for 30 sec - exiting
13:17:23 (5204): No heartbeat from core client for 30 sec - exiting
13:17:24 (5204): No heartbeat from core client for 30 sec - exiting
13:19:22 (6776): start_timer_thread(): CreateThread() failed, errno 0
13:19:24 (3616): start_timer_thread(): CreateThread() failed, errno 0
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6776, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3616, selfPID=2168, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1212, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5596, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4904, selfPID=1828, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6112, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3680, selfPID=5640, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2716, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3656, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5240, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3616, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4000, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5656, selfPID=5868, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3832, selfPID=5232, iMonCtr=1
Model crash detected, will try to restart...
11:03:53 (1524): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5468, selfPID=4680, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5736, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3704, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5696, selfPID=5512, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4192, selfPID=5632, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5356, selfPID=5448, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
10:22:50 (6160): start_timer_thread(): CreateThread() failed, errno 0
10:22:51 (6184): start_timer_thread(): CreateThread() failed, errno 0
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6160, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6184, selfPID=3904, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4068, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1800, selfPID=5548, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Colobal Worker:: nPDN ller:: CPDN process is npr ounning, excting, bRetVal = 1, chexiting, bRetlfPID=4780, iMonCtr=2
elodel crash iMonCter=2
will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5928, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4556, selfPID=5860, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3756, selfPID=5728, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1160, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2324, selfPID=4944, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_k3mh_201212_12_306_010267250_0_8.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_k3mh_201212_12_306_010267250_0_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_k3mh_201212_12_306_010267250_0_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_k3mh_201212_12_306_010267250_0_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_k3mh_201212_12_306_010267250_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
19 Feb 2016 10:25:17 1137733 19212889 hadam3p_anz_k3mh_201212_12_306_010267250_0 80,939 262,763 3.2464
15 Feb 2016 17:00:24 1137733 19212889 hadam3p_anz_k3mh_201212_12_306_010267250_0 69,419 225,043 3.2418
11 Feb 2016 13:08:34 1137733 19212889 hadam3p_anz_k3mh_201212_12_306_010267250_0 57,899 188,121 3.2491
04 Feb 2016 12:10:17 1137733 19212889 hadam3p_anz_k3mh_201212_12_306_010267250_0 46,379 151,246 3.2611
29 Jan 2016 13:16:05 1137733 19212889 hadam3p_anz_k3mh_201212_12_306_010267250_0 34,859 114,109 3.2734
29 Jan 2016 01:58:39 1137733 19212889 hadam3p_anz_k3mh_201212_12_306_010267250_0 23,339 75,967 3.2549
28 Jan 2016 15:06:07 1137733 19212889 hadam3p_anz_k3mh_201212_12_306_010267250_0 11,819 37,730 3.1923


©2024 cpdn.org