climateprediction.net home page
Task 15156408

Task 15156408

Name hadam3p_eu_96p6_1965_1_008157992_0
Workunit 8313116
Created 20 Aug 2012, 11:18:48 UTC
Sent 21 Aug 2012, 19:51:25 UTC
Report deadline 4 Aug 2013, 1:11:25 UTC
Received 2 Sep 2012, 0:53:35 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1233103
Run time 2 days 2 hours 43 min 17 sec
CPU time 1 days 19 hours 43 min 44 sec
Validate state Invalid
Credit 1,591.64
Device peak FLOPS 3.57 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
22:25:57 (1748): No heartbeat from core client for 30 sec - exiting
22:25:59 (1748): No heartbeat from core client for 30 sec - exiting
22:26:00 (1748): No heartbeat from core client for 30 sec - exiting
22:26:01 (1748): No heartbeat from core client for 30 sec - exiting
22:26:02 (1748): No heartbeat from core client for 30 sec - exiting
22:26:03 (1748): No heartbeat from core client for 30 sec - exiting
22:26:04 (1748): No heartbeat from core client for 30 sec - exiting
22:26:05 (1748): No heartbeat from core client for 30 sec - exiting
22:26:06 (1748): No heartbeat from core client for 30 sec - exiting
22:26:07 (1748): No heartbeat from core client for 30 sec - exiting
22:26:08 (1748): No heartbeat from core client for 30 sec - exiting
22:26:09 (1748): No heartbeat from core client for 30 sec - exiting
22:26:11 (1748): No heartbeat from core client for 30 sec - exiting
22:26:12 (1748): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:26:13 (1748): No heartbeat from core client for 30 sec - exiting
22:26:14 (1748): No heartbeat from core client for 30 sec - exiting
09:37:55 (5028): No heartbeat from core client for 30 sec - exiting
09:37:56 (5028): No heartbeat from core client for 30 sec - exiting
09:37:57 (5028): No heartbeat from core client for 30 sec - exiting
09:37:58 (5028): No heartbeat from core client for 30 sec - exiting
09:37:59 (5028): No heartbeat from core client for 30 sec - exiting
09:38:00 (5028): No heartbeat from core client for 30 sec - exiting
09:38:01 (5028): No heartbeat from core client for 30 sec - exiting
09:38:02 (5028): No heartbeat from core client for 30 sec - exiting
09:38:03 (5028): No heartbeat from core client for 30 sec - exiting
09:38:04 (5028): No heartbeat from core client for 30 sec - exiting
09:38:05 (5028): No heartbeat from core client for 30 sec - exiting
09:38:06 (5028): No heartbeat from core client for 30 sec - exiting
09:38:07 (5028): No heartbeat from core client for 30 sec - exiting
09:38:09 (5028): No heartbeat from core client for 30 sec - exiting
09:38:10 (5028): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3572, selfPID=3752, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1568, selfPID=2572, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
17:41:50 (4292): No heartbeat from core client for 30 sec - exiting
17:41:51 (4292): No heartbeat from core client for 30 sec - exiting
17:41:52 (4292): No heartbeat from core client for 30 sec - exiting
17:41:53 (4292): No heartbeat from core client for 30 sec - exiting
17:41:54 (4292): No heartbeat from core client for 30 sec - exiting
17:41:55 (4292): No heartbeat from core client for 30 sec - exiting
17:41:56 (4292): No heartbeat from core client for 30 sec - exiting
17:41:57 (4292): No heartbeat from core client for 30 sec - exiting
17:41:58 (4292): No heartbeat from core client for 30 sec - exiting
17:41:59 (4292): No heartbeat from core client for 30 sec - exiting
17:42:00 (4292): No heartbeat from core client for 30 sec - exiting
17:42:02 (4292): No heartbeat from core client for 30 sec - exiting
17:42:03 (4292): No heartbeat from core client for 30 sec - exiting
17:42:04 (4292): No heartbeat from core client for 30 sec - exiting
17:42:05 (4292): No heartbeat from core client for 30 sec - exiting
17:42:06 (4292): No heartbeat from core client for 30 sec - exiting
17:42:07 (4292): No heartbeat from core client for 30 sec - exiting
17:42:08 (4292): No heartbeat from core client for 30 sec - exiting
17:42:09 (4292): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4800, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5092, selfPID=1544, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
21:38:48 (3852): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:38:49 (3852): No heartbeat from core client for 30 sec - exiting

Model crashed: 
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_96p6_1965_1_008157992_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_96p6_1965_1_008157992_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_96p6_1965_1_008157992_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_96p6_1965_1_008157992_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
31 Aug 2012 17:26:11 1233103 15156408 hadam3p_eu_96p6_1965_1_008157992_0 92,265 140,811 1.5262
31 Aug 2012 17:26:11 1233103 15156408 hadam3p_eu_96p6_1965_1_008157992_0 92,256 140,590 1.5239
30 Aug 2012 21:42:29 1233103 15156408 hadam3p_eu_96p6_1965_1_008157992_0 80,736 123,268 1.5268
27 Aug 2012 03:10:08 1233103 15156408 hadam3p_eu_96p6_1965_1_008157992_0 69,216 105,691 1.5270
27 Aug 2012 03:10:08 1233103 15156408 hadam3p_eu_96p6_1965_1_008157992_0 57,696 87,995 1.5251
25 Aug 2012 17:55:40 1233103 15156408 hadam3p_eu_96p6_1965_1_008157992_0 46,176 70,404 1.5247
25 Aug 2012 17:55:40 1233103 15156408 hadam3p_eu_96p6_1965_1_008157992_0 34,656 52,882 1.5259
23 Aug 2012 13:42:30 1233103 15156408 hadam3p_eu_96p6_1965_1_008157992_0 23,136 35,170 1.5201
23 Aug 2012 13:42:30 1233103 15156408 hadam3p_eu_96p6_1965_1_008157992_0 11,616 17,828 1.5348


©2024 cpdn.org