climateprediction.net home page
Task 14911571

Task 14911571

Name hadam3p_pnw_b8hk_1960_1_008045271_1
Workunit 8200385
Created 13 Jul 2012, 9:58:53 UTC
Sent 13 Jul 2012, 10:13:01 UTC
Report deadline 25 Jun 2013, 15:33:01 UTC
Received 28 Jul 2012, 20:58:42 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1177318
Run time 5 days 1 hours 24 min 53 sec
CPU time 4 days 16 hours 34 min 17 sec
Validate state Invalid
Credit 2,254.93
Device peak FLOPS 2.43 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>7.0.25</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7728, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4292, selfPID=9060, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7252, selfPID=2804, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1752, selfPID=3908, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2852, selfPID=4760, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
15:00:03 (5152): No heartbeat from core client for 30 sec - exiting
15:00:04 (5152): No heartbeat from core client for 30 sec - exiting
15:00:05 (5152): No heartbeat from core client for 30 sec - exiting
15:00:06 (5152): No heartbeat from core client for 30 sec - exiting
15:00:07 (5152): No heartbeat from core client for 30 sec - exiting
15:00:08 (5152): No heartbeat from core client for 30 sec - exiting
15:00:09 (5152): No heartbeat from core client for 30 sec - exiting
15:00:10 (5152): No heartbeat from core client for 30 sec - exiting
15:00:11 (5152): No heartbeat from core client for 30 sec - exiting
15:00:12 (5152): No heartbeat from core client for 30 sec - exiting
15:00:13 (5152): No heartbeat from core client for 30 sec - exiting
15:00:14 (5152): No heartbeat from core client for 30 sec - exiting
15:00:15 (5152): No heartbeat from core client for 30 sec - exiting
15:00:16 (5152): No heartbeat from core client for 30 sec - exiting
15:00:17 (5152): No heartbeat from core client for 30 sec - exiting
15:00:18 (5152): No heartbeat from core client for 30 sec - exiting
15:00:19 (5152): No heartbeat from core client for 30 sec - exiting
15:00:20 (5152): No heartbeat from core client for 30 sec - exiting
15:00:21 (5152): No heartbeat from core client for 30 sec - exiting
15:00:22 (5152): No heartbeat from core client for 30 sec - exiting
15:00:23 (5152): No heartbeat from core client for 30 sec - exiting
15:00:24 (5152): No heartbeat from core client for 30 sec - exiting
15:00:25 (5152): No heartbeat from core client for 30 sec - exiting
15:00:26 (5152): No heartbeat from core client for 30 sec - exiting
15:00:27 (5152): No heartbeat from core client for 30 sec - exiting
15:00:28 (5152): No heartbeat from core client for 30 sec - exiting
15:00:29 (5152): No heartbeat from core client for 30 sec - exiting
15:00:30 (5152): No heartbeat from core client for 30 sec - exiting
15:00:31 (5152): No heartbeat from core client for 30 sec - exiting
15:00:32 (5152): No heartbeat from core client for 30 sec - exiting
15:00:33 (5152): No heartbeat from core client for 30 sec - exiting
15:00:34 (5152): No heartbeat from core client for 30 sec - exiting
15:00:35 (5152): No heartbeat from core client for 30 sec - exiting
15:00:36 (5152): No heartbeat from core client for 30 sec - exiting
15:00:37 (5152): No heartbeat from core client for 30 sec - exiting
15:00:38 (5152): No heartbeat from core client for 30 sec - exiting
15:00:39 (5152): No heartbeat from core client for 30 sec - exiting
15:00:40 (5152): No heartbeat from core client for 30 sec - exiting
15:00:41 (5152): No heartbeat from core client for 30 sec - exiting
15:00:42 (5152): No heartbeat from core client for 30 sec - exiting
15:00:43 (5152): No heartbeat from core client for 30 sec - exiting
15:00:44 (5152): No heartbeat from core client for 30 sec - exiting
15:00:45 (5152): No heartbeat from core client for 30 sec - exiting
15:00:46 (5152): No heartbeat from core client for 30 sec - exiting
15:00:47 (5152): No heartbeat from core client for 30 sec - exiting
15:00:48 (5152): No heartbeat from core client for 30 sec - exiting
15:00:49 (5152): No heartbeat from core client for 30 sec - exiting
15:00:50 (5152): No heartbeat from core client for 30 sec - exiting
15:00:51 (5152): No heartbeat from core client for 30 sec - exiting
15:00:52 (5152): No heartbeat from core client for 30 sec - exiting
15:00:53 (5152): No heartbeat from core client for 30 sec - exiting
15:00:54 (5152): No heartbeat from core client for 30 sec - exiting
15:00:55 (5152): No heartbeat from core client for 30 sec - exiting
15:00:56 (5152): No heartbeat from core client for 30 sec - exiting
15:00:57 (5152): No heartbeat from core client for 30 sec - exiting
15:00:58 (5152): No heartbeat from core client for 30 sec - exiting
15:00:59 (5152): No heartbeat from core client for 30 sec - exiting
15:01:00 (5152): No heartbeat from core client for 30 sec - exiting
15:01:01 (5152): No heartbeat from core client for 30 sec - exiting
15:01:02 (5152): No heartbeat from core client for 30 sec - exiting
15:01:03 (5152): No heartbeat from core client for 30 sec - exiting
15:01:04 (5152): No heartbeat from core client for 30 sec - exiting
15:01:05 (5152): No heartbeat from core client for 30 sec - exiting
15:01:06 (5152): No heartbeat from core client for 30 sec - exiting
15:01:07 (5152): No heartbeat from core client for 30 sec - exiting
15:01:08 (5152): No heartbeat from core client for 30 sec - exiting
15:01:09 (5152): No heartbeat from core client for 30 sec - exiting
15:01:10 (5152): No heartbeat from core client for 30 sec - exiting
15:01:11 (5152): No heartbeat from core client for 30 sec - exiting
15:01:12 (5152): No heartbeat from core client for 30 sec - exiting
15:01:13 (5152): No heartbeat from core client for 30 sec - exiting
15:01:14 (5152): No heartbeat from core client for 30 sec - exiting
15:01:15 (5152): No heartbeat from core client for 30 sec - exiting
15:01:16 (5152): No heartbeat from core client for 30 sec - exiting
15:01:17 (5152): No heartbeat from core client for 30 sec - exiting
15:01:18 (5152): No heartbeat from core client for 30 sec - exiting
15:01:19 (5152): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8092, selfPID=6488, iMonCtr=1
Model crash detected, will try to restart...
CSuspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7676, selfPID=3636, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4936, selfPID=5856, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 6
08:50:19 (2976): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7232, selfPID=3608, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7996, selfPID=7156, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6516, selfPID=5348, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 9
cpdnmonitor: cannot open input file G:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_pnw_b8hk_1960_1_008045271/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file G:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_pnw_b8hk_1960_1_008045271/dataout/region_restart.day after 11 attempts


odel crcrased: REAEADHIST: End of file in READ  from historytfioerfo file forst NLIHIt NLIHI   O                                                                                                                                                                                           tmp/xaakgaakm.pipe_dum                                                             048   8 
   
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 0
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_b8hk_1960_1_008045271_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_b8hk_1960_1_008045271_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_b8hk_1960_1_008045271_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
28 Jul 2012 15:50:26 1177318 14911571 hadam3p_pnw_b8hk_1960_1_008045271_1 103,776 387,350 3.7326
27 Jul 2012 18:52:54 1177318 14911571 hadam3p_pnw_b8hk_1960_1_008045271_1 92,256 344,609 3.7354
26 Jul 2012 17:48:27 1177318 14911571 hadam3p_pnw_b8hk_1960_1_008045271_1 80,736 303,293 3.7566
23 Jul 2012 20:40:54 1177318 14911571 hadam3p_pnw_b8hk_1960_1_008045271_1 69,216 262,139 3.7873
19 Jul 2012 16:38:18 1177318 14911571 hadam3p_pnw_b8hk_1960_1_008045271_1 57,696 218,615 3.7891
19 Jul 2012 01:34:45 1177318 14911571 hadam3p_pnw_b8hk_1960_1_008045271_1 46,176 174,635 3.7819
18 Jul 2012 12:04:47 1177318 14911571 hadam3p_pnw_b8hk_1960_1_008045271_1 34,656 130,302 3.7599
16 Jul 2012 18:34:53 1177318 14911571 hadam3p_pnw_b8hk_1960_1_008045271_1 23,136 87,216 3.7697
16 Jul 2012 05:35:51 1177318 14911571 hadam3p_pnw_b8hk_1960_1_008045271_1 11,620 45,396 3.9067
15 Jul 2012 20:48:13 1177318 14911571 hadam3p_pnw_b8hk_1960_1_008045271_1 11,616 44,909 3.8661


©2024 cpdn.org