climateprediction.net home page
Task 14986113

Task 14986113

Name hadam3p_eu_a3y8_1991_1_008060008_1
Workunit 8215122
Created 24 Jul 2012, 0:48:15 UTC
Sent 24 Jul 2012, 0:50:06 UTC
Report deadline 6 Jul 2013, 6:10:06 UTC
Received 29 Jul 2012, 23:12:43 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1187159
Run time 5 days 16 hours 55 min 42 sec
CPU time 14 hours 53 min 52 sec
Validate state Invalid
Credit 1,591.48
Device peak FLOPS 1.61 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<stderr_txt>
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6360, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4400, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3960, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5152, selfPID=2796, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1504, selfPID=5632, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7948, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1380, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
16:24:36 (2800): No heartbeat from core client for 30 sec - exiting
16:24:37 (2800): No heartbeat from core client for 30 sec - exiting
16:24:38 (2800): No heartbeat from core client for 30 sec - exiting
16:24:39 (2800): No heartbeat from core client for 30 sec - exiting
16:24:40 (2800): No heartbeat from core client for 30 sec - exiting
16:24:41 (2800): No heartbeat from core client for 30 sec - exiting
16:24:42 (2800): No heartbeat from core client for 30 sec - exiting
16:24:43 (2800): No heartbeat from core client for 30 sec - exiting
16:24:44 (2800): No heartbeat from core client for 30 sec - exiting
16:24:45 (2800): No heartbeat from core client for 30 sec - exiting
16:24:46 (2800): No heartbeat from core client for 30 sec - exiting
16:24:47 (2800): No heartbeat from core client for 30 sec - exiting
16:24:48 (2800): No heartbeat from core client for 30 sec - exiting
16:24:49 (2800): No heartbeat from core client for 30 sec - exiting
16:24:51 (2800): No heartbeat from core client for 30 sec - exiting
16:24:52 (2800): No heartbeat from core client for 30 sec - exiting
16:24:53 (2800): No heartbeat from core client for 30 sec - exiting
16:24:54 (2800): No heartbeat from core client for 30 sec - exiting
16:24:55 (2800): No heartbeat from core client for 30 sec - exiting
16:24:56 (2800): No heartbeat from core client for 30 sec - exiting
16:24:57 (2800): No heartbeat from core client for 30 sec - exiting
16:24:58 (2800): No heartbeat from core client for 30 sec - exiting
16:24:59 (2800): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4428, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5248, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish
17:11:18 (5328): No heartbeat from core client for 30 sec - exiting
17:11:19 (5328): No heartbeat from core client for 30 sec - exiting
17:11:20 (5328): No heartbeat from core client for 30 sec - exiting
17:11:21 (5328): No heartbeat from core client for 30 sec - exiting
17:11:22 (5328): No heartbeat from core client for 30 sec - exiting
17:11:23 (5328): No heartbeat from core client for 30 sec - exiting
17:11:24 (5328): No heartbeat from core client for 30 sec - exiting
17:11:25 (5328): No heartbeat from core client for 30 sec - exiting
17:11:26 (5328): No heartbeat from core client for 30 sec - exiting
17:11:27 (5328): No heartbeat from core client for 30 sec - exiting
17:11:28 (5328): No heartbeat from core client for 30 sec - exiting
17:11:29 (5328): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_a3y8_1991_1_008060008_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_a3y8_1991_1_008060008_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_a3y8_1991_1_008060008_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_a3y8_1991_1_008060008_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
29 Jul 2012 00:02:45 1187159 14986113 hadam3p_eu_a3y8_1991_1_008060008_1 92,256 365,233 3.9589
28 Jul 2012 07:53:58 1187159 14986113 hadam3p_eu_a3y8_1991_1_008060008_1 80,736 319,545 3.9579
27 Jul 2012 15:32:32 1187159 14986113 hadam3p_eu_a3y8_1991_1_008060008_1 69,216 274,088 3.9599
27 Jul 2012 01:18:45 1187159 14986113 hadam3p_eu_a3y8_1991_1_008060008_1 57,696 228,417 3.9590
26 Jul 2012 11:26:00 1187159 14986113 hadam3p_eu_a3y8_1991_1_008060008_1 46,176 182,857 3.9600
25 Jul 2012 18:50:34 1187159 14986113 hadam3p_eu_a3y8_1991_1_008060008_1 34,656 136,536 3.9398
25 Jul 2012 05:36:36 1187159 14986113 hadam3p_eu_a3y8_1991_1_008060008_1 23,136 91,099 3.9375
24 Jul 2012 16:21:09 1187159 14986113 hadam3p_eu_a3y8_1991_1_008060008_1 11,616 45,963 3.9569


©2024 climateprediction.net