climateprediction.net home page
Task 11821486

Task 11821486

Name hadam3p_pnw_v30n_1982_1_006677541_1
Workunit 6880794
Created 26 Aug 2010, 17:27:41 UTC
Sent 30 Aug 2010, 21:26:50 UTC
Report deadline 13 Aug 2011, 2:46:50 UTC
Received 18 Oct 2010, 22:50:55 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 893103
Run time
CPU time 4 days 8 hours 53 min 26 sec
Validate state Invalid
Credit 1,754.30
Device peak FLOPS 1.17 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.05
windows_intelx86
Stderr
<core_client_version>6.2.18</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3140, selfPID=4956, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
GGlobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5836, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5852, selfPID=5588, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5728, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5520, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5944, selfPID=5716, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4184, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5208, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5228, iMonCtr=2
Model crash detected, will try to restart...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5424, selfPID=5196, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2180, selfPID=2352, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1512, selfPID=4960, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
GCPDN Monitor - Quit request from BOINC...
22:42:07 (5740): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:42:08 (5740): No heartbeat from core client for 30 sec - exiting

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4944, selfPID=6132, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 7
23:49:34 (6132): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_pnw_v30n_1982_1_006677541_1_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_v30n_1982_1_006677541_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_v30n_1982_1_006677541_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_v30n_1982_1_006677541_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_v30n_1982_1_006677541_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
17 Oct 2010 14:58:46 893103 11821486 hadam3p_pnw_v30n_1982_1_006677541_1 80,736 364,288 4.5121
14 Oct 2010 19:09:42 893103 11821486 hadam3p_pnw_v30n_1982_1_006677541_1 69,216 311,763 4.5042
10 Oct 2010 10:05:31 893103 11821486 hadam3p_pnw_v30n_1982_1_006677541_1 57,696 259,929 4.5051
05 Oct 2010 20:34:52 893103 11821486 hadam3p_pnw_v30n_1982_1_006677541_1 46,176 209,088 4.5281
02 Oct 2010 20:16:57 893103 11821486 hadam3p_pnw_v30n_1982_1_006677541_1 34,656 157,052 4.5317
29 Sep 2010 18:30:15 893103 11821486 hadam3p_pnw_v30n_1982_1_006677541_1 23,136 105,930 4.5786
26 Sep 2010 08:59:37 893103 11821486 hadam3p_pnw_v30n_1982_1_006677541_1 11,616 52,821 4.5473


©2024 cpdn.org