climateprediction.net home page
Task 13955085

Task 13955085

Name hadam3p_eu_xklx_1981_1_007156628_1
Workunit 7341408
Created 23 Jan 2012, 15:29:51 UTC
Sent 23 Jan 2012, 15:30:37 UTC
Report deadline 4 Jan 2013, 20:50:37 UTC
Received 20 Feb 2012, 15:13:29 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1149100
Run time 3 days 4 hours 13 min 19 sec
CPU time 2 days 20 hours 53 min 59 sec
Validate state Invalid
Credit 1,194.02
Device peak FLOPS 2.40 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
15:32:18 (5292): No heartbeat from core client for 30 sec - exiting
15:32:19 (5292): No heartbeat from core client for 30 sec - exiting
15:32:20 (5292): No heartbeat from core client for 30 sec - exiting
15:32:21 (5292): No heartbeat from core client for 30 sec - exiting
15:32:22 (5292): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5388, selfPID=5388, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4124, selfPID=4124, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1056, selfPID=5296, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

zip error: Could not create output file (was replacing the original zip file)
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5292, selfPID=5816, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1184, selfPID=6108, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

zip error: Could not create output file (was replacing the original zip file)
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6672, selfPID=3196, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1920, selfPID=5368, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6724, selfPID=5432, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6464, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3248, selfPID=1592, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5596, selfPID=5500, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5420, selfPID=4144, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4428, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6100, selfPID=4960, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5764, selfPID=1144, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1260, selfPID=4540, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4312, selfPID=4668, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5116, selfPID=4732, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3424, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3352, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1576, selfPID=5072, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5264, selfPID=4420, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5276, selfPID=4732, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4876, selfPID=1064, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2924, selfPID=4936, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2132, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6168, selfPID=5828, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5588, selfPID=5788, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_eu_xklx_1981_1_007156628/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_eu_xklx_1981_1_007156628/dataout/region_restart.day after 11 attempts

Model crashed: READHIST: End of file in READ from hist
ry file fashed: READHIST: End o    ile in   AD from history file for namelist NLIHISTO                                                                                                                                              tmp                                          tmp/xaakg.pipe_dummy                                                            2048    
/xaakm.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_eu_xklx_1981_1_007156628_1_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_xklx_1981_1_007156628_1_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_xklx_1981_1_007156628_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_xklx_1981_1_007156628_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_xklx_1981_1_007156628_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_xklx_1981_1_007156628_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
15 Feb 2012 22:26:54 1149100 13955085 hadam3p_eu_xklx_1981_1_007156628_1 69,216 213,666 3.0869
13 Feb 2012 16:43:43 1149100 13955085 hadam3p_eu_xklx_1981_1_007156628_1 57,701 177,527 3.0767
12 Feb 2012 20:48:58 1149100 13955085 hadam3p_eu_xklx_1981_1_007156628_1 57,696 177,098 3.0695
10 Feb 2012 21:25:08 1149100 13955085 hadam3p_eu_xklx_1981_1_007156628_1 46,176 140,965 3.0528
05 Feb 2012 19:12:53 1149100 13955085 hadam3p_eu_xklx_1981_1_007156628_1 34,657 105,453 3.0428
05 Feb 2012 18:15:15 1149100 13955085 hadam3p_eu_xklx_1981_1_007156628_1 34,656 105,049 3.0312
01 Feb 2012 21:15:04 1149100 13955085 hadam3p_eu_xklx_1981_1_007156628_1 23,137 69,123 2.9876
01 Feb 2012 20:18:49 1149100 13955085 hadam3p_eu_xklx_1981_1_007156628_1 23,136 68,716 2.9701
30 Jan 2012 16:03:09 1149100 13955085 hadam3p_eu_xklx_1981_1_007156628_1 11,616 35,885 3.0893


©2024 cpdn.org