climateprediction.net home page
Task 12203522

Task 12203522

Name hadam3p_eu_x3ow_1991_1_006923592_0
Workunit 7126908
Created 22 Nov 2010, 10:39:09 UTC
Sent 8 Feb 2011, 22:20:15 UTC
Report deadline 22 Jan 2012, 3:40:15 UTC
Received 20 Feb 2011, 10:45:52 UTC
Server state Over
Outcome No reply
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1045266
Run time 4 days 4 hours 31 min 26 sec
CPU time 3 days 23 hours 53 min 2 sec
Validate state Invalid
Credit 1,988.99
Device peak FLOPS 2.67 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.08
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5964, selfPID=4580, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6080, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4128, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7924, selfPID=3588, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4840, selfPID=4708, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5400, selfPID=4828, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
07:08:06 (3484): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2824, selfPID=2824, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5652, selfPID=5652, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3532, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5264, selfPID=3008, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=748, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3284, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5736, iMonCtr=2
Leaving CPDN_Main::Monitor...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_eu_x3ow_1991_1_006923592/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_eu_x3ow_1991_1_006923592/dataout/region_restart.day after 11 attempts

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakg.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...
20:01:37 (5872): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_eu_x3ow_1991_1_006923592_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_x3ow_1991_1_006923592_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
18 Feb 2011 07:23:42 1045266 12203522 hadam3p_eu_x3ow_1991_1_006923592_0 115,299 343,379 2.9782
18 Feb 2011 00:13:08 1045266 12203522 hadam3p_eu_x3ow_1991_1_006923592_0 115,296 342,984 2.9748
16 Feb 2011 19:48:32 1045266 12203522 hadam3p_eu_x3ow_1991_1_006923592_0 103,776 308,471 2.9725
16 Feb 2011 07:56:44 1045266 12203522 hadam3p_eu_x3ow_1991_1_006923592_0 92,256 273,684 2.9666
14 Feb 2011 09:18:36 1045266 12203522 hadam3p_eu_x3ow_1991_1_006923592_0 80,736 239,465 2.9660
13 Feb 2011 15:15:41 1045266 12203522 hadam3p_eu_x3ow_1991_1_006923592_0 69,216 205,757 2.9727
12 Feb 2011 16:47:42 1045266 12203522 hadam3p_eu_x3ow_1991_1_006923592_0 57,696 171,771 2.9772
11 Feb 2011 22:54:42 1045266 12203522 hadam3p_eu_x3ow_1991_1_006923592_0 46,176 136,982 2.9665
11 Feb 2011 12:28:29 1045266 12203522 hadam3p_eu_x3ow_1991_1_006923592_0 34,656 102,271 2.9510
10 Feb 2011 15:17:35 1045266 12203522 hadam3p_eu_x3ow_1991_1_006923592_0 23,136 67,888 2.9343
09 Feb 2011 16:56:00 1045266 12203522 hadam3p_eu_x3ow_1991_1_006923592_0 11,616 33,394 2.8748


©2024 cpdn.org