climateprediction.net home page
Task 16769296

Task 16769296

Name hadam3p_eu_e6j8_2013_1_008850255_0
Workunit 8996184
Created 8 Jul 2014, 19:20:46 UTC
Sent 19 Jul 2014, 17:25:48 UTC
Report deadline 1 Jul 2015, 22:45:48 UTC
Received 27 Jul 2014, 11:51:17 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 415805
Run time 2 days 1 hours 0 min 25 sec
CPU time 1 days 20 hours 42 min 7 sec
Validate state Invalid
Credit 1,392.75
Device peak FLOPS 3.24 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1188, selfPID=1188, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2488, selfPID=5856, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5204, selfPID=5372, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3900, selfPID=4596, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4084, selfPID=5936, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3900, selfPID=5488, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3048, selfPID=4948, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5872, selfPID=5556, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3884, selfPID=6112, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1324, selfPID=5668, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3712, selfPID=5200, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4996, selfPID=5376, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6104, selfPID=5916, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5808, selfPID=5228, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5112, selfPID=6124, iMonCtr=1
Model crash detected, will try to restart...
14:14:16 (5256): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3272, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3140, selfPID=6048, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1876, selfPID=4144, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1984, selfPID=4944, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3736, selfPID=5828, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=700, selfPID=4184, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4424, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3296, selfPID=6128, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5272, selfPID=5784, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6032, selfPID=1248, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_eu_e6j8_2013_1_008850255/dataout/atmos_restart.day after 11 attempts
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_e6j8_2013_1_008850255\tmp\xaakm.namelists

Image              PC        Routine            Line        Source             
hadam3p_eu_um_6.0  0131A39A  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  012C2CD0  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  012C1E9A  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  012A2819  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  011A2287  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  0123E7B2  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  0123F2DA  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  00FB9BD2  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  012FE638  Unknown               Unknown  Unknown
kernel32.dll       7742EE1C  Unknown               Unknown  Unknown
ntdll.dll          772F37EB  Unknown               Unknown  Unknown
ntdll.dll          772F37BE  Unknown               Unknown  Unknown

Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4840, selfPID=4988, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_e6j8_2013_1_008850255_0_8.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_e6j8_2013_1_008850255_0_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_e6j8_2013_1_008850255_0_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_e6j8_2013_1_008850255_0_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_e6j8_2013_1_008850255_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
26 Jul 2014 13:25:44 415805 16769296 hadam3p_eu_e6j8_2013_1_008850255_0 80,736 141,357 1.7509
25 Jul 2014 18:05:18 415805 16769296 hadam3p_eu_e6j8_2013_1_008850255_0 69,216 121,667 1.7578
25 Jul 2014 06:38:59 415805 16769296 hadam3p_eu_e6j8_2013_1_008850255_0 57,696 101,504 1.7593
23 Jul 2014 18:27:15 415805 16769296 hadam3p_eu_e6j8_2013_1_008850255_0 46,176 82,023 1.7763
23 Jul 2014 06:30:32 415805 16769296 hadam3p_eu_e6j8_2013_1_008850255_0 34,656 61,422 1.7723
22 Jul 2014 12:42:13 415805 16769296 hadam3p_eu_e6j8_2013_1_008850255_0 23,136 41,475 1.7927
21 Jul 2014 09:54:01 415805 16769296 hadam3p_eu_e6j8_2013_1_008850255_0 11,616 20,702 1.7822


©2024 cpdn.org