climateprediction.net home page
Task 19200789

Task 19200789

Name hadam3p_anz_h110_200012_12_289_010254855_1
Workunit 10254855
Created 14 Jan 2016, 20:01:21 UTC
Sent 14 Jan 2016, 20:03:18 UTC
Report deadline 27 Dec 2016, 1:23:18 UTC
Received 5 Feb 2016, 11:20:40 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1379047
Run time 7 days 14 hours 41 min 26 sec
CPU time 6 days 2 hours 43 min 40 sec
Validate state Invalid
Credit 3,987.46
Device peak FLOPS 2.54 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.6.22</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4552, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4936, selfPID=4836, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3924, selfPID=3924, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6176, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4176, selfPID=3204, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5392, selfPID=5708, iMonCtr=1
Model crash detected, will try to restart...
12:15:07 (3104): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4200, selfPID=4200, iMonCtr=2
11:02:33 (4052): No heartbeat from core client for 30 sec - exiting
11:02:34 (4052): No heartbeat from core client for 30 sec - exiting
11:02:35 (4052): No heartbeat from core client for 30 sec - exiting
11:02:36 (4052): No heartbeat from core client for 30 sec - exiting
11:02:37 (4052): No heartbeat from core client for 30 sec - exiting
11:02:38 (4052): No heartbeat from core client for 30 sec - exiting
11:02:39 (4052): No heartbeat from core client for 30 sec - exiting
11:02:40 (4052): No heartbeat from core client for 30 sec - exiting
11:02:41 (4052): No heartbeat from core client for 30 sec - exiting
11:02:43 (4052): No heartbeat from core client for 30 sec - exiting
11:02:44 (4052): No heartbeat from core client for 30 sec - exiting
11:02:45 (4052): No heartbeat from core client for 30 sec - exiting
11:02:46 (4052): No heartbeat from core client for 30 sec - exiting
11:02:47 (4052): No heartbeat from core client for 30 sec - exiting
11:02:48 (4052): No heartbeat from core client for 30 sec - exiting
11:02:49 (4052): No heartbeat from core client for 30 sec - exiting
11:02:50 (4052): No heartbeat from core client for 30 sec - exiting
11:02:51 (4052): No heartbeat from core client for 30 sec - exiting
11:02:52 (4052): No heartbeat from core client for 30 sec - exiting
11:02:54 (4052): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5340, selfPID=5340, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5700, selfPID=5700, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3920, selfPID=4112, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4732, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3612, selfPID=4848, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_anz_h110_200012_12_289_010254855/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_anz_h110_200012_12_289_010254855/dataout/region_restart.day after 11 attempts

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakg.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_h110_200012_12_289_010254855_1_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_h110_200012_12_289_010254855_1_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_h110_200012_12_289_010254855_1_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_h110_200012_12_289_010254855_1_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
04 Feb 2016 10:38:04 1379047 19200789 hadam3p_anz_h110_200012_12_289_010254855_1 92,459 491,823 5.3194
03 Feb 2016 12:07:39 1379047 19200789 hadam3p_anz_h110_200012_12_289_010254855_1 80,939 430,048 5.3132
01 Feb 2016 07:35:08 1379047 19200789 hadam3p_anz_h110_200012_12_289_010254855_1 69,419 369,205 5.3185
28 Jan 2016 20:27:10 1379047 19200789 hadam3p_anz_h110_200012_12_289_010254855_1 57,899 310,411 5.3612
27 Jan 2016 01:14:24 1379047 19200789 hadam3p_anz_h110_200012_12_289_010254855_1 46,379 247,855 5.3441
23 Jan 2016 17:13:13 1379047 19200789 hadam3p_anz_h110_200012_12_289_010254855_1 34,859 184,667 5.2975
21 Jan 2016 00:58:25 1379047 19200789 hadam3p_anz_h110_200012_12_289_010254855_1 23,339 126,317 5.4123
19 Jan 2016 07:36:11 1379047 19200789 hadam3p_anz_h110_200012_12_289_010254855_1 11,819 64,637 5.4689


©2024 climateprediction.net