climateprediction.net home page
Task 16529693

Task 16529693

Name hadam3p_eu_f2rk_2013_1_008685890_0
Workunit 8820364
Created 17 Apr 2014, 15:20:48 UTC
Sent 21 Apr 2014, 4:44:03 UTC
Report deadline 3 Apr 2015, 10:04:03 UTC
Received 30 May 2014, 21:25:17 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1281494
Run time 2 days 23 hours 45 min 43 sec
CPU time 2 days 17 hours 47 min 33 sec
Validate state Invalid
Credit 1,392.75
Device peak FLOPS 3.14 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>7.2.33</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7740, selfPID=6964, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6672, selfPID=5116, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
17:48:31 (5532): No heartbeat from core client for 30 sec - exiting
17:48:33 (5532): No heartbeat from core client for 30 sec - exiting
17:48:34 (5532): No heartbeat from core client for 30 sec - exiting
17:48:35 (5532): No heartbeat from core client for 30 sec - exiting
17:48:36 (5532): No heartbeat from core client for 30 sec - exiting
17:48:37 (5532): No heartbeat from core client for 30 sec - exiting
17:48:38 (5532): No heartbeat from core client for 30 sec - exiting
17:48:39 (5532): No heartbeat from core client for 30 sec - exiting
17:48:40 (5532): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:50:49 (5616): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
13:41:53 (5444): No heartbeat from core client for 30 sec - exiting
13:41:54 (5444): No heartbeat from core client for 30 sec - exiting
13:41:55 (5444): No heartbeat from core client for 30 sec - exiting
13:41:56 (5444): No heartbeat from core client for 30 sec - exiting
13:41:57 (5444): No heartbeat from core client for 30 sec - exiting
13:41:58 (5444): No heartbeat from core client for 30 sec - exiting
13:41:59 (5444): No heartbeat from core client for 30 sec - exiting
13:42:00 (5444): No heartbeat from core client for 30 sec - exiting
13:42:01 (5444): No heartbeat from core client for 30 sec - exiting
13:42:02 (5444): No heartbeat from core client for 30 sec - exiting
13:42:03 (5444): No heartbeat from core client for 30 sec - exiting
13:42:04 (5444): No heartbeat from core client for 30 sec - exiting
13:42:05 (5444): No heartbeat from core client for 30 sec - exiting
13:42:06 (5444): No heartbeat from core client for 30 sec - exiting
13:42:07 (5444): No heartbeat from core client for 30 sec - exiting
13:42:08 (5444): No heartbeat from core client for 30 sec - exiting
13:42:09 (5444): No heartbeat from core client for 30 sec - exiting
13:42:10 (5444): No heartbeat from core client for 30 sec - exiting
13:42:11 (5444): No heartbeat from core client for 30 sec - exiting
13:42:12 (5444): No heartbeat from core client for 30 sec - exiting
13:42:13 (5444): No heartbeat from core client for 30 sec - exiting
13:42:14 (5444): No heartbeat from core client for 30 sec - exiting
13:42:15 (5444): No heartbeat from core client for 30 sec - exiting
13:42:16 (5444): No heartbeat from core client for 30 sec - exiting
13:42:17 (5444): No heartbeat from core client for 30 sec - exiting
13:42:18 (5444): No heartbeat from core client for 30 sec - exiting
13:42:19 (5444): No heartbeat from core client for 30 sec - exiting
13:42:20 (5444): No heartbeat from core client for 30 sec - exiting
13:42:21 (5444): No heartbeat from core client for 30 sec - exiting
13:42:22 (5444): No heartbeat from core client for 30 sec - exiting
13:42:23 (5444): No heartbeat from core client for 30 sec - exiting
13:42:24 (5444): No heartbeat from core client for 30 sec - exiting
13:42:25 (5444): No heartbeat from core client for 30 sec - exiting
13:42:26 (5444): No heartbeat from core client for 30 sec - exiting
13:42:27 (5444): No heartbeat from core client for 30 sec - exiting
13:42:28 (5444): No heartbeat from core client for 30 sec - exiting
13:42:29 (5444): No heartbeat from core client for 30 sec - exiting
13:42:30 (5444): No heartbeat from core client for 30 sec - exiting
13:42:31 (5444): No heartbeat from core client for 30 sec - exiting
13:42:32 (5444): No heartbeat from core client for 30 sec - exiting
13:42:33 (5444): No heartbeat from core client for 30 sec - exiting
13:42:34 (5444): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8360, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8408, selfPID=5184, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bR19:30:32 (7140): No heartbeat from core client for 30 sec - exiting
19:30:33 (7140): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:52:07 (9092): No heartbeat from core client for 30 sec - exiting
20:52:08 (9092): No heartbeat from core client for 30 sec - exiting
20:52:09 (9092): No heartbeat from core client for 30 sec - exiting
20:52:10 (9092): No heartbeat from core client for 30 sec - exiting
20:52:11 (9092): No heartbeat from core client for 30 sec - exiting
20:52:12 (9092): No heartbeat from core client for 30 sec - exiting
20:52:13 (9092): No heartbeat from core client for 30 sec - exiting
20:52:14 (9092): No heartbeat from core client for 30 sec - exiting
20:52:15 (9092): No heartbeat from core client for 30 sec - exiting
20:52:16 (9092): No heartbeat from core client for 30 sec - exiting
20:52:17 (9092): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
20:05:48 (8692): No heartbeat from core client for 30 sec - exiting
20:05:49 (8692): No heartbeat from core client for 30 sec - exiting
20:05:51 (8692): No heartbeat from core client for 30 sec - exiting
20:05:52 (8692): No heartbeat from core client for 30 sec - exiting
20:05:53 (8692): No heartbeat from core client for 30 sec - exiting
20:05:54 (8692): No heartbeat from core client for 30 sec - exiting
20:05:55 (8692): No heartbeat from core client for 30 sec - exiting
20:05:56 (8692): No heartbeat from core client for 30 sec - exiting
20:05:57 (8692): No heartbeat from core client for 30 sec - exiting
20:05:58 (8692): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5044, selfPID=3064, iMonCtr=1
Model crash detected, will try to restart...
17:19:41 (6004): No heartbeat from core client for 30 sec - exiting
17:19:43 (6004): No heartbeat from core client for 30 sec - exiting
17:19:44 (6004): No heartbeat from core client for 30 sec - exiting
17:19:45 (6004): No heartbeat from core client for 30 sec - exiting
17:19:46 (6004): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2444, selfPID=6928, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1308, selfPID=2532, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
17:22:53 (6140): No heartbeat from core client for 30 sec - exiting
17:22:54 (6140): No heartbeat from core client for 30 sec - exiting
17:22:55 (6140): No heartbeat from core client for 30 sec - exiting
17:22:56 (6140): No heartbeat from core client for 30 sec - exiting
17:22:57 (6140): No heartbeat from core client for 30 sec - exiting
17:22:58 (6140): No heartbeat from core client for 30 sec - exiting
17:22:59 (6140): No heartbeat from core client for 30 sec - exiting
17:23:00 (6140): No heartbeat from core client for 30 sec - exiting
17:23:01 (6140): No heartbeat from core client for 30 sec - exiting
17:23:02 (6140): No heartbeat from core client for 30 sec - exiting
17:23:03 (6140): No heartbeat from core client for 30 sec - exiting
17:23:04 (6140): No heartbeat from core client for 30 sec - exiting
17:23:05 (6140): No heartbeat from core client for 30 sec - exiting
17:23:06 (6140): No heartbeat from core client for 30 sec - exiting
17:23:07 (6140): No heartbeat from core client for 30 sec - exiting
17:23:08 (6140): No heartbeat from core client for 30 sec - exiting
17:23:09 (6140): No heartbeat from core client for 30 sec - exiting
17:23:10 (6140): No heartbeat from core client for 30 sec - exiting
17:23:11 (6140): No heartbeat from core client for 30 sec - exiting
17:23:12 (6140): No heartbeat from core client for 30 sec - exiting
17:23:13 (6140): No heartbeat from core client for 30 sec - exiting
17:23:14 (6140): No heartbeat from core client for 30 sec - exiting
17:23:15 (6140): No heartbeat from core client for 30 sec - exiting
17:23:16 (6140): No heartbeat from core client for 30 sec - exiting
17:23:17 (6140): No heartbeat from core client for 30 sec - exiting
17:23:18 (6140): No heartbeat from core client for 30 sec - exiting
17:23:20 (6140): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3652, selfPID=5916, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2352, selfPID=2340, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
17:22:30 (5480): No heartbeat from core client for 30 sec - exiting
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_eu_f2rk_2013_1_008685890/dataout/atmos_restart.day after 11 attempts
17:22:31 (5480): No heartbeat from core client for 30 sec - exiting
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_eu_f2rk_2013_1_008685890/dataout/region_restart.day after 11 attempts
17:22:32 (5480): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakg.pipe_dummy                                                            2048    
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_eu_f2rk_2013_1_008685890/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_eu_f2rk_2013_1_008685890/dataout/region_restart.day after 11 attempts

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                        
  odel c           d: RE  ADHIS       of file    in READ fro m history file fo  r  namelist NLIHISTO              tmp/xaakg.pipe                                                                     48    
                                                                                      tmp/xaakm.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_f2rk_2013_1_008685890_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_f2rk_2013_1_008685890_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_f2rk_2013_1_008685890_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_f2rk_2013_1_008685890_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_f2rk_2013_1_008685890_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
24 May 2014 04:25:39 1281494 16529693 hadam3p_eu_f2rk_2013_1_008685890_0 80,736 208,408 2.5814
19 May 2014 23:54:45 1281494 16529693 hadam3p_eu_f2rk_2013_1_008685890_0 69,216 178,633 2.5808
11 May 2014 01:39:48 1281494 16529693 hadam3p_eu_f2rk_2013_1_008685890_0 57,696 148,584 2.5753
05 May 2014 02:02:29 1281494 16529693 hadam3p_eu_f2rk_2013_1_008685890_0 46,176 118,592 2.5683
03 May 2014 02:51:16 1281494 16529693 hadam3p_eu_f2rk_2013_1_008685890_0 34,656 88,701 2.5595
28 Apr 2014 23:03:54 1281494 16529693 hadam3p_eu_f2rk_2013_1_008685890_0 23,136 59,089 2.5540
27 Apr 2014 00:32:02 1281494 16529693 hadam3p_eu_f2rk_2013_1_008685890_0 11,616 29,931 2.5767


©2024 climateprediction.net