climateprediction.net home page
Task 16531546

Task 16531546

Name hadam3p_eu_j2nc_2013_1_008687708_0
Workunit 8822182
Created 17 Apr 2014, 15:25:44 UTC
Sent 19 Apr 2014, 22:33:42 UTC
Report deadline 2 Apr 2015, 3:53:42 UTC
Received 5 May 2014, 5:52:25 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1314952
Run time 1 days 13 hours 37 min 19 sec
CPU time 1 days 11 hours 56 min 52 sec
Validate state Invalid
Credit 1,591.55
Device peak FLOPS 3.24 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
22:25:37 (6848): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:25:38 (6848): No heartbeat from core client for 30 sec - exiting
22:25:39 (6848): No heartbeat from core client for 30 sec - exiting
22:25:40 (6848): No heartbeat from core client for 30 sec - exiting
22:25:41 (6848): No heartbeat from core client for 30 sec - exiting
22:25:42 (6848): No heartbeat from core client for 30 sec - exiting
22:25:43 (6848): No heartbeat from core client for 30 sec - exiting
22:25:44 (6848): No heartbeat from core client for 30 sec - exiting
22:25:45 (6848): No heartbeat from core client for 30 sec - exiting
22:25:46 (6848): No heartbeat from core client for 30 sec - exiting
22:25:47 (6848): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4984, selfPID=13636, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
21:14:52 (4108): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:16:22 (7828): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:17:20 (12108): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:19:12 (8380): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7576, selfPID=7576, iMonCtr=2
21:23:45 (14068): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:34:26 (1900): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:48:01 (10608): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:48:06 (10608): No heartbeat from core client for 30 sec - exiting
21:48:07 (10608): No heartbeat from core client for 30 sec - exiting
21:48:08 (10608): No heartbeat from core client for 30 sec - exiting
21:48:09 (10608): No heartbeat from core client for 30 sec - exiting
21:48:10 (10608): No heartbeat from core client for 30 sec - exiting
21:48:11 (10608): No heartbeat from core client for 30 sec - exiting
21:48:12 (10608): No heartbeat from core client for 30 sec - exiting
21:48:13 (10608): No heartbeat from core client for 30 sec - exiting
21:48:14 (10608): No heartbeat from core client for 30 sec - exiting
22:06:18 (13556): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:18:20 (11692): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9612, selfPID=9612, iMonCtr=2
22:28:59 (10556): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:39:52 (8420): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:47:38 (4796): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:00:33 (7536): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:05:00 (12976): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:11:02 (14864): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:12:02 (11288): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:23:10 (11908): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15056, selfPID=15056, iMonCtr=2
23:30:04 (10216): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14828, selfPID=14828, iMonCtr=2
23:47:45 (14404): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12760, selfPID=12760, iMonCtr=2
23:56:45 (14752): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:06:35 (1236): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:21:50 (15564): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=17188, selfPID=17188, iMonCtr=2

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_j2nc_2013_1_008687708_0_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_j2nc_2013_1_008687708_0_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_j2nc_2013_1_008687708_0_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_j2nc_2013_1_008687708_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
05 May 2014 04:53:01 1314952 16531546 hadam3p_eu_j2nc_2013_1_008687708_0 92,260 126,552 1.3717
05 May 2014 03:27:46 1314952 16531546 hadam3p_eu_j2nc_2013_1_008687708_0 92,256 126,356 1.3696
05 May 2014 01:22:21 1314952 16531546 hadam3p_eu_j2nc_2013_1_008687708_0 80,736 111,476 1.3807
05 May 2014 01:22:21 1314952 16531546 hadam3p_eu_j2nc_2013_1_008687708_0 69,216 96,706 1.3972
05 May 2014 01:22:21 1314952 16531546 hadam3p_eu_j2nc_2013_1_008687708_0 57,696 81,975 1.4208
05 May 2014 01:22:21 1314952 16531546 hadam3p_eu_j2nc_2013_1_008687708_0 46,176 67,202 1.4553
05 May 2014 01:22:21 1314952 16531546 hadam3p_eu_j2nc_2013_1_008687708_0 34,656 52,242 1.5074
05 May 2014 01:22:21 1314952 16531546 hadam3p_eu_j2nc_2013_1_008687708_0 23,136 37,125 1.6046
05 May 2014 01:22:21 1314952 16531546 hadam3p_eu_j2nc_2013_1_008687708_0 11,616 20,016 1.7231


©2024 climateprediction.net