climateprediction.net home page
Task 17550516

Task 17550516

Name hadam3p_anz_d681_2012_1_009262875_1
Workunit 9355791
Created 4 Dec 2014, 16:01:20 UTC
Sent 4 Dec 2014, 16:46:51 UTC
Report deadline 16 Nov 2015, 22:06:51 UTC
Received 12 Mar 2015, 10:02:44 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1241522
Run time 5 days 11 hours 37 min 23 sec
CPU time 5 days 5 hours 50 min 45 sec
Validate state Invalid
Credit 3,490.64
Device peak FLOPS 3.32 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.4.36</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4916, selfPID=4836, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6080, selfPID=5064, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
17:49:38 (4924): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:25:21 (1096): No heartbeat from core client for 30 sec - exiting
18:25:22 (1096): No heartbeat from core client for 30 sec - exiting
18:25:23 (1096): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:01:20 (4984): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
14:15:39 (3892): No heartbeat from core client for 30 sec - exiting
14:15:40 (3892): No heartbeat from core client for 30 sec - exiting
14:15:41 (3892): No heartbeat from core client for 30 sec - exiting
14:15:42 (3892): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2916, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5256, selfPID=4276, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5200, selfPID=3944, iMonCtr=1
Model crash detected, will try to restart...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5016, selfPID=5576, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1476, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:44:19 (3816): No heartbeat from core client for 30 sec - exiting
09:44:20 (3816): No heartbeat from core client for 30 sec - exiting
09:44:21 (3816): No heartbeat from core client for 30 sec - exiting
09:44:23 (3816): No heartbeat from core client for 30 sec - exiting
09:44:24 (3816): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:23:55 (4932): No heartbeat from core client for 30 sec - exiting
09:23:56 (4932): No heartbeat from core client for 30 sec - exiting
09:23:57 (4932): No heartbeat from core client for 30 sec - exiting
09:23:58 (4932): No heartbeat from core client for 30 sec - exiting
09:23:59 (4932): No heartbeat from core client for 30 sec - exiting
09:24:00 (4932): No heartbeat from core client for 30 sec - exiting
09:24:01 (4932): No heartbeat from core client for 30 sec - exiting
09:24:03 (4932): No heartbeat from core client for 30 sec - exiting
09:24:04 (4932): No heartbeat from core client for 30 sec - exiting
09:24:05 (4932): No heartbeat from core client for 30 sec - exiting
09:24:06 (4932): No heartbeat from core client for 30 sec - exiting
09:24:07 (4932): No heartbeat from core client for 30 sec - exiting
09:24:08 (4932): No heartbeat from core client for 30 sec - exiting
09:24:09 (4932): No heartbeat from core client for 30 sec - exiting
09:24:10 (4932): No heartbeat from core client for 30 sec - exiting
09:24:11 (4932): No heartbeat from core client for 30 sec - exiting
09:24:12 (4932): No heartbeat from core client for 30 sec - exiting
09:24:13 (4932): No heartbeat from core client for 30 sec - exiting
09:24:15 (4932): No heartbeat from core client for 30 sec - exiting
09:24:16 (4932): No heartbeat from core client for 30 sec - exiting
09:24:17 (4932): No heartbeat from core client for 30 sec - exiting
09:24:18 (4932): No heartbeat from core client for 30 sec - exiting
09:24:19 (4932): No heartbeat from core client for 30 sec - exiting
09:24:20 (4932): No heartbeat from core client for 30 sec - exiting
09:24:21 (4932): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2436, selfPID=4220, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_d681_2012_1_009262875_1_8.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_d681_2012_1_009262875_1_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_d681_2012_1_009262875_1_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_d681_2012_1_009262875_1_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_d681_2012_1_009262875_1_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
03 Mar 2015 14:06:38 1241522 17550516 hadam3p_anz_d681_2012_1_009262875_1 80,939 411,380 5.0826
24 Feb 2015 10:04:36 1241522 17550516 hadam3p_anz_d681_2012_1_009262875_1 69,419 353,659 5.0946
16 Feb 2015 16:47:32 1241522 17550516 hadam3p_anz_d681_2012_1_009262875_1 57,899 297,182 5.1328
12 Feb 2015 11:27:44 1241522 17550516 hadam3p_anz_d681_2012_1_009262875_1 46,379 237,782 5.1269
03 Feb 2015 09:00:21 1241522 17550516 hadam3p_anz_d681_2012_1_009262875_1 34,859 176,159 5.0535
13 Jan 2015 16:53:41 1241522 17550516 hadam3p_anz_d681_2012_1_009262875_1 23,339 117,002 5.0132
12 Dec 2014 16:10:52 1241522 17550516 hadam3p_anz_d681_2012_1_009262875_1 11,819 60,013 5.0777


©2024 cpdn.org