climateprediction.net home page
Task 12290739

Task 12290739

Name hadam3p_saf_1uij_2002_1_007005779_0
Workunit 7209095
Created 24 Nov 2010, 12:28:33 UTC
Sent 24 Jan 2011, 6:59:47 UTC
Report deadline 6 Jan 2012, 12:19:47 UTC
Received 29 Jan 2011, 21:22:46 UTC
Server state Over
Outcome No reply
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1129983
Run time 1 days 19 hours 1 min 34 sec
CPU time 1 days 18 hours 5 min 38 sec
Validate state Invalid
Credit 1,122.82
Device peak FLOPS 3.65 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.08
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
06:12:07 (5368): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
15:31:05 (3272): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
16:30:02 (984): No heartbeat from core client for 30 sec - exiting
16:30:03 (984): No heartbeat from core client for 30 sec - exiting
16:30:27 (984): No heartbeat from core client for 30 sec - exiting
16:30:28 (984): No heartbeat from core client for 30 sec - exiting
16:30:29 (984): No heartbeat from core client for 30 sec - exiting
16:30:30 (984): No heartbeat from core client for 30 sec - exiting
16:30:31 (984): No heartbeat from core client for 30 sec - exiting
16:30:32 (984): No heartbeat from core client for 30 sec - exiting
16:30:33 (984): No heartbeat from core client for 30 sec - exiting
16:30:34 (984): No heartbeat from core client for 30 sec - exiting
16:30:35 (984): No heartbeat from core client for 30 sec - exiting
16:30:36 (984): No heartbeat from core client for 30 sec - exiting
16:30:37 (984): No heartbeat from core client for 30 sec - exiting
16:30:38 (984): No heartbeat from core client for 30 sec - exiting
16:30:39 (984): No heartbeat from core client for 30 sec - exiting
16:30:40 (984): No heartbeat from core client for 30 sec - exiting
16:30:41 (984): No heartbeat from core client for 30 sec - exiting
16:30:42 (984): No heartbeat from core client for 30 sec - exiting
16:30:43 (984): No heartbeat from core client for 30 sec - exiting
16:30:44 (984): No heartbeat from core client for 30 sec - exiting
16:30:45 (984): No heartbeat from core client for 30 sec - exiting
16:30:46 (984): No heartbeat from core client for 30 sec - exiting
16:30:47 (984): No heartbeat from core client for 30 sec - exiting
16:30:48 (984): No heartbeat from core client for 30 sec - exiting
16:30:49 (984): No heartbeat from core client for 30 sec - exiting
16:30:50 (984): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
15:34:13 (4504): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3848, selfPID=3848, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3556, selfPID=3556, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4492, selfPID=4492, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5528, selfPID=5528, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1568, selfPID=1568, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4300, selfPID=4300, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
10:00:03 (4644): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=1904, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3200, selfPID=5816, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
13:47:48 (5816): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_saf_1uij_2002_1_007005779_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1uij_2002_1_007005779_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1uij_2002_1_007005779_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1uij_2002_1_007005779_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1uij_2002_1_007005779_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1uij_2002_1_007005779_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
29 Jan 2011 13:45:54 1129983 12290739 hadam3p_saf_1uij_2002_1_007005779_0 69,216 136,686 1.9748
27 Jan 2011 18:11:53 1129983 12290739 hadam3p_saf_1uij_2002_1_007005779_0 57,699 112,137 1.9435
27 Jan 2011 18:09:24 1129983 12290739 hadam3p_saf_1uij_2002_1_007005779_0 57,696 111,810 1.9379
26 Jan 2011 13:50:21 1129983 12290739 hadam3p_saf_1uij_2002_1_007005779_0 46,176 87,802 1.9015
25 Jan 2011 14:49:29 1129983 12290739 hadam3p_saf_1uij_2002_1_007005779_0 34,656 62,381 1.8000
25 Jan 2011 11:14:57 1129983 12290739 hadam3p_saf_1uij_2002_1_007005779_0 23,136 43,355 1.8739
25 Jan 2011 11:14:57 1129983 12290739 hadam3p_saf_1uij_2002_1_007005779_0 11,616 22,172 1.9087


©2024 climateprediction.net