climateprediction.net home page
Task 13993900

Task 13993900

Name hadam3p_eu_919t_1969_1_007721705_0
Workunit 7876813
Created 26 Jan 2012, 13:27:54 UTC
Sent 13 Feb 2012, 22:30:40 UTC
Report deadline 26 Jan 2013, 3:50:40 UTC
Received 22 Feb 2012, 21:21:40 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1177318
Run time 4 days 2 hours 22 min 35 sec
CPU time 3 days 20 hours 37 min 21 sec
Validate state Invalid
Credit 1,790.21
Device peak FLOPS 2.38 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6496, selfPID=7592, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3268, selfPID=5580, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5256, selfPID=4952, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
10:42:20 (4396): No heartbeat from core client for 30 sec - exiting
10:42:21 (4396): No heartbeat from core client for 30 sec - exiting
10:42:23 (4396): No heartbeat from core client for 30 sec - exiting
10:42:24 (4396): No heartbeat from core client for 30 sec - exiting
10:42:25 (4396): No heartbeat from core client for 30 sec - exiting
10:42:26 (4396): No heartbeat from core client for 30 sec - exiting
10:42:27 (4396): No heartbeat from core client for 30 sec - exiting
10:42:28 (4396): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7232, selfPID=7232, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6784, selfPID=6784, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2416, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6536, selfPID=4040, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6024, selfPID=4684, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1788, selfPID=1788, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6796, selfPID=3616, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish
16:21:47 (3616): No heartbeat from core client for 30 sec - exiting
16:21:48 (3616): No heartbeat from core client for 30 sec - exiting

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_919t_1969_1_007721705_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_919t_1969_1_007721705_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_919t_1969_1_007721705_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
22 Feb 2012 11:11:00 1177318 13993900 hadam3p_eu_919t_1969_1_007721705_0 103,776 320,128 3.0848
20 Feb 2012 21:06:11 1177318 13993900 hadam3p_eu_919t_1969_1_007721705_0 92,256 285,651 3.0963
20 Feb 2012 08:37:28 1177318 13993900 hadam3p_eu_919t_1969_1_007721705_0 80,736 247,217 3.0620
19 Feb 2012 15:58:21 1177318 13993900 hadam3p_eu_919t_1969_1_007721705_0 69,216 211,903 3.0615
18 Feb 2012 22:24:52 1177318 13993900 hadam3p_eu_919t_1969_1_007721705_0 57,696 176,773 3.0639
17 Feb 2012 16:59:53 1177318 13993900 hadam3p_eu_919t_1969_1_007721705_0 46,176 141,570 3.0659
16 Feb 2012 19:09:30 1177318 13993900 hadam3p_eu_919t_1969_1_007721705_0 34,656 104,854 3.0256
15 Feb 2012 22:31:59 1177318 13993900 hadam3p_eu_919t_1969_1_007721705_0 23,136 71,928 3.1089
14 Feb 2012 20:23:10 1177318 13993900 hadam3p_eu_919t_1969_1_007721705_0 11,616 35,532 3.0589


©2024 cpdn.org