climateprediction.net home page
Task 12584889

Task 12584889

Name hadam3p_eu_xo6h_2000_1_006993345_1
Workunit 7196661
Created 16 Feb 2011, 8:47:05 UTC
Sent 16 Feb 2011, 8:53:37 UTC
Report deadline 29 Jan 2012, 14:13:37 UTC
Received 26 Mar 2011, 2:35:31 UTC
Server state Over
Outcome No reply
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1132662
Run time 3 days 19 hours 58 min 48 sec
CPU time 3 days 15 hours 27 min 24 sec
Validate state Invalid
Credit 1,790.21
Device peak FLOPS 2.62 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.08
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4472, selfPID=4472, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process isCPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1020, selfPID=1020, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2020, selfPID=2020, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4140, selfPID=3480, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5176, selfPID=4668, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5048, selfPID=3656, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4852, selfPID=4852, iMonCtr=2
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1176, selfPID=4448, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2060, selfPID=5020, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4792, selfPID=4924, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6528, selfPID=3200, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5544, selfPID=4920, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6088, selfPID=5956, iMonCtr=1
Model crash detected, will try to restart...
23:04:02 (3740): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:41:17 (4540): No heartbeat from core client for 30 sec - exiting
10:41:18 (4540): No heartbeat from core client for 30 sec - exiting
10:41:19 (4540): No heartbeat from core client for 30 sec - exiting
10:41:20 (4540): No heartbeat from core client for 30 sec - exiting
10:41:21 (4540): No heartbeat from core client for 30 sec - exiting
10:41:22 (4540): No heartbeat from core client for 30 sec - exiting
10:41:23 (4540): No heartbeat from core client for 30 sec - exiting
10:41:24 (4540): No heartbeat from core client for 30 sec - exiting
10:41:25 (4540): No heartbeat from core client for 30 sec - exiting
10:41:26 (4540): No heartbeat from core client for 30 sec - exiting
10:41:27 (4540): No heartbeat from core client for 30 sec - exiting
10:41:28 (4540): No heartbeat from core client for 30 sec - exiting
10:41:29 (4540): No heartbeat from core client for 30 sec - exiting
10:41:30 (4540): No heartbeat from core client for 30 sec - exiting
10:41:31 (4540): No heartbeat from core client for 30 sec - exiting
10:41:32 (4540): No heartbeat from core client for 30 sec - exiting
10:41:33 (4540): No heartbeat from core client for 30 sec - exiting
10:41:34 (4540): No heartbeat from core client for 30 sec - exiting
10:41:35 (4540): No heartbeat from core client for 30 sec - exiting
10:41:36 (4540): No heartbeat from core client for 30 sec - exiting
10:41:37 (4540): No heartbeat from core client for 30 sec - exiting
10:41:38 (4540): No heartbeat from core client for 30 sec - exiting
10:41:39 (4540): No heartbeat from core client for 30 sec - exiting
10:41:40 (4540): No heartbeat from core client for 30 sec - exiting
10:41:41 (4540): No heartbeat from core client for 30 sec - exiting
10:41:42 (4540): No heartbeat from core client for 30 sec - exiting
10:41:43 (4540): No heartbeat from core client for 30 sec - exiting
10:41:44 (4540): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4468, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
13:34:04 (4468): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_eu_xo6h_2000_1_006993345_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_xo6h_2000_1_006993345_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_xo6h_2000_1_006993345_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
22 Mar 2011 11:27:52 1132662 12584889 hadam3p_eu_xo6h_2000_1_006993345_1 103,776 297,435 2.8661
18 Mar 2011 03:22:40 1132662 12584889 hadam3p_eu_xo6h_2000_1_006993345_1 92,256 264,008 2.8617
14 Mar 2011 09:54:53 1132662 12584889 hadam3p_eu_xo6h_2000_1_006993345_1 80,736 231,056 2.8619
08 Mar 2011 20:59:12 1132662 12584889 hadam3p_eu_xo6h_2000_1_006993345_1 69,216 198,019 2.8609
08 Mar 2011 20:59:12 1132662 12584889 hadam3p_eu_xo6h_2000_1_006993345_1 57,696 166,908 2.8929
08 Mar 2011 20:59:12 1132662 12584889 hadam3p_eu_xo6h_2000_1_006993345_1 46,176 135,411 2.9325
08 Mar 2011 20:59:12 1132662 12584889 hadam3p_eu_xo6h_2000_1_006993345_1 34,656 102,038 2.9443
25 Feb 2011 14:15:53 1132662 12584889 hadam3p_eu_xo6h_2000_1_006993345_1 23,139 68,094 2.9428
24 Feb 2011 13:07:25 1132662 12584889 hadam3p_eu_xo6h_2000_1_006993345_1 23,136 67,661 2.9245
20 Feb 2011 12:25:14 1132662 12584889 hadam3p_eu_xo6h_2000_1_006993345_1 11,616 33,916 2.9198


©2024 cpdn.org