climateprediction.net home page
Task 12511268

Task 12511268

Name hadam3p_eu_xmep_1960_1_007010249_1
Workunit 7213565
Created 21 Jan 2011, 19:04:12 UTC
Sent 21 Jan 2011, 19:50:04 UTC
Report deadline 4 Jan 2012, 1:10:04 UTC
Received 10 Apr 2011, 22:29:38 UTC
Server state Over
Outcome No reply
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1051057
Run time 5 days 22 hours 7 min 16 sec
CPU time 2 days 6 hours 4 min 4 sec
Validate state Invalid
Credit 1,591.48
Device peak FLOPS 2.63 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.08
windows_intelx86
Stderr
<core_client_version>6.10.60</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4400, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3760, selfPID=2684, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3788, selfPID=728, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=920, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1584, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3904, selfPID=3092, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5000, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4636, iMonCtr=2
Model crash detected, will try to restart...
09:29:22 (3356): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4292, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4040, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=788, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3500, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
04:07:08 (3500): called boinc_finish
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
15:13:40 (3768): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:35:14 (4644): No heartbeat from core client for 30 sec - exiting
15:35:15 (4644): No heartbeat from core client for 30 sec - exiting
15:35:16 (4644): No heartbeat from core client for 30 sec - exiting
15:35:17 (4644): No heartbeat from core client for 30 sec - exiting
15:35:18 (4644): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
12:04:39 (5808): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
15:17:19 (4592): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
15:17:20 (4592): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
08:39:46 (1396): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
08:39:47 (1396): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
16:35:33 (5096): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4648, selfPID=4648, iMonCtr=2
16:35:34 (5096): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
08:31:43 (1412): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:12:43 (3972): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:12:44 (3972): No heartbeat from core client for 30 sec - exiting
09:12:45 (3972): No heartbeat from core client for 30 sec - exiting
09:12:46 (3972): No heartbeat from core client for 30 sec - exiting
09:12:47 (3972): No heartbeat from core client for 30 sec - exiting
09:12:48 (3972): No heartbeat from core client for 30 sec - exiting
09:18:51 (2288): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:18:54 (4400): Can't acquire lockfile (32) - waiting 35s
10:49:06 (4400): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
07:55:05 (4168): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
14:38:42 (4940): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
14:38:43 (4940): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5624, selfPID=5216, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5020, selfPID=4972, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4936, selfPID=5060, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5828, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5664, selfPID=4380, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4276, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
23:37:30 (4276): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_eu_xmep_1960_1_007010249_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_xmep_1960_1_007010249_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_xmep_1960_1_007010249_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_xmep_1960_1_007010249_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
29 Jan 2011 16:35:17 1051057 12511268 hadam3p_eu_xmep_1960_1_007010249_1 92,256 187,595 2.0334
29 Jan 2011 12:11:47 1051057 12511268 hadam3p_eu_xmep_1960_1_007010249_1 80,736 164,458 2.0370
27 Jan 2011 15:02:09 1051057 12511268 hadam3p_eu_xmep_1960_1_007010249_1 69,216 140,595 2.0313
26 Jan 2011 13:45:08 1051057 12511268 hadam3p_eu_xmep_1960_1_007010249_1 57,696 117,272 2.0326
26 Jan 2011 06:42:32 1051057 12511268 hadam3p_eu_xmep_1960_1_007010249_1 46,180 93,806 2.0313
25 Jan 2011 12:06:30 1051057 12511268 hadam3p_eu_xmep_1960_1_007010249_1 46,176 93,494 2.0247
24 Jan 2011 11:39:17 1051057 12511268 hadam3p_eu_xmep_1960_1_007010249_1 34,656 70,001 2.0199
23 Jan 2011 19:53:54 1051057 12511268 hadam3p_eu_xmep_1960_1_007010249_1 23,136 46,868 2.0258
22 Jan 2011 11:36:09 1051057 12511268 hadam3p_eu_xmep_1960_1_007010249_1 11,616 23,553 2.0276


©2024 climateprediction.net