climateprediction.net home page
Task 12566008

Task 12566008

Name hadam3p_eu_whfc_1984_1_007153809_0
Workunit 7338589
Created 9 Feb 2011, 16:07:01 UTC
Sent 9 Feb 2011, 22:04:36 UTC
Report deadline 23 Jan 2012, 3:24:36 UTC
Received 5 Sep 2011, 10:49:09 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 975380
Run time 6 days 19 hours 52 min 29 sec
CPU time 4 days 15 hours 12 min 10 sec
Validate state Invalid
Credit 1,988.94
Device peak FLOPS 1.91 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.08
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<stderr_txt>
12:41:55 (3580): No heartbeat from core client for 30 sec - exiting
12:41:56 (3580): No heartbeat from core client for 30 sec - exiting
12:41:57 (3580): No heartbeat from core client for 30 sec - exiting
12:41:58 (3580): No heartbeat from core client for 30 sec - exiting
12:41:59 (3580): No heartbeat from core client for 30 sec - exiting
12:42:00 (3580): No heartbeat from core client for 30 sec - exiting
12:42:01 (3580): No heartbeat from core client for 30 sec - exiting
12:42:02 (3580): No heartbeat from core client for 30 sec - exiting
12:42:03 (3580): No heartbeat from core client for 30 sec - exiting
12:42:04 (3580): No heartbeat from core client for 30 sec - exiting
12:42:05 (3580): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:35:19 (3416): No heartbeat from core client for 30 sec - exiting
13:35:20 (3416): No heartbeat from core client for 30 sec - exiting
13:35:21 (3416): No heartbeat from core client for 30 sec - exiting
13:35:22 (3416): No heartbeat from core client for 30 sec - exiting
13:35:23 (3416): No heartbeat from core client for 30 sec - exiting
13:35:24 (3416): No heartbeat from core client for 30 sec - exiting
13:35:25 (3416): No heartbeat from core client for 30 sec - exiting
13:35:26 (3416): No heartbeat from core client for 30 sec - exiting
13:35:27 (3416): No heartbeat from core client for 30 sec - exiting
13:35:28 (3416): No heartbeat from core client for 30 sec - exiting
13:35:29 (3416): No heartbeat from core client for 30 sec - exiting
13:35:30 (3416): No heartbeat from core client for 30 sec - exiting
13:35:31 (3416): No heartbeat from core client for 30 sec - exiting
13:35:32 (3416): No heartbeat from core client for 30 sec - exiting
13:35:33 (3416): No heartbeat from core client for 30 sec - exiting
13:35:34 (3416): No heartbeat from core client for 30 sec - exiting
13:35:35 (3416): No heartbeat from core client for 30 sec - exiting
13:35:36 (3416): No heartbeat from core client for 30 sec - exiting
13:35:37 (3416): No heartbeat from core client for 30 sec - exiting
13:35:38 (3416): No heartbeat from core client for 30 sec - exiting
13:35:39 (3416): No heartbeat from core client for 30 sec - exiting
13:35:40 (3416): No heartbeat from core client for 30 sec - exiting
13:35:41 (3416): No heartbeat from core client for 30 sec - exiting
13:35:42 (3416): No heartbeat from core client for 30 sec - exiting
13:35:43 (3416): No heartbeat from core client for 30 sec - exiting
13:35:44 (3416): No heartbeat from core client for 30 sec - exiting
13:35:45 (3416): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5280, selfPID=3980, iMonCtr=1
Model crash detected, will try to restart...
21:07:34 (4592): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:07:35 (4592): No heartbeat from core client for 30 sec - exiting
21:07:36 (4592): No heartbeat from core client for 30 sec - exiting
21:07:37 (4592): No heartbeat from core client for 30 sec - exiting
21:07:38 (4592): No heartbeat from core client for 30 sec - exiting
21:07:39 (4592): No heartbeat from core client for 30 sec - exiting
21:07:40 (4592): No heartbeat from core client for 30 sec - exiting
21:07:42 (4592): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1232, selfPID=992, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1332, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3024, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1680, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
10:53:17 (2472): No heartbeat from core client for 30 sec - exiting
10:53:18 (2472): No heartbeat from core client for 30 sec - exiting
10:53:20 (2472): No heartbeat from core client for 30 sec - exiting
10:53:21 (2472): No heartbeat from core client for 30 sec - exiting
10:53:22 (2472): No heartbeat from core client for 30 sec - exiting
10:53:23 (2472): No heartbeat from core client for 30 sec - exiting
10:53:24 (2472): No heartbeat from core client for 30 sec - exiting
10:53:25 (2472): No heartbeat from core client for 30 sec - exiting
10:53:26 (2472): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checCPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4468, selfPID=4468, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4396, selfPID=1444, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4696, selfPID=4508, iMonCtr=1
Model crash detected, will try to restart...
20:51:27 (1652): No heartbeat from core client for 30 sec - exiting
20:51:28 (1652): No heartbeat from core client for 30 sec - exiting
20:51:29 (1652): No heartbeat from core client for 30 sec - exiting
20:51:30 (1652): No heartbeat from core client for 30 sec - exiting
20:51:31 (1652): No heartbeat from core client for 30 sec - exiting
20:51:32 (1652): No heartbeat from core client for 30 sec - exiting
20:51:33 (1652): No heartbeat from core client for 30 sec - exiting
20:51:34 (1652): No heartbeat from core client for 30 sec - exiting
20:51:35 (1652): No heartbeat from core client for 30 sec - exiting
20:51:36 (1652): No heartbeat from core client for 30 sec - exiting
20:51:37 (1652): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4692, selfPID=2964, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3928, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4864, iMonCtr=2
Model crash detected, will try to restart...
CCPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3512, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4120, selfPID=2792, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4052, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5C13:07:05 (4628): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=156, selfPID=5972, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CCCPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_eu_whfc_1984_1_007153809_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_whfc_1984_1_007153809_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
29 Aug 2011 19:19:01 975380 12566008 hadam3p_eu_whfc_1984_1_007153809_0 115,296 398,252 3.4542
25 Jul 2011 18:58:29 975380 12566008 hadam3p_eu_whfc_1984_1_007153809_0 103,813 359,313 3.4612
25 Jul 2011 18:57:23 975380 12566008 hadam3p_eu_whfc_1984_1_007153809_0 103,809 358,691 3.4553
25 Jul 2011 18:53:52 975380 12566008 hadam3p_eu_whfc_1984_1_007153809_0 103,801 358,088 3.4498
25 Jul 2011 15:06:17 975380 12566008 hadam3p_eu_whfc_1984_1_007153809_0 103,786 357,493 3.4445
25 Jul 2011 15:06:17 975380 12566008 hadam3p_eu_whfc_1984_1_007153809_0 103,780 356,922 3.4392
28 May 2011 12:50:48 975380 12566008 hadam3p_eu_whfc_1984_1_007153809_0 103,776 356,387 3.4342
23 Apr 2011 19:52:37 975380 12566008 hadam3p_eu_whfc_1984_1_007153809_0 92,256 317,118 3.4374
27 Mar 2011 23:03:40 975380 12566008 hadam3p_eu_whfc_1984_1_007153809_0 80,736 278,481 3.4493
20 Mar 2011 20:34:41 975380 12566008 hadam3p_eu_whfc_1984_1_007153809_0 69,216 238,166 3.4409
15 Mar 2011 07:52:19 975380 12566008 hadam3p_eu_whfc_1984_1_007153809_0 57,696 198,727 3.4444
13 Mar 2011 20:40:35 975380 12566008 hadam3p_eu_whfc_1984_1_007153809_0 46,176 159,210 3.4479
10 Mar 2011 20:19:04 975380 12566008 hadam3p_eu_whfc_1984_1_007153809_0 34,656 118,751 3.4266
08 Mar 2011 18:01:07 975380 12566008 hadam3p_eu_whfc_1984_1_007153809_0 23,136 79,500 3.4362
27 Feb 2011 15:06:48 975380 12566008 hadam3p_eu_whfc_1984_1_007153809_0 11,616 39,718 3.4192


©2024 climateprediction.net