climateprediction.net home page
Task 16109592

Task 16109592

Name hadam3p_eu_isvj_2005_1_008482612_0
Workunit 8633425
Created 3 Dec 2013, 19:55:57 UTC
Sent 3 Dec 2013, 19:57:07 UTC
Report deadline 16 Nov 2014, 1:17:07 UTC
Received 22 Dec 2013, 17:07:49 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 194 (0x000000C2) EXIT_ABORTED_BY_CLIENT
Computer ID 1300898
Run time 6 days 23 hours 15 min 11 sec
CPU time 5 days 18 hours 52 min 52 sec
Validate state Invalid
Credit 2,386.39
Device peak FLOPS 2.30 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>7.2.33</core_client_version>
<![CDATA[
<message>
finish file present too long
</message>
<stderr_txt>
08:49:14 (2100): No heartbeat from core client for 30 sec - exiting
08:49:15 (2100): No heartbeat from core client for 30 sec - exiting
08:49:16 (2100): No heartbeat from core client for 30 sec - exiting
08:49:17 (2100): No heartbeat from core client for 30 sec - exiting
08:49:18 (2100): No heartbeat from core client for 30 sec - exiting
08:49:19 (2100): No heartbeat from core client for 30 sec - exiting
08:49:20 (2100): No heartbeat from core client for 30 sec - exiting
08:49:21 (2100): No heartbeat from core client for 30 sec - exiting
08:49:22 (2100): No heartbeat from core client for 30 sec - exiting
08:49:23 (2100): No heartbeat from core client for 30 sec - exiting
08:49:24 (2100): No heartbeat from core client for 30 sec - exiting
08:49:25 (2100): No heartbeat from core client for 30 sec - exiting
08:49:26 (2100): No heartbeat from core client for 30 sec - exiting
08:49:27 (2100): No heartbeat from core client for 30 sec - exiting
08:49:28 (2100): No heartbeat from core client for 30 sec - exiting
08:49:29 (2100): No heartbeat from core client for 30 sec - exiting
08:49:30 (2100): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:11:02 (348): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Colobal Worker:: CPDtr proler:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2816, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1684, selfPID=3276, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3788, selfPID=3788, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
17:24:12 (3908): No heartbeat from core client for 30 sec - exiting
17:24:13 (3908): No heartbeat from core client for 30 sec - exiting
17:24:14 (3908): No heartbeat from core client for 30 sec - exiting
17:24:15 (3908): No heartbeat from core client for 30 sec - exiting
17:24:16 (3908): No heartbeat from core client for 30 sec - exiting
17:24:17 (3908): No heartbeat from core client for 30 sec - exiting
17:24:18 (3908): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4468, selfPID=4468, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5012, selfPID=5012, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Colobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4492, iMonCtr=2
ntroller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3244, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
09:39:42 (3528): No heartbeat from core client for 30 sec - exiting
09:39:43 (3528): No heartbeat from core client for 30 sec - exiting
09:39:44 (3528): No heartbeat from core client for 30 sec - exiting
09:39:45 (3528): No heartbeat from core client for 30 sec - exiting
09:39:46 (3528): No heartbeat from core client for 30 sec - exiting
09:39:47 (3528): No heartbeat from core client for 30 sec - exiting
09:39:48 (3528): No heartbeat from core client for 30 sec - exiting
09:39:49 (3528): No heartbeat from core client for 30 sec - exiting
09:39:50 (3528): No heartbeat from core client for 30 sec - exiting
09:39:51 (3528): No heartbeat from core client for 30 sec - exiting
09:39:52 (3528): No heartbeat from core client for 30 sec - exiting
09:39:53 (3528): No heartbeat from core client for 30 sec - exiting
09:39:54 (3528): No heartbeat from core client for 30 sec - exiting
09:39:55 (3528): No heartbeat from core client for 30 sec - exiting
09:39:56 (3528): No heartbeat from core client for 30 sec - exiting
09:39:57 (3528): No heartbeat from core client for 30 sec - exiting
09:39:58 (3528): No heartbeat from core client for 30 sec - exiting
09:39:59 (3528): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1988, selfPID=1988, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4048, selfPID=3900, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1688, selfPID=1688, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4452, selfPID=4452, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
11:02:14 (3352): No heartbeat from core client for 30 sec - exiting
11:02:15 (3352): No heartbeat from core client for 30 sec - exiting
11:02:16 (3352): No heartbeat from core client for 30 sec - exiting
11:02:17 (3352): No heartbeat from core client for 30 sec - exiting
11:02:18 (3352): No heartbeat from core client for 30 sec - exiting
11:02:19 (3352): No heartbeat from core client for 30 sec - exiting
11:02:20 (3352): No heartbeat from core client for 30 sec - exiting
11:02:21 (3352): No heartbeat from core client for 30 sec - exiting
11:02:22 (3352): No heartbeat from core client for 30 sec - exiting
11:02:23 (3352): No heartbeat from core client for 30 sec - exiting
11:02:24 (3352): No heartbeat from core client for 30 sec - exiting
11:02:25 (3352): No heartbeat from core client for 30 sec - exiting
11:02:26 (3352): No heartbeat from core client for 30 sec - exiting
11:02:27 (3352): No heartbeat from core client for 30 sec - exiting
11:02:28 (3352): No heartbeat from core client for 30 sec - exiting
11:02:29 (3352): No heartbeat from core client for 30 sec - exiting
11:02:30 (3352): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Leaving CPDN_Main::Monitor...
Called boinc_finish
CPDN Monitor - Abort request from BOINC...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
21 Dec 2013 16:14:28 1300898 16109592 hadam3p_eu_isvj_2005_1_008482612_0 138,336 500,574 3.6185
19 Dec 2013 18:08:54 1300898 16109592 hadam3p_eu_isvj_2005_1_008482612_0 126,816 458,910 3.6187
18 Dec 2013 12:28:11 1300898 16109592 hadam3p_eu_isvj_2005_1_008482612_0 115,296 417,642 3.6223
17 Dec 2013 10:52:08 1300898 16109592 hadam3p_eu_isvj_2005_1_008482612_0 103,776 375,897 3.6222
16 Dec 2013 10:44:28 1300898 16109592 hadam3p_eu_isvj_2005_1_008482612_0 92,256 334,983 3.6310
13 Dec 2013 15:10:35 1300898 16109592 hadam3p_eu_isvj_2005_1_008482612_0 80,736 293,835 3.6395
12 Dec 2013 10:51:25 1300898 16109592 hadam3p_eu_isvj_2005_1_008482612_0 69,216 253,031 3.6557
10 Dec 2013 20:03:47 1300898 16109592 hadam3p_eu_isvj_2005_1_008482612_0 57,696 211,656 3.6685
09 Dec 2013 09:04:29 1300898 16109592 hadam3p_eu_isvj_2005_1_008482612_0 46,176 169,245 3.6652
06 Dec 2013 22:01:41 1300898 16109592 hadam3p_eu_isvj_2005_1_008482612_0 34,656 126,701 3.6560
05 Dec 2013 21:58:05 1300898 16109592 hadam3p_eu_isvj_2005_1_008482612_0 23,136 85,168 3.6812
04 Dec 2013 22:39:22 1300898 16109592 hadam3p_eu_isvj_2005_1_008482612_0 11,616 42,407 3.6507


©2024 cpdn.org