climateprediction.net home page
Task 11874023

Task 11874023

Name famous_v3q4_1999_200_006690594_5
Workunit 6893847
Created 7 Sep 2010, 8:13:41 UTC
Sent 11 Sep 2010, 19:53:45 UTC
Report deadline 12 Dec 2010, 3:20:56 UTC
Received 4 Nov 2010, 11:22:30 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1100336
Run time
CPU time 11 days 22 hours 16 min 28 sec
Validate state Invalid
Credit 3,119.13
Device peak FLOPS 1.39 GFLOPS
Application version UK Met Office FAMOUS v6.11
windows_intelx86
Stderr
<core_client_version>6.2.28</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
00:43:48 (5160): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:43:49 (5160): No heartbeat from core client for 30 sec - exiting
00:43:50 (5160): No heartbeat from core client for 30 sec - exiting
00:43:51 (5160): No heartbeat from core client for 30 sec - exiting
00:43:53 (5160): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4928, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5600, iMonCtr=1
Model crash detected, will try to restart...
01:50:32 (2040): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:50:58 (2040): No heartbeat from core client for 30 sec - exiting
01:50:59 (2040): No heartbeat from core client for 30 sec - exiting
01:51:00 (2040): No heartbeat from core client for 30 sec - exiting
01:51:01 (2040): No heartbeat from core client for 30 sec - exiting
C05:16:38 (4316): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:16:39 (4316): No heartbeat from core client for 30 sec - exiting
05:16:40 (4316): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5864, iMonCtr=1
Model crash detected, will try to restart...
07:32:43 (5284): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:00:13 (5340): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:00:14 (5340): No heartbeat from core client for 30 sec - exiting
17:34:02 (11120): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=19548, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5116, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4972, iMonCtr=1
Model crash detected, will try to restart...
23:57:39 (5984): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:53:48 (2068): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11780, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=24096, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5400, iMonCtr=1
Model crash detected, will try to restart...
01:34:44 (4588): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:34:45 (4588): No heartbeat from core client for 30 sec - exiting
01:34:46 (4588): No heartbeat from core client for 30 sec - exiting
01:34:48 (4588): No heartbeat from core client for 30 sec - exiting
01:34:49 (4588): No heartbeat from core client for 30 sec - exiting
01:34:50 (4588): No heartbeat from core client for 30 sec - exiting
01:34:52 (4588): No heartbeat from core client for 30 sec - exiting
18:56:23 (5460): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:56:49 (5460): No heartbeat from core client for 30 sec - exiting
18:56:50 (5460): No heartbeat from core client for 30 sec - exiting
18:56:51 (5460): No heartbeat from core client for 30 sec - exiting
15:21:52 (3948): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:21:57 (3948): No heartbeat from core client for 30 sec - exiting
15:21:58 (3948): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
07:10:36 (5648): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:10:37 (5648): No heartbeat from core client for 30 sec - exiting
21:45:19 (5520): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=31844, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4932, iMonCtr=1
Model crash detected, will try to restart...
00:02:46 (4656): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:02:47 (4656): No heartbeat from core client for 30 sec - exiting
07:42:36 (9884): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:42:50 (9884): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4860, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4936, iMonCtr=1
Model crash detected, will try to restart...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
23:56:16 (2088): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1052, iMonCtr=1
Model crash detected, will try to restart...
07:34:46 (4212): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:34:47 (4212): No heartbeat from core client for 30 sec - exiting
07:34:48 (4212): No heartbeat from core client for 30 sec - exiting
07:34:49 (4212): No heartbeat from core client for 30 sec - exiting
07:34:51 (4212): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=41292, iMonCtr=1
Model crash detected, will try to restart...
00:05:36 (4200): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:54:45 (1104): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:54:46 (1104): No heartbeat from core client for 30 sec - exiting
14:54:47 (1104): No heartbeat from core client for 30 sec - exiting
14:54:49 (1104): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7192, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=840, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3892, iMonCtr=1
Model crash detected, will try to restart...
02:03:45 (4584): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:02:27 (12388): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:24:39 (35016): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:02:34 (4400): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:02:35 (4400): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16536, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
07:44:53 (3556): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:44:54 (3556): No heartbeat from core client for 30 sec - exiting

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4256, iMonCtr=1
Model crash detected, will try to restart...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
04 Nov 2010 04:49:40 1100336 11874023 famous_v3q4_1999_200_006690594_5 945,386 1,022,989 1.0821
03 Nov 2010 21:46:21 1100336 11874023 famous_v3q4_1999_200_006690594_5 936,026 1,012,795 1.0820
03 Nov 2010 04:48:57 1100336 11874023 famous_v3q4_1999_200_006690594_5 926,666 1,002,406 1.0817
02 Nov 2010 22:36:02 1100336 11874023 famous_v3q4_1999_200_006690594_5 917,306 992,115 1.0816
02 Nov 2010 17:52:33 1100336 11874023 famous_v3q4_1999_200_006690594_5 907,946 981,760 1.0813
02 Nov 2010 12:16:26 1100336 11874023 famous_v3q4_1999_200_006690594_5 898,586 971,288 1.0809
02 Nov 2010 06:06:44 1100336 11874023 famous_v3q4_1999_200_006690594_5 889,226 960,762 1.0804
02 Nov 2010 04:31:57 1100336 11874023 famous_v3q4_1999_200_006690594_5 879,866 950,169 1.0799
01 Nov 2010 04:58:31 1100336 11874023 famous_v3q4_1999_200_006690594_5 870,506 940,105 1.0800
31 Oct 2010 21:22:30 1100336 11874023 famous_v3q4_1999_200_006690594_5 861,146 930,105 1.0801
31 Oct 2010 06:33:55 1100336 11874023 famous_v3q4_1999_200_006690594_5 851,786 919,832 1.0799
31 Oct 2010 03:29:37 1100336 11874023 famous_v3q4_1999_200_006690594_5 842,426 909,626 1.0798
30 Oct 2010 23:15:46 1100336 11874023 famous_v3q4_1999_200_006690594_5 833,066 899,559 1.0798
30 Oct 2010 03:48:34 1100336 11874023 famous_v3q4_1999_200_006690594_5 823,706 889,304 1.0796
29 Oct 2010 12:04:32 1100336 11874023 famous_v3q4_1999_200_006690594_5 814,346 878,847 1.0792
29 Oct 2010 08:34:06 1100336 11874023 famous_v3q4_1999_200_006690594_5 804,986 868,625 1.0791
28 Oct 2010 22:05:21 1100336 11874023 famous_v3q4_1999_200_006690594_5 795,626 858,603 1.0792
28 Oct 2010 20:12:10 1100336 11874023 famous_v3q4_1999_200_006690594_5 786,266 848,715 1.0794
27 Oct 2010 11:20:59 1100336 11874023 famous_v3q4_1999_200_006690594_5 776,906 838,649 1.0795
27 Oct 2010 08:32:31 1100336 11874023 famous_v3q4_1999_200_006690594_5 767,546 828,578 1.0795


©2024 cpdn.org