climateprediction.net home page
Task 11874024

Task 11874024

Name famous_v3q5_599_200_006690595_6
Workunit 6893848
Created 7 Sep 2010, 8:13:42 UTC
Sent 11 Sep 2010, 19:53:24 UTC
Report deadline 12 Dec 2010, 3:20:35 UTC
Received 12 Oct 2010, 18:19:46 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 1100336
Run time
CPU time 5 days 23 hours 18 min 20 sec
Validate state Invalid
Credit 1,575.05
Device peak FLOPS 1.39 GFLOPS
Application version UK Met Office FAMOUS v6.11
windows_intelx86
Stderr
<core_client_version>6.2.28</core_client_version>
<![CDATA[
<message>
The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
00:43:51 (5224): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:43:52 (5224): No heartbeat from core client for 30 sec - exiting
00:43:54 (5224): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5592, iMonCtr=1
Model crash detected, will try to restart...
01:50:40 (2504): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
21:39:41 (4776): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:39:42 (4776): No heartbeat from core client for 30 sec - exiting
05:16:32 (4308): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:16:33 (4308): No heartbeat from core client for 30 sec - exiting
05:16:34 (4308): No heartbeat from core client for 30 sec - exiting
05:16:35 (4308): No heartbeat from core client for 30 sec - exiting
05:16:36 (4308): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8940, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5684, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5856, iMonCtr=1
Model crash detected, will try to restart...
07:32:38 (5276): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4512, iMonCtr=1
Model crash detected, will try to restart...
01:00:04 (5332): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:00:15 (5332): No heartbeat from core client for 30 sec - exiting
17:34:03 (5488): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=20580, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5564, iMonCtr=1
Model crash detected, will try to restart...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=19000, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4964, iMonCtr=1
Model crash detected, will try to restart...
23:57:28 (5976): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:57:30 (5976): No heartbeat from core client for 30 sec - exiting
23:57:31 (5976): No heartbeat from core client for 30 sec - exiting
23:57:32 (5976): No heartbeat from core client for 30 sec - exiting
23:57:33 (5976): No heartbeat from core client for 30 sec - exiting
23:57:34 (5976): No heartbeat from core client for 30 sec - exiting
06:54:07 (1480): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:54:17 (1480): No heartbeat from core client for 30 sec - exiting
06:54:18 (1480): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=24560, iMonCtr=1
Model crash detected, will try to restart...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
01:34:50 (4580): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:34:53 (4580): No heartbeat from core client for 30 sec - exiting
18:56:17 (5452): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:21:55 (168): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:21:57 (168): No heartbeat from core client for 30 sec - exiting
15:21:58 (168): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=31776, iMonCtr=1
Model crash detected, will try to restart...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5768, iMonCtr=1
Model crash detected, will try to restart...
07:10:39 (5640): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:10:41 (5640): No heartbeat from core client for 30 sec - exiting
07:10:42 (5640): No heartbeat from core client for 30 sec - exiting
07:10:43 (5640): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
14:18:22 (3976): called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
12 Oct 2010 03:48:05 1100336 11874024 famous_v3q5_599_200_006690595_6 477,386 505,897 1.0597
12 Oct 2010 03:48:05 1100336 11874024 famous_v3q5_599_200_006690595_6 468,026 495,965 1.0597
10 Oct 2010 19:28:46 1100336 11874024 famous_v3q5_599_200_006690595_6 458,666 485,992 1.0596
10 Oct 2010 17:21:09 1100336 11874024 famous_v3q5_599_200_006690595_6 449,306 476,050 1.0595
10 Oct 2010 17:21:09 1100336 11874024 famous_v3q5_599_200_006690595_6 439,946 466,142 1.0595
10 Oct 2010 17:21:09 1100336 11874024 famous_v3q5_599_200_006690595_6 430,586 456,209 1.0595
10 Oct 2010 17:21:09 1100336 11874024 famous_v3q5_599_200_006690595_6 421,226 446,246 1.0594
06 Oct 2010 06:38:40 1100336 11874024 famous_v3q5_599_200_006690595_6 411,866 436,364 1.0595
06 Oct 2010 03:51:29 1100336 11874024 famous_v3q5_599_200_006690595_6 402,506 426,497 1.0596
06 Oct 2010 03:32:55 1100336 11874024 famous_v3q5_599_200_006690595_6 393,146 416,620 1.0597
05 Oct 2010 20:34:58 1100336 11874024 famous_v3q5_599_200_006690595_6 383,786 406,771 1.0599
05 Oct 2010 05:38:12 1100336 11874024 famous_v3q5_599_200_006690595_6 374,426 396,855 1.0599
04 Oct 2010 06:36:10 1100336 11874024 famous_v3q5_599_200_006690595_6 365,066 386,940 1.0599
04 Oct 2010 03:17:52 1100336 11874024 famous_v3q5_599_200_006690595_6 355,706 376,654 1.0589
03 Oct 2010 21:40:10 1100336 11874024 famous_v3q5_599_200_006690595_6 346,346 366,500 1.0582
02 Oct 2010 04:12:31 1100336 11874024 famous_v3q5_599_200_006690595_6 336,986 356,556 1.0581
01 Oct 2010 06:10:29 1100336 11874024 famous_v3q5_599_200_006690595_6 327,626 346,595 1.0579
01 Oct 2010 03:23:53 1100336 11874024 famous_v3q5_599_200_006690595_6 318,266 336,687 1.0579
30 Sep 2010 22:57:24 1100336 11874024 famous_v3q5_599_200_006690595_6 308,906 326,727 1.0577
30 Sep 2010 10:57:13 1100336 11874024 famous_v3q5_599_200_006690595_6 299,546 316,770 1.0575


©2024 cpdn.org