climateprediction.net home page
Task 12489811

Task 12489811

Name famous_wuh6_1599_200_007114001_0
Workunit 7313724
Created 16 Jan 2011, 14:57:03 UTC
Sent 19 Jan 2011, 17:29:24 UTC
Report deadline 21 Apr 2011, 0:56:35 UTC
Received 23 Feb 2011, 0:21:00 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 981127
Run time 10 days 8 hours 39 min 19 sec
CPU time 9 days 15 hours 11 min 47 sec
Validate state Invalid
Credit 4,694.09
Device peak FLOPS 2.61 GFLOPS
Application version UK Met Office FAMOUS v6.11
windows_intelx86
Stderr
<core_client_version>6.6.38</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2796, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
CPDN Monitor - Quit request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5276, iMonCtr=1
Model crash detected, will try to restart...
17:12:54 (5420): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3576, iMonCtr=1
Model crash detected, will try to restart...
08:39:02 (5476): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
18:23:02 (5768): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:37:37 (4744): No heartbeat from core client for 30 sec - exiting
05:37:39 (4744): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:40:13 (5444): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:48:04 (4944): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:51:02 (3088): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4944, iMonCtr=1
Model crash detected, will try to restart...
12:50:21 (4796): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5584, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5228, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5132, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...

Model crashed: READHIST: End of file in READ from history file for namelist NLCFILES                                                                                                                                                                                           tmp/pipe_dummy                                                                  

Model crashed: READHIST: End of file in READ from history file for namelist NLCFILES                                                                                                                                                                                           tmp/pipe_dummy                                                                  

Model crashed: READHIST: End of file in READ from history file for namelist NLCFILES                                                                                                                                                                                           tmp/pipe_dummy                                                                  

Model crashed: READHIST: End of file in READ from history file for namelist NLCFILES                                                                                                                                                                                           tmp/pipe_dummy                                                                  

Model crashed: READHIST: End of file in READ from history file for namelist NLCFILES                                                                                                                                                                                           tmp/pipe_dummy                                                                  

Model crashed: READHIST: End of file in READ from history file for namelist NLCFILES                                                                                                                                                                                           tmp/pipe_dummy                                                                  
Sorry, too many model crashes! :-(
10:11:10 (5044): called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
21 Feb 2011 18:52:41 981127 12489811 famous_wuh6_1599_200_007114001_0 1,422,746 828,200 0.5821
21 Feb 2011 18:52:32 981127 12489811 famous_wuh6_1599_200_007114001_0 1,413,386 822,415 0.5819
21 Feb 2011 18:52:32 981127 12489811 famous_wuh6_1599_200_007114001_0 1,404,026 817,014 0.5819
21 Feb 2011 18:52:32 981127 12489811 famous_wuh6_1599_200_007114001_0 1,394,666 811,668 0.5820
21 Feb 2011 18:52:32 981127 12489811 famous_wuh6_1599_200_007114001_0 1,385,306 806,262 0.5820
20 Feb 2011 23:55:16 981127 12489811 famous_wuh6_1599_200_007114001_0 1,375,946 800,907 0.5821
20 Feb 2011 22:22:32 981127 12489811 famous_wuh6_1599_200_007114001_0 1,366,586 795,546 0.5821
20 Feb 2011 20:46:16 981127 12489811 famous_wuh6_1599_200_007114001_0 1,357,226 790,149 0.5822
20 Feb 2011 19:31:25 981127 12489811 famous_wuh6_1599_200_007114001_0 1,347,866 784,805 0.5823
20 Feb 2011 19:27:58 981127 12489811 famous_wuh6_1599_200_007114001_0 1,338,506 779,363 0.5823
20 Feb 2011 04:06:58 981127 12489811 famous_wuh6_1599_200_007114001_0 1,329,146 773,805 0.5822
20 Feb 2011 02:34:04 981127 12489811 famous_wuh6_1599_200_007114001_0 1,319,786 768,386 0.5822
20 Feb 2011 01:01:20 981127 12489811 famous_wuh6_1599_200_007114001_0 1,310,426 763,065 0.5823
19 Feb 2011 23:32:22 981127 12489811 famous_wuh6_1599_200_007114001_0 1,301,066 757,758 0.5824
19 Feb 2011 22:04:57 981127 12489811 famous_wuh6_1599_200_007114001_0 1,291,706 752,445 0.5825
19 Feb 2011 20:30:58 981127 12489811 famous_wuh6_1599_200_007114001_0 1,282,346 747,023 0.5825
19 Feb 2011 20:13:23 981127 12489811 famous_wuh6_1599_200_007114001_0 1,272,986 741,584 0.5826
19 Feb 2011 20:10:01 981127 12489811 famous_wuh6_1599_200_007114001_0 1,263,626 735,994 0.5824
19 Feb 2011 20:10:01 981127 12489811 famous_wuh6_1599_200_007114001_0 1,254,266 730,625 0.5825
19 Feb 2011 20:10:01 981127 12489811 famous_wuh6_1599_200_007114001_0 1,244,906 725,277 0.5826


©2024 cpdn.org