Name | famous_v2wp_1599_200_006689535_5 |
Workunit | 6892788 |
Created | 4 Sep 2010, 23:31:54 UTC |
Sent | 12 Sep 2010, 5:20:13 UTC |
Report deadline | 12 Dec 2010, 12:47:24 UTC |
Received | 17 Nov 2010, 0:59:10 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 1045655 |
Run time | 5 days 4 hours 23 min 58 sec |
CPU time | 4 days 11 hours 52 min 5 sec |
Validate state | Invalid |
Credit | 2,995.60 |
Device peak FLOPS | 2.41 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 16:50:24 (3808): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:50:25 (3808): No heartbeat from core client for 30 sec - exiting 16:50:26 (3808): No heartbeat from core client for 30 sec - exiting 16:50:27 (3808): No heartbeat from core client for 30 sec - exiting 16:50:28 (3808): No heartbeat from core client for 30 sec - exiting 16:50:29 (3808): No heartbeat from core client for 30 sec - exiting 16:50:30 (3808): No heartbeat from core client for 30 sec - exiting 16:50:31 (3808): No heartbeat from core client for 30 sec - exiting 16:50:32 (3808): No heartbeat from core client for 30 sec - exiting 16:50:33 (3808): No heartbeat from core client for 30 sec - exiting 16:50:34 (3808): No heartbeat from core client for 30 sec - exiting 16:50:35 (3808): No heartbeat from core client for 30 sec - exiting 16:50:36 (3808): No heartbeat from core client for 30 sec - exiting 16:50:37 (3808): No heartbeat from core client for 30 sec - exiting 16:50:38 (3808): No heartbeat from core client for 30 sec - exiting 16:50:39 (3808): No heartbeat from core client for 30 sec - exiting 16:50:40 (3808): No heartbeat from core client for 30 sec - exiting 16:50:41 (3808): No heartbeat from core client for 30 sec - exiting 16:50:42 (3808): No heartbeat from core client for 30 sec - exiting 16:50:43 (3808): No heartbeat from core client for 30 sec - exiting 16:50:44 (3808): No heartbeat from core client for 30 sec - exiting 16:50:45 (3808): No heartbeat from core client for 30 sec - exiting 16:50:46 (3808): No heartbeat from core client for 30 sec - exiting 16:50:47 (3808): No heartbeat from core client for 30 sec - exiting 16:50:48 (3808): No heartbeat from core client for 30 sec - exiting 16:50:49 (3808): No heartbeat from core client for 30 sec - exiting 16:50:50 (3808): No heartbeat from core client for 30 sec - exiting 16:50:51 (3808): No heartbeat from core client for 30 sec - exiting 16:50:52 (3808): No heartbeat from core client for 30 sec - exiting BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 22:01:45 (6016): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5056, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5056, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5056, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 13:53:47 (3276): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:53:48 (3276): No heartbeat from core client for 30 sec - exiting 13:53:49 (3276): No heartbeat from core client for 30 sec - exiting 13:53:50 (3276): No heartbeat from core client for 30 sec - exiting 13:53:51 (3276): No heartbeat from core client for 30 sec - exiting 13:53:52 (3276): No heartbeat from core client for 30 sec - exiting 13:53:53 (3276): No heartbeat from core client for 30 sec - exiting 13:53:55 (3276): No heartbeat from core client for 30 sec - exiting 13:53:56 (3276): No heartbeat from core client for 30 sec - exiting 13:53:57 (3276): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
16 Nov 2010 23:10:53 | 1045655 | 11867592 | famous_v2wp_1599_200_006689535_5 | 907,946 | 384,773 | 0.4238 |
15 Nov 2010 10:11:53 | 1045655 | 11867592 | famous_v2wp_1599_200_006689535_5 | 898,586 | 380,670 | 0.4236 |
07 Nov 2010 22:22:43 | 1045655 | 11867592 | famous_v2wp_1599_200_006689535_5 | 889,226 | 375,928 | 0.4228 |
03 Nov 2010 23:01:42 | 1045655 | 11867592 | famous_v2wp_1599_200_006689535_5 | 879,866 | 371,855 | 0.4226 |
03 Nov 2010 21:36:00 | 1045655 | 11867592 | famous_v2wp_1599_200_006689535_5 | 870,506 | 367,787 | 0.4225 |
03 Nov 2010 17:49:10 | 1045655 | 11867592 | famous_v2wp_1599_200_006689535_5 | 861,146 | 363,811 | 0.4225 |
02 Nov 2010 00:15:27 | 1045655 | 11867592 | famous_v2wp_1599_200_006689535_5 | 851,786 | 359,788 | 0.4224 |
18 Oct 2010 20:20:23 | 1045655 | 11867592 | famous_v2wp_1599_200_006689535_5 | 842,426 | 355,603 | 0.4221 |
17 Oct 2010 20:14:33 | 1045655 | 11867592 | famous_v2wp_1599_200_006689535_5 | 833,066 | 351,677 | 0.4221 |
16 Oct 2010 04:36:48 | 1045655 | 11867592 | famous_v2wp_1599_200_006689535_5 | 823,706 | 347,358 | 0.4217 |
15 Oct 2010 22:33:01 | 1045655 | 11867592 | famous_v2wp_1599_200_006689535_5 | 814,346 | 343,260 | 0.4215 |
15 Oct 2010 20:27:55 | 1045655 | 11867592 | famous_v2wp_1599_200_006689535_5 | 804,986 | 338,804 | 0.4209 |
15 Oct 2010 19:16:12 | 1045655 | 11867592 | famous_v2wp_1599_200_006689535_5 | 795,626 | 334,837 | 0.4208 |
15 Oct 2010 17:54:10 | 1045655 | 11867592 | famous_v2wp_1599_200_006689535_5 | 786,266 | 330,950 | 0.4209 |
15 Oct 2010 03:41:27 | 1045655 | 11867592 | famous_v2wp_1599_200_006689535_5 | 776,906 | 327,157 | 0.4211 |
14 Oct 2010 23:29:50 | 1045655 | 11867592 | famous_v2wp_1599_200_006689535_5 | 767,546 | 323,354 | 0.4213 |
14 Oct 2010 22:13:28 | 1045655 | 11867592 | famous_v2wp_1599_200_006689535_5 | 758,186 | 319,558 | 0.4215 |
14 Oct 2010 20:57:01 | 1045655 | 11867592 | famous_v2wp_1599_200_006689535_5 | 748,826 | 315,781 | 0.4217 |
14 Oct 2010 19:45:35 | 1045655 | 11867592 | famous_v2wp_1599_200_006689535_5 | 739,466 | 311,996 | 0.4219 |
14 Oct 2010 18:38:56 | 1045655 | 11867592 | famous_v2wp_1599_200_006689535_5 | 730,106 | 308,216 | 0.4222 |
©2024 cpdn.org