Name | famous_ukug_1599_200_006659859_2 |
Workunit | 6863231 |
Created | 10 Jun 2010, 14:56:32 UTC |
Sent | 13 Jul 2010, 9:13:51 UTC |
Report deadline | 12 Oct 2010, 16:41:02 UTC |
Received | 3 Aug 2010, 13:56:02 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 913817 |
Run time | 5 days 1 hours 43 min 30 sec |
CPU time | 4 days 11 hours 53 min 4 sec |
Validate state | Invalid |
Credit | 1,729.46 |
Device peak FLOPS | 1.83 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.6.20</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 17:44:11 (5284): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:44:13 (5284): No heartbeat from core client for 30 sec - exiting 20:39:58 (3608): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 16:03:44 (5276): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:03:46 (5276): No heartbeat from core client for 30 sec - exiting No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5992, selfPID=5992, iMonCtr=1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 CPDN Monitor - Quit request from BOINC... 12:44:45 (1932): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:44:46 (1932): No heartbeat from core client for 30 sec - exiting 13:48:48 (5756): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:48:49 (5756): No heartbeat from core client for 30 sec - exiting 13:48:50 (5756): No heartbeat from core client for 30 sec - exiting 14:10:15 (1196): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 17:04:50 (4144): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:04:52 (4144): No heartbeat from core client for 30 sec - exiting 17:04:53 (4144): No heartbeat from core client for 30 sec - exiting BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 17:09:08 (956): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:09:09 (956): No heartbeat from core client for 30 sec - exiting 17:09:10 (956): No heartbeat from core client for 30 sec - exiting 17:09:11 (956): No heartbeat from core client for 30 sec - exiting 17:09:12 (956): No heartbeat from core client for 30 sec - exiting 17:09:13 (956): No heartbeat from core client for 30 sec - exiting 17:09:14 (956): No heartbeat from core client for 30 sec - exiting 17:09:15 (956): No heartbeat from core client for 30 sec - exiting BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 17:12:25 (2536): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... 17:14:09 (3420): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:14:10 (3420): No heartbeat from core client for 30 sec - exiting 17:14:11 (3420): No heartbeat from core client for 30 sec - exiting 17:38:49 (5912): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:38:50 (5912): No heartbeat from core client for 30 sec - exiting 17:38:51 (5912): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 19:40:13 (556): No heartbeat from core client for 30 sec - exiting 19:40:15 (556): No heartbeat from core client for 30 sec - exiting 19:40:16 (556): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:40:17 (556): No heartbeat from core client for 30 sec - exiting 19:40:18 (556): No heartbeat from core client for 30 sec - exiting 19:47:07 (5288): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:47:10 (5288): No heartbeat from core client for 30 sec - exiting 19:47:11 (5288): No heartbeat from core client for 30 sec - exiting 19:47:12 (5288): No heartbeat from core client for 30 sec - exiting 19:47:13 (5288): No heartbeat from core client for 30 sec - exiting 19:47:14 (5288): No heartbeat from core client for 30 sec - exiting 19:47:15 (5288): No heartbeat from core client for 30 sec - exiting 19:47:16 (5288): No heartbeat from core client for 30 sec - exiting 20:36:07 (5340): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:36:09 (5340): No heartbeat from core client for 30 sec - exiting 20:36:10 (5340): No heartbeat from core client for 30 sec - exiting 20:36:11 (5340): No heartbeat from core client for 30 sec - exiting 20:36:12 (5340): No heartbeat from core client for 30 sec - exiting 20:36:13 (5340): No heartbeat from core client for 30 sec - exiting Model crashed: READHIST: End of file in READ from history file for namelist NLCFILES tmp/pipe_dummy Model crashed: READHIST: End of file in READ from history file for namelist NLCFILES tmp/pipe_dummy Model crashed: READHIST: End of file in READ from history file for namelist NLCFILES tmp/pipe_dummy Model crashed: READHIST: End of file in READ from history file for namelist NLCFILES tmp/pipe_dummy Model crashed: READHIST: End of file in READ from history file for namelist NLCFILES tmp/pipe_dummy Model crashed: READHIST: End of file in READ from history file for namelist NLCFILES tmp/pipe_dummy Sorry, too many model crashes! :-( 14:54:50 (5396): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
02 Aug 2010 18:23:02 | 913817 | 11550147 | famous_ukug_1599_200_006659859_2 | 524,186 | 386,275 | 0.7369 |
29 Jul 2010 18:09:20 | 913817 | 11550147 | famous_ukug_1599_200_006659859_2 | 514,826 | 380,113 | 0.7383 |
29 Jul 2010 15:11:14 | 913817 | 11550147 | famous_ukug_1599_200_006659859_2 | 505,466 | 372,866 | 0.7377 |
29 Jul 2010 12:50:07 | 913817 | 11550147 | famous_ukug_1599_200_006659859_2 | 496,106 | 365,994 | 0.7377 |
29 Jul 2010 09:03:09 | 913817 | 11550147 | famous_ukug_1599_200_006659859_2 | 486,746 | 358,811 | 0.7372 |
29 Jul 2010 06:57:15 | 913817 | 11550147 | famous_ukug_1599_200_006659859_2 | 477,386 | 351,612 | 0.7365 |
29 Jul 2010 04:57:51 | 913817 | 11550147 | famous_ukug_1599_200_006659859_2 | 468,026 | 344,443 | 0.7359 |
29 Jul 2010 02:59:04 | 913817 | 11550147 | famous_ukug_1599_200_006659859_2 | 458,666 | 337,231 | 0.7352 |
29 Jul 2010 00:46:59 | 913817 | 11550147 | famous_ukug_1599_200_006659859_2 | 449,306 | 329,919 | 0.7343 |
28 Jul 2010 23:53:40 | 913817 | 11550147 | famous_ukug_1599_200_006659859_2 | 439,946 | 322,577 | 0.7332 |
28 Jul 2010 20:35:48 | 913817 | 11550147 | famous_ukug_1599_200_006659859_2 | 430,586 | 315,300 | 0.7323 |
28 Jul 2010 18:31:06 | 913817 | 11550147 | famous_ukug_1599_200_006659859_2 | 421,226 | 308,079 | 0.7314 |
28 Jul 2010 16:21:20 | 913817 | 11550147 | famous_ukug_1599_200_006659859_2 | 411,866 | 300,739 | 0.7302 |
28 Jul 2010 13:16:34 | 913817 | 11550147 | famous_ukug_1599_200_006659859_2 | 402,506 | 293,398 | 0.7289 |
28 Jul 2010 09:44:59 | 913817 | 11550147 | famous_ukug_1599_200_006659859_2 | 393,146 | 284,942 | 0.7248 |
28 Jul 2010 06:56:09 | 913817 | 11550147 | famous_ukug_1599_200_006659859_2 | 383,786 | 276,537 | 0.7205 |
28 Jul 2010 04:54:42 | 913817 | 11550147 | famous_ukug_1599_200_006659859_2 | 374,426 | 268,395 | 0.7168 |
28 Jul 2010 01:56:32 | 913817 | 11550147 | famous_ukug_1599_200_006659859_2 | 365,066 | 260,151 | 0.7126 |
28 Jul 2010 00:40:43 | 913817 | 11550147 | famous_ukug_1599_200_006659859_2 | 355,706 | 252,931 | 0.7111 |
27 Jul 2010 21:18:29 | 913817 | 11550147 | famous_ukug_1599_200_006659859_2 | 346,346 | 246,132 | 0.7107 |
©2024 cpdn.org