Name | famous_v87o_999_200_006696410_0 |
Workunit | 6899663 |
Created | 26 Aug 2010, 16:26:32 UTC |
Sent | 9 Dec 2010, 22:45:17 UTC |
Report deadline | 11 Mar 2011, 6:12:28 UTC |
Received | 26 Mar 2011, 11:03:44 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 25 (0x00000019) Unknown error code |
Computer ID | 1110633 |
Run time | 20 days 20 hours 9 min 17 sec |
CPU time | 17 days 10 hours 9 min 25 sec |
Validate state | Invalid |
Credit | 5,898.48 |
Device peak FLOPS | 1.43 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 14:08:05 (5084): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5228, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 02:44:04 (5272): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:44:05 (5272): No heartbeat from core client for 30 sec - exiting 02:44:06 (5272): No heartbeat from core client for 30 sec - exiting 02:44:07 (5272): No heartbeat from core client for 30 sec - exiting 02:44:08 (5272): No heartbeat from core client for 30 sec - exiting 02:44:09 (5272): No heartbeat from core client for 30 sec - exiting 02:44:10 (5272): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4788, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 23:28:40 (5216): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:36:02 (3164): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:57:53 (5616): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:23:05 (5844): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 22:47:52 (5620): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 19:46:34 (3460): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Suspended CPDN Monitor - Suspend request from BOINC... 13:23:25 (5204): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:23:26 (5204): No heartbeat from core client for 30 sec - exiting 13:23:27 (5204): No heartbeat from core client for 30 sec - exiting C13:17:18 (5820): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Suspended CPDN Monitor - Suspend request from BOINC... BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 18:12:19 (5200): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5588, iMonCtr=1 Model crash detected, will try to restart... 18:59:55 (5188): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5688, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5688, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 08:23:24 (5348): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:23:25 (5348): No heartbeat from core client for 30 sec - exiting 08:23:26 (5348): No heartbeat from core client for 30 sec - exiting 08:23:27 (5348): No heartbeat from core client for 30 sec - exiting 08:23:28 (5348): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 22:05:55 (6028): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4996, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2976, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5484, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3344, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5572, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5220, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 11:02:38 (5100): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
25 Mar 2011 18:02:49 | 1110633 | 11741819 | famous_v87o_999_200_006696410_0 | 1,787,786 | 1,503,656 | 0.8411 |
25 Mar 2011 16:56:28 | 1110633 | 11741819 | famous_v87o_999_200_006696410_0 | 1,778,426 | 1,496,587 | 0.8415 |
25 Mar 2011 16:56:28 | 1110633 | 11741819 | famous_v87o_999_200_006696410_0 | 1,769,066 | 1,489,395 | 0.8419 |
25 Mar 2011 10:36:05 | 1110633 | 11741819 | famous_v87o_999_200_006696410_0 | 1,759,706 | 1,482,083 | 0.8422 |
25 Mar 2011 08:20:53 | 1110633 | 11741819 | famous_v87o_999_200_006696410_0 | 1,750,346 | 1,474,943 | 0.8427 |
25 Mar 2011 00:00:01 | 1110633 | 11741819 | famous_v87o_999_200_006696410_0 | 1,740,986 | 1,467,603 | 0.8430 |
23 Mar 2011 22:42:19 | 1110633 | 11741819 | famous_v87o_999_200_006696410_0 | 1,731,626 | 1,460,153 | 0.8432 |
23 Mar 2011 20:18:32 | 1110633 | 11741819 | famous_v87o_999_200_006696410_0 | 1,722,266 | 1,452,740 | 0.8435 |
23 Mar 2011 16:40:10 | 1110633 | 11741819 | famous_v87o_999_200_006696410_0 | 1,712,906 | 1,445,573 | 0.8439 |
23 Mar 2011 14:18:34 | 1110633 | 11741819 | famous_v87o_999_200_006696410_0 | 1,703,546 | 1,438,351 | 0.8443 |
23 Mar 2011 11:46:10 | 1110633 | 11741819 | famous_v87o_999_200_006696410_0 | 1,694,186 | 1,431,094 | 0.8447 |
23 Mar 2011 09:25:25 | 1110633 | 11741819 | famous_v87o_999_200_006696410_0 | 1,684,826 | 1,423,729 | 0.8450 |
22 Mar 2011 18:17:46 | 1110633 | 11741819 | famous_v87o_999_200_006696410_0 | 1,675,466 | 1,416,239 | 0.8453 |
22 Mar 2011 14:50:24 | 1110633 | 11741819 | famous_v87o_999_200_006696410_0 | 1,666,106 | 1,408,983 | 0.8457 |
22 Mar 2011 11:53:12 | 1110633 | 11741819 | famous_v87o_999_200_006696410_0 | 1,656,746 | 1,401,517 | 0.8459 |
22 Mar 2011 09:26:04 | 1110633 | 11741819 | famous_v87o_999_200_006696410_0 | 1,647,386 | 1,394,174 | 0.8463 |
21 Mar 2011 20:50:21 | 1110633 | 11741819 | famous_v87o_999_200_006696410_0 | 1,638,026 | 1,386,787 | 0.8466 |
21 Mar 2011 16:40:54 | 1110633 | 11741819 | famous_v87o_999_200_006696410_0 | 1,628,666 | 1,379,485 | 0.8470 |
21 Mar 2011 14:19:28 | 1110633 | 11741819 | famous_v87o_999_200_006696410_0 | 1,619,306 | 1,372,213 | 0.8474 |
21 Mar 2011 12:02:03 | 1110633 | 11741819 | famous_v87o_999_200_006696410_0 | 1,609,946 | 1,365,000 | 0.8479 |
©2024 cpdn.org