Name | famous_unsa_1599_200_006663669_0 |
Workunit | 6867041 |
Created | 10 Jun 2010, 15:30:13 UTC |
Sent | 3 Jul 2010, 23:38:37 UTC |
Report deadline | 3 Oct 2010, 7:05:48 UTC |
Received | 4 Jul 2010, 23:40:11 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 3 (0x00000003) Unknown error code |
Computer ID | 1134514 |
Run time | 6 hours 13 min 45 sec |
CPU time | 4 hours 53 min 35 sec |
Validate state | Invalid |
Credit | 123.61 |
Device peak FLOPS | 2.67 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> The system cannot find the path specified. (0x3) - exit code 3 (0x3) </message> <stderr_txt> 17:46:25 (3892): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:47:31 (4740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:49:18 (4980): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:51:20 (1216): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:53:53 (2080): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:55:27 (4140): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:56:56 (4900): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:00:15 (2568): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:02:40 (4040): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:06:35 (4504): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:08:21 (3080): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:20:34 (4784): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:29:07 (4620): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:31:34 (1720): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:33:13 (4204): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:52:03 (3536): No heartbeat from core client for 30 sec - exiting 19:52:04 (3536): No heartbeat from core client for 30 sec - exiting 19:52:05 (3536): No heartbeat from core client for 30 sec - exiting 19:52:06 (3536): No heartbeat from core client for 30 sec - exiting 19:52:07 (3536): No heartbeat from core client for 30 sec - exiting 19:52:08 (3536): No heartbeat from core client for 30 sec - exiting 19:52:09 (3536): No heartbeat from core client for 30 sec - exiting 19:52:10 (3536): No heartbeat from core client for 30 sec - exiting 19:52:11 (3536): No heartbeat from core client for 30 sec - exiting 19:52:12 (3536): No heartbeat from core client for 30 sec - exiting 19:52:14 (3536): No heartbeat from core client for 30 sec - exiting 19:52:15 (3536): No heartbeat from core client for 30 sec - exiting 19:52:16 (3536): No heartbeat from core client for 30 sec - exiting 19:52:17 (3536): No heartbeat from core client for 30 sec - exiting 19:52:18 (3536): No heartbeat from core client for 30 sec - exiting 19:52:19 (3536): No heartbeat from core client for 30 sec - exiting 19:52:20 (3536): No heartbeat from core client for 30 sec - exiting 19:52:21 (3536): No heartbeat from core client for 30 sec - exiting 19:52:22 (3536): No heartbeat from core client for 30 sec - exiting 19:52:23 (3536): No heartbeat from core client for 30 sec - exiting 19:52:24 (3536): No heartbeat from core client for 30 sec - exiting 19:52:26 (3536): No heartbeat from core client for 30 sec - exiting 19:52:27 (3536): No heartbeat from core client for 30 sec - exiting 19:52:28 (3536): No heartbeat from core client for 30 sec - exiting 19:52:29 (3536): No heartbeat from core client for 30 sec - exiting 19:52:30 (3536): No heartbeat from core client for 30 sec - exiting 19:52:31 (3536): No heartbeat from core client for 30 sec - exiting 19:52:32 (3536): No heartbeat from core client for 30 sec - exiting 19:52:33 (3536): No heartbeat from core client for 30 sec - exiting 19:52:34 (3536): No heartbeat from core client for 30 sec - exiting 19:52:35 (3536): No heartbeat from core client for 30 sec - exiting 19:52:36 (3536): No heartbeat from core client for 30 sec - exiting 19:52:38 (3536): No heartbeat from core client for 30 sec - exiting 19:52:39 (3536): No heartbeat from core client for 30 sec - exiting 19:52:40 (3536): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:52:41 (3536): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... 19:53:53 (4724): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:16:05 (4500): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:17:28 (3180): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:21:12 (3440): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:58:29 (3740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:06:28 (4324): No heartbeat from core client for 30 sec - exiting 21:06:30 (4324): No heartbeat from core client for 30 sec - exiting 21:06:31 (4324): No heartbeat from core client for 30 sec - exiting 21:06:32 (4324): No heartbeat from core client for 30 sec - exiting 21:06:33 (4324): No heartbeat from core client for 30 sec - exiting 21:06:34 (4324): No heartbeat from core client for 30 sec - exiting 21:06:35 (4324): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 21:07:56 (4236): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:07:57 (4236): No heartbeat from core client for 30 sec - exiting 21:07:58 (4236): No heartbeat from core client for 30 sec - exiting 21:07:59 (4236): No heartbeat from core client for 30 sec - exiting 21:08:01 (4236): No heartbeat from core client for 30 sec - exiting 21:08:02 (4236): No heartbeat from core client for 30 sec - exiting 21:08:03 (4236): No heartbeat from core client for 30 sec - exiting 21:08:04 (4236): No heartbeat from core client for 30 sec - exiting 21:08:05 (4236): No heartbeat from core client for 30 sec - exiting 21:08:06 (4236): No heartbeat from core client for 30 sec - exiting 21:08:07 (4236): No heartbeat from core client for 30 sec - exiting BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 21:09:29 (3204): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:09:30 (3204): No heartbeat from core client for 30 sec - exiting 21:09:31 (3204): No heartbeat from core client for 30 sec - exiting 21:09:32 (3204): No heartbeat from core client for 30 sec - exiting 21:09:33 (3204): No heartbeat from core client for 30 sec - exiting 21:09:34 (3204): No heartbeat from core client for 30 sec - exiting 21:09:35 (3204): No heartbeat from core client for 30 sec - exiting 21:09:37 (3204): No heartbeat from core client for 30 sec - exiting 21:09:38 (3204): No heartbeat from core client for 30 sec - exiting 21:09:39 (3204): No heartbeat from core client for 30 sec - exiting 21:09:40 (3204): No heartbeat from core client for 30 sec - exiting BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: Result too large BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 21:13:39 (4308): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:13:40 (4308): No heartbeat from core client for 30 sec - exiting 21:13:41 (4308): No heartbeat from core client for 30 sec - exiting 21:13:42 (4308): No heartbeat from core client for 30 sec - exiting 21:13:43 (4308): No heartbeat from core client for 30 sec - exiting 21:13:44 (4308): No heartbeat from core client for 30 sec - exiting 21:13:46 (4308): No heartbeat from core client for 30 sec - exiting 21:13:47 (4308): No heartbeat from core client for 30 sec - exiting 21:13:48 (4308): No heartbeat from core client for 30 sec - exiting 21:13:49 (4308): No heartbeat from core client for 30 sec - exiting 21:13:50 (4308): No heartbeat from core client for 30 sec - exiting 21:31:02 (2612): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:35:59 (5084): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:40:33 (2184): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:51:20 (2904): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:51:21 (2904): No heartbeat from core client for 30 sec - exiting 21:51:22 (2904): No heartbeat from core client for 30 sec - exiting 21:51:23 (2904): No heartbeat from core client for 30 sec - exiting 21:51:25 (2904): No heartbeat from core client for 30 sec - exiting 21:51:26 (2904): No heartbeat from core client for 30 sec - exiting 21:51:27 (2904): No heartbeat from core client for 30 sec - exiting 21:51:28 (2904): No heartbeat from core client for 30 sec - exiting 21:51:29 (2904): No heartbeat from core client for 30 sec - exiting 21:51:30 (2904): No heartbeat from core client for 30 sec - exiting 21:51:31 (2904): No heartbeat from core client for 30 sec - exiting 22:42:20 (392): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:11:08 (4992): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:11:09 (4992): No heartbeat from core client for 30 sec - exiting Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3112, selfPID=3112, iMonCtr=1 </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
04 Jul 2010 23:43:03 | 1048971 | 11569203 | famous_unsa_1599_200_006663669_0 | 37,466 | 14,411 | 0.3846 |
04 Jul 2010 23:43:03 | 1048971 | 11569203 | famous_unsa_1599_200_006663669_0 | 28,106 | 11,153 | 0.3968 |
04 Jul 2010 23:43:03 | 1048971 | 11569203 | famous_unsa_1599_200_006663669_0 | 18,746 | 7,945 | 0.4238 |
04 Jul 2010 23:43:03 | 1048971 | 11569203 | famous_unsa_1599_200_006663669_0 | 9,386 | 4,398 | 0.4686 |
©2024 cpdn.org