Name | famous_vh80_1599_200_006708086_1 |
Workunit | 6911339 |
Created | 26 Aug 2010, 16:59:49 UTC |
Sent | 20 Nov 2010, 23:57:18 UTC |
Report deadline | 20 Feb 2011, 7:24:29 UTC |
Received | 27 Apr 2011, 12:00:10 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 1100336 |
Run time | |
CPU time | 14 days 1 hours 0 min 53 sec |
Validate state | Invalid |
Credit | 3,613.24 |
Device peak FLOPS | 1.42 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.2.28</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> error: cannot delete old C:/ProgramData/BOINC/projects/climateprediction.net/famous_se_6.11_windows_intelx86.dll Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11744, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5056, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5620, iMonCtr=1 Model crash detected, will try to restart... 23:35:50 (5628): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:35:51 (5628): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13324, iMonCtr=1 Model crash detected, will try to restart... 03:58:06 (4620): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12372, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5620, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5620, iMonCtr=1 Model crash detected, will try to restart... CCController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5416, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4296, iMonCtr=1 Model crash detected, will try to restart... 07:44:38 (1816): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:44:45 (1816): No heartbeat from core client for 30 sec - exiting CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4176, iMonCtr=1 Model crash detected, will try to restart... 08:10:13 (5328): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:10:14 (5328): No heartbeat from core client for 30 sec - exiting 08:10:15 (5328): No heartbeat from core client for 30 sec - exiting 08:10:16 (5328): No heartbeat from core client for 30 sec - exiting 08:10:17 (5328): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4384, iMonCtr=1 Model crash detected, will try to restart... 18:09:28 (5320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:00:31 (44752): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5200, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 01:55:38 (5596): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:55:39 (5596): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=20492, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5392, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6056, iMonCtr=1 Model crash detected, will try to restart... 19:13:26 (4640): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:13:37 (4640): No heartbeat from core client for 30 sec - exiting 19:13:51 (10656): Can't acquire lockfile (32) - waiting 35s Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10656, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 12:45:30 (6020): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:45:39 (6020): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=44656, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11088, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5576, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4720, iMonCtr=1 Model crash detected, will try to restart... 12:26:42 (4508): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:01:17 (14220): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:01:27 (14220): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 17:33:55 (5112): handle_file_upload_status: can't open boinc_ufs_cpdnout2.zip 17:33:55 (5112): handle_file_upload_status: can't open boinc_ufs_cpdnout3.zip 17:33:55 (5112): handle_file_upload_status: can't open boinc_ufs_cpdnout4.zip 17:33:55 (5112): handle_file_upload_status: can't open boinc_ufs_cpdnout5.zip 17:33:55 (5112): handle_file_upload_status: can't open boinc_ufs_cpdnout6.zip 17:33:55 (5112): handle_file_upload_status: can't open boinc_ufs_cpdnout7.zip 17:33:55 (5112): handle_file_upload_status: can't open boinc_ufs_cpdnout8.zip Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5112, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5140, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4364, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5080, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5036, iMonCtr=1 Model crash detected, will try to restart... 06:20:00 (5060): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:20:15 (5060): No heartbeat from core client for 30 sec - exiting 06:20:16 (5060): No heartbeat from core client for 30 sec - exiting 06:20:18 (5060): No heartbeat from core client for 30 sec - exiting 06:20:19 (5060): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4852, iMonCtr=1 Model crash detected, will try to restart... 17:02:02 (4792): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:02:06 (4792): No heartbeat from core client for 30 sec - exiting 17:02:07 (4792): No heartbeat from core client for 30 sec - exiting 17:02:08 (4792): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 23:46:43 (6668): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4184, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4296, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 14:09:51 (4704): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=35232, iMonCtr=1 Model crash detected, will try to restart... 05:42:22 (4048): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6876, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 07:37:37 (8152): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
27 Apr 2011 01:50:06 | 1100336 | 11800200 | famous_vh80_1599_200_006708086_1 | 1,095,146 | 1,203,422 | 1.0989 |
26 Apr 2011 10:55:48 | 1100336 | 11800200 | famous_vh80_1599_200_006708086_1 | 1,085,786 | 1,192,831 | 1.0986 |
25 Apr 2011 15:03:42 | 1100336 | 11800200 | famous_vh80_1599_200_006708086_1 | 1,076,426 | 1,182,571 | 1.0986 |
25 Apr 2011 11:21:38 | 1100336 | 11800200 | famous_vh80_1599_200_006708086_1 | 1,067,066 | 1,172,391 | 1.0987 |
25 Apr 2011 01:34:05 | 1100336 | 11800200 | famous_vh80_1599_200_006708086_1 | 1,057,706 | 1,161,363 | 1.0980 |
24 Apr 2011 19:19:25 | 1100336 | 11800200 | famous_vh80_1599_200_006708086_1 | 1,048,346 | 1,149,551 | 1.0965 |
24 Apr 2011 06:31:41 | 1100336 | 11800200 | famous_vh80_1599_200_006708086_1 | 1,038,986 | 1,138,726 | 1.0960 |
24 Apr 2011 02:28:39 | 1100336 | 11800200 | famous_vh80_1599_200_006708086_1 | 1,029,626 | 1,127,250 | 1.0948 |
23 Apr 2011 06:11:16 | 1100336 | 11800200 | famous_vh80_1599_200_006708086_1 | 1,020,266 | 1,116,895 | 1.0947 |
22 Apr 2011 05:24:08 | 1100336 | 11800200 | famous_vh80_1599_200_006708086_1 | 1,010,906 | 1,106,246 | 1.0943 |
22 Apr 2011 00:17:46 | 1100336 | 11800200 | famous_vh80_1599_200_006708086_1 | 1,001,546 | 1,095,610 | 1.0939 |
21 Apr 2011 06:19:08 | 1100336 | 11800200 | famous_vh80_1599_200_006708086_1 | 992,186 | 1,085,260 | 1.0938 |
21 Apr 2011 03:44:02 | 1100336 | 11800200 | famous_vh80_1599_200_006708086_1 | 982,826 | 1,075,229 | 1.0940 |
20 Apr 2011 23:26:17 | 1100336 | 11800200 | famous_vh80_1599_200_006708086_1 | 973,466 | 1,065,346 | 1.0944 |
20 Apr 2011 21:38:10 | 1100336 | 11800200 | famous_vh80_1599_200_006708086_1 | 964,106 | 1,055,107 | 1.0944 |
20 Apr 2011 21:38:10 | 1100336 | 11800200 | famous_vh80_1599_200_006708086_1 | 954,746 | 1,044,895 | 1.0944 |
20 Apr 2011 21:38:10 | 1100336 | 11800200 | famous_vh80_1599_200_006708086_1 | 945,386 | 1,033,976 | 1.0937 |
20 Apr 2011 21:38:10 | 1100336 | 11800200 | famous_vh80_1599_200_006708086_1 | 936,026 | 1,023,540 | 1.0935 |
20 Apr 2011 21:38:10 | 1100336 | 11800200 | famous_vh80_1599_200_006708086_1 | 926,666 | 1,013,263 | 1.0935 |
20 Apr 2011 21:38:10 | 1100336 | 11800200 | famous_vh80_1599_200_006708086_1 | 917,306 | 1,003,268 | 1.0937 |
©2024 cpdn.org