Name | famous_uf01_1799_200_006652284_2 |
Workunit | 6855656 |
Created | 10 Jun 2010, 13:50:04 UTC |
Sent | 19 Aug 2010, 0:29:39 UTC |
Report deadline | 18 Nov 2010, 7:56:50 UTC |
Received | 22 Aug 2010, 19:29:57 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 1088185 |
Run time | 1 days 22 hours 47 min 33 sec |
CPU time | 1 days 22 hours 26 min 11 sec |
Validate state | Invalid |
Credit | 895.65 |
Device peak FLOPS | 1.66 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5988, selfPID=5988, iMonCtr=1 Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4828, selfPID=4828, iMonCtr=1 Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2700, selfPID=2700, iMonCtr=1 Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2928, selfPID=2928, iMonCtr=1 Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2448, selfPID=2448, iMonCtr=1 Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6052, selfPID=6052, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2804, iMonCtr=1 Model crash detected, will try to restart... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5192, selfPID=5192, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4228, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5452, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5768, iMonCtr=1 Model crash detected, will try to restart... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5956, selfPID=5956, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5996, iMonCtr=1 Model crash detected, will try to restart... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3056, selfPID=3056, iMonCtr=1 Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5696, selfPID=5696, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5740, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5964, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=752, iMonCtr=1 Model crash detected, will try to restart... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5432, selfPID=5432, iMonCtr=1 Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3052, selfPID=3052, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4552, iMonCtr=1 Model crash detected, will try to restart... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5832, selfPID=5832, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5396, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6028, iMonCtr=1 Model crash detected, will try to restart... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5884, selfPID=5884, iMonCtr=1 Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5136, selfPID=5136, iMonCtr=1 Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5036, selfPID=5036, iMonCtr=1 Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3544, selfPID=3544, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6044, iMonCtr=1 Model crash detected, will try to restart... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4820, selfPID=4820, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5012, iMonCtr=1 Model crash deWorker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6088, selfPID=6088, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4100, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6008, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=976, iMonCtr=1 Model crash detected, will try to restart... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5320, selfPID=5320, iMonCtr=1 Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4116, selfPID=4116, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4720, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6076, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5444, iMonCtr=1 Model crash detected, will try to restart... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6040, selfPID=6040, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6108, iMonCtr=1 Model crash detected, will try to restart... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
22 Aug 2010 01:39:59 | 1088185 | 11512259 | famous_uf01_1799_200_006652284_2 | 271,466 | 165,988 | 0.6115 |
21 Aug 2010 22:01:21 | 1088185 | 11512259 | famous_uf01_1799_200_006652284_2 | 262,106 | 160,221 | 0.6113 |
21 Aug 2010 20:28:32 | 1088185 | 11512259 | famous_uf01_1799_200_006652284_2 | 252,746 | 154,540 | 0.6114 |
21 Aug 2010 18:49:43 | 1088185 | 11512259 | famous_uf01_1799_200_006652284_2 | 243,386 | 148,777 | 0.6113 |
21 Aug 2010 07:40:41 | 1088185 | 11512259 | famous_uf01_1799_200_006652284_2 | 234,026 | 143,011 | 0.6111 |
20 Aug 2010 21:25:46 | 1088185 | 11512259 | famous_uf01_1799_200_006652284_2 | 224,666 | 137,378 | 0.6115 |
20 Aug 2010 19:48:19 | 1088185 | 11512259 | famous_uf01_1799_200_006652284_2 | 215,306 | 131,742 | 0.6119 |
20 Aug 2010 18:13:19 | 1088185 | 11512259 | famous_uf01_1799_200_006652284_2 | 205,946 | 126,113 | 0.6124 |
20 Aug 2010 16:34:55 | 1088185 | 11512259 | famous_uf01_1799_200_006652284_2 | 196,586 | 120,361 | 0.6123 |
20 Aug 2010 15:02:31 | 1088185 | 11512259 | famous_uf01_1799_200_006652284_2 | 187,226 | 114,610 | 0.6121 |
20 Aug 2010 13:24:10 | 1088185 | 11512259 | famous_uf01_1799_200_006652284_2 | 177,866 | 108,853 | 0.6120 |
20 Aug 2010 11:48:06 | 1088185 | 11512259 | famous_uf01_1799_200_006652284_2 | 168,506 | 103,097 | 0.6118 |
20 Aug 2010 10:12:29 | 1088185 | 11512259 | famous_uf01_1799_200_006652284_2 | 159,146 | 97,335 | 0.6116 |
20 Aug 2010 08:36:31 | 1088185 | 11512259 | famous_uf01_1799_200_006652284_2 | 149,786 | 91,573 | 0.6114 |
20 Aug 2010 06:55:58 | 1088185 | 11512259 | famous_uf01_1799_200_006652284_2 | 140,426 | 85,814 | 0.6111 |
20 Aug 2010 05:23:46 | 1088185 | 11512259 | famous_uf01_1799_200_006652284_2 | 131,066 | 80,054 | 0.6108 |
20 Aug 2010 03:47:54 | 1088185 | 11512259 | famous_uf01_1799_200_006652284_2 | 121,706 | 74,288 | 0.6104 |
20 Aug 2010 02:08:41 | 1088185 | 11512259 | famous_uf01_1799_200_006652284_2 | 112,346 | 68,532 | 0.6100 |
20 Aug 2010 00:33:00 | 1088185 | 11512259 | famous_uf01_1799_200_006652284_2 | 102,986 | 62,769 | 0.6095 |
19 Aug 2010 22:50:52 | 1088185 | 11512259 | famous_uf01_1799_200_006652284_2 | 93,626 | 57,020 | 0.6090 |
©2024 cpdn.org