Name | famous_umcj_999_200_006661806_3 |
Workunit | 6865178 |
Created | 10 Jun 2010, 15:13:44 UTC |
Sent | 8 Jul 2010, 10:06:09 UTC |
Report deadline | 7 Oct 2010, 17:33:20 UTC |
Received | 24 Jul 2010, 12:25:34 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1062154 |
Run time | 9 days 9 hours 55 min 31 sec |
CPU time | 9 days 1 hours 1 min 38 sec |
Validate state | Invalid |
Credit | 3,922.05 |
Device peak FLOPS | 1.98 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Signal 11 received, exiting... 08:14:32 (7316): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=1 Model crash detected, will try to restart... Signal 4 received, exiting... 08:29:08 (4060): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 14:41:14 (6568): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 18:01:41 (5688): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=1 Model crash detected, will try to restart... Signal 4 received, exiting... 18:53:43 (8088): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 22:52:54 (108): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 00:19:48 (4832): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=1 Model crash detected, will try to restart... Signal 4 received, exiting... 02:59:44 (5720): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 03:05:24 (9196): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 04:21:49 (6972): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=1 Model crash detected, will try to restart... Signal 8 received, exiting... 05:22:15 (8120): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 07:16:19 (7608): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 11 received, exiting... 23:46:19 (5760): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1160, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 23:49:40 (5692): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1160, iMonCtr=1 Model crash detected, will try to restart... Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy Signal 4 received, exiting... 07:28:33 (6532): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1160, iMonCtr=1 Model crash detected, will try to restart... Signal 4 received, exiting... 13:08:43 (7208): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1160, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 14:11:27 (9480): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1160, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 15:22:41 (244): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1160, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Signal 4 received, exiting... 15:58:49 (4012): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1160, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 21:56:42 (8068): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1160, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 22:06:13 (3464): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1160, iMonCtr=1 Model crash detected, will try to restart... Signal 4 received, exiting... 02:39:07 (180): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1160, iMonCtr=1 Model crash detected, will try to restart... Signal 4 received, exiting... 10:14:49 (7312): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1160, iMonCtr=1 Model crash detected, will try to restart... Signal 4 received, exiting... 12:17:18 (7740): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1160, iMonCtr=1 Model crash detected, will try to restart... Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Signal 11 received, exiting... 04:11:16 (3732): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2232, iMonCtr=1 Model crash detected, will try to restart... Signal 4 received, exiting... 04:28:24 (2716): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2232, iMonCtr=1 Model crash detected, will try to restart... Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy CPDN Monitor - Quit request from BOINC... Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Sorry, too many model crashes! :-( 06:43:38 (2336): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
24 Jul 2010 10:19:59 | 1062154 | 11559888 | famous_umcj_999_200_006661806_3 | 1,188,746 | 776,293 | 0.6530 |
24 Jul 2010 08:32:04 | 1062154 | 11559888 | famous_umcj_999_200_006661806_3 | 1,179,386 | 770,155 | 0.6530 |
24 Jul 2010 06:41:20 | 1062154 | 11559888 | famous_umcj_999_200_006661806_3 | 1,170,026 | 763,903 | 0.6529 |
24 Jul 2010 04:36:00 | 1062154 | 11559888 | famous_umcj_999_200_006661806_3 | 1,160,666 | 757,685 | 0.6528 |
24 Jul 2010 02:10:06 | 1062154 | 11559888 | famous_umcj_999_200_006661806_3 | 1,151,306 | 751,566 | 0.6528 |
23 Jul 2010 08:20:59 | 1062154 | 11559888 | famous_umcj_999_200_006661806_3 | 1,141,946 | 744,700 | 0.6521 |
23 Jul 2010 07:21:12 | 1062154 | 11559888 | famous_umcj_999_200_006661806_3 | 1,132,586 | 738,399 | 0.6520 |
23 Jul 2010 05:10:18 | 1062154 | 11559888 | famous_umcj_999_200_006661806_3 | 1,123,226 | 732,277 | 0.6519 |
23 Jul 2010 03:05:55 | 1062154 | 11559888 | famous_umcj_999_200_006661806_3 | 1,113,866 | 726,149 | 0.6519 |
23 Jul 2010 01:53:59 | 1062154 | 11559888 | famous_umcj_999_200_006661806_3 | 1,104,506 | 720,016 | 0.6519 |
23 Jul 2010 01:53:59 | 1062154 | 11559888 | famous_umcj_999_200_006661806_3 | 1,095,146 | 713,873 | 0.6519 |
22 Jul 2010 21:53:23 | 1062154 | 11559888 | famous_umcj_999_200_006661806_3 | 1,085,786 | 707,732 | 0.6518 |
22 Jul 2010 19:45:18 | 1062154 | 11559888 | famous_umcj_999_200_006661806_3 | 1,076,426 | 701,700 | 0.6519 |
22 Jul 2010 17:50:21 | 1062154 | 11559888 | famous_umcj_999_200_006661806_3 | 1,067,066 | 695,649 | 0.6519 |
22 Jul 2010 16:00:47 | 1062154 | 11559888 | famous_umcj_999_200_006661806_3 | 1,057,706 | 689,528 | 0.6519 |
22 Jul 2010 14:18:16 | 1062154 | 11559888 | famous_umcj_999_200_006661806_3 | 1,048,346 | 683,384 | 0.6519 |
22 Jul 2010 12:38:58 | 1062154 | 11559888 | famous_umcj_999_200_006661806_3 | 1,038,986 | 677,237 | 0.6518 |
22 Jul 2010 10:53:24 | 1062154 | 11559888 | famous_umcj_999_200_006661806_3 | 1,029,626 | 671,102 | 0.6518 |
22 Jul 2010 09:03:34 | 1062154 | 11559888 | famous_umcj_999_200_006661806_3 | 1,020,266 | 664,944 | 0.6517 |
22 Jul 2010 07:21:00 | 1062154 | 11559888 | famous_umcj_999_200_006661806_3 | 1,010,906 | 658,799 | 0.6517 |
©2024 cpdn.org