Task 11559888

Name	famous_umcj_999_200_006661806_3
Workunit	6865178
Created	10 Jun 2010, 15:13:44 UTC
Sent	8 Jul 2010, 10:06:09 UTC
Report deadline	7 Oct 2010, 17:33:20 UTC
Received	24 Jul 2010, 12:25:34 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1062154
Run time	9 days 9 hours 55 min 31 sec
CPU time	9 days 1 hours 1 min 38 sec
Validate state	Invalid
Credit	3,922.05
Device peak FLOPS	1.98 GFLOPS
Application version	UK Met Office FAMOUS v6.11 windows_intelx86
Stderr	<core_client_version>6.10.18</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Signal 11 received, exiting... 08:14:32 (7316): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=1 Model crash detected, will try to restart... Signal 4 received, exiting... 08:29:08 (4060): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 14:41:14 (6568): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 18:01:41 (5688): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=1 Model crash detected, will try to restart... Signal 4 received, exiting... 18:53:43 (8088): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 22:52:54 (108): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 00:19:48 (4832): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=1 Model crash detected, will try to restart... Signal 4 received, exiting... 02:59:44 (5720): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 03:05:24 (9196): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 04:21:49 (6972): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=1 Model crash detected, will try to restart... Signal 8 received, exiting... 05:22:15 (8120): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 07:16:19 (7608): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2056, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 11 received, exiting... 23:46:19 (5760): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1160, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 23:49:40 (5692): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1160, iMonCtr=1 Model crash detected, will try to restart... Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy Signal 4 received, exiting... 07:28:33 (6532): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1160, iMonCtr=1 Model crash detected, will try to restart... Signal 4 received, exiting... 13:08:43 (7208): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1160, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 14:11:27 (9480): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1160, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 15:22:41 (244): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1160, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Signal 4 received, exiting... 15:58:49 (4012): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1160, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 21:56:42 (8068): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1160, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 22:06:13 (3464): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1160, iMonCtr=1 Model crash detected, will try to restart... Signal 4 received, exiting... 02:39:07 (180): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1160, iMonCtr=1 Model crash detected, will try to restart... Signal 4 received, exiting... 10:14:49 (7312): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1160, iMonCtr=1 Model crash detected, will try to restart... Signal 4 received, exiting... 12:17:18 (7740): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1160, iMonCtr=1 Model crash detected, will try to restart... Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Signal 11 received, exiting... 04:11:16 (3732): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2232, iMonCtr=1 Model crash detected, will try to restart... Signal 4 received, exiting... 04:28:24 (2716): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2232, iMonCtr=1 Model crash detected, will try to restart... Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy CPDN Monitor - Quit request from BOINC... Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Sorry, too many model crashes! :-( 06:43:38 (2336): called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
24 Jul 2010 10:19:59	1062154	11559888	famous_umcj_999_200_006661806_3	1,188,746	776,293	0.6530
24 Jul 2010 08:32:04	1062154	11559888	famous_umcj_999_200_006661806_3	1,179,386	770,155	0.6530
24 Jul 2010 06:41:20	1062154	11559888	famous_umcj_999_200_006661806_3	1,170,026	763,903	0.6529
24 Jul 2010 04:36:00	1062154	11559888	famous_umcj_999_200_006661806_3	1,160,666	757,685	0.6528
24 Jul 2010 02:10:06	1062154	11559888	famous_umcj_999_200_006661806_3	1,151,306	751,566	0.6528
23 Jul 2010 08:20:59	1062154	11559888	famous_umcj_999_200_006661806_3	1,141,946	744,700	0.6521
23 Jul 2010 07:21:12	1062154	11559888	famous_umcj_999_200_006661806_3	1,132,586	738,399	0.6520
23 Jul 2010 05:10:18	1062154	11559888	famous_umcj_999_200_006661806_3	1,123,226	732,277	0.6519
23 Jul 2010 03:05:55	1062154	11559888	famous_umcj_999_200_006661806_3	1,113,866	726,149	0.6519
23 Jul 2010 01:53:59	1062154	11559888	famous_umcj_999_200_006661806_3	1,104,506	720,016	0.6519
23 Jul 2010 01:53:59	1062154	11559888	famous_umcj_999_200_006661806_3	1,095,146	713,873	0.6519
22 Jul 2010 21:53:23	1062154	11559888	famous_umcj_999_200_006661806_3	1,085,786	707,732	0.6518
22 Jul 2010 19:45:18	1062154	11559888	famous_umcj_999_200_006661806_3	1,076,426	701,700	0.6519
22 Jul 2010 17:50:21	1062154	11559888	famous_umcj_999_200_006661806_3	1,067,066	695,649	0.6519
22 Jul 2010 16:00:47	1062154	11559888	famous_umcj_999_200_006661806_3	1,057,706	689,528	0.6519
22 Jul 2010 14:18:16	1062154	11559888	famous_umcj_999_200_006661806_3	1,048,346	683,384	0.6519
22 Jul 2010 12:38:58	1062154	11559888	famous_umcj_999_200_006661806_3	1,038,986	677,237	0.6518
22 Jul 2010 10:53:24	1062154	11559888	famous_umcj_999_200_006661806_3	1,029,626	671,102	0.6518
22 Jul 2010 09:03:34	1062154	11559888	famous_umcj_999_200_006661806_3	1,020,266	664,944	0.6517
22 Jul 2010 07:21:00	1062154	11559888	famous_umcj_999_200_006661806_3	1,010,906	658,799	0.6517