Task 11519319

Name	famous_ug38_1399_200_006653695_3
Workunit	6857067
Created	10 Jun 2010, 14:02:22 UTC
Sent	21 Aug 2010, 7:28:18 UTC
Report deadline	20 Nov 2010, 14:55:29 UTC
Received	29 Aug 2010, 0:59:21 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1095309
Run time	6 days 13 hours 37 min 33 sec
CPU time	6 days 11 hours 16 min 37 sec
Validate state	Invalid
Credit	5,002.91
Device peak FLOPS	2.78 GFLOPS
Application version	UK Met Office FAMOUS v6.11 windows_intelx86
Stderr	<core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Signal 11 received, exiting... 10:47:05 (1008): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=488, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Signal 11 received, exiting... 12:23:47 (3508): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=488, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 13:17:09 (3820): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=488, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 11 received, exiting... 11:05:31 (1916): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3576, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 13:40:02 (2364): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3576, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 15:17:55 (1252): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3576, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 20:32:38 (3596): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3576, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 22:28:50 (732): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3576, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 00:38:27 (3016): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3576, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 01:20:32 (3868): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3576, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 03:00:42 (1892): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3576, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 03:37:17 (3928): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3576, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Signal 11 received, exiting... 13:00:12 (3384): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3576, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 16:21:12 (1132): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3576, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 16:36:05 (1180): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3576, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 22:30:43 (2680): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2548, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 23:36:39 (1976): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2548, iMonCtr=1 Model crash detected, will try to restart... Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy Sorry, too many model crashes! :-( 01:52:36 (2548): called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
29 Aug 2010 00:13:13	1095309	11519319	famous_ug38_1399_200_006653695_3	1,516,346	556,472	0.3670
29 Aug 2010 00:12:28	1095309	11519319	famous_ug38_1399_200_006653695_3	1,506,986	553,119	0.3670
28 Aug 2010 22:17:04	1095309	11519319	famous_ug38_1399_200_006653695_3	1,497,626	549,768	0.3671
28 Aug 2010 21:24:14	1095309	11519319	famous_ug38_1399_200_006653695_3	1,488,266	546,422	0.3672
28 Aug 2010 20:25:59	1095309	11519319	famous_ug38_1399_200_006653695_3	1,478,906	543,070	0.3672
28 Aug 2010 19:32:01	1095309	11519319	famous_ug38_1399_200_006653695_3	1,469,546	539,720	0.3673
28 Aug 2010 18:36:30	1095309	11519319	famous_ug38_1399_200_006653695_3	1,460,186	536,374	0.3673
28 Aug 2010 17:36:12	1095309	11519319	famous_ug38_1399_200_006653695_3	1,450,826	533,027	0.3674
28 Aug 2010 16:43:14	1095309	11519319	famous_ug38_1399_200_006653695_3	1,441,466	529,680	0.3675
28 Aug 2010 15:44:47	1095309	11519319	famous_ug38_1399_200_006653695_3	1,432,106	526,324	0.3675
28 Aug 2010 14:51:39	1095309	11519319	famous_ug38_1399_200_006653695_3	1,422,746	522,960	0.3676
28 Aug 2010 13:53:46	1095309	11519319	famous_ug38_1399_200_006653695_3	1,413,386	519,602	0.3676
28 Aug 2010 12:55:26	1095309	11519319	famous_ug38_1399_200_006653695_3	1,404,026	516,242	0.3677
28 Aug 2010 12:02:11	1095309	11519319	famous_ug38_1399_200_006653695_3	1,394,666	512,880	0.3677
28 Aug 2010 11:03:47	1095309	11519319	famous_ug38_1399_200_006653695_3	1,385,306	509,517	0.3678
28 Aug 2010 10:09:37	1095309	11519319	famous_ug38_1399_200_006653695_3	1,375,946	506,152	0.3679
28 Aug 2010 09:14:21	1095309	11519319	famous_ug38_1399_200_006653695_3	1,366,586	502,795	0.3679
28 Aug 2010 08:20:23	1095309	11519319	famous_ug38_1399_200_006653695_3	1,357,226	499,431	0.3680
28 Aug 2010 08:20:23	1095309	11519319	famous_ug38_1399_200_006653695_3	1,347,866	496,066	0.3680
28 Aug 2010 06:27:33	1095309	11519319	famous_ug38_1399_200_006653695_3	1,338,506	492,702	0.3681