Task 11672757

Name	famous_ub82_1999_200_006647389_6
Workunit	6850761
Created	24 Aug 2010, 10:29:56 UTC
Sent	24 Aug 2010, 11:03:29 UTC
Report deadline	23 Nov 2010, 18:30:40 UTC
Received	16 Sep 2010, 15:01:28 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1095977
Run time	15 days 18 hours 25 min 27 sec
CPU time	15 days 6 hours 5 min 50 sec
Validate state	Invalid
Credit	5,188.20
Device peak FLOPS	1.46 GFLOPS
Application version	UK Met Office FAMOUS v6.11 windows_intelx86
Stderr	<core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4132, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4728, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5136, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 10:12:47 (5212): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5092, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=816, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=816, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5368, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3556, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3556, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5112, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4524, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4736, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4560, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3388, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4968, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4984, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4796, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5320, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3980, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3764, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5928, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5396, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: cannot open input file D:\BOINCdata/projects/climateprediction.net/famous_ub82_1999_200_006647389/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file D:\BOINCdata/projects/climateprediction.net/famous_ub82_1999_200_006647389/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file D:\BOINCdata/projects/climateprediction.net/famous_ub82_1999_200_006647389/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file D:\BOINCdata/projects/climateprediction.net/famous_ub82_1999_200_006647389/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file D:\BOINCdata/projects/climateprediction.net/famous_ub82_1999_200_006647389/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file D:\BOINCdata/projects/climateprediction.net/famous_ub82_1999_200_006647389/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy Sorry, too many model crashes! :-( 16:45:53 (1320): called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
16 Sep 2010 12:26:10	1095977	11672757	famous_ub82_1999_200_006647389_6	1,572,506	1,310,590	0.8334
16 Sep 2010 10:03:39	1095977	11672757	famous_ub82_1999_200_006647389_6	1,563,146	1,302,559	0.8333
16 Sep 2010 07:38:43	1095977	11672757	famous_ub82_1999_200_006647389_6	1,553,786	1,294,542	0.8332
16 Sep 2010 04:23:44	1095977	11672757	famous_ub82_1999_200_006647389_6	1,544,426	1,286,548	0.8330
16 Sep 2010 02:07:01	1095977	11672757	famous_ub82_1999_200_006647389_6	1,535,066	1,278,521	0.8329
15 Sep 2010 23:48:44	1095977	11672757	famous_ub82_1999_200_006647389_6	1,525,706	1,270,555	0.8328
15 Sep 2010 21:31:18	1095977	11672757	famous_ub82_1999_200_006647389_6	1,516,346	1,262,584	0.8326
15 Sep 2010 19:11:43	1095977	11672757	famous_ub82_1999_200_006647389_6	1,506,986	1,254,572	0.8325
15 Sep 2010 13:21:52	1095977	11672757	famous_ub82_1999_200_006647389_6	1,497,626	1,246,573	0.8324
15 Sep 2010 10:52:42	1095977	11672757	famous_ub82_1999_200_006647389_6	1,488,266	1,238,581	0.8322
15 Sep 2010 07:32:39	1095977	11672757	famous_ub82_1999_200_006647389_6	1,478,906	1,230,548	0.8321
15 Sep 2010 05:18:55	1095977	11672757	famous_ub82_1999_200_006647389_6	1,469,546	1,222,551	0.8319
15 Sep 2010 03:04:44	1095977	11672757	famous_ub82_1999_200_006647389_6	1,460,186	1,214,534	0.8318
15 Sep 2010 01:49:10	1095977	11672757	famous_ub82_1999_200_006647389_6	1,450,826	1,206,545	0.8316
14 Sep 2010 22:33:12	1095977	11672757	famous_ub82_1999_200_006647389_6	1,441,466	1,198,633	0.8315
14 Sep 2010 20:06:14	1095977	11672757	famous_ub82_1999_200_006647389_6	1,432,106	1,190,608	0.8314
14 Sep 2010 17:44:54	1095977	11672757	famous_ub82_1999_200_006647389_6	1,422,746	1,182,522	0.8312
14 Sep 2010 15:21:32	1095977	11672757	famous_ub82_1999_200_006647389_6	1,413,386	1,174,455	0.8310
14 Sep 2010 13:01:26	1095977	11672757	famous_ub82_1999_200_006647389_6	1,404,026	1,166,445	0.8308
14 Sep 2010 10:42:06	1095977	11672757	famous_ub82_1999_200_006647389_6	1,394,666	1,158,386	0.8306