Task 11577187

Name	famous_up0m_599_200_006665265_0
Workunit	6868637
Created	10 Jun 2010, 15:44:17 UTC
Sent	17 Jun 2010, 12:02:08 UTC
Report deadline	16 Sep 2010, 19:29:19 UTC
Received	10 Oct 2010, 13:06:02 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	980556
Run time	8 days 14 hours 16 min 48 sec
CPU time	8 days 7 hours 5 min 44 sec
Validate state	Invalid
Credit	5,157.32
Device peak FLOPS	2.06 GFLOPS
Application version	UK Met Office FAMOUS v6.10 windows_intelx86
Stderr	<core_client_version>6.10.18</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8656, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7592, iMonCtr=1 Model crash detected, will try to restart... 07:15:37 (4544): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5136, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6136, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 10:49:13 (5612): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:41:07 (7584): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:41:13 (7584): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7380, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1516, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6984, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 21:16:39 (4008): No heartbeat from core client for 30 sec - exiting 21:16:40 (4008): No heartbeat from core client for 30 sec - exiting 21:16:41 (4008): No heartbeat from core client for 30 sec - exiting 21:16:42 (4008): No heartbeat from core client for 30 sec - exiting 21:16:43 (4008): No heartbeat from core client for 30 sec - exiting 21:16:44 (4008): No heartbeat from core client for 30 sec - exiting 21:16:45 (4008): No heartbeat from core client for 30 sec - exiting 21:16:46 (4008): No heartbeat from core client for 30 sec - exiting 21:16:47 (4008): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:16:48 (4008): No heartbeat from core client for 30 sec - exiting 21:18:22 (5928): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6728, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3724, iMonCtr=1 Model crash detected, will try to restart... 18:54:07 (7316): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:54:08 (7316): No heartbeat from core client for 30 sec - exiting 18:54:09 (7316): No heartbeat from core client for 30 sec - exiting 18:54:10 (7316): No heartbeat from core client for 30 sec - exiting 18:54:11 (7316): No heartbeat from core client for 30 sec - exiting 18:54:12 (7316): No heartbeat from core client for 30 sec - exiting 18:54:13 (7316): No heartbeat from core client for 30 sec - exiting 18:54:14 (7316): No heartbeat from core client for 30 sec - exiting 18:54:15 (7316): No heartbeat from core client for 30 sec - exiting 18:54:16 (7316): No heartbeat from core client for 30 sec - exiting 18:54:17 (7316): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... 21:09:49 (5460): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy Sorry, too many model crashes! :-( 23:04:51 (6896): called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
10 Oct 2010 12:16:51	980556	11577187	famous_up0m_599_200_006665265_0	1,563,146	713,756	0.4566
10 Oct 2010 11:11:07	980556	11577187	famous_up0m_599_200_006665265_0	1,553,786	709,292	0.4565
10 Oct 2010 11:11:07	980556	11577187	famous_up0m_599_200_006665265_0	1,544,426	704,829	0.4564
10 Oct 2010 11:11:07	980556	11577187	famous_up0m_599_200_006665265_0	1,535,066	700,359	0.4562
10 Oct 2010 11:11:07	980556	11577187	famous_up0m_599_200_006665265_0	1,525,706	695,892	0.4561
10 Oct 2010 11:11:07	980556	11577187	famous_up0m_599_200_006665265_0	1,516,346	691,433	0.4560
10 Oct 2010 11:11:07	980556	11577187	famous_up0m_599_200_006665265_0	1,506,986	686,978	0.4559
10 Oct 2010 11:11:07	980556	11577187	famous_up0m_599_200_006665265_0	1,497,626	682,473	0.4557
10 Oct 2010 11:11:07	980556	11577187	famous_up0m_599_200_006665265_0	1,488,266	677,897	0.4555
10 Oct 2010 11:11:07	980556	11577187	famous_up0m_599_200_006665265_0	1,478,906	673,290	0.4553
10 Oct 2010 11:11:07	980556	11577187	famous_up0m_599_200_006665265_0	1,469,546	668,759	0.4551
10 Oct 2010 11:11:07	980556	11577187	famous_up0m_599_200_006665265_0	1,460,186	664,270	0.4549
06 Oct 2010 12:33:13	980556	11577187	famous_up0m_599_200_006665265_0	1,450,826	659,862	0.4548
03 Oct 2010 12:58:00	980556	11577187	famous_up0m_599_200_006665265_0	1,441,466	655,299	0.4546
03 Oct 2010 03:23:02	980556	11577187	famous_up0m_599_200_006665265_0	1,432,106	650,846	0.4545
30 Sep 2010 11:38:58	980556	11577187	famous_up0m_599_200_006665265_0	1,422,746	646,629	0.4545
30 Sep 2010 10:20:23	980556	11577187	famous_up0m_599_200_006665265_0	1,413,386	642,136	0.4543
30 Sep 2010 04:50:48	980556	11577187	famous_up0m_599_200_006665265_0	1,404,026	637,585	0.4541
30 Sep 2010 04:50:48	980556	11577187	famous_up0m_599_200_006665265_0	1,394,666	633,033	0.4539
25 Sep 2010 14:17:03	980556	11577187	famous_up0m_599_200_006665265_0	1,385,306	629,003	0.4541