Task 12745882

Name	hadcm3n_o5em_1900_40_007202337_1
Workunit	7400617
Created	28 Mar 2011, 14:13:32 UTC
Sent	29 Mar 2011, 21:23:48 UTC
Report deadline	29 Jun 2011, 4:50:59 UTC
Received	22 Apr 2011, 19:41:09 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1346606
Run time	10 days 1 hours 33 min 1 sec
CPU time	9 days 7 hours 51 min 5 sec
Validate state	Invalid
Credit	6,220.80
Device peak FLOPS	3.01 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.60</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 11:08:05 (6776): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 16:01:09 (6420): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 17:18:55 (6416): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 18:33:32 (7016): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:01:04 (4428): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:05:13 (6700): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:26:10 (5616): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:53:26 (7092): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:27:33 (4856): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 01:15:39 (5892): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7908, iMonCtr=1 Model crash detected, will try to restart... 02:16:24 (5184): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 17:32:52 (6172): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy 2048 Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy 2048 Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy 2048 Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy 2048 Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy 2048 Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
22 Apr 2011 17:40:47	1134682	12745882	hadcm3n_o5em_1900_40_007202337_1	518,400	840,906	1.6221
22 Apr 2011 05:00:23	1134682	12745882	hadcm3n_o5em_1900_40_007202337_1	492,480	797,247	1.6188
21 Apr 2011 17:28:39	1134682	12745882	hadcm3n_o5em_1900_40_007202337_1	466,560	753,923	1.6159
21 Apr 2011 03:55:03	1134682	12745882	hadcm3n_o5em_1900_40_007202337_1	440,640	710,673	1.6128
20 Apr 2011 18:30:51	1134682	12745882	hadcm3n_o5em_1900_40_007202337_1	414,720	667,903	1.6105
20 Apr 2011 18:30:51	1134682	12745882	hadcm3n_o5em_1900_40_007202337_1	388,800	625,341	1.6084
20 Apr 2011 18:30:51	1134682	12745882	hadcm3n_o5em_1900_40_007202337_1	362,880	582,559	1.6054
20 Apr 2011 18:30:51	1134682	12745882	hadcm3n_o5em_1900_40_007202337_1	336,960	540,051	1.6027
20 Apr 2011 18:30:51	1134682	12745882	hadcm3n_o5em_1900_40_007202337_1	311,040	498,246	1.6019
20 Apr 2011 18:30:51	1134682	12745882	hadcm3n_o5em_1900_40_007202337_1	285,120	457,275	1.6038
20 Apr 2011 18:30:51	1134682	12745882	hadcm3n_o5em_1900_40_007202337_1	259,200	416,584	1.6072
20 Apr 2011 18:30:51	1134682	12745882	hadcm3n_o5em_1900_40_007202337_1	233,280	374,241	1.6043
20 Apr 2011 18:30:50	1134682	12745882	hadcm3n_o5em_1900_40_007202337_1	207,360	333,613	1.6089
20 Apr 2011 18:30:50	1134682	12745882	hadcm3n_o5em_1900_40_007202337_1	181,440	292,847	1.6140
20 Apr 2011 18:30:50	1134682	12745882	hadcm3n_o5em_1900_40_007202337_1	155,520	252,621	1.6244
20 Apr 2011 18:30:50	1134682	12745882	hadcm3n_o5em_1900_40_007202337_1	129,600	213,448	1.6470
20 Apr 2011 18:30:50	1134682	12745882	hadcm3n_o5em_1900_40_007202337_1	103,680	174,155	1.6797
12 Apr 2011 23:50:27	1134682	12745882	hadcm3n_o5em_1900_40_007202337_1	77,760	133,556	1.7175
12 Apr 2011 04:05:26	1134682	12745882	hadcm3n_o5em_1900_40_007202337_1	51,840	90,618	1.7480
10 Apr 2011 06:21:33	1134682	12745882	hadcm3n_o5em_1900_40_007202337_1	25,920	48,875	1.8856