Task 10986647

Name	hadsm3dhet2_jmre_006592492_0
Workunit	6795865
Created	15 Mar 2010, 11:57:29 UTC
Sent	15 Oct 2010, 5:29:24 UTC
Report deadline	27 Sep 2011, 10:49:24 UTC
Received	18 Oct 2010, 18:30:06 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1078114
Run time
CPU time	1 days 11 hours 1 min 46 sec
Validate state	Invalid
Credit	1,389.41
Device peak FLOPS	2.98 GFLOPS
Application version	UK Met Office HadSM3 Slab Model v6.08 i686-pc-linux-gnu
Stderr	<core_client_version>6.4.5</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=71537, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Quit request from BOINC... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1 Model crash detected, will try to restart... CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
18 Oct 2010 12:55:09	1078114	10986647	hadsm3dhet2_jmre_006592492_0	151,228	135,502	0.8960
18 Oct 2010 10:46:36	1078114	10986647	hadsm3dhet2_jmre_006592492_0	140,426	127,990	0.9114
18 Oct 2010 08:29:18	1078114	10986647	hadsm3dhet2_jmre_006592492_0	129,624	120,007	0.9258
18 Oct 2010 06:08:47	1078114	10986647	hadsm3dhet2_jmre_006592492_0	118,822	111,554	0.9388
18 Oct 2010 04:20:23	1078114	10986647	hadsm3dhet2_jmre_006592492_0	108,020	100,484	0.9302
18 Oct 2010 04:20:23	1078114	10986647	hadsm3dhet2_jmre_006592492_0	97,218	91,252	0.9386
17 Oct 2010 21:30:47	1078114	10986647	hadsm3dhet2_jmre_006592492_0	86,416	80,693	0.9338
17 Oct 2010 18:32:58	1078114	10986647	hadsm3dhet2_jmre_006592492_0	75,614	70,109	0.9272
17 Oct 2010 15:34:48	1078114	10986647	hadsm3dhet2_jmre_006592492_0	64,812	59,538	0.9186
17 Oct 2010 12:35:20	1078114	10986647	hadsm3dhet2_jmre_006592492_0	54,010	48,989	0.9070
17 Oct 2010 09:41:40	1078114	10986647	hadsm3dhet2_jmre_006592492_0	43,208	38,614	0.8937
17 Oct 2010 06:47:07	1078114	10986647	hadsm3dhet2_jmre_006592492_0	32,406	28,309	0.8736
17 Oct 2010 03:52:26	1078114	10986647	hadsm3dhet2_jmre_006592492_0	21,604	17,908	0.8289
17 Oct 2010 03:17:27	1078114	10986647	hadsm3dhet2_jmre_006592492_0	10,802	8,754	0.8104