Task 15897903

Name	hadcm3n_o5x1_1980_40_008386814_4
Workunit	8537673
Created	20 Jul 2013, 0:14:57 UTC
Sent	20 Jul 2013, 0:15:14 UTC
Report deadline	19 Oct 2013, 7:42:25 UTC
Received	23 Sep 2013, 17:14:16 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	193 (0x000000C1) EXIT_SIGNAL
Computer ID	1120700
Run time	37 days 19 hours 44 min 19 sec
CPU time	26 days 14 hours 18 min 43 sec
Validate state	Invalid
Credit	9,331.20
Device peak FLOPS	1.36 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.64</core_client_version> <![CDATA[ <message> (unknown error) - exit code 193 (0xc1) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 16:59:00 (2664): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 17:39:56 (4800): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8248, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8248, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8248, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 09:40:31 (17812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:40:33 (17812): No heartbeat from core client for 30 sec - exiting 09:40:34 (17812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1232, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 09:00:56 (4600): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CSuspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6108, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5464, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3736, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:57:05 (4608): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3076, iMonCtr=1 Model crash detected, will try to restart... 17:52:25 (4528): No heartbeat from core client for 30 sec - exiting 17:52:27 (4528): No heartbeat from core client for 30 sec - exiting 17:52:28 (4528): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
23 Sep 2013 13:50:21	1120700	15897903	hadcm3n_o5x1_1980_40_008386814_4	777,600	2,297,917	2.9551
21 Sep 2013 10:40:59	1120700	15897903	hadcm3n_o5x1_1980_40_008386814_4	751,680	2,222,044	2.9561
18 Sep 2013 14:17:08	1120700	15897903	hadcm3n_o5x1_1980_40_008386814_4	725,760	2,146,089	2.9570
18 Sep 2013 13:15:32	1120700	15897903	hadcm3n_o5x1_1980_40_008386814_4	699,840	2,070,480	2.9585
18 Sep 2013 13:15:32	1120700	15897903	hadcm3n_o5x1_1980_40_008386814_4	673,920	1,994,587	2.9597
18 Sep 2013 13:15:32	1120700	15897903	hadcm3n_o5x1_1980_40_008386814_4	648,000	1,918,637	2.9609
18 Sep 2013 13:15:32	1120700	15897903	hadcm3n_o5x1_1980_40_008386814_4	622,080	1,842,810	2.9623
08 Sep 2013 01:04:17	1120700	15897903	hadcm3n_o5x1_1980_40_008386814_4	596,160	1,765,673	2.9617
06 Sep 2013 21:26:12	1120700	15897903	hadcm3n_o5x1_1980_40_008386814_4	570,240	1,687,146	2.9587
05 Sep 2013 19:02:00	1120700	15897903	hadcm3n_o5x1_1980_40_008386814_4	544,320	1,609,698	2.9573
04 Sep 2013 15:49:02	1120700	15897903	hadcm3n_o5x1_1980_40_008386814_4	518,400	1,531,709	2.9547
03 Sep 2013 13:02:15	1120700	15897903	hadcm3n_o5x1_1980_40_008386814_4	492,480	1,454,202	2.9528
02 Sep 2013 10:24:45	1120700	15897903	hadcm3n_o5x1_1980_40_008386814_4	466,560	1,376,320	2.9499
01 Sep 2013 06:38:19	1120700	15897903	hadcm3n_o5x1_1980_40_008386814_4	440,640	1,298,346	2.9465
31 Aug 2013 01:56:41	1120700	15897903	hadcm3n_o5x1_1980_40_008386814_4	414,720	1,219,982	2.9417
30 Aug 2013 00:52:54	1120700	15897903	hadcm3n_o5x1_1980_40_008386814_4	388,800	1,142,230	2.9378
28 Aug 2013 06:57:38	1120700	15897903	hadcm3n_o5x1_1980_40_008386814_4	362,880	1,064,567	2.9337
26 Aug 2013 19:16:36	1120700	15897903	hadcm3n_o5x1_1980_40_008386814_4	336,960	988,706	2.9342
25 Aug 2013 17:49:58	1120700	15897903	hadcm3n_o5x1_1980_40_008386814_4	311,040	911,518	2.9305
24 Aug 2013 15:19:40	1120700	15897903	hadcm3n_o5x1_1980_40_008386814_4	285,120	835,118	2.9290