Task 16014219

Name	hadcm3n_n01h_1880_40_008373105_2
Workunit	8523964
Created	13 Sep 2013, 3:07:39 UTC
Sent	13 Sep 2013, 3:21:31 UTC
Report deadline	13 Dec 2013, 10:48:42 UTC
Received	20 Oct 2013, 20:04:55 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	193 (0x000000C1) EXIT_SIGNAL
Computer ID	1122348
Run time	12 days 11 hours 31 min 45 sec
CPU time	12 days 10 hours 35 min 1 sec
Validate state	Invalid
Credit	6,220.80
Device peak FLOPS	2.30 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.58</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6064, iMonCtr=1 Model crash detected, will try to restart... 16:28:56 (5588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:05:21 (1780): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:07:03 (7532): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:11:40 (4804): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:04:25 (6880): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5784, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5784, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5016, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5016, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5016, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 04:19:20 (7940): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:50:38 (5872): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 04:18:24 (4316): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:07:04 (3892): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:05:35 (5176): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:07:21 (3984): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:09:02 (7936): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:10:41 (5292): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:13:02 (5540): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7092, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 22:42:15 (9232): No heartbeat from core client for 30 sec - exiting 22:42:17 (9232): No heartbeat from core client for 30 sec - exiting 22:42:18 (9232): No heartbeat from core client for 30 sec - exiting 22:42:19 (9232): No heartbeat from core client for 30 sec - exiting 22:42:20 (9232): No heartbeat from core client for 30 sec - exiting 22:42:21 (9232): No heartbeat from core client for 30 sec - exiting 22:42:22 (9232): No heartbeat from core client for 30 sec - exiting 22:42:23 (9232): No heartbeat from core client for 30 sec - exiting 22:42:24 (9232): No heartbeat from core client for 30 sec - exiting 22:42:25 (9232): No heartbeat from core client for 30 sec - exiting 22:42:26 (9232): No heartbeat from core client for 30 sec - exiting 22:42:27 (9232): No heartbeat from core client for 30 sec - exiting 22:42:28 (9232): No heartbeat from core client for 30 sec - exiting 22:42:29 (9232): No heartbeat from core client for 30 sec - exiting 22:42:30 (9232): No heartbeat from core client for 30 sec - exiting 22:42:31 (9232): No heartbeat from core client for 30 sec - exiting 22:42:32 (9232): No heartbeat from core client for 30 sec - exiting 22:42:33 (9232): No heartbeat from core client for 30 sec - exiting 22:42:34 (9232): No heartbeat from core client for 30 sec - exiting 22:42:35 (9232): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5804, iMonCtr=1 Model crash detected, will try to restart... 03:38:42 (6816): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8124, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77C7FF6B write attempt to address 0xFFFFFFF8 Engaging BOINC Windows Runtime Debugger... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77243AC3 read attempt to address 0x00000000 Engaging BOINC Windows Runtime Debugger... Cannot serialize file G:\BOINC Data/projects/climateprediction.net/hadcm3n_n01h_1880_40_008373105/dataout/shmem_restart.day Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
19 Oct 2013 10:35:50	1122348	16014219	hadcm3n_n01h_1880_40_008373105_2	518,400	1,061,573	2.0478
18 Oct 2013 06:59:50	1122348	16014219	hadcm3n_n01h_1880_40_008373105_2	492,480	1,006,907	2.0446
17 Oct 2013 02:40:27	1122348	16014219	hadcm3n_n01h_1880_40_008373105_2	466,560	951,747	2.0399
14 Oct 2013 04:55:23	1122348	16014219	hadcm3n_n01h_1880_40_008373105_2	440,640	896,640	2.0349
11 Oct 2013 11:44:47	1122348	16014219	hadcm3n_n01h_1880_40_008373105_2	414,720	841,167	2.0283
07 Oct 2013 16:53:11	1122348	16014219	hadcm3n_n01h_1880_40_008373105_2	388,800	786,844	2.0238
05 Oct 2013 11:04:43	1122348	16014219	hadcm3n_n01h_1880_40_008373105_2	362,880	731,537	2.0159
04 Oct 2013 09:06:42	1122348	16014219	hadcm3n_n01h_1880_40_008373105_2	336,960	675,692	2.0053
02 Oct 2013 10:36:33	1122348	16014219	hadcm3n_n01h_1880_40_008373105_2	311,040	621,951	1.9996
01 Oct 2013 05:54:25	1122348	16014219	hadcm3n_n01h_1880_40_008373105_2	285,120	566,582	1.9872
30 Sep 2013 05:03:29	1122348	16014219	hadcm3n_n01h_1880_40_008373105_2	259,200	510,348	1.9689
29 Sep 2013 13:20:47	1122348	16014219	hadcm3n_n01h_1880_40_008373105_2	233,280	453,926	1.9458
28 Sep 2013 23:20:49	1122348	16014219	hadcm3n_n01h_1880_40_008373105_2	207,360	399,270	1.9255
28 Sep 2013 07:49:16	1122348	16014219	hadcm3n_n01h_1880_40_008373105_2	181,440	344,987	1.9014
25 Sep 2013 19:32:01	1122348	16014219	hadcm3n_n01h_1880_40_008373105_2	155,520	290,935	1.8707
25 Sep 2013 09:29:28	1122348	16014219	hadcm3n_n01h_1880_40_008373105_2	129,600	247,831	1.9123
24 Sep 2013 09:19:26	1122348	16014219	hadcm3n_n01h_1880_40_008373105_2	103,680	206,061	1.9875
24 Sep 2013 08:18:58	1122348	16014219	hadcm3n_n01h_1880_40_008373105_2	77,760	162,009	2.0834
24 Sep 2013 08:18:58	1122348	16014219	hadcm3n_n01h_1880_40_008373105_2	51,840	109,179	2.1061
21 Sep 2013 16:33:20	1122348	16014219	hadcm3n_n01h_1880_40_008373105_2	25,920	54,765	2.1128