Task 13371115

Name	hadcm3n_o4f5_1940_40_007452253_1
Workunit	7649756
Created	10 Sep 2011, 12:56:15 UTC
Sent	11 Sep 2011, 10:42:31 UTC
Report deadline	11 Dec 2011, 18:09:42 UTC
Received	6 Dec 2011, 13:47:02 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	-1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION
Computer ID	1153453
Run time	10 days 16 hours 16 min 7 sec
CPU time	9 days 15 hours 3 min 44 sec
Validate state	Invalid
Credit	6,220.80
Device peak FLOPS	2.64 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.12.34</core_client_version> <![CDATA[ <message> - exit code -1073741819 (0xc0000005) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4596, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6012, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5136, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:31:00 (7136): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5656, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6340, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6340, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4016, iMonCtr=1 Model crash detected, will try to restart... 00:59:56 (5236): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4884, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:23:24 (5272): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4376, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 00:35:41 (1820): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 17:24:14 (5192): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5216, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 01:00:00 (3580): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:13:52 (5576): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6544, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 13:42:42 (4868): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 21:40:18 (6860): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4224, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4576, iMonCtr=1 Model crash detected, will try to restart... 21:28:12 (4432): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 21:26:09 (3036): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1652, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4604, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4604, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4604, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... C23:18:28 (5948): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4660, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x777DB84B write attempt to address 0x4346869F Engaging BOINC Windows Runtime Debugger... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x76EC3A93 read attempt to address 0x00000000 Engaging BOINC Windows Runtime Debugger... </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
04 Dec 2011 04:38:57	1153453	13371115	hadcm3n_o4f5_1940_40_007452253_1	518,400	830,424	1.6019
23 Nov 2011 15:43:57	1153453	13371115	hadcm3n_o4f5_1940_40_007452253_1	492,480	788,769	1.6016
19 Nov 2011 16:30:45	1153453	13371115	hadcm3n_o4f5_1940_40_007452253_1	466,560	746,036	1.5990
16 Nov 2011 14:41:55	1153453	13371115	hadcm3n_o4f5_1940_40_007452253_1	440,640	703,362	1.5962
15 Nov 2011 22:40:05	1153453	13371115	hadcm3n_o4f5_1940_40_007452253_1	414,720	662,589	1.5977
06 Nov 2011 12:40:36	1153453	13371115	hadcm3n_o4f5_1940_40_007452253_1	388,800	620,280	1.5954
04 Nov 2011 16:04:03	1153453	13371115	hadcm3n_o4f5_1940_40_007452253_1	362,880	577,533	1.5915
31 Oct 2011 19:42:53	1153453	13371115	hadcm3n_o4f5_1940_40_007452253_1	336,960	535,289	1.5886
31 Oct 2011 18:41:36	1153453	13371115	hadcm3n_o4f5_1940_40_007452253_1	311,040	494,306	1.5892
31 Oct 2011 17:27:37	1153453	13371115	hadcm3n_o4f5_1940_40_007452253_1	285,120	452,270	1.5862
31 Oct 2011 15:23:14	1153453	13371115	hadcm3n_o4f5_1940_40_007452253_1	259,200	410,753	1.5847
16 Oct 2011 05:49:44	1153453	13371115	hadcm3n_o4f5_1940_40_007452253_1	233,280	367,613	1.5758
14 Oct 2011 14:04:19	1153453	13371115	hadcm3n_o4f5_1940_40_007452253_1	207,360	325,407	1.5693
09 Oct 2011 12:45:05	1153453	13371115	hadcm3n_o4f5_1940_40_007452253_1	181,440	284,037	1.5655
05 Oct 2011 12:49:06	1153453	13371115	hadcm3n_o4f5_1940_40_007452253_1	155,520	243,330	1.5646
01 Oct 2011 10:40:07	1153453	13371115	hadcm3n_o4f5_1940_40_007452253_1	129,600	203,213	1.5680
27 Sep 2011 13:31:12	1153453	13371115	hadcm3n_o4f5_1940_40_007452253_1	103,680	162,922	1.5714
19 Sep 2011 14:01:39	1153453	13371115	hadcm3n_o4f5_1940_40_007452253_1	77,760	120,911	1.5549
17 Sep 2011 13:57:55	1153453	13371115	hadcm3n_o4f5_1940_40_007452253_1	51,840	80,627	1.5553
14 Sep 2011 15:43:43	1153453	13371115	hadcm3n_o4f5_1940_40_007452253_1	25,920	40,656	1.5685