Task 15874174

Name	hadcm3n_o1kv_2100_40_008270693_4
Workunit	8425817
Created	30 Jun 2013, 13:55:25 UTC
Sent	30 Jun 2013, 14:13:59 UTC
Report deadline	29 Sep 2013, 21:41:10 UTC
Received	24 Sep 2013, 18:21:33 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	-1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION
Computer ID	1065739
Run time	38 days 4 hours 19 min 46 sec
CPU time	34 days 14 hours 46 min 16 sec
Validate state	Invalid
Credit	12,441.60
Device peak FLOPS	2.91 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.64</core_client_version> <![CDATA[ <message> (unknown error) - exit code -1073741819 (0xc0000005) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:07:53 (3616): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:11:13 (4068): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3696, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 13:45:57 (1828): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2476, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2728, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2936, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2136, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2756, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2756, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 22:20:07 (3588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3120, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3100, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2212, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4424, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x76F8C3EB write attempt to address 0xFFFFFFF8 Engaging BOINC Windows Runtime Debugger... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4504, selfPID=4504, iMonCtr=1 </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
25 Sep 2013 09:22:52	1065739	15874174	hadcm3n_o1kv_2100_40_008270693_4	1,036,800	2,990,767	2.8846
23 Sep 2013 15:29:02	1065739	15874174	hadcm3n_o1kv_2100_40_008270693_4	1,010,880	2,947,725	2.9160
21 Sep 2013 20:06:05	1065739	15874174	hadcm3n_o1kv_2100_40_008270693_4	984,960	2,907,961	2.9524
20 Sep 2013 19:26:20	1065739	15874174	hadcm3n_o1kv_2100_40_008270693_4	959,040	2,864,653	2.9870
20 Sep 2013 06:38:31	1065739	15874174	hadcm3n_o1kv_2100_40_008270693_4	933,120	2,821,752	3.0240
18 Sep 2013 18:12:56	1065739	15874174	hadcm3n_o1kv_2100_40_008270693_4	907,200	2,778,587	3.0628
17 Sep 2013 20:18:56	1065739	15874174	hadcm3n_o1kv_2100_40_008270693_4	881,280	2,735,815	3.1044
16 Sep 2013 13:48:26	1065739	15874174	hadcm3n_o1kv_2100_40_008270693_4	855,360	2,692,952	3.1483
14 Sep 2013 15:39:43	1065739	15874174	hadcm3n_o1kv_2100_40_008270693_4	829,440	2,650,279	3.1953
11 Sep 2013 16:23:00	1065739	15874174	hadcm3n_o1kv_2100_40_008270693_4	803,520	2,607,632	3.2453
10 Sep 2013 13:15:44	1065739	15874174	hadcm3n_o1kv_2100_40_008270693_4	777,600	2,562,893	3.2959
09 Sep 2013 10:44:06	1065739	15874174	hadcm3n_o1kv_2100_40_008270693_4	751,680	2,517,124	3.3487
06 Sep 2013 15:49:12	1065739	15874174	hadcm3n_o1kv_2100_40_008270693_4	725,760	2,471,865	3.4059
05 Sep 2013 15:25:50	1065739	15874174	hadcm3n_o1kv_2100_40_008270693_4	699,840	2,425,071	3.4652
14 Aug 2013 16:27:28	1065739	15874174	hadcm3n_o1kv_2100_40_008270693_4	673,920	1,174,300	1.7425
14 Aug 2013 16:27:28	1065739	15874174	hadcm3n_o1kv_2100_40_008270693_4	648,000	1,126,076	1.7378
29 Jul 2013 13:55:43	1065739	15874174	hadcm3n_o1kv_2100_40_008270693_4	622,080	1,083,740	1.7421
26 Jul 2013 14:44:09	1065739	15874174	hadcm3n_o1kv_2100_40_008270693_4	596,160	1,036,780	1.7391
23 Jul 2013 22:16:24	1065739	15874174	hadcm3n_o1kv_2100_40_008270693_4	570,240	990,191	1.7364
23 Jul 2013 21:48:19	1065739	15874174	hadcm3n_o1kv_2100_40_008270693_4	544,320	945,415	1.7369