Task 15895675

Name	hadcm3n_u0ch_2020_40_008339134_3
Workunit	8489995
Created	18 Jul 2013, 13:31:58 UTC
Sent	18 Jul 2013, 13:35:31 UTC
Report deadline	17 Oct 2013, 21:02:42 UTC
Received	20 Sep 2013, 12:20:40 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	-1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION
Computer ID	1269121
Run time	15 days 10 hours 11 min 48 sec
CPU time	13 days 13 hours 11 min 12 sec
Validate state	Invalid
Credit	3,110.40
Device peak FLOPS	1.46 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.64</core_client_version> <![CDATA[ <message> (unknown error) - exit code -1073741819 (0xc0000005) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:58:01 (5376): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 12:55:21 (5436): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5960, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5960, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5960, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5960, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5960, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5960, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish 15:28:26 (6132): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 08:39:23 (3708): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Error converting file to netcdf: dataout/u0chko.pjm3c10 Error converting file to netcdf: dataout/u0chko.pim3c10 Error converting file to netcdf: dataout/u0chko.pfm3c10 Error converting file to netcdf: dataout/u0chka.phm3c10 Error converting file to netcdf: dataout/u0chka.pgm3c10 Error converting file to netcdf: dataout/u0chka.pem3c10 Error converting file to netcdf: dataout/u0chka.pdm3c10 08:16:38 (5936): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:16:39 (5936): No heartbeat from core client for 30 sec - exiting 08:16:40 (5936): No heartbeat from core client for 30 sec - exiting 08:22:13 (4116): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 12:27:59 (4936): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 03:19:21 (5156): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x7C95101A write attempt to address 0x40A26306 Engaging BOINC Windows Runtime Debugger... </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
20 Sep 2013 12:25:35	1269121	15895675	hadcm3n_u0ch_2020_40_008339134_3	259,200	1,170,661	4.5164
18 Sep 2013 15:00:19	1269121	15895675	hadcm3n_u0ch_2020_40_008339134_3	233,280	1,089,385	4.6699
15 Sep 2013 16:56:27	1269121	15895675	hadcm3n_u0ch_2020_40_008339134_3	207,360	1,008,248	4.8623
14 Sep 2013 16:49:59	1269121	15895675	hadcm3n_u0ch_2020_40_008339134_3	181,440	926,985	5.1090
10 Sep 2013 20:17:28	1269121	15895675	hadcm3n_u0ch_2020_40_008339134_3	155,520	848,736	5.4574
08 Sep 2013 04:25:28	1269121	15895675	hadcm3n_u0ch_2020_40_008339134_3	129,600	768,461	5.9295
14 Aug 2013 15:28:54	1269121	15895675	hadcm3n_u0ch_2020_40_008339134_3	103,680	322,149	3.1071
14 Aug 2013 15:28:54	1269121	15895675	hadcm3n_u0ch_2020_40_008339134_3	77,760	240,779	3.0964
14 Aug 2013 15:28:54	1269121	15895675	hadcm3n_u0ch_2020_40_008339134_3	51,840	161,793	3.1210
14 Aug 2013 15:28:54	1269121	15895675	hadcm3n_u0ch_2020_40_008339134_3	25,920	81,003	3.1251