Task 13416009

Name	hadcm3n_u4bf_1980_40_007460793_2
Workunit	7658296
Created	23 Sep 2011, 21:20:40 UTC
Sent	23 Sep 2011, 21:33:12 UTC
Report deadline	24 Dec 2011, 5:00:23 UTC
Received	29 Oct 2011, 16:53:00 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	193 (0x000000C1) EXIT_SIGNAL
Computer ID	985784
Run time	23 days 0 hours 10 min 47 sec
CPU time	21 days 23 hours 8 min 5 sec
Validate state	Invalid
Credit	12,441.60
Device peak FLOPS	2.57 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.18</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 16:01:43 (2052): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 16:01:41 (3044): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 16:01:42 (2676): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:01:43 (2676): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 16:01:43 (2180): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:01:44 (2180): No heartbeat from core client for 30 sec - exiting 16:01:45 (2180): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 16:01:40 (2988): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 16:01:41 (2980): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 16:02:01 (1688): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:01:53 (3508): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2356, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2356, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1708, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2584, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2576, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2576, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2552, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1864, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1864, iMonCtr=1 Model crash detected, will try to restart... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x76F73A93 read attempt to address 0x4031809C Engaging BOINC Windows Runtime Debugger... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x76F77353 read attempt to address 0xFFFFFFF8 Engaging BOINC Windows Runtime Debugger... Cannot serialize file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_u4bf_1980_40_007460793/dataout/shmem_restart.day Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
31 Oct 2011 18:35:21	985784	13416009	hadcm3n_u4bf_1980_40_007460793_2	1,036,800	1,897,027	1.8297
31 Oct 2011 18:16:27	985784	13416009	hadcm3n_u4bf_1980_40_007460793_2	1,010,880	1,849,725	1.8298
31 Oct 2011 17:33:00	985784	13416009	hadcm3n_u4bf_1980_40_007460793_2	984,960	1,802,941	1.8305
31 Oct 2011 16:59:43	985784	13416009	hadcm3n_u4bf_1980_40_007460793_2	959,040	1,756,190	1.8312
31 Oct 2011 15:52:22	985784	13416009	hadcm3n_u4bf_1980_40_007460793_2	933,120	1,708,797	1.8313
31 Oct 2011 14:01:07	985784	13416009	hadcm3n_u4bf_1980_40_007460793_2	907,200	1,661,969	1.8320
31 Oct 2011 14:01:06	985784	13416009	hadcm3n_u4bf_1980_40_007460793_2	881,280	1,614,564	1.8321
31 Oct 2011 14:01:06	985784	13416009	hadcm3n_u4bf_1980_40_007460793_2	855,360	1,567,367	1.8324
31 Oct 2011 14:01:06	985784	13416009	hadcm3n_u4bf_1980_40_007460793_2	829,440	1,520,569	1.8332
31 Oct 2011 14:01:06	985784	13416009	hadcm3n_u4bf_1980_40_007460793_2	803,520	1,473,753	1.8341
31 Oct 2011 14:01:06	985784	13416009	hadcm3n_u4bf_1980_40_007460793_2	777,600	1,427,102	1.8353
31 Oct 2011 14:01:05	985784	13416009	hadcm3n_u4bf_1980_40_007460793_2	751,680	1,380,264	1.8362
31 Oct 2011 14:01:05	985784	13416009	hadcm3n_u4bf_1980_40_007460793_2	725,760	1,332,992	1.8367
18 Oct 2011 21:42:22	985784	13416009	hadcm3n_u4bf_1980_40_007460793_2	699,840	1,285,581	1.8370
17 Oct 2011 22:22:39	985784	13416009	hadcm3n_u4bf_1980_40_007460793_2	673,920	1,238,800	1.8382
16 Oct 2011 21:55:49	985784	13416009	hadcm3n_u4bf_1980_40_007460793_2	648,000	1,191,740	1.8391
16 Oct 2011 07:41:01	985784	13416009	hadcm3n_u4bf_1980_40_007460793_2	622,080	1,144,412	1.8397
15 Oct 2011 18:33:27	985784	13416009	hadcm3n_u4bf_1980_40_007460793_2	596,160	1,097,420	1.8408
15 Oct 2011 04:28:36	985784	13416009	hadcm3n_u4bf_1980_40_007460793_2	570,240	1,050,518	1.8422
14 Oct 2011 15:51:11	985784	13416009	hadcm3n_u4bf_1980_40_007460793_2	544,320	1,003,258	1.8431