Task 16273384

Name	hadcm3n_7wt5_1980_40_008453804_2
Workunit	8604660
Created	17 Jan 2014, 19:20:46 UTC
Sent	17 Jan 2014, 19:20:55 UTC
Report deadline	19 Apr 2014, 2:48:06 UTC
Received	14 Feb 2014, 19:19:08 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	193 (0x000000C1) EXIT_SIGNAL
Computer ID	1212841
Run time	6 days 0 hours 56 min 26 sec
CPU time	5 days 23 hours 2 min 52 sec
Validate state	Invalid
Credit	6,220.80
Device peak FLOPS	2.90 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.2.33</core_client_version> <![CDATA[ <message> (unknown error) - exit code 193 (0xc1) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... 19:11:46 (4336): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... ControlleSuspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=916, iMonCtr=1 Model crash detected, will try to restart... 18:55:26 (2984): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3856, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5824, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4576, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4680, iMonCtr=1 Model crash detected, will try to restart... 12:59:08 (224): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4584, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4584, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4584, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4728, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4104, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4992, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77033AC3 read attempt to address 0x404F80F1 Engaging BOINC Windows Runtime Debugger... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77983AC3 read attempt to address 0x404F80F1 Engaging BOINC Windows Runtime Debugger... Cannot serialize file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_7wt5_1980_40_008453804/dataout/shmem_restart.day Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
12 Feb 2014 20:25:27	1212841	16273384	hadcm3n_7wt5_1980_40_008453804_2	518,400	510,113	0.9840
09 Feb 2014 20:09:38	1212841	16273384	hadcm3n_7wt5_1980_40_008453804_2	492,480	483,904	0.9826
09 Feb 2014 12:56:24	1212841	16273384	hadcm3n_7wt5_1980_40_008453804_2	466,560	457,945	0.9815
08 Feb 2014 18:42:29	1212841	16273384	hadcm3n_7wt5_1980_40_008453804_2	440,640	432,719	0.9820
08 Feb 2014 12:11:30	1212841	16273384	hadcm3n_7wt5_1980_40_008453804_2	414,720	409,582	0.9876
07 Feb 2014 19:07:46	1212841	16273384	hadcm3n_7wt5_1980_40_008453804_2	388,800	384,247	0.9883
03 Feb 2014 21:21:56	1212841	16273384	hadcm3n_7wt5_1980_40_008453804_2	362,880	358,220	0.9872
02 Feb 2014 10:28:46	1212841	16273384	hadcm3n_7wt5_1980_40_008453804_2	336,960	333,233	0.9889
31 Jan 2014 20:54:31	1212841	16273384	hadcm3n_7wt5_1980_40_008453804_2	311,040	307,416	0.9883
30 Jan 2014 19:08:48	1212841	16273384	hadcm3n_7wt5_1980_40_008453804_2	285,120	280,771	0.9847
26 Jan 2014 21:55:35	1212841	16273384	hadcm3n_7wt5_1980_40_008453804_2	259,200	254,547	0.9820
26 Jan 2014 14:23:08	1212841	16273384	hadcm3n_7wt5_1980_40_008453804_2	233,280	227,804	0.9765
25 Jan 2014 23:10:11	1212841	16273384	hadcm3n_7wt5_1980_40_008453804_2	207,360	201,972	0.9740
25 Jan 2014 16:17:24	1212841	16273384	hadcm3n_7wt5_1980_40_008453804_2	181,440	177,522	0.9784
25 Jan 2014 09:27:17	1212841	16273384	hadcm3n_7wt5_1980_40_008453804_2	155,520	153,002	0.9838
24 Jan 2014 19:14:21	1212841	16273384	hadcm3n_7wt5_1980_40_008453804_2	129,600	127,106	0.9808
20 Jan 2014 19:57:12	1212841	16273384	hadcm3n_7wt5_1980_40_008453804_2	103,680	100,914	0.9733
19 Jan 2014 15:16:49	1212841	16273384	hadcm3n_7wt5_1980_40_008453804_2	77,760	75,811	0.9749
18 Jan 2014 20:37:16	1212841	16273384	hadcm3n_7wt5_1980_40_008453804_2	51,840	50,966	0.9831
18 Jan 2014 13:38:39	1212841	16273384	hadcm3n_7wt5_1980_40_008453804_2	25,920	25,928	1.0003