Task 16040187

Name	hadcm3n_ob3r_1900_40_008469514_0
Workunit	8620353
Created	27 Sep 2013, 9:48:48 UTC
Sent	3 Oct 2013, 15:16:03 UTC
Report deadline	2 Jan 2014, 22:43:14 UTC
Received	6 Nov 2013, 18:27:36 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	193 (0x000000C1) EXIT_SIGNAL
Computer ID	1239712
Run time	19 days 0 hours 0 min 25 sec
CPU time	17 days 21 hours 4 min 9 sec
Validate state	Invalid
Credit	6,220.80
Device peak FLOPS	1.41 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.64</core_client_version> <![CDATA[ <message> (unknown error) - exit code 193 (0xc1) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5844, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4736, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6180, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8116, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6460, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7196, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5624, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6556, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6760, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6760, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5132, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6904, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6152, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8360, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6340, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3400, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1772, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6728, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6752, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5568, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2804, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
06 Nov 2013 18:31:12	1239712	16040187	hadcm3n_ob3r_1900_40_008469514_0	518,400	1,544,640	2.9796
05 Nov 2013 09:33:17	1239712	16040187	hadcm3n_ob3r_1900_40_008469514_0	492,480	1,462,825	2.9703
04 Nov 2013 00:16:18	1239712	16040187	hadcm3n_ob3r_1900_40_008469514_0	466,560	1,382,292	2.9627
02 Nov 2013 16:55:46	1239712	16040187	hadcm3n_ob3r_1900_40_008469514_0	440,640	1,301,143	2.9528
31 Oct 2013 17:21:00	1239712	16040187	hadcm3n_ob3r_1900_40_008469514_0	414,720	1,224,341	2.9522
29 Oct 2013 14:06:10	1239712	16040187	hadcm3n_ob3r_1900_40_008469514_0	388,800	1,145,722	2.9468
27 Oct 2013 20:33:21	1239712	16040187	hadcm3n_ob3r_1900_40_008469514_0	362,880	1,064,825	2.9344
25 Oct 2013 17:59:25	1239712	16040187	hadcm3n_ob3r_1900_40_008469514_0	336,960	989,217	2.9357
24 Oct 2013 15:00:44	1239712	16040187	hadcm3n_ob3r_1900_40_008469514_0	311,040	918,566	2.9532
23 Oct 2013 08:46:43	1239712	16040187	hadcm3n_ob3r_1900_40_008469514_0	285,120	839,079	2.9429
21 Oct 2013 08:48:32	1239712	16040187	hadcm3n_ob3r_1900_40_008469514_0	259,200	763,044	2.9438
19 Oct 2013 16:06:12	1239712	16040187	hadcm3n_ob3r_1900_40_008469514_0	233,280	680,462	2.9169
18 Oct 2013 03:48:46	1239712	16040187	hadcm3n_ob3r_1900_40_008469514_0	207,360	599,019	2.8888
16 Oct 2013 18:41:32	1239712	16040187	hadcm3n_ob3r_1900_40_008469514_0	181,440	523,068	2.8829
15 Oct 2013 08:20:40	1239712	16040187	hadcm3n_ob3r_1900_40_008469514_0	155,520	443,928	2.8545
13 Oct 2013 20:20:39	1239712	16040187	hadcm3n_ob3r_1900_40_008469514_0	129,600	373,259	2.8801
12 Oct 2013 23:30:12	1239712	16040187	hadcm3n_ob3r_1900_40_008469514_0	103,680	306,203	2.9533
11 Oct 2013 18:22:41	1239712	16040187	hadcm3n_ob3r_1900_40_008469514_0	77,760	238,484	3.0669
08 Oct 2013 18:20:24	1239712	16040187	hadcm3n_ob3r_1900_40_008469514_0	51,840	161,566	3.1166
05 Oct 2013 12:15:15	1239712	16040187	hadcm3n_ob3r_1900_40_008469514_0	25,920	78,263	3.0194