Task 15686299

Name	hadcm3n_u7dd_2020_40_008337568_1
Workunit	8488429
Created	27 Mar 2013, 3:23:56 UTC
Sent	27 Mar 2013, 3:24:10 UTC
Report deadline	26 Jun 2013, 10:51:21 UTC
Received	13 May 2013, 10:38:16 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	193 (0x000000C1) EXIT_SIGNAL
Computer ID	1146914
Run time	11 days 19 hours 1 min 24 sec
CPU time	11 days 12 hours 38 min 14 sec
Validate state	Invalid
Credit	6,220.80
Device peak FLOPS	2.42 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.28</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6056, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5144, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5144, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5144, iMonCtr=1 Model crash detected, will try to restart... 13:08:02 (2608): No heartbeat from core client for 30 sec - exiting 13:08:03 (2608): No heartbeat from core client for 30 sec - exiting 13:08:04 (2608): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6120, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5932, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5432, iMonCtr=1 Model crash detected, will try to restart... 09:58:58 (5924): No heartbeat from core client for 30 sec - exiting 09:58:59 (5924): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4328, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6008, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4752, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4752, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5100, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2204, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5844, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5844, iMonCtr=1 Model crash detected, will try to restart... 11:04:02 (3588): No heartbeat from core client for 30 sec - exiting 11:04:04 (3588): No heartbeat from core client for 30 sec - exiting 11:04:05 (3588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5692, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5788, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5536, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5536, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5536, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6036, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
13 May 2013 07:33:06	1146914	15686299	hadcm3n_u7dd_2020_40_008337568_1	518,400	995,889	1.9211
10 May 2013 11:07:07	1146914	15686299	hadcm3n_u7dd_2020_40_008337568_1	492,480	947,970	1.9249
09 May 2013 09:21:13	1146914	15686299	hadcm3n_u7dd_2020_40_008337568_1	466,560	896,248	1.9210
08 May 2013 00:21:41	1146914	15686299	hadcm3n_u7dd_2020_40_008337568_1	440,640	845,614	1.9191
06 May 2013 03:42:09	1146914	15686299	hadcm3n_u7dd_2020_40_008337568_1	414,720	795,834	1.9190
01 May 2013 03:07:23	1146914	15686299	hadcm3n_u7dd_2020_40_008337568_1	388,800	744,713	1.9154
29 Apr 2013 05:56:53	1146914	15686299	hadcm3n_u7dd_2020_40_008337568_1	362,880	694,510	1.9139
26 Apr 2013 07:22:28	1146914	15686299	hadcm3n_u7dd_2020_40_008337568_1	336,960	639,450	1.8977
25 Apr 2013 05:30:58	1146914	15686299	hadcm3n_u7dd_2020_40_008337568_1	311,040	588,370	1.8916
23 Apr 2013 02:32:01	1146914	15686299	hadcm3n_u7dd_2020_40_008337568_1	285,120	536,944	1.8832
20 Apr 2013 06:48:14	1146914	15686299	hadcm3n_u7dd_2020_40_008337568_1	259,200	488,370	1.8841
19 Apr 2013 03:01:43	1146914	15686299	hadcm3n_u7dd_2020_40_008337568_1	233,280	439,274	1.8830
18 Apr 2013 01:53:03	1146914	15686299	hadcm3n_u7dd_2020_40_008337568_1	207,360	389,527	1.8785
16 Apr 2013 01:27:07	1146914	15686299	hadcm3n_u7dd_2020_40_008337568_1	181,440	341,395	1.8816
11 Apr 2013 06:23:15	1146914	15686299	hadcm3n_u7dd_2020_40_008337568_1	155,520	292,152	1.8785
08 Apr 2013 03:40:11	1146914	15686299	hadcm3n_u7dd_2020_40_008337568_1	129,600	247,572	1.9103
06 Apr 2013 05:07:11	1146914	15686299	hadcm3n_u7dd_2020_40_008337568_1	103,680	194,702	1.8779
03 Apr 2013 07:04:47	1146914	15686299	hadcm3n_u7dd_2020_40_008337568_1	77,760	139,854	1.7985
01 Apr 2013 11:35:30	1146914	15686299	hadcm3n_u7dd_2020_40_008337568_1	51,840	92,156	1.7777
30 Mar 2013 00:31:38	1146914	15686299	hadcm3n_u7dd_2020_40_008337568_1	25,920	45,940	1.7724