Task 15925251

Name	hadcm3n_4guw_1940_40_008307048_3
Workunit	8458183
Created	18 Aug 2013, 13:23:45 UTC
Sent	18 Aug 2013, 13:24:48 UTC
Report deadline	17 Nov 2013, 20:51:59 UTC
Received	30 Sep 2013, 16:52:20 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	10780 (0x00002A1C) Unknown error code
Computer ID	459222
Run time	7 days 18 hours 40 min 45 sec
CPU time	7 days 16 hours 54 min 47 sec
Validate state	Invalid
Credit	9,953.28
Device peak FLOPS	3.27 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.64</core_client_version> <![CDATA[ <message> (unknown error) - exit code 10780 (0x2a1c) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10752, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10752, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10752, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12160, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11236, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10556, iMonCtr=1 Model crash detected, will try to restart... C19:11:33 (4820): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:11:34 (4820): No heartbeat from core client for 30 sec - exiting 19:11:35 (4820): No heartbeat from core client for 30 sec - exiting 19:11:36 (4820): No heartbeat from core client for 30 sec - exiting 19:11:37 (4820): No heartbeat from core client for 30 sec - exiting 19:11:38 (4820): No heartbeat from core client for 30 sec - exiting 19:11:39 (4820): No heartbeat from core client for 30 sec - exiting 19:11:40 (4820): No heartbeat from core client for 30 sec - exiting 19:11:41 (4820): No heartbeat from core client for 30 sec - exiting 19:11:42 (4820): No heartbeat from core client for 30 sec - exiting 19:11:43 (4820): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1668, iMonCtr=1 Model crash detected, will try to restart... 17:29:47 (10028): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6780, iMonCtr=1 Model crash detected, will try to restart... </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
30 Sep 2013 15:54:53	459222	15925251	hadcm3n_4guw_1940_40_008307048_3	829,440	662,915	0.7992
29 Sep 2013 13:30:52	459222	15925251	hadcm3n_4guw_1940_40_008307048_3	803,520	640,509	0.7971
28 Sep 2013 20:19:45	459222	15925251	hadcm3n_4guw_1940_40_008307048_3	777,600	619,207	0.7963
28 Sep 2013 14:28:36	459222	15925251	hadcm3n_4guw_1940_40_008307048_3	751,680	598,505	0.7962
28 Sep 2013 08:39:40	459222	15925251	hadcm3n_4guw_1940_40_008307048_3	725,760	577,705	0.7960
27 Sep 2013 16:37:24	459222	15925251	hadcm3n_4guw_1940_40_008307048_3	699,840	556,059	0.7946
26 Sep 2013 15:44:28	459222	15925251	hadcm3n_4guw_1940_40_008307048_3	673,920	533,712	0.7920
25 Sep 2013 09:23:31	459222	15925251	hadcm3n_4guw_1940_40_008307048_3	648,000	513,225	0.7920
23 Sep 2013 18:07:02	459222	15925251	hadcm3n_4guw_1940_40_008307048_3	622,080	492,131	0.7911
23 Sep 2013 15:18:41	459222	15925251	hadcm3n_4guw_1940_40_008307048_3	596,160	469,738	0.7879
19 Sep 2013 16:21:36	459222	15925251	hadcm3n_4guw_1940_40_008307048_3	570,240	448,508	0.7865
17 Sep 2013 19:58:47	459222	15925251	hadcm3n_4guw_1940_40_008307048_3	544,320	426,626	0.7838
16 Sep 2013 19:54:52	459222	15925251	hadcm3n_4guw_1940_40_008307048_3	518,400	405,418	0.7821
15 Sep 2013 18:16:45	459222	15925251	hadcm3n_4guw_1940_40_008307048_3	492,480	384,482	0.7807
07 Sep 2013 08:13:57	459222	15925251	hadcm3n_4guw_1940_40_008307048_3	466,560	363,295	0.7787
06 Sep 2013 15:44:11	459222	15925251	hadcm3n_4guw_1940_40_008307048_3	440,640	342,874	0.7781
05 Sep 2013 15:50:58	459222	15925251	hadcm3n_4guw_1940_40_008307048_3	414,720	322,747	0.7782
03 Sep 2013 19:10:13	459222	15925251	hadcm3n_4guw_1940_40_008307048_3	388,800	302,938	0.7792
02 Sep 2013 16:38:21	459222	15925251	hadcm3n_4guw_1940_40_008307048_3	362,880	282,289	0.7779
01 Sep 2013 16:37:27	459222	15925251	hadcm3n_4guw_1940_40_008307048_3	336,960	261,975	0.7775