Task 15279553

Name	hadcm3n_z926_1880_40_008200457_1
Workunit	8355581
Created	13 Sep 2012, 7:14:43 UTC
Sent	14 Sep 2012, 2:43:23 UTC
Report deadline	14 Dec 2012, 10:10:34 UTC
Received	25 Sep 2012, 2:22:42 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	193 (0x000000C1) EXIT_SIGNAL
Computer ID	775427
Run time	6 days 6 hours 11 min 43 sec
CPU time	5 days 21 hours 3 min 7 sec
Validate state	Invalid
Credit	3,110.40
Device peak FLOPS	2.32 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.28</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8036, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 22:14:12 (10828): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6120, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8920, iMonCtr=1 Model crash detected, will try to restart... CSuspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6080, iMonCtr=1 Model crash detected, will try to restart... 14:50:19 (4456): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:50:55 (7156): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:22:51 (2072): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:24:29 (6996): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:55:48 (3916): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9852, iMonCtr=1 Model crash detected, will try to restart... 08:01:46 (4208): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:01:47 (4208): No heartbeat from core client for 30 sec - exiting 08:01:48 (4208): No heartbeat from core client for 30 sec - exiting 08:01:49 (4208): No heartbeat from core client for 30 sec - exiting 08:01:50 (4208): No heartbeat from core client for 30 sec - exiting 08:01:51 (4208): No heartbeat from core client for 30 sec - exiting 08:01:52 (4208): No heartbeat from core client for 30 sec - exiting 08:01:53 (4208): No heartbeat from core client for 30 sec - exiting 14:26:20 (8056): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 15:27:55 (3880): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
25 Sep 2012 01:26:21	775427	15279553	hadcm3n_z926_1880_40_008200457_1	259,200	507,778	1.9590
23 Sep 2012 23:27:46	775427	15279553	hadcm3n_z926_1880_40_008200457_1	233,280	454,227	1.9471
22 Sep 2012 22:08:27	775427	15279553	hadcm3n_z926_1880_40_008200457_1	207,360	402,129	1.9393
22 Sep 2012 01:41:25	775427	15279553	hadcm3n_z926_1880_40_008200457_1	181,440	350,376	1.9311
21 Sep 2012 02:31:04	775427	15279553	hadcm3n_z926_1880_40_008200457_1	155,520	301,559	1.9390
20 Sep 2012 02:40:14	775427	15279553	hadcm3n_z926_1880_40_008200457_1	129,600	254,605	1.9645
18 Sep 2012 18:33:43	775427	15279553	hadcm3n_z926_1880_40_008200457_1	103,680	202,396	1.9521
17 Sep 2012 15:11:42	775427	15279553	hadcm3n_z926_1880_40_008200457_1	77,760	153,460	1.9735
16 Sep 2012 15:51:14	775427	15279553	hadcm3n_z926_1880_40_008200457_1	51,840	103,247	1.9916
16 Sep 2012 01:07:42	775427	15279553	hadcm3n_z926_1880_40_008200457_1	25,920	52,288	2.0173