Task 16831802

Name	hadcm3n_8cij_1980_40_008725638_3
Workunit	8871616
Created	30 Jul 2014, 23:31:30 UTC
Sent	30 Jul 2014, 23:33:23 UTC
Report deadline	30 Oct 2014, 7:00:34 UTC
Received	7 Oct 2014, 3:55:17 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	193 (0x000000C1) EXIT_SIGNAL
Computer ID	1281494
Run time	6 days 3 hours 24 min 27 sec
CPU time	5 days 10 hours 34 min 2 sec
Validate state	Invalid
Credit	3,110.40
Device peak FLOPS	3.12 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.2.33</core_client_version> <![CDATA[ <message> (unknown error) - exit code 193 (0xc1) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 14:11:03 (6496): No heartbeat from core client for 30 sec - exiting 14:11:04 (6496): No heartbeat from core client for 30 sec - exiting 14:11:05 (6496): No heartbeat from core client for 30 sec - exiting 14:11:06 (6496): No heartbeat from core client for 30 sec - exiting 14:11:07 (6496): No heartbeat from core client for 30 sec - exiting 14:11:08 (6496): No heartbeat from core client for 30 sec - exiting 14:11:09 (6496): No heartbeat from core client for 30 sec - exiting 14:11:10 (6496): No heartbeat from core client for 30 sec - exiting 14:11:11 (6496): No heartbeat from core client for 30 sec - exiting 14:11:12 (6496): No heartbeat from core client for 30 sec - exiting 14:11:13 (6496): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:14:13 (4348): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 17:25:37 (6892): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 17:18:44 (6184): No heartbeat from core client for 30 sec - exiting 17:18:45 (6184): No heartbeat from core client for 30 sec - exiting 17:18:46 (6184): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 17:08:34 (6944): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3044, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6452, iMonCtr=1 Model crash detected, will try to restart... 14:02:08 (6520): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6944, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4776, iMonCtr=1 Model crash detected, will try to restart... 17:28:09 (6756): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7956, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4292, iMonCtr=1 Model crash detected, will try to restart... 14:23:27 (6572): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
07 Oct 2014 03:59:56	1281494	16831802	hadcm3n_8cij_1980_40_008725638_3	259,200	470,037	1.8134
04 Oct 2014 04:11:16	1281494	16831802	hadcm3n_8cij_1980_40_008725638_3	233,280	421,673	1.8076
21 Sep 2014 22:50:55	1281494	16831802	hadcm3n_8cij_1980_40_008725638_3	207,360	373,362	1.8005
20 Sep 2014 01:43:08	1281494	16831802	hadcm3n_8cij_1980_40_008725638_3	181,440	325,757	1.7954
17 Sep 2014 21:21:53	1281494	16831802	hadcm3n_8cij_1980_40_008725638_3	155,520	279,838	1.7994
07 Sep 2014 22:31:30	1281494	16831802	hadcm3n_8cij_1980_40_008725638_3	129,600	232,571	1.7945
31 Aug 2014 02:42:26	1281494	16831802	hadcm3n_8cij_1980_40_008725638_3	103,680	185,893	1.7929
25 Aug 2014 00:45:26	1281494	16831802	hadcm3n_8cij_1980_40_008725638_3	77,760	138,293	1.7785
18 Aug 2014 21:26:32	1281494	16831802	hadcm3n_8cij_1980_40_008725638_3	51,840	91,970	1.7741
07 Aug 2014 01:56:13	1281494	16831802	hadcm3n_8cij_1980_40_008725638_3	25,920	46,426	1.7911