Task 16291392

Name	hadcm3n_of6v_1900_40_008474810_3
Workunit	8625649
Created	16 Feb 2014, 10:23:00 UTC
Sent	16 Feb 2014, 10:23:04 UTC
Report deadline	18 May 2014, 17:50:15 UTC
Received	14 Aug 2014, 18:50:07 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	193 (0x000000C1) EXIT_SIGNAL
Computer ID	1125445
Run time	19 days 15 hours 41 min 4 sec
CPU time	17 days 6 hours 49 min 18 sec
Validate state	Invalid
Credit	9,331.20
Device peak FLOPS	2.68 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.2.42</core_client_version> <![CDATA[ <message> (unknown error) - exit code 193 (0xc1) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6128, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5924, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5252, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3708, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3788, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3536, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6360, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7152, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7152, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3220, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1976, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4004, iMonCtr=1 Model crash detected, will try to restart... 08:09:33 (3360): No heartbeat from core client for 30 sec - exiting 08:09:34 (3360): No heartbeat from core client for 30 sec - exiting 08:09:35 (3360): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3408, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
14 Aug 2014 18:50:20	1125445	16291392	hadcm3n_of6v_1900_40_008474810_3	777,600	1,493,353	1.9205
14 Aug 2014 18:50:20	1125445	16291392	hadcm3n_of6v_1900_40_008474810_3	751,680	1,442,923	1.9196
14 Aug 2014 18:50:20	1125445	16291392	hadcm3n_of6v_1900_40_008474810_3	725,760	1,392,141	1.9182
14 Aug 2014 18:50:19	1125445	16291392	hadcm3n_of6v_1900_40_008474810_3	699,840	1,341,952	1.9175
26 Jul 2014 16:50:16	1125445	16291392	hadcm3n_of6v_1900_40_008474810_3	673,920	1,291,162	1.9159
20 Jul 2014 15:41:28	1125445	16291392	hadcm3n_of6v_1900_40_008474810_3	648,000	1,240,389	1.9142
13 Jul 2014 18:01:34	1125445	16291392	hadcm3n_of6v_1900_40_008474810_3	622,080	1,189,614	1.9123
05 Jul 2014 11:51:51	1125445	16291392	hadcm3n_of6v_1900_40_008474810_3	596,160	1,139,367	1.9112
24 Jun 2014 15:08:25	1125445	16291392	hadcm3n_of6v_1900_40_008474810_3	570,240	1,088,716	1.9092
19 Jun 2014 22:54:34	1125445	16291392	hadcm3n_of6v_1900_40_008474810_3	544,320	1,038,158	1.9073
18 Jun 2014 19:23:53	1125445	16291392	hadcm3n_of6v_1900_40_008474810_3	518,400	987,714	1.9053
10 Jun 2014 14:00:06	1125445	16291392	hadcm3n_of6v_1900_40_008474810_3	492,480	939,842	1.9084
10 Jun 2014 09:01:50	1125445	16291392	hadcm3n_of6v_1900_40_008474810_3	466,560	897,445	1.9235
09 Jun 2014 16:03:59	1125445	16291392	hadcm3n_of6v_1900_40_008474810_3	440,640	853,766	1.9376
29 May 2014 17:23:19	1125445	16291392	hadcm3n_of6v_1900_40_008474810_3	414,720	802,948	1.9361
24 May 2014 15:15:42	1125445	16291392	hadcm3n_of6v_1900_40_008474810_3	388,800	751,786	1.9336
18 May 2014 17:18:06	1125445	16291392	hadcm3n_of6v_1900_40_008474810_3	362,880	701,042	1.9319
12 May 2014 17:31:41	1125445	16291392	hadcm3n_of6v_1900_40_008474810_3	336,960	651,103	1.9323
04 May 2014 09:15:51	1125445	16291392	hadcm3n_of6v_1900_40_008474810_3	311,040	600,185	1.9296
01 May 2014 14:04:36	1125445	16291392	hadcm3n_of6v_1900_40_008474810_3	285,120	549,037	1.9256