Task 15775676

Name	hadcm3n_4gnr_1940_40_008311677_1
Workunit	8462812
Created	11 May 2013, 2:06:16 UTC
Sent	11 May 2013, 13:44:23 UTC
Report deadline	10 Aug 2013, 21:11:34 UTC
Received	21 May 2013, 10:59:40 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1238071
Run time	8 days 5 hours 43 min 44 sec
CPU time	7 days 13 hours 52 min 5 sec
Validate state	Invalid
Credit	3,110.40
Device peak FLOPS	2.54 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-apple-darwin
Stderr	<core_client_version>7.0.31</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... hadcm3n_6.07_i686-apple-darwin(808,0xa077f540) malloc: * error for object 0x801004: incorrect checksum for freed object - object was probably modified after being freed. * set a breakpoint in malloc_error_break to debug hadcm3n_6.07_i686-apple-darwin(808,0xa077f540) malloc: * error for object 0x801000: incorrect checksum for freed object - object was probably modified after being freed. * set a breakpoint in malloc_error_break to debug Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=808, iMonCtr=1 Model crash detected, will try to restart... Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=808, iMonCtr=1 Model crash detected, will try to restart... Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=808, iMonCtr=1 Model crash detected, will try to restart... Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=808, iMonCtr=1 Model crash detected, will try to restart... Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=808, iMonCtr=1 Model crash detected, will try to restart... Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=808, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
19 May 2013 18:21:55	1238071	15775676	hadcm3n_4gnr_1940_40_008311677_1	259,200	650,955	2.5114
18 May 2013 22:08:06	1238071	15775676	hadcm3n_4gnr_1940_40_008311677_1	233,280	585,111	2.5082
18 May 2013 02:26:14	1238071	15775676	hadcm3n_4gnr_1940_40_008311677_1	207,360	519,978	2.5076
17 May 2013 06:44:33	1238071	15775676	hadcm3n_4gnr_1940_40_008311677_1	181,440	454,862	2.5070
16 May 2013 11:53:28	1238071	15775676	hadcm3n_4gnr_1940_40_008311677_1	155,520	389,870	2.5069
15 May 2013 15:47:59	1238071	15775676	hadcm3n_4gnr_1940_40_008311677_1	129,600	324,788	2.5061
14 May 2013 19:43:44	1238071	15775676	hadcm3n_4gnr_1940_40_008311677_1	103,680	259,819	2.5060
14 May 2013 00:10:26	1238071	15775676	hadcm3n_4gnr_1940_40_008311677_1	77,760	194,664	2.5034
13 May 2013 04:41:19	1238071	15775676	hadcm3n_4gnr_1940_40_008311677_1	51,840	129,489	2.4979
12 May 2013 10:16:33	1238071	15775676	hadcm3n_4gnr_1940_40_008311677_1	25,920	65,346	2.5211