Task 15785521

Name	hadcm3n_4ke1_1940_40_008306256_3
Workunit	8457391
Created	15 May 2013, 18:56:06 UTC
Sent	15 May 2013, 18:56:47 UTC
Report deadline	15 Aug 2013, 2:23:58 UTC
Received	25 May 2013, 5:35:53 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1189922
Run time	8 days 14 hours 16 min 51 sec
CPU time	1 days 10 hours 31 min 52 sec
Validate state	Invalid
Credit	9,642.24
Device peak FLOPS	4.22 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy 2048 Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12596, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12596, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12596, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12596, iMonCtr=1 Model crash detected, will try to restart... 13:21:59 (3300): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:22:00 (3300): No heartbeat from core client for 30 sec - exiting 13:22:01 (3300): No heartbeat from core client for 30 sec - exiting 13:22:02 (3300): No heartbeat from core client for 30 sec - exiting 13:22:03 (3300): No heartbeat from core client for 30 sec - exiting 13:22:04 (3300): No heartbeat from core client for 30 sec - exiting 13:22:05 (3300): No heartbeat from core client for 30 sec - exiting 13:22:06 (3300): No heartbeat from core client for 30 sec - exiting 13:22:07 (3300): No heartbeat from core client for 30 sec - exiting 13:22:08 (3300): No heartbeat from core client for 30 sec - exiting 13:22:09 (3300): No heartbeat from core client for 30 sec - exiting 08:35:08 (4316): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5852, iMonCtr=1 Model crash detected, will try to restart... Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
25 May 2013 01:34:35	1189922	15785521	hadcm3n_4ke1_1940_40_008306256_3	803,520	133,758	0.1665
24 May 2013 17:48:03	1189922	15785521	hadcm3n_4ke1_1940_40_008306256_3	777,600	133,910	0.1722
23 May 2013 15:55:58	1189922	15785521	hadcm3n_4ke1_1940_40_008306256_3	751,680	110,430	0.1469
23 May 2013 09:23:19	1189922	15785521	hadcm3n_4ke1_1940_40_008306256_3	725,760	87,049	0.1199
23 May 2013 02:51:00	1189922	15785521	hadcm3n_4ke1_1940_40_008306256_3	699,840	63,746	0.0911
22 May 2013 20:17:08	1189922	15785521	hadcm3n_4ke1_1940_40_008306256_3	673,920	40,203	0.0597
22 May 2013 13:20:03	1189922	15785521	hadcm3n_4ke1_1940_40_008306256_3	648,000	16,930	0.0261
22 May 2013 06:52:35	1189922	15785521	hadcm3n_4ke1_1940_40_008306256_3	622,080	62,580	0.1006
22 May 2013 00:30:07	1189922	15785521	hadcm3n_4ke1_1940_40_008306256_3	596,160	39,708	0.0666
21 May 2013 17:59:44	1189922	15785521	hadcm3n_4ke1_1940_40_008306256_3	570,240	16,641	0.0292
21 May 2013 11:24:00	1189922	15785521	hadcm3n_4ke1_1940_40_008306256_3	544,320	300,121	0.5514
21 May 2013 04:51:37	1189922	15785521	hadcm3n_4ke1_1940_40_008306256_3	518,400	276,829	0.5340
20 May 2013 22:23:53	1189922	15785521	hadcm3n_4ke1_1940_40_008306256_3	492,480	253,752	0.5153
20 May 2013 15:48:41	1189922	15785521	hadcm3n_4ke1_1940_40_008306256_3	466,560	230,458	0.4940
20 May 2013 09:16:05	1189922	15785521	hadcm3n_4ke1_1940_40_008306256_3	440,640	207,204	0.4702
20 May 2013 02:43:47	1189922	15785521	hadcm3n_4ke1_1940_40_008306256_3	414,720	183,943	0.4435
19 May 2013 20:19:47	1189922	15785521	hadcm3n_4ke1_1940_40_008306256_3	388,800	160,867	0.4138
19 May 2013 13:53:54	1189922	15785521	hadcm3n_4ke1_1940_40_008306256_3	362,880	137,985	0.3802
19 May 2013 07:26:45	1189922	15785521	hadcm3n_4ke1_1940_40_008306256_3	336,960	114,928	0.3411
19 May 2013 01:04:25	1189922	15785521	hadcm3n_4ke1_1940_40_008306256_3	311,040	92,052	0.2959