Task 17353422

Name	hadcm3n_x1ie_1940_40_009149225_0
Workunit	9279561
Created	6 Nov 2014, 12:47:29 UTC
Sent	6 Nov 2014, 12:53:00 UTC
Report deadline	5 Feb 2015, 20:20:11 UTC
Received	9 Oct 2015, 1:30:56 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	958040
Run time	20 days 9 hours 56 min 29 sec
CPU time	20 days 9 hours 56 min 29 sec
Validate state	Invalid
Credit	11,819.52
Device peak FLOPS	2.21 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.4.7</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... 15:44:58 (4460): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:31:31 (5272): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:36:05 (5308): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1328, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1328, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1328, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1328, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1328, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1328, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
08 Oct 2015 21:40:12	958040	17353422	hadcm3n_x1ie_1940_40_009149225_0	984,960	1,762,117	1.7890
08 Oct 2015 01:36:56	958040	17353422	hadcm3n_x1ie_1940_40_009149225_0	959,040	1,714,839	1.7881
07 Oct 2015 07:37:53	958040	17353422	hadcm3n_x1ie_1940_40_009149225_0	933,120	1,667,776	1.7873
06 Oct 2015 14:00:37	958040	17353422	hadcm3n_x1ie_1940_40_009149225_0	907,200	1,621,643	1.7875
05 Oct 2015 19:21:58	958040	17353422	hadcm3n_x1ie_1940_40_009149225_0	881,280	1,574,923	1.7871
04 Oct 2015 23:54:26	958040	17353422	hadcm3n_x1ie_1940_40_009149225_0	855,360	1,528,224	1.7866
04 Oct 2015 06:02:02	958040	17353422	hadcm3n_x1ie_1940_40_009149225_0	829,440	1,481,920	1.7867
03 Oct 2015 12:28:32	958040	17353422	hadcm3n_x1ie_1940_40_009149225_0	803,520	1,436,145	1.7873
02 Oct 2015 19:25:03	958040	17353422	hadcm3n_x1ie_1940_40_009149225_0	777,600	1,390,766	1.7885
18 Jun 2015 13:07:13	958040	17353422	hadcm3n_x1ie_1940_40_009149225_0	751,680	1,344,320	1.7884
17 Jun 2015 18:26:08	958040	17353422	hadcm3n_x1ie_1940_40_009149225_0	725,760	1,297,199	1.7874
17 Jun 2015 05:39:31	958040	17353422	hadcm3n_x1ie_1940_40_009149225_0	699,840	1,254,181	1.7921
16 Jun 2015 12:21:46	958040	17353422	hadcm3n_x1ie_1940_40_009149225_0	673,920	1,207,714	1.7921
15 Jun 2015 17:59:13	958040	17353422	hadcm3n_x1ie_1940_40_009149225_0	648,000	1,160,589	1.7910
14 Jun 2015 23:16:15	958040	17353422	hadcm3n_x1ie_1940_40_009149225_0	622,080	1,112,671	1.7886
14 Jun 2015 04:44:17	958040	17353422	hadcm3n_x1ie_1940_40_009149225_0	596,160	1,064,960	1.7864
13 Jun 2015 10:34:45	958040	17353422	hadcm3n_x1ie_1940_40_009149225_0	570,240	1,016,892	1.7833
12 Jun 2015 17:26:20	958040	17353422	hadcm3n_x1ie_1940_40_009149225_0	544,320	970,157	1.7823
11 Jun 2015 23:11:12	958040	17353422	hadcm3n_x1ie_1940_40_009149225_0	518,400	921,989	1.7785
11 Jun 2015 05:02:30	958040	17353422	hadcm3n_x1ie_1940_40_009149225_0	492,480	874,878	1.7765