Task 13956694

Name	hadcm3n_o3u8_1940_40_007693976_1
Workunit	7849084
Created	23 Jan 2012, 19:59:13 UTC
Sent	23 Jan 2012, 20:03:53 UTC
Report deadline	24 Apr 2012, 3:31:04 UTC
Received	16 Feb 2012, 8:51:14 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	959555
Run time	15 days 2 hours 11 min 54 sec
CPU time	14 days 17 hours 35 min
Validate state	Invalid
Credit	6,842.88
Device peak FLOPS	2.65 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 10:00:35 (3752): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 14:41:30 (3220): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 15:40:29 (4640): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 14:12:53 (3472): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3844, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3844, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3844, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3844, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3844, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5984, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
16 Feb 2012 08:54:06	959555	13956694	hadcm3n_o3u8_1940_40_007693976_1	570,240	1,284,909	2.2533
15 Feb 2012 06:23:19	959555	13956694	hadcm3n_o3u8_1940_40_007693976_1	544,320	1,226,709	2.2537
14 Feb 2012 13:26:40	959555	13956694	hadcm3n_o3u8_1940_40_007693976_1	518,400	1,167,993	2.2531
14 Feb 2012 07:08:19	959555	13956694	hadcm3n_o3u8_1940_40_007693976_1	492,480	1,109,137	2.2521
13 Feb 2012 08:00:33	959555	13956694	hadcm3n_o3u8_1940_40_007693976_1	466,560	1,050,394	2.2514
13 Feb 2012 08:00:33	959555	13956694	hadcm3n_o3u8_1940_40_007693976_1	440,640	991,722	2.2506
13 Feb 2012 08:00:33	959555	13956694	hadcm3n_o3u8_1940_40_007693976_1	414,720	932,969	2.2496
13 Feb 2012 08:00:33	959555	13956694	hadcm3n_o3u8_1940_40_007693976_1	388,800	874,403	2.2490
13 Feb 2012 08:00:33	959555	13956694	hadcm3n_o3u8_1940_40_007693976_1	362,880	815,837	2.2482
09 Feb 2012 14:21:59	959555	13956694	hadcm3n_o3u8_1940_40_007693976_1	336,960	757,384	2.2477
09 Feb 2012 14:21:59	959555	13956694	hadcm3n_o3u8_1940_40_007693976_1	311,040	698,598	2.2460
09 Feb 2012 14:21:59	959555	13956694	hadcm3n_o3u8_1940_40_007693976_1	285,120	639,802	2.2440
09 Feb 2012 14:21:59	959555	13956694	hadcm3n_o3u8_1940_40_007693976_1	259,200	581,280	2.2426
09 Feb 2012 14:21:59	959555	13956694	hadcm3n_o3u8_1940_40_007693976_1	233,280	522,768	2.2409
09 Feb 2012 14:21:59	959555	13956694	hadcm3n_o3u8_1940_40_007693976_1	207,360	464,617	2.2406
09 Feb 2012 14:21:59	959555	13956694	hadcm3n_o3u8_1940_40_007693976_1	181,440	406,454	2.2402
09 Feb 2012 14:21:59	959555	13956694	hadcm3n_o3u8_1940_40_007693976_1	155,520	348,310	2.2396
09 Feb 2012 14:21:59	959555	13956694	hadcm3n_o3u8_1940_40_007693976_1	129,600	290,294	2.2399
03 Feb 2012 06:30:17	959555	13956694	hadcm3n_o3u8_1940_40_007693976_1	103,680	232,224	2.2398
02 Feb 2012 15:50:35	959555	13956694	hadcm3n_o3u8_1940_40_007693976_1	77,760	174,107	2.2390