Task 13356429

Name	hadcm3n_t3u0_1940_40_007446193_0
Workunit	7643696
Created	9 Sep 2011, 14:59:37 UTC
Sent	17 Sep 2011, 10:42:57 UTC
Report deadline	17 Dec 2011, 18:10:08 UTC
Received	16 Oct 2011, 16:32:09 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1107675
Run time	11 days 3 hours 28 min 19 sec
CPU time	9 days 17 hours 52 min 37 sec
Validate state	Invalid
Credit	3,732.48
Device peak FLOPS	2.52 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.12.34</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2640, iMonCtr=1 Model crash detected, will try to restart... BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2516, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2932, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2932, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2724, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2724, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2724, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2724, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1280, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1280, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
16 Oct 2011 13:10:19	1107675	13356429	hadcm3n_t3u0_1940_40_007446193_0	311,040	840,103	2.7009
14 Oct 2011 15:35:54	1107675	13356429	hadcm3n_t3u0_1940_40_007446193_0	285,120	770,148	2.7011
13 Oct 2011 12:49:40	1107675	13356429	hadcm3n_t3u0_1940_40_007446193_0	259,200	702,238	2.7093
08 Oct 2011 17:25:06	1107675	13356429	hadcm3n_t3u0_1940_40_007446193_0	233,280	630,612	2.7032
02 Oct 2011 17:40:46	1107675	13356429	hadcm3n_t3u0_1940_40_007446193_0	207,360	560,557	2.7033
30 Sep 2011 21:37:43	1107675	13356429	hadcm3n_t3u0_1940_40_007446193_0	181,440	488,290	2.6912
28 Sep 2011 21:43:49	1107675	13356429	hadcm3n_t3u0_1940_40_007446193_0	155,520	420,811	2.7058
26 Sep 2011 22:00:28	1107675	13356429	hadcm3n_t3u0_1940_40_007446193_0	129,600	351,358	2.7111
24 Sep 2011 05:34:41	1107675	13356429	hadcm3n_t3u0_1940_40_007446193_0	103,680	278,367	2.6849
22 Sep 2011 22:17:08	1107675	13356429	hadcm3n_t3u0_1940_40_007446193_0	77,760	211,120	2.7150
20 Sep 2011 15:32:29	1107675	13356429	hadcm3n_t3u0_1940_40_007446193_0	51,840	141,719	2.7338
18 Sep 2011 21:49:14	1107675	13356429	hadcm3n_t3u0_1940_40_007446193_0	25,920	70,735	2.7290