Task 13103914

Name	hadcm3n_yd8t_1900_40_007350023_1
Workunit	7547453
Created	6 Jul 2011, 14:02:58 UTC
Sent	17 Jul 2011, 6:34:52 UTC
Report deadline	16 Oct 2011, 14:02:03 UTC
Received	26 Aug 2011, 7:50:38 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1311971
Run time	7 days 16 hours 40 min 17 sec
CPU time	7 days 15 hours 43 min 36 sec
Validate state	Invalid
Credit	4,665.60
Device peak FLOPS	2.54 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.12.33</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 22:59:44 (1360): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITHEAD: I/O error tmp/pipe_dummy 2048 23:06:08 (4724): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITHEAD: I/O error tmp/pipe_dummy 2048 forrtl: There is not enough space on the disk. 23:17:06 (3096): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... forrtl: There is not enough space on the disk. 23:22:40 (3632): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3172, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4144, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4144, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4144, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4144, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4144, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
26 Aug 2011 00:36:44	1070959	13103914	hadcm3n_yd8t_1900_40_007350023_1	388,800	636,388	1.6368
25 Aug 2011 12:48:59	1070959	13103914	hadcm3n_yd8t_1900_40_007350023_1	362,880	593,814	1.6364
25 Aug 2011 01:09:54	1070959	13103914	hadcm3n_yd8t_1900_40_007350023_1	336,960	552,016	1.6382
24 Aug 2011 13:36:32	1070959	13103914	hadcm3n_yd8t_1900_40_007350023_1	311,040	510,742	1.6420
24 Aug 2011 02:07:25	1070959	13103914	hadcm3n_yd8t_1900_40_007350023_1	285,120	469,457	1.6465
23 Aug 2011 14:38:43	1070959	13103914	hadcm3n_yd8t_1900_40_007350023_1	259,200	428,186	1.6520
23 Aug 2011 03:12:54	1070959	13103914	hadcm3n_yd8t_1900_40_007350023_1	233,280	386,930	1.6587
22 Aug 2011 15:41:33	1070959	13103914	hadcm3n_yd8t_1900_40_007350023_1	207,360	345,777	1.6675
22 Aug 2011 03:22:10	1070959	13103914	hadcm3n_yd8t_1900_40_007350023_1	181,440	303,799	1.6744
21 Aug 2011 15:48:06	1070959	13103914	hadcm3n_yd8t_1900_40_007350023_1	155,520	262,236	1.6862
03 Aug 2011 02:33:33	1070959	13103914	hadcm3n_yd8t_1900_40_007350023_1	129,600	218,772	1.6881
02 Aug 2011 14:22:28	1070959	13103914	hadcm3n_yd8t_1900_40_007350023_1	103,680	174,982	1.6877
25 Jul 2011 17:36:37	1070959	13103914	hadcm3n_yd8t_1900_40_007350023_1	77,760	131,262	1.6880
25 Jul 2011 17:15:54	1070959	13103914	hadcm3n_yd8t_1900_40_007350023_1	51,840	87,586	1.6895
25 Jul 2011 16:26:35	1070959	13103914	hadcm3n_yd8t_1900_40_007350023_1	25,920	43,884	1.6931