Task 13014627

Name	hadcm3n_t2jq_1940_40_007311787_0
Workunit	7509217
Created	28 Jun 2011, 0:37:41 UTC
Sent	28 Jun 2011, 0:37:51 UTC
Report deadline	27 Sep 2011, 8:05:02 UTC
Received	4 Nov 2011, 0:22:40 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	193 (0x000000C1) EXIT_SIGNAL
Computer ID	1141592
Run time	26 days 8 hours 7 min 29 sec
CPU time	23 days 11 hours 21 min 30 sec
Validate state	Invalid
Credit	9,331.20
Device peak FLOPS	1.87 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.10.60</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5860, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5012, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2792, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2160, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3684, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4972, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITHEAD: I/O error tmp/pipe_dummy 2048 BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 forrtl: There is not enough space on the disk. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3100, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3100, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3132, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6884, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3492, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7160, iMonCtr=1 Model crash detected, will try to restart... BUFFOUT: C I/O Error - Return code = 32 Model crashed: STWORK : Error in PP_FILE tmp/pipe_dummy 2048 forrtl: There is not enough space on the disk. CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CCPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2472, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4564, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
03 Nov 2011 18:57:31	1141592	13014627	hadcm3n_t2jq_1940_40_007311787_0	777,600	2,028,086	2.6081
31 Oct 2011 19:23:39	1141592	13014627	hadcm3n_t2jq_1940_40_007311787_0	751,680	1,961,375	2.6093
31 Oct 2011 17:23:05	1141592	13014627	hadcm3n_t2jq_1940_40_007311787_0	725,760	1,893,658	2.6092
31 Oct 2011 17:23:05	1141592	13014627	hadcm3n_t2jq_1940_40_007311787_0	699,840	1,825,773	2.6088
16 Oct 2011 21:55:48	1141592	13014627	hadcm3n_t2jq_1940_40_007311787_0	673,920	1,757,507	2.6079
11 Oct 2011 00:39:13	1141592	13014627	hadcm3n_t2jq_1940_40_007311787_0	648,000	1,690,313	2.6085
09 Oct 2011 17:43:23	1141592	13014627	hadcm3n_t2jq_1940_40_007311787_0	622,080	1,632,046	2.6235
06 Oct 2011 04:23:51	1141592	13014627	hadcm3n_t2jq_1940_40_007311787_0	596,160	1,574,834	2.6416
28 Sep 2011 06:58:41	1141592	13014627	hadcm3n_t2jq_1940_40_007311787_0	570,240	1,507,041	2.6428
24 Sep 2011 22:32:17	1141592	13014627	hadcm3n_t2jq_1940_40_007311787_0	544,320	1,438,397	2.6426
23 Sep 2011 05:54:35	1141592	13014627	hadcm3n_t2jq_1940_40_007311787_0	518,400	1,368,277	2.6394
18 Sep 2011 20:58:32	1141592	13014627	hadcm3n_t2jq_1940_40_007311787_0	492,480	1,307,881	2.6557
11 Sep 2011 20:28:44	1141592	13014627	hadcm3n_t2jq_1940_40_007311787_0	466,560	1,239,776	2.6573
08 Sep 2011 18:17:13	1141592	13014627	hadcm3n_t2jq_1940_40_007311787_0	440,640	1,171,132	2.6578
05 Sep 2011 03:14:12	1141592	13014627	hadcm3n_t2jq_1940_40_007311787_0	414,720	1,102,949	2.6595
04 Sep 2011 04:13:51	1141592	13014627	hadcm3n_t2jq_1940_40_007311787_0	388,800	1,034,795	2.6615
03 Sep 2011 08:08:55	1141592	13014627	hadcm3n_t2jq_1940_40_007311787_0	362,880	966,147	2.6624
02 Sep 2011 11:50:38	1141592	13014627	hadcm3n_t2jq_1940_40_007311787_0	336,960	896,873	2.6617
01 Sep 2011 15:46:17	1141592	13014627	hadcm3n_t2jq_1940_40_007311787_0	311,040	827,699	2.6611
31 Aug 2011 19:29:02	1141592	13014627	hadcm3n_t2jq_1940_40_007311787_0	285,120	758,304	2.6596