Task 14104955

Name	hadcm3n_t3zy_1940_40_007753639_0
Workunit	7908748
Created	17 Feb 2012, 11:33:12 UTC
Sent	17 Feb 2012, 11:33:28 UTC
Report deadline	18 May 2012, 19:00:39 UTC
Received	6 Mar 2012, 17:00:22 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1196001
Run time	8 days 21 hours 6 min 5 sec
CPU time	7 days 17 hours 50 min 10 sec
Validate state	Invalid
Credit	4,665.60
Device peak FLOPS	2.34 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>6.12.34</core_client_version> <![CDATA[ <message> El dispositivo no reconoce el comando. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 08:56:50 (3472): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:08:36 (3144): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:47:50 (4904): No heartbeat from core client for 30 sec - exiting 09:47:51 (4904): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 07:49:11 (3564): No heartbeat from core client for 30 sec - exiting 07:49:12 (3564): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:29:40 (4368): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6852, iMonCtr=1 Model crash detected, will try to restart... 09:27:05 (3652): No heartbeat from core client for 30 sec - exiting 09:27:06 (3652): No heartbeat from core client for 30 sec - exiting 09:27:07 (3652): No heartbeat from core client for 30 sec - exiting 09:27:08 (3652): No heartbeat from core client for 30 sec - exiting 09:27:09 (3652): No heartbeat from core client for 30 sec - exiting 09:27:10 (3652): No heartbeat from core client for 30 sec - exiting 09:27:11 (3652): No heartbeat from core client for 30 sec - exiting 09:27:12 (3652): No heartbeat from core client for 30 sec - exiting 09:27:13 (3652): No heartbeat from core client for 30 sec - exiting 09:27:14 (3652): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7164, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7164, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7164, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7164, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7164, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7164, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
05 Mar 2012 16:08:34	1196001	14104955	hadcm3n_t3zy_1940_40_007753639_0	388,800	639,595	1.6450
04 Mar 2012 15:25:41	1196001	14104955	hadcm3n_t3zy_1940_40_007753639_0	362,880	596,989	1.6451
03 Mar 2012 10:15:32	1196001	14104955	hadcm3n_t3zy_1940_40_007753639_0	336,960	551,035	1.6353
01 Mar 2012 21:44:11	1196001	14104955	hadcm3n_t3zy_1940_40_007753639_0	311,040	506,476	1.6283
01 Mar 2012 07:14:13	1196001	14104955	hadcm3n_t3zy_1940_40_007753639_0	285,120	464,392	1.6288
29 Feb 2012 12:10:19	1196001	14104955	hadcm3n_t3zy_1940_40_007753639_0	259,200	426,392	1.6450
28 Feb 2012 08:51:38	1196001	14104955	hadcm3n_t3zy_1940_40_007753639_0	233,280	385,036	1.6505
25 Feb 2012 14:52:17	1196001	14104955	hadcm3n_t3zy_1940_40_007753639_0	207,360	341,634	1.6475
24 Feb 2012 13:54:28	1196001	14104955	hadcm3n_t3zy_1940_40_007753639_0	181,440	299,438	1.6503
23 Feb 2012 12:08:45	1196001	14104955	hadcm3n_t3zy_1940_40_007753639_0	155,520	256,206	1.6474
22 Feb 2012 11:00:46	1196001	14104955	hadcm3n_t3zy_1940_40_007753639_0	129,600	212,944	1.6431
21 Feb 2012 09:48:19	1196001	14104955	hadcm3n_t3zy_1940_40_007753639_0	103,680	169,079	1.6308
20 Feb 2012 01:14:23	1196001	14104955	hadcm3n_t3zy_1940_40_007753639_0	77,760	126,095	1.6216
19 Feb 2012 12:16:59	1196001	14104955	hadcm3n_t3zy_1940_40_007753639_0	51,840	83,916	1.6188
18 Feb 2012 12:24:05	1196001	14104955	hadcm3n_t3zy_1940_40_007753639_0	25,920	42,287	1.6314