Task 15915452

Name	hadcm3n_3i90_2020_40_008366494_1
Workunit	8517353
Created	14 Aug 2013, 11:40:02 UTC
Sent	14 Aug 2013, 17:57:13 UTC
Report deadline	14 Nov 2013, 1:24:24 UTC
Received	4 Oct 2013, 15:22:43 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1135209
Run time	20 days 11 hours 28 min 29 sec
CPU time	19 days 19 hours 31 min 33 sec
Validate state	Invalid
Credit	12,130.56
Device peak FLOPS	2.93 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.2.5</core_client_version> <![CDATA[ <message> El dispositivo no reconoce el comando. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6036, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6036, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 12:56:46 (5932): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:19:35 (9208): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:34:08 (10836): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:38:53 (10576): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:40:49 (5860): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:56:16 (8592): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:03:08 (11920): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:23:31 (6064): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:20:40 (5612): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:45:15 (5372): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6092, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6056, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5728, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1 Model crash detected, will try to restart... Ocean Restart file copy failed on 3i90ko.dap83n0 CSignal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1340, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1340, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1340, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1340, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1340, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1340, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
04 Oct 2013 12:33:36	1135209	15915452	hadcm3n_3i90_2020_40_008366494_1	1,010,880	1,702,657	1.6843
03 Oct 2013 15:13:24	1135209	15915452	hadcm3n_3i90_2020_40_008366494_1	984,960	1,654,867	1.6801
02 Oct 2013 17:12:46	1135209	15915452	hadcm3n_3i90_2020_40_008366494_1	959,040	1,607,248	1.6759
01 Oct 2013 18:37:18	1135209	15915452	hadcm3n_3i90_2020_40_008366494_1	933,120	1,559,322	1.6711
30 Sep 2013 18:18:30	1135209	15915452	hadcm3n_3i90_2020_40_008366494_1	907,200	1,511,513	1.6661
27 Sep 2013 20:59:49	1135209	15915452	hadcm3n_3i90_2020_40_008366494_1	881,280	1,464,129	1.6614
27 Sep 2013 06:26:23	1135209	15915452	hadcm3n_3i90_2020_40_008366494_1	855,360	1,416,254	1.6557
26 Sep 2013 08:22:02	1135209	15915452	hadcm3n_3i90_2020_40_008366494_1	829,440	1,369,018	1.6505
25 Sep 2013 09:50:48	1135209	15915452	hadcm3n_3i90_2020_40_008366494_1	803,520	1,321,233	1.6443
25 Sep 2013 09:13:57	1135209	15915452	hadcm3n_3i90_2020_40_008366494_1	777,600	1,273,403	1.6376
23 Sep 2013 14:30:35	1135209	15915452	hadcm3n_3i90_2020_40_008366494_1	751,680	1,225,971	1.6310
20 Sep 2013 16:09:02	1135209	15915452	hadcm3n_3i90_2020_40_008366494_1	725,760	1,178,826	1.6243
19 Sep 2013 18:18:25	1135209	15915452	hadcm3n_3i90_2020_40_008366494_1	699,840	1,132,384	1.6181
18 Sep 2013 20:44:03	1135209	15915452	hadcm3n_3i90_2020_40_008366494_1	673,920	1,085,812	1.6112
18 Sep 2013 07:42:43	1135209	15915452	hadcm3n_3i90_2020_40_008366494_1	648,000	1,040,805	1.6062
18 Sep 2013 07:42:43	1135209	15915452	hadcm3n_3i90_2020_40_008366494_1	648,000	1,040,805	1.6062
17 Sep 2013 12:32:02	1135209	15915452	hadcm3n_3i90_2020_40_008366494_1	622,080	1,000,344	1.6081
16 Sep 2013 16:08:56	1135209	15915452	hadcm3n_3i90_2020_40_008366494_1	596,160	959,699	1.6098
13 Sep 2013 20:28:40	1135209	15915452	hadcm3n_3i90_2020_40_008366494_1	570,240	918,768	1.6112
12 Sep 2013 12:30:41	1135209	15915452	hadcm3n_3i90_2020_40_008366494_1	544,320	877,925	1.6129