Task 15900207

Name	hadcm3n_o0x1_2020_40_008401760_0
Workunit	8552616
Created	22 Jul 2013, 14:21:49 UTC
Sent	22 Jul 2013, 14:24:10 UTC
Report deadline	21 Oct 2013, 21:51:21 UTC
Received	14 Aug 2013, 16:54:46 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1123512
Run time	6 days 18 hours 11 min 46 sec
CPU time	6 days 1 hours 36 min 5 sec
Validate state	Invalid
Credit	6,531.84
Device peak FLOPS	4.36 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.2.5</core_client_version> <![CDATA[ <message> Urządzenie nie rozpoznaje polecenia. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:39:36 (2488): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 18:39:37 (2488): No heartbeat from core client for 30 sec - exiting 18:39:38 (2488): No heartbeat from core client for 30 sec - exiting 18:39:39 (2488): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4644, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4644, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 00:02:07 (2924): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3464, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3464, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3464, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3464, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3464, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3464, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
14 Aug 2013 16:57:52	1123512	15900207	hadcm3n_o0x1_2020_40_008401760_0	544,320	516,192	0.9483
14 Aug 2013 16:57:52	1123512	15900207	hadcm3n_o0x1_2020_40_008401760_0	518,400	493,516	0.9520
14 Aug 2013 16:57:52	1123512	15900207	hadcm3n_o0x1_2020_40_008401760_0	492,480	470,794	0.9560
14 Aug 2013 16:57:52	1123512	15900207	hadcm3n_o0x1_2020_40_008401760_0	466,560	448,180	0.9606
14 Aug 2013 16:57:52	1123512	15900207	hadcm3n_o0x1_2020_40_008401760_0	440,640	425,576	0.9658
14 Aug 2013 16:57:52	1123512	15900207	hadcm3n_o0x1_2020_40_008401760_0	414,720	402,616	0.9708
14 Aug 2013 16:57:52	1123512	15900207	hadcm3n_o0x1_2020_40_008401760_0	388,800	380,245	0.9780
14 Aug 2013 16:57:52	1123512	15900207	hadcm3n_o0x1_2020_40_008401760_0	362,880	358,657	0.9884
14 Aug 2013 16:57:52	1123512	15900207	hadcm3n_o0x1_2020_40_008401760_0	336,960	333,358	0.9893
14 Aug 2013 16:57:52	1123512	15900207	hadcm3n_o0x1_2020_40_008401760_0	311,040	311,395	1.0011
14 Aug 2013 16:57:52	1123512	15900207	hadcm3n_o0x1_2020_40_008401760_0	285,120	285,841	1.0025
26 Jul 2013 00:24:06	1123512	15900207	hadcm3n_o0x1_2020_40_008401760_0	259,200	259,690	1.0019
25 Jul 2013 16:08:28	1123512	15900207	hadcm3n_o0x1_2020_40_008401760_0	233,280	233,103	0.9992
25 Jul 2013 08:39:57	1123512	15900207	hadcm3n_o0x1_2020_40_008401760_0	207,360	206,983	0.9982
25 Jul 2013 00:41:05	1123512	15900207	hadcm3n_o0x1_2020_40_008401760_0	181,440	181,214	0.9988
24 Jul 2013 15:39:08	1123512	15900207	hadcm3n_o0x1_2020_40_008401760_0	155,520	155,588	1.0004
24 Jul 2013 08:15:32	1123512	15900207	hadcm3n_o0x1_2020_40_008401760_0	129,600	129,781	1.0014
24 Jul 2013 00:40:16	1123512	15900207	hadcm3n_o0x1_2020_40_008401760_0	103,680	104,012	1.0032
23 Jul 2013 22:13:55	1123512	15900207	hadcm3n_o0x1_2020_40_008401760_0	77,760	77,636	0.9984
23 Jul 2013 22:06:36	1123512	15900207	hadcm3n_o0x1_2020_40_008401760_0	51,840	51,654	0.9964