Task 15484294

Name	hadcm3n_o3q8_1980_40_008182401_4
Workunit	8337525
Created	19 Dec 2012, 18:00:26 UTC
Sent	19 Dec 2012, 18:00:36 UTC
Report deadline	21 Mar 2013, 1:27:47 UTC
Received	9 Jan 2013, 17:06:03 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1183081
Run time	9 days 15 hours 25 min 5 sec
CPU time	8 days 13 hours 53 min 31 sec
Validate state	Invalid
Credit	6,531.84
Device peak FLOPS	3.05 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1168, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 12:17:34 (2692): No heartbeat from core client for 30 sec - exiting 12:17:35 (2692): No heartbeat from core client for 30 sec - exiting 12:17:36 (2692): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:12:24 (3120): No heartbeat from core client for 30 sec - exiting 09:12:25 (3120): No heartbeat from core client for 30 sec - exiting 09:12:26 (3120): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2996, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Model crashed: TEMPHIST: Failed in OPEN of history file tmp/pipe_dummy 2048 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2972, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3248, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3248, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3248, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3248, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3248, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3248, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
08 Jan 2013 21:37:15	1183081	15484294	hadcm3n_o3q8_1980_40_008182401_4	544,320	737,540	1.3550
07 Jan 2013 17:52:41	1183081	15484294	hadcm3n_o3q8_1980_40_008182401_4	518,400	702,368	1.3549
06 Jan 2013 16:12:45	1183081	15484294	hadcm3n_o3q8_1980_40_008182401_4	492,480	668,110	1.3566
05 Jan 2013 17:26:25	1183081	15484294	hadcm3n_o3q8_1980_40_008182401_4	466,560	640,497	1.3728
04 Jan 2013 18:47:17	1183081	15484294	hadcm3n_o3q8_1980_40_008182401_4	440,640	605,769	1.3747
03 Jan 2013 14:30:41	1183081	15484294	hadcm3n_o3q8_1980_40_008182401_4	414,720	567,307	1.3679
02 Jan 2013 17:42:31	1183081	15484294	hadcm3n_o3q8_1980_40_008182401_4	388,800	530,501	1.3645
01 Jan 2013 14:30:44	1183081	15484294	hadcm3n_o3q8_1980_40_008182401_4	362,880	495,927	1.3666
31 Dec 2012 16:46:09	1183081	15484294	hadcm3n_o3q8_1980_40_008182401_4	336,960	460,602	1.3669
30 Dec 2012 20:24:26	1183081	15484294	hadcm3n_o3q8_1980_40_008182401_4	311,040	425,828	1.3690
30 Dec 2012 00:27:17	1183081	15484294	hadcm3n_o3q8_1980_40_008182401_4	285,120	390,666	1.3702
29 Dec 2012 13:46:43	1183081	15484294	hadcm3n_o3q8_1980_40_008182401_4	259,200	355,654	1.3721
28 Dec 2012 15:56:05	1183081	15484294	hadcm3n_o3q8_1980_40_008182401_4	233,280	317,445	1.3608
27 Dec 2012 18:19:30	1183081	15484294	hadcm3n_o3q8_1980_40_008182401_4	207,360	280,117	1.3509
26 Dec 2012 21:46:44	1183081	15484294	hadcm3n_o3q8_1980_40_008182401_4	181,440	245,190	1.3514
26 Dec 2012 11:44:35	1183081	15484294	hadcm3n_o3q8_1980_40_008182401_4	155,520	210,453	1.3532
25 Dec 2012 11:38:08	1183081	15484294	hadcm3n_o3q8_1980_40_008182401_4	129,600	175,593	1.3549
23 Dec 2012 22:48:39	1183081	15484294	hadcm3n_o3q8_1980_40_008182401_4	103,680	141,019	1.3601
22 Dec 2012 20:34:20	1183081	15484294	hadcm3n_o3q8_1980_40_008182401_4	77,760	104,032	1.3379
21 Dec 2012 22:21:49	1183081	15484294	hadcm3n_o3q8_1980_40_008182401_4	51,840	66,455	1.2819