Task 15999662

Name	hadcm3n_zhno_1920_40_008315949_4
Workunit	8467084
Created	1 Sep 2013, 22:43:57 UTC
Sent	1 Sep 2013, 22:53:12 UTC
Report deadline	2 Dec 2013, 6:20:23 UTC
Received	21 Dec 2013, 0:21:25 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1281494
Run time	8 days 3 hours 46 min 4 sec
CPU time	7 days 22 hours 32 min 11 sec
Validate state	Invalid
Credit	5,598.72
Device peak FLOPS	3.16 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6360, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7076, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5976, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 15:20:29 (5372): No heartbeat from core client for 30 sec - exiting 15:20:30 (5372): No heartbeat from core client for 30 sec - exiting 15:20:31 (5372): No heartbeat from core client for 30 sec - exiting 15:20:32 (5372): No heartbeat from core client for 30 sec - exiting 15:20:33 (5372): No heartbeat from core client for 30 sec - exiting 15:20:34 (5372): No heartbeat from core client for 30 sec - exiting 15:20:36 (5372): No heartbeat from core client for 30 sec - exiting 15:20:37 (5372): No heartbeat from core client for 30 sec - exiting 15:20:38 (5372): No heartbeat from core client for 30 sec - exiting 15:20:39 (5372): No heartbeat from core client for 30 sec - exiting 15:20:40 (5372): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1904, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2084, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 17:44:49 (5576): No heartbeat from core client for 30 sec - exiting 17:44:50 (5576): No heartbeat from core client for 30 sec - exiting 17:44:51 (5576): No heartbeat from core client for 30 sec - exiting 17:44:52 (5576): No heartbeat from core client for 30 sec - exiting 17:44:54 (5576): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:44:55 (5576): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 13:05:17 (1868): No heartbeat from core client for 30 sec - exiting 13:05:18 (1868):CPDN Monitor - No 'heartbeat' from BOINC... No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 17:21:41 (4248): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 12:28:42 (4784): No heartbeat from core client for 30 sec - exiting 12:28:43 (4784): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 13:39:22 (5136): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:04:39 (5804): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... SETPOS: Unit 63 to Word Address -198 Failed with Error Code -1 Model crashed: SETPOS: Unit 63 to Word Address -198 Failed with Error Code -1 SETPOS: Unit 63 to Word Address -198 Failed with Error Code -1 Model crashed: SETPOS: Unit 63 to Word Address -198 Failed with Error Code -1 SETPOS: Unit 63 to Word Address -198 Failed with Error Code -1 Model crashed: SETPOS: Unit 63 to Word Address -198 Failed with Error Code -1 SETPOS: Unit 63 to Word Address -198 Failed with Error Code -1 Model crashed: SETPOS: Unit 63 to Word Address -198 Failed with Error Code -1 SETPOS: Unit 63 to Word Address -198 Failed with Error Code -1 Model crashed: SETPOS: Unit 63 to Word Address -198 Failed with Error Code -1 SETPOS: Unit 63 to Word Address -198 Failed with Error Code -1 Model crashed: SETPOS: Unit 63 to Word Address -198 Failed with Error Code -1 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
20 Dec 2013 23:24:07	1281494	15999662	hadcm3n_zhno_1920_40_008315949_4	466,560	683,899	1.4658
15 Dec 2013 22:10:53	1281494	15999662	hadcm3n_zhno_1920_40_008315949_4	440,640	656,058	1.4889
10 Dec 2013 02:29:17	1281494	15999662	hadcm3n_zhno_1920_40_008315949_4	414,720	627,637	1.5134
07 Dec 2013 20:33:37	1281494	15999662	hadcm3n_zhno_1920_40_008315949_4	388,800	599,672	1.5424
03 Dec 2013 01:17:10	1281494	15999662	hadcm3n_zhno_1920_40_008315949_4	362,880	571,356	1.5745
08 Nov 2013 23:49:08	1281494	15999662	hadcm3n_zhno_1920_40_008315949_4	336,960	543,675	1.6135
04 Nov 2013 00:26:21	1281494	15999662	hadcm3n_zhno_1920_40_008315949_4	311,040	515,900	1.6586
02 Nov 2013 22:49:34	1281494	15999662	hadcm3n_zhno_1920_40_008315949_4	285,120	487,481	1.7097
29 Oct 2013 01:34:40	1281494	15999662	hadcm3n_zhno_1920_40_008315949_4	259,200	457,256	1.7641
26 Oct 2013 00:38:25	1281494	15999662	hadcm3n_zhno_1920_40_008315949_4	233,280	418,676	1.7947
14 Oct 2013 01:03:17	1281494	15999662	hadcm3n_zhno_1920_40_008315949_4	207,360	377,529	1.8206
05 Oct 2013 05:22:37	1281494	15999662	hadcm3n_zhno_1920_40_008315949_4	181,440	334,219	1.8420
30 Sep 2013 01:47:05	1281494	15999662	hadcm3n_zhno_1920_40_008315949_4	155,520	287,444	1.8483
26 Sep 2013 23:19:21	1281494	15999662	hadcm3n_zhno_1920_40_008315949_4	129,600	239,254	1.8461
21 Sep 2013 00:12:49	1281494	15999662	hadcm3n_zhno_1920_40_008315949_4	103,680	191,057	1.8428
15 Sep 2013 01:52:12	1281494	15999662	hadcm3n_zhno_1920_40_008315949_4	77,760	142,903	1.8377
13 Sep 2013 22:34:17	1281494	15999662	hadcm3n_zhno_1920_40_008315949_4	51,840	94,744	1.8276
03 Sep 2013 00:21:56	1281494	15999662	hadcm3n_zhno_1920_40_008315949_4	25,920	46,988	1.8128