Task 15728923

Name	hadcm3n_zera_1960_40_008350335_0
Workunit	8501196
Created	17 Apr 2013, 18:50:05 UTC
Sent	17 Apr 2013, 18:51:59 UTC
Report deadline	18 Jul 2013, 2:19:10 UTC
Received	30 Apr 2013, 17:06:38 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1236068
Run time	10 days 22 hours 24 min 48 sec
CPU time	10 days 16 hours 2 min 31 sec
Validate state	Invalid
Credit	4,043.52
Device peak FLOPS	2.18 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 14:54:14 (6352): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:53:16 (4344): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:52:16 (6668): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:51:14 (5108): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:50:15 (4752): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:49:10 (3872): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:48:09 (988): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:47:07 (5116): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3824, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3824, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3824, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3824, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3824, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3824, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
30 Apr 2013 06:19:37	1236068	15728923	hadcm3n_zera_1960_40_008350335_0	336,960	921,481	2.7347
29 Apr 2013 10:18:11	1236068	15728923	hadcm3n_zera_1960_40_008350335_0	311,040	850,495	2.7344
28 Apr 2013 14:05:25	1236068	15728923	hadcm3n_zera_1960_40_008350335_0	285,120	779,430	2.7337
27 Apr 2013 18:54:01	1236068	15728923	hadcm3n_zera_1960_40_008350335_0	259,200	708,536	2.7335
26 Apr 2013 16:21:47	1236068	15728923	hadcm3n_zera_1960_40_008350335_0	233,280	637,869	2.7343
25 Apr 2013 20:12:30	1236068	15728923	hadcm3n_zera_1960_40_008350335_0	207,360	568,080	2.7396
25 Apr 2013 00:19:12	1236068	15728923	hadcm3n_zera_1960_40_008350335_0	181,440	498,563	2.7478
24 Apr 2013 04:13:39	1236068	15728923	hadcm3n_zera_1960_40_008350335_0	155,520	428,401	2.7546
23 Apr 2013 09:14:12	1236068	15728923	hadcm3n_zera_1960_40_008350335_0	129,600	358,892	2.7692
22 Apr 2013 12:20:04	1236068	15728923	hadcm3n_zera_1960_40_008350335_0	103,680	288,422	2.7818
20 Apr 2013 15:10:38	1236068	15728923	hadcm3n_zera_1960_40_008350335_0	77,760	216,229	2.7807
19 Apr 2013 11:54:44	1236068	15728923	hadcm3n_zera_1960_40_008350335_0	51,840	143,384	2.7659
18 Apr 2013 16:31:05	1236068	15728923	hadcm3n_zera_1960_40_008350335_0	25,920	72,101	2.7817