Task 15898835

Name	hadcm3n_n52u_1960_40_008378502_2
Workunit	8529361
Created	21 Jul 2013, 10:08:50 UTC
Sent	21 Jul 2013, 10:08:52 UTC
Report deadline	20 Oct 2013, 17:36:03 UTC
Received	29 Jul 2013, 12:47:44 UTC
Server state	Over
Outcome	Computation error
Client state	Compute error
Exit status	22 (0x00000016) Unknown error code
Computer ID	1380666
Run time	6 days 21 hours 52 min 17 sec
CPU time	6 days 11 hours 26 min 37 sec
Validate state	Invalid
Credit	3,732.48
Device peak FLOPS	2.17 GFLOPS
Application version	UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86
Stderr	<core_client_version>7.0.64</core_client_version> <![CDATA[ <message> デバイスがコマンドを認識できません。 (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... 03:17:05 (2808): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:45:49 (6176): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:34:27 (6996): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:39:08 (6260): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 14:04:28 (2088): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 14:16:32 (3788): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 13:17:42 (5816): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:40:55 (5516): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:26:03 (2592): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:59:05 (6388): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 15:25:58 (2252): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6580, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6580, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6580, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6580, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6868, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6868, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]>

Latest Trickles Received
Time Sent (UTC)	Host ID	Result ID	Result Name	Timestep	CPU Time (sec)	Average (sec/TS)
29 Jul 2013 12:53:51	1286664	15898835	hadcm3n_n52u_1960_40_008378502_2	311,040	539,327	1.7339
29 Jul 2013 12:53:51	1286664	15898835	hadcm3n_n52u_1960_40_008378502_2	285,120	492,352	1.7268
29 Jul 2013 12:53:51	1286664	15898835	hadcm3n_n52u_1960_40_008378502_2	259,200	444,345	1.7143
26 Jul 2013 19:45:14	1286664	15898835	hadcm3n_n52u_1960_40_008378502_2	233,280	399,585	1.7129
26 Jul 2013 06:26:37	1286664	15898835	hadcm3n_n52u_1960_40_008378502_2	207,360	354,751	1.7108
25 Jul 2013 17:33:40	1286664	15898835	hadcm3n_n52u_1960_40_008378502_2	181,440	310,405	1.7108
25 Jul 2013 03:42:40	1286664	15898835	hadcm3n_n52u_1960_40_008378502_2	155,520	265,708	1.7085
24 Jul 2013 13:47:50	1286664	15898835	hadcm3n_n52u_1960_40_008378502_2	129,600	221,043	1.7056
24 Jul 2013 00:40:20	1286664	15898835	hadcm3n_n52u_1960_40_008378502_2	103,680	176,424	1.7016
23 Jul 2013 22:02:50	1286664	15898835	hadcm3n_n52u_1960_40_008378502_2	77,760	131,915	1.6964
23 Jul 2013 21:47:33	1286664	15898835	hadcm3n_n52u_1960_40_008378502_2	51,840	87,932	1.6962
23 Jul 2013 21:31:21	1286664	15898835	hadcm3n_n52u_1960_40_008378502_2	25,920	43,988	1.6971