|
Name | hadcm3n_yb53_1900_40_007347297_1 |
Workunit | 7544727 |
Created | 6 Jul 2011, 13:43:15 UTC |
Sent | 19 Jul 2011, 2:19:03 UTC |
Report deadline | 18 Oct 2011, 9:46:14 UTC |
Received | 16 Sep 2011, 2:20:24 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1078039 |
Run time | 40 days 5 hours 54 min 15 sec |
CPU time | 36 days 14 hours 20 min 28 sec |
Validate state | Invalid |
Credit | 10,575.36 |
Device peak FLOPS | 1.63 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.12.33</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 01:26:48 (6560): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:14:12 (4696): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 13:12:30 (5444): No heartbeat from core client for 30 sec - exiting 13:12:31 (5444): No heartbeat from core client for 30 sec - exiting 13:12:32 (5444): No heartbeat from core client for 30 sec - exiting 13:12:33 (5444): No heartbeat from core client for 30 sec - exiting 13:12:34 (5444): No heartbeat from core client for 30 sec - exiting 13:12:35 (5444): No heartbeat from core client for 30 sec - exiting 13:12:36 (5444): No heartbeat from core client for 30 sec - exiting 13:12:37 (5444): No heartbeat from core client for 30 sec - exiting 13:12:38 (5444): No heartbeat from core client for 30 sec - exiting 13:12:39 (5444): No heartbeat from core client for 30 sec - exiting 13:12:41 (5444): No heartbeat from core client for 30 sec - exiting 13:12:42 (5444): No heartbeat from core client for 30 sec - exiting 13:12:43 (5444): No heartbeat from core client for 30 sec - exiting 13:12:44 (5444): No heartbeat from core client for 30 sec - exiting 13:12:45 (5444): No heartbeat from core client for 30 sec - exiting 13:12:46 (5444): No heartbeat from core client for 30 sec - exiting 13:12:47 (5444): No heartbeat from core client for 30 sec - exiting 13:12:48 (5444): No heartbeat from core client for 30 sec - exiting 13:12:49 (5444): No heartbeat from core client for 30 sec - exiting 13:12:50 (5444): No heartbeat from core client for 30 sec - exiting 13:12:51 (5444): No heartbeat from core client for 30 sec - exiting 13:12:53 (5444): No heartbeat from core client for 30 sec - exiting 13:12:54 (5444): No heartbeat from core client for 30 sec - exiting 13:12:55 (5444): No heartbeat from core client for 30 sec - exiting 13:12:56 (5444): No heartbeat from core client for 30 sec - exiting 13:12:57 (5444): No heartbeat from core client for 30 sec - exiting 13:12:58 (5444): No heartbeat from core client for 30 sec - exiting 13:12:59 (5444): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:37:25 (5680): No heartbeat from core client for 30 sec - exiting 13:37:27 (5680): No heartbeat from core client for 30 sec - exiting 13:37:28 (5680): No heartbeat from core client for 30 sec - exiting 13:37:29 (5680): No heartbeat from core client for 30 sec - exiting 13:37:30 (5680): No heartbeat from core client for 30 sec - exiting 13:37:31 (5680): No heartbeat from core client for 30 sec - exiting 13:37:32 (5680): No heartbeat from core client for 30 sec - exiting 13:37:33 (5680): No heartbeat from core client for 30 sec - exiting 13:37:34 (5680): No heartbeat from core client for 30 sec - exiting 13:37:35 (5680): No heartbeat from core client for 30 sec - exiting 13:37:37 (5680): No heartbeat from core client for 30 sec - exiting 13:37:38 (5680): No heartbeat from core client for 30 sec - exiting 13:37:39 (5680): No heartbeat from core client for 30 sec - exiting 13:37:40 (5680): No heartbeat from core client for 30 sec - exiting 13:37:41 (5680): No heartbeat from core client for 30 sec - exiting 13:37:42 (5680): No heartbeat from core client for 30 sec - exiting 13:37:43 (5680): No heartbeat from core client for 30 sec - exiting 13:37:44 (5680): No heartbeat from core client for 30 sec - exiting 13:37:45 (5680): No heartbeat from core client for 30 sec - exiting 13:37:46 (5680): No heartbeat from core client for 30 sec - exiting 13:37:48 (5680): No heartbeat from core client for 30 sec - exiting 13:37:49 (5680): No heartbeat from core client for 30 sec - exiting 13:37:50 (5680): No heartbeat from core client for 30 sec - exiting 13:37:51 (5680): No heartbeat from core client for 30 sec - exiting 13:37:52 (5680): No heartbeat from core client for 30 sec - exiting 13:37:53 (5680): No heartbeat from core client for 30 sec - exiting 13:37:54 (5680): No heartbeat from core client for 30 sec - exiting 13:37:55 (5680): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4700, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 08:14:41 (5840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:13:03 (4828): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:34:52 (5348): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:09:07 (664): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:54:11 (2200): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:49:45 (5348): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 07:08:13 (6064): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:57:43 (3296): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 19:21:48 (3680): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:24:00 (2044): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:29:14 (4204): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 16:52:03 (4772): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:54:28 (3432): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:50:05 (6064): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:22:12 (2588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 01:43:41 (3640): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:34:51 (1740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 01:06:56 (6176): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:50:15 (4920): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:23:17 (2084): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 11:41:42 (5940): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 18:49:17 (5300): No heartbeat from core client for 30 sec - exiting 18:49:18 (5300): No heartbeat from core client for 30 sec - exiting 18:49:19 (5300): No heartbeat from core client for 30 sec - exiting 18:49:20 (5300): No heartbeat from core client for 30 sec - exiting 18:49:21 (5300): No heartbeat from core client for 30 sec - exiting 18:49:22 (5300): No heartbeat from core client for 30 sec - exiting 18:49:23 (5300): No heartbeat from core client for 30 sec - exiting 18:49:24 (5300): No heartbeat from core client for 30 sec - exiting 18:49:25 (5300): No heartbeat from core client for 30 sec - exiting 18:49:26 (5300): No heartbeat from core client for 30 sec - exiting 18:49:27 (5300): No heartbeat from core client for 30 sec - exiting 18:49:28 (5300): No heartbeat from core client for 30 sec - exiting 18:49:29 (5300): No heartbeat from core client for 30 sec - exiting 18:49:30 (5300): No heartbeat from core client for 30 sec - exiting 18:49:31 (5300): No heartbeat from core client for 30 sec - exiting 18:49:32 (5300): No heartbeat from core client for 30 sec - exiting 18:49:33 (5300): No heartbeat from core client for 30 sec - exiting 18:49:34 (5300): No heartbeat from core client for 30 sec - exiting 18:49:35 (5300): No heartbeat from core client for 30 sec - exiting 18:49:36 (5300): No heartbeat from core client for 30 sec - exiting 18:49:37 (5300): No heartbeat from core client for 30 sec - exiting 18:49:38 (5300): No heartbeat from core client for 30 sec - exiting 18:49:39 (5300): No heartbeat from core client for 30 sec - exiting 18:49:40 (5300): No heartbeat from core client for 30 sec - exiting 18:49:41 (5300): No heartbeat from core client for 30 sec - exiting 18:49:42 (5300): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:51:53 (4752): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
15 Sep 2011 11:25:01 | 1078039 | 13098461 | hadcm3n_yb53_1900_40_007347297_1 | 881,280 | 3,223,000 | 3.6572 |
14 Sep 2011 07:35:14 | 1078039 | 13098461 | hadcm3n_yb53_1900_40_007347297_1 | 855,360 | 3,136,645 | 3.6670 |
12 Sep 2011 02:32:26 | 1078039 | 13098461 | hadcm3n_yb53_1900_40_007347297_1 | 829,440 | 3,039,634 | 3.6647 |
02 Sep 2011 22:04:16 | 1078039 | 13098461 | hadcm3n_yb53_1900_40_007347297_1 | 803,520 | 2,913,002 | 3.6253 |
01 Sep 2011 06:58:19 | 1078039 | 13098461 | hadcm3n_yb53_1900_40_007347297_1 | 777,600 | 2,791,384 | 3.5897 |
31 Aug 2011 05:10:27 | 1078039 | 13098461 | hadcm3n_yb53_1900_40_007347297_1 | 751,680 | 2,704,502 | 3.5979 |
30 Aug 2011 03:32:52 | 1078039 | 13098461 | hadcm3n_yb53_1900_40_007347297_1 | 725,760 | 2,618,700 | 3.6082 |
29 Aug 2011 02:34:31 | 1078039 | 13098461 | hadcm3n_yb53_1900_40_007347297_1 | 699,840 | 2,534,190 | 3.6211 |
27 Aug 2011 01:31:00 | 1078039 | 13098461 | hadcm3n_yb53_1900_40_007347297_1 | 673,920 | 2,452,587 | 3.6393 |
26 Aug 2011 01:54:10 | 1078039 | 13098461 | hadcm3n_yb53_1900_40_007347297_1 | 648,000 | 2,370,324 | 3.6579 |
19 Aug 2011 15:43:34 | 1078039 | 13098461 | hadcm3n_yb53_1900_40_007347297_1 | 622,080 | 2,278,473 | 3.6627 |
17 Aug 2011 20:38:15 | 1078039 | 13098461 | hadcm3n_yb53_1900_40_007347297_1 | 596,160 | 2,135,722 | 3.5825 |
16 Aug 2011 01:46:07 | 1078039 | 13098461 | hadcm3n_yb53_1900_40_007347297_1 | 570,240 | 1,993,618 | 3.4961 |
14 Aug 2011 10:51:45 | 1078039 | 13098461 | hadcm3n_yb53_1900_40_007347297_1 | 544,320 | 1,896,249 | 3.4837 |
13 Aug 2011 12:04:05 | 1078039 | 13098461 | hadcm3n_yb53_1900_40_007347297_1 | 518,400 | 1,816,022 | 3.5031 |
12 Aug 2011 10:33:35 | 1078039 | 13098461 | hadcm3n_yb53_1900_40_007347297_1 | 492,480 | 1,729,811 | 3.5124 |
11 Aug 2011 08:29:16 | 1078039 | 13098461 | hadcm3n_yb53_1900_40_007347297_1 | 466,560 | 1,641,873 | 3.5191 |
07 Aug 2011 12:22:47 | 1078039 | 13098461 | hadcm3n_yb53_1900_40_007347297_1 | 440,640 | 1,552,383 | 3.5230 |
06 Aug 2011 02:20:47 | 1078039 | 13098461 | hadcm3n_yb53_1900_40_007347297_1 | 414,720 | 1,460,739 | 3.5222 |
04 Aug 2011 23:55:03 | 1078039 | 13098461 | hadcm3n_yb53_1900_40_007347297_1 | 388,800 | 1,370,413 | 3.5247 |
©2024 climateprediction.net