Name | hadcm3n_o5ve_1900_40_007440438_3 |
Workunit | 7637941 |
Created | 5 Sep 2011, 22:02:45 UTC |
Sent | 5 Sep 2011, 22:07:59 UTC |
Report deadline | 6 Dec 2011, 5:35:10 UTC |
Received | 21 Sep 2011, 0:41:56 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1122757 |
Run time | 10 days 21 hours 34 min 40 sec |
CPU time | 10 days 17 hours 41 min 56 sec |
Validate state | Invalid |
Credit | 3,421.44 |
Device peak FLOPS | 1.67 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 02:23:46 (736): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:24:04 (736): No heartbeat from core client for 30 sec - exiting 02:24:05 (736): No heartbeat from core client for 30 sec - exiting 02:24:06 (736): No heartbeat from core client for 30 sec - exiting 02:24:07 (736): No heartbeat from core client for 30 sec - exiting 02:24:08 (736): No heartbeat from core client for 30 sec - exiting 02:24:09 (736): No heartbeat from core client for 30 sec - exiting 02:24:10 (736): No heartbeat from core client for 30 sec - exiting 02:24:11 (736): No heartbeat from core client for 30 sec - exiting 02:24:12 (736): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold Suspended CPDN Monitor - Suspend request from BOINC... 14:27:34 (4180): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold 17:00:43 (6812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 21:04:34 (1728): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:04:44 (1728): No heartbeat from core client for 30 sec - exiting 21:04:45 (1728): No heartbeat from core client for 30 sec - exiting 21:04:46 (1728): No heartbeat from core client for 30 sec - exiting 21:04:47 (1728): No heartbeat from core client for 30 sec - exiting 21:04:48 (1728): No heartbeat from core client for 30 sec - exiting 21:04:49 (1728): No heartbeat from core client for 30 sec - exiting 21:04:50 (1728): No heartbeat from core client for 30 sec - exiting 21:04:52 (1728): No heartbeat from core client for 30 sec - exiting 21:04:53 (1728): No heartbeat from core client for 30 sec - exiting 21:04:54 (1728): No heartbeat from core client for 30 sec - exiting 21:04:55 (1728): No heartbeat from core client for 30 sec - exiting 21:04:56 (1728): No heartbeat from core client for 30 sec - exiting 21:04:57 (1728): No heartbeat from core client for 30 sec - exiting 21:04:58 (1728): No heartbeat from core client for 30 sec - exiting 21:04:59 (1728): No heartbeat from core client for 30 sec - exiting 21:05:00 (1728): No heartbeat from core client for 30 sec - exiting 21:05:01 (1728): No heartbeat from core client for 30 sec - exiting 21:05:02 (1728): No heartbeat from core client for 30 sec - exiting 21:05:04 (1728): No heartbeat from core client for 30 sec - exiting 21:05:05 (1728): No heartbeat from core client for 30 sec - exiting 23:40:02 (5448): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:44:13 (3696): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:44:20 (3696): No heartbeat from core client for 30 sec - exiting 23:44:21 (3696): No heartbeat from core client for 30 sec - exiting 23:44:22 (3696): No heartbeat from core client for 30 sec - exiting 23:44:23 (3696): No heartbeat from core client for 30 sec - exiting 23:44:24 (3696): No heartbeat from core client for 30 sec - exiting 23:44:25 (3696): No heartbeat from core client for 30 sec - exiting 23:44:27 (3696): No heartbeat from core client for 30 sec - exiting 23:56:55 (5652): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:57:10 (5652): No heartbeat from core client for 30 sec - exiting 23:57:11 (5652): No heartbeat from core client for 30 sec - exiting 23:57:12 (5652): No heartbeat from core client for 30 sec - exiting 23:57:14 (5652): No heartbeat from core client for 30 sec - exiting 23:57:15 (5652): No heartbeat from core client for 30 sec - exiting 23:57:16 (5652): No heartbeat from core client for 30 sec - exiting 23:57:17 (5652): No heartbeat from core client for 30 sec - exiting 23:57:18 (5652): No heartbeat from core client for 30 sec - exiting 23:57:19 (5652): No heartbeat from core client for 30 sec - exiting 23:57:20 (5652): No heartbeat from core client for 30 sec - exiting 00:07:08 (6016): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:09:20 (6844): No heartbeat from core client for 30 sec - exiting 00:09:29 (6844): No heartbeat from core client for 30 sec - exiting 00:09:31 (6844): No heartbeat from core client for 30 sec - exiting 00:09:32 (6844): No heartbeat from core client for 30 sec - exiting 00:09:33 (6844): No heartbeat from core client for 30 sec - exiting 00:09:34 (6844): No heartbeat from core client for 30 sec - exiting 00:09:35 (6844): No heartbeat from core client for 30 sec - exiting 00:09:36 (6844): No heartbeat from core client for 30 sec - exiting 00:09:37 (6844): No heartbeat from core client for 30 sec - exiting 00:09:38 (6844): No heartbeat from core client for 30 sec - exiting 00:09:39 (6844): No heartbeat from core client for 30 sec - exiting 00:09:40 (6844): No heartbeat from core client for 30 sec - exiting 00:09:42 (6844): No heartbeat from core client for 30 sec - exiting 00:09:43 (6844): No heartbeat from core client for 30 sec - exiting 00:09:44 (6844): No heartbeat from core client for 30 sec - exiting 00:09:45 (6844): No heartbeat from core client for 30 sec - exiting 00:09:46 (6844): No heartbeat from core client for 30 sec - exiting 00:09:47 (6844): No heartbeat from core client for 30 sec - exiting 00:09:48 (6844): No heartbeat from core client for 30 sec - exiting 00:09:49 (6844): No heartbeat from core client for 30 sec - exiting 00:09:50 (6844): No heartbeat from core client for 30 sec - exiting 00:09:51 (6844): No heartbeat from core client for 30 sec - exiting 00:09:52 (6844): No heartbeat from core client for 30 sec - exiting 00:09:54 (6844): No heartbeat from core client for 30 sec - exiting 00:10:27 (6844): No heartbeat from core client for 30 sec - exiting 00:10:28 (6844): No heartbeat from core client for 30 sec - exiting 00:10:30 (6844): No heartbeat from core client for 30 sec - exiting 00:10:31 (6844): No heartbeat from core client for 30 sec - exiting 00:10:32 (6844): No heartbeat from core client for 30 sec - exiting 00:10:33 (6844): No heartbeat from core client for 30 sec - exiting 00:10:34 (6844): No heartbeat from core client for 30 sec - exiting 00:10:35 (6844): No heartbeat from core client for 30 sec - exiting 00:10:36 (6844): No heartbeat from core client for 30 sec - exiting 00:10:37 (6844): No heartbeat from core client for 30 sec - exiting 00:10:38 (6844): No heartbeat from core client for 30 sec - exiting 00:10:39 (6844): No heartbeat from core client for 30 sec - exiting 00:10:40 (6844): No heartbeat from core client for 30 sec - exiting 00:10:42 (6844): No heartbeat from core client for 30 sec - exiting 00:10:43 (6844): No heartbeat from core client for 30 sec - exiting 00:10:44 (6844): No heartbeat from core client for 30 sec - exiting 00:10:45 (6844): No heartbeat from core client for 30 sec - exiting 00:10:46 (6844): No heartbeat from core client for 30 sec - exiting 00:10:47 (6844): No heartbeat from core client for 30 sec - exiting 00:10:48 (6844): No heartbeat from core client for 30 sec - exiting 00:10:49 (6844): No heartbeat from core client for 30 sec - exiting 00:10:50 (6844): No heartbeat from core client for 30 sec - exiting 00:10:51 (6844): No heartbeat from core client for 30 sec - exiting 00:10:52 (6844): No heartbeat from core client for 30 sec - exiting 00:10:54 (6844): No heartbeat from core client for 30 sec - exiting 00:10:55 (6844): No heartbeat from core client for 30 sec - exiting 00:10:56 (6844): No heartbeat from core client for 30 sec - exiting 00:10:57 (6844): No heartbeat from core client for 30 sec - exiting 00:10:58 (6844): No heartbeat from core client for 30 sec - exiting 00:10:59 (6844): No heartbeat from core client for 30 sec - exiting 00:11:00 (6844): No heartbeat from core client for 30 sec - exiting 00:11:34 (6844): No heartbeat from core client for 30 sec - exiting 00:11:35 (6844): No heartbeat from core client for 30 sec - exiting 00:11:36 (6844): No heartbeat from core client for 30 sec - exiting 00:11:37 (6844): No heartbeat from core client for 30 sec - exiting 00:11:38 (6844): No heartbeat from core client for 30 sec - exiting 00:11:39 (6844): No heartbeat from core client for 30 sec - exiting 00:11:41 (6844): No heartbeat from core client for 30 sec - exiting 00:11:42 (6844): No heartbeat from core client for 30 sec - exiting 00:11:43 (6844): No heartbeat from core client for 30 sec - exiting 00:11:44 (6844): No heartbeat from core client for 30 sec - exiting 00:11:45 (6844): No heartbeat from core client for 30 sec - exiting 00:11:46 (6844): No heartbeat from core client for 30 sec - exiting 00:11:47 (6844): No heartbeat from core client for 30 sec - exiting 00:11:48 (6844): No heartbeat from core client for 30 sec - exiting 00:11:49 (6844): No heartbeat from core client for 30 sec - exiting 00:11:50 (6844): No heartbeat from core client for 30 sec - exiting 00:11:51 (6844): No heartbeat from core client for 30 sec - exiting 00:11:53 (6844): No heartbeat from core client for 30 sec - exiting 00:11:54 (6844): No heartbeat from core client for 30 sec - exiting 00:11:55 (6844): No heartbeat from core client for 30 sec - exiting 00:11:56 (6844): No heartbeat from core client for 30 sec - exiting 00:11:57 (6844): No heartbeat from core client for 30 sec - exiting 00:11:58 (6844): No heartbeat from core client for 30 sec - exiting 00:11:59 (6844): No heartbeat from core client for 30 sec - exiting 00:12:00 (6844): No heartbeat from core client for 30 sec - exiting 00:12:01 (6844): No heartbeat from core client for 30 sec - exiting 00:12:02 (6844): No heartbeat from core client for 30 sec - exiting 00:12:03 (6844): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:03:21 (4788): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:06:47 (3432): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:09:59 (5840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:59:06 (3900): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:59:09 (3900): No heartbeat from core client for 30 sec - exiting 16:59:10 (3900): No heartbeat from core client for 30 sec - exiting 16:59:11 (3900): No heartbeat from core client for 30 sec - exiting 16:59:12 (3900): No heartbeat from core client for 30 sec - exiting 16:59:13 (3900): No heartbeat from core client for 30 sec - exiting 16:59:14 (3900): No heartbeat from core client for 30 sec - exiting 16:59:15 (3900): No heartbeat from core client for 30 sec - exiting 16:59:16 (3900): No heartbeat from core client for 30 sec - exiting 16:59:17 (3900): No heartbeat from core client for 30 sec - exiting 16:59:18 (3900): No heartbeat from core client for 30 sec - exiting 16:59:20 (3900): No heartbeat from core client for 30 sec - exiting 16:59:21 (3900): No heartbeat from core client for 30 sec - exiting 16:59:22 (3900): No heartbeat from core client for 30 sec - exiting 16:59:23 (3900): No heartbeat from core client for 30 sec - exiting 16:59:24 (3900): No heartbeat from core client for 30 sec - exiting 16:59:25 (3900): No heartbeat from core client for 30 sec - exiting 16:59:26 (3900): No heartbeat from core client for 30 sec - exiting 16:59:27 (3900): No heartbeat from core client for 30 sec - exiting 16:59:28 (3900): No heartbeat from core client for 30 sec - exiting 16:59:29 (3900): No heartbeat from core client for 30 sec - exiting 16:59:30 (3900): No heartbeat from core client for 30 sec - exiting 16:59:32 (3900): No heartbeat from core client for 30 sec - exiting 16:59:33 (3900): No heartbeat from core client for 30 sec - exiting 16:59:34 (3900): No heartbeat from core client for 30 sec - exiting 16:59:35 (3900): No heartbeat from core client for 30 sec - exiting 16:59:36 (3900): No heartbeat from core client for 30 sec - exiting 16:59:37 (3900): No heartbeat from core client for 30 sec - exiting 16:59:38 (3900): No heartbeat from core client for 30 sec - exiting 16:59:39 (3900): No heartbeat from core client for 30 sec - exiting 16:59:40 (3900): No heartbeat from core client for 30 sec - exiting 16:59:41 (3900): No heartbeat from core client for 30 sec - exiting 16:59:42 (3900): No heartbeat from core client for 30 sec - exiting 16:59:43 (3900): No heartbeat from core client for 30 sec - exiting 16:59:44 (3900): No heartbeat from core client for 30 sec - exiting 16:59:45 (3900): No heartbeat from core client for 30 sec - exiting 16:59:46 (3900): No heartbeat from core client for 30 sec - exiting 16:59:47 (3900): No heartbeat from core client for 30 sec - exiting 16:59:48 (3900): No heartbeat from core client for 30 sec - exiting 16:59:49 (3900): No heartbeat from core client for 30 sec - exiting 16:59:50 (3900): No heartbeat from core client for 30 sec - exiting 16:59:51 (3900): No heartbeat from core client for 30 sec - exiting 16:59:52 (3900): No heartbeat from core client for 30 sec - exiting 16:59:53 (3900): No heartbeat from core client for 30 sec - exiting 16:59:54 (3900): No heartbeat from core client for 30 sec - exiting 16:59:55 (3900): No heartbeat from core client for 30 sec - exiting 16:59:56 (3900): No heartbeat from core client for 30 sec - exiting 16:59:57 (3900): No heartbeat from core client for 30 sec - exiting 16:59:58 (3900): No heartbeat from core client for 30 sec - exiting 16:59:59 (3900): No heartbeat from core client for 30 sec - exiting 17:00:00 (3900): No heartbeat from core client for 30 sec - exiting 17:00:01 (3900): No heartbeat from core client for 30 sec - exiting 17:00:02 (3900): No heartbeat from core client for 30 sec - exiting 17:00:03 (3900): No heartbeat from core client for 30 sec - exiting 17:00:04 (3900): No heartbeat from core client for 30 sec - exiting 17:00:05 (3900): No heartbeat from core client for 30 sec - exiting 17:00:06 (3900): No heartbeat from core client for 30 sec - exiting 17:00:07 (3900): No heartbeat from core client for 30 sec - exiting 17:00:08 (3900): No heartbeat from core client for 30 sec - exiting 17:00:09 (3900): No heartbeat from core client for 30 sec - exiting 17:00:10 (3900): No heartbeat from core client for 30 sec - exiting 17:00:11 (3900): No heartbeat from core client for 30 sec - exiting 17:00:12 (3900): No heartbeat from core client for 30 sec - exiting 17:00:13 (3900): No heartbeat from core client for 30 sec - exiting 17:00:14 (3900): No heartbeat from core client for 30 sec - exiting 17:00:15 (3900): No heartbeat from core client for 30 sec - exiting 17:00:16 (3900): No heartbeat from core client for 30 sec - exiting 17:00:17 (3900): No heartbeat from core client for 30 sec - exiting 17:00:18 (3900): No heartbeat from core client for 30 sec - exiting 17:00:19 (3900): No heartbeat from core client for 30 sec - exiting 17:00:20 (3900): No heartbeat from core client for 30 sec - exiting 17:00:21 (3900): No heartbeat from core client for 30 sec - exiting 17:02:08 (4012): No heartbeat from core client for 30 sec - exiting 17:02:29 (4012): No heartbeat from core client for 30 sec - exiting 17:02:30 (4012): No heartbeat from core client for 30 sec - exiting 17:02:31 (4012): No heartbeat from core client for 30 sec - exiting 17:02:32 (4012): No heartbeat from core client for 30 sec - exiting 17:03:06 (4012): No heartbeat from core client for 30 sec - exiting 17:03:07 (4012): No heartbeat from core client for 30 sec - exiting 17:03:08 (4012): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 02:23:28 (6204): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:23:32 (6204): No heartbeat from core client for 30 sec - exiting 02:23:33 (6204): No heartbeat from core client for 30 sec - exiting 02:23:35 (6204): No heartbeat from core client for 30 sec - exiting 16:59:20 (6836): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:00:21 (6364): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:00:26 (6364): No heartbeat from core client for 30 sec - exiting 17:00:27 (6364): No heartbeat from core client for 30 sec - exiting 17:00:28 (6364): No heartbeat from core client for 30 sec - exiting 17:00:29 (6364): No heartbeat from core client for 30 sec - exiting 17:00:30 (6364): No heartbeat from core client for 30 sec - exiting 17:00:31 (6364): No heartbeat from core client for 30 sec - exiting 17:00:32 (6364): No heartbeat from core client for 30 sec - exiting 17:00:34 (6364): No heartbeat from core client for 30 sec - exiting 17:00:35 (6364): No heartbeat from core client for 30 sec - exiting 17:00:36 (6364): No heartbeat from core client for 30 sec - exiting Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3696, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3696, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3696, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3696, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3696, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3696, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
20 Sep 2011 04:19:44 | 1122757 | 13338819 | hadcm3n_o5ve_1900_40_007440438_3 | 285,120 | 857,591 | 3.0078 |
19 Sep 2011 05:56:06 | 1122757 | 13338819 | hadcm3n_o5ve_1900_40_007440438_3 | 259,200 | 778,580 | 3.0038 |
18 Sep 2011 08:01:01 | 1122757 | 13338819 | hadcm3n_o5ve_1900_40_007440438_3 | 233,280 | 702,033 | 3.0094 |
17 Sep 2011 10:10:57 | 1122757 | 13338819 | hadcm3n_o5ve_1900_40_007440438_3 | 207,360 | 624,390 | 3.0111 |
16 Sep 2011 12:34:20 | 1122757 | 13338819 | hadcm3n_o5ve_1900_40_007440438_3 | 181,440 | 546,399 | 3.0115 |
15 Sep 2011 14:24:10 | 1122757 | 13338819 | hadcm3n_o5ve_1900_40_007440438_3 | 155,520 | 469,014 | 3.0158 |
14 Sep 2011 16:24:33 | 1122757 | 13338819 | hadcm3n_o5ve_1900_40_007440438_3 | 129,600 | 390,867 | 3.0159 |
13 Sep 2011 17:04:08 | 1122757 | 13338819 | hadcm3n_o5ve_1900_40_007440438_3 | 103,680 | 313,377 | 3.0225 |
12 Sep 2011 19:07:46 | 1122757 | 13338819 | hadcm3n_o5ve_1900_40_007440438_3 | 77,760 | 235,208 | 3.0248 |
11 Sep 2011 21:56:18 | 1122757 | 13338819 | hadcm3n_o5ve_1900_40_007440438_3 | 51,840 | 157,115 | 3.0308 |
10 Sep 2011 22:54:54 | 1122757 | 13338819 | hadcm3n_o5ve_1900_40_007440438_3 | 25,920 | 79,278 | 3.0586 |
©2024 cpdn.org