Name | hadcm3n_4ioj_1940_40_008303390_0 |
Workunit | 8454525 |
Created | 6 Feb 2013, 22:40:21 UTC |
Sent | 6 Feb 2013, 22:40:31 UTC |
Report deadline | 9 May 2013, 6:07:42 UTC |
Received | 25 Feb 2013, 6:08:17 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1100141 |
Run time | 4 days 13 hours 42 min 59 sec |
CPU time | 3 days 5 hours 38 min 12 sec |
Validate state | Invalid |
Credit | 2,177.28 |
Device peak FLOPS | 2.68 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 00:09:29 (12664): No heartbeat from core client for 30 sec - exiting 00:09:30 (12664): No heartbeat from core client for 30 sec - exiting 00:09:31 (12664): No heartbeat from core client for 30 sec - exiting 00:09:32 (12664): No heartbeat from core client for 30 sec - exiting 00:09:33 (12664): No heartbeat from core client for 30 sec - exiting 00:09:34 (12664): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 22:59:20 (13092): No heartbeat from core client for 30 sec - exiting 22:59:21 (13092): No heartbeat from core client for 30 sec - exiting 22:59:22 (13092): No heartbeat from core client for 30 sec - exiting 22:59:23 (13092): No heartbeat from core client for 30 sec - exiting 22:59:24 (13092): No heartbeat from core client for 30 sec - exiting 22:59:25 (13092): No heartbeat from core client for 30 sec - exiting 22:59:26 (13092): No heartbeat from core client for 30 sec - exiting 22:59:27 (13092): No heartbeat from core client for 30 sec - exiting 22:59:28 (13092): No heartbeat from core client for 30 sec - exiting 22:59:29 (13092): No heartbeat from core client for 30 sec - exiting 22:59:30 (13092): No heartbeat from core client for 30 sec - exiting 22:59:31 (13092): No heartbeat from core client for 30 sec - exiting 22:59:32 (13092): No heartbeat from core client for 30 sec - exiting 22:59:33 (13092): No heartbeat from core client for 30 sec - exiting 22:59:34 (13092): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:48:36 (12388): No heartbeat from core client for 30 sec - exiting 10:48:37 (12388): No heartbeat from core client for 30 sec - exiting 10:48:38 (12388): No heartbeat from core client for 30 sec - exiting 10:48:39 (12388): No heartbeat from core client for 30 sec - exiting 10:48:40 (12388): No heartbeat from core client for 30 sec - exiting 10:48:41 (12388): No heartbeat from core client for 30 sec - exiting 10:48:42 (12388): No heartbeat from core client for 30 sec - exiting 10:48:43 (12388): No heartbeat from core client for 30 sec - exiting 10:48:45 (12388): No heartbeat from core client for 30 sec - exiting 10:48:46 (12388): No heartbeat from core client for 30 sec - exiting 10:48:47 (12388): No heartbeat from core client for 30 sec - exiting 10:48:48 (12388): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:59:51 (12252): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:22:59 (12372): No heartbeat from core client for 30 sec - exiting 20:23:01 (12372): No heartbeat from core client for 30 sec - exiting 20:23:02 (12372): No heartbeat from core client for 30 sec - exiting 20:23:03 (12372): No heartbeat from core client for 30 sec - exiting 20:23:04 (12372): No heartbeat from core client for 30 sec - exiting 20:23:05 (12372): No heartbeat from core client for 30 sec - exiting 20:23:06 (12372): No heartbeat from core client for 30 sec - exiting 20:23:07 (12372): No heartbeat from core client for 30 sec - exiting 20:23:08 (12372): No heartbeat from core client for 30 sec - exiting 20:23:09 (12372): No heartbeat from core client for 30 sec - exiting 20:23:10 (12372): No heartbeat from core client for 30 sec - exiting 20:23:12 (12372): No heartbeat from core client for 30 sec - exiting 20:23:13 (12372): No heartbeat from core client for 30 sec - exiting 20:23:14 (12372): No heartbeat from core client for 30 sec - exiting 20:23:15 (12372): No heartbeat from core client for 30 sec - exiting 20:23:16 (12372): No heartbeat from core client for 30 sec - exiting 20:23:17 (12372): No heartbeat from core client for 30 sec - exiting 20:23:18 (12372): No heartbeat from core client for 30 sec - exiting 20:23:19 (12372): No heartbeat from core client for 30 sec - exiting 20:23:20 (12372): No heartbeat from core client for 30 sec - exiting 20:23:21 (12372): No heartbeat from core client for 30 sec - exiting 20:23:23 (12372): No heartbeat from core client for 30 sec - exiting 20:23:24 (12372): No heartbeat from core client for 30 sec - exiting 20:23:25 (12372): No heartbeat from core client for 30 sec - exiting 20:23:26 (12372): No heartbeat from core client for 30 sec - exiting 20:23:27 (12372): No heartbeat from core client for 30 sec - exiting 20:23:28 (12372): No heartbeat from core client for 30 sec - exiting 20:23:29 (12372): No heartbeat from core client for 30 sec - exiting 20:23:30 (12372): No heartbeat from core client for 30 sec - exiting 20:23:31 (12372): No heartbeat from core client for 30 sec - exiting 20:23:32 (12372): No heartbeat from core client for 30 sec - exiting 20:23:33 (12372): No heartbeat from core client for 30 sec - exiting 20:23:35 (12372): No heartbeat from core client for 30 sec - exiting 20:23:36 (12372): No heartbeat from core client for 30 sec - exiting 20:23:37 (12372): No heartbeat from core client for 30 sec - exiting 20:23:38 (12372): No heartbeat from core client for 30 sec - exiting 20:23:39 (12372): No heartbeat from core client for 30 sec - exiting 20:23:40 (12372): No heartbeat from core client for 30 sec - exiting 20:23:41 (12372): No heartbeat from core client for 30 sec - exiting 20:23:42 (12372): No heartbeat from core client for 30 sec - exiting 20:23:43 (12372): No heartbeat from core client for 30 sec - exiting 20:23:44 (12372): No heartbeat from core client for 30 sec - exiting 20:23:45 (12372): No heartbeat from core client for 30 sec - exiting 20:23:47 (12372): No heartbeat from core client for 30 sec - exiting 20:23:48 (12372): No heartbeat from core client for 30 sec - exiting 20:23:49 (12372): No heartbeat from core client for 30 sec - exiting 20:23:50 (12372): No heartbeat from core client for 30 sec - exiting 20:23:51 (12372): No heartbeat from core client for 30 sec - exiting 20:23:52 (12372): No heartbeat from core client for 30 sec - exiting 20:23:53 (12372): No heartbeat from core client for 30 sec - exiting 20:23:54 (12372): No heartbeat from core client for 30 sec - exiting 20:23:55 (12372): No heartbeat from core client for 30 sec - exiting 20:23:56 (12372): No heartbeat from core client for 30 sec - exiting 20:23:57 (12372): No heartbeat from core client for 30 sec - exiting 20:23:59 (12372): No heartbeat from core client for 30 sec - exiting 20:24:00 (12372): No heartbeat from core client for 30 sec - exiting 20:24:01 (12372): No heartbeat from core client for 30 sec - exiting 20:24:02 (12372): No heartbeat from core client for 30 sec - exiting 20:24:03 (12372): No heartbeat from core client for 30 sec - exiting 20:24:04 (12372): No heartbeat from core client for 30 sec - exiting 20:24:05 (12372): No heartbeat from core client for 30 sec - exiting 20:24:06 (12372): No heartbeat from core client for 30 sec - exiting 20:24:07 (12372): No heartbeat from core client for 30 sec - exiting 20:24:08 (12372): No heartbeat from core client for 30 sec - exiting 20:24:09 (12372): No heartbeat from core client for 30 sec - exiting 20:24:11 (12372): No heartbeat from core client for 30 sec - exiting 20:24:12 (12372): No heartbeat from core client for 30 sec - exiting 20:24:13 (12372): No heartbeat from core client for 30 sec - exiting 20:24:14 (12372): No heartbeat from core client for 30 sec - exiting 20:24:15 (12372): No heartbeat from core client for 30 sec - exiting 20:24:16 (12372): No heartbeat from core client for 30 sec - exiting 20:24:17 (12372): No heartbeat from core client for 30 sec - exiting 20:24:18 (12372): No heartbeat from core client for 30 sec - exiting 20:24:19 (12372): No heartbeat from core client for 30 sec - exiting 20:24:20 (12372): No heartbeat from core client for 30 sec - exiting 20:24:21 (12372): No heartbeat from core client for 30 sec - exiting 20:24:23 (12372): No heartbeat from core client for 30 sec - exiting 20:24:24 (12372): No heartbeat from core client for 30 sec - exiting 20:24:25 (12372): No heartbeat from core client for 30 sec - exiting 20:24:26 (12372): No heartbeat from core client for 30 sec - exiting 20:24:27 (12372): No heartbeat from core client for 30 sec - exiting 20:24:28 (12372): No heartbeat from core client for 30 sec - exiting 20:24:29 (12372): No heartbeat from core client for 30 sec - exiting 20:24:30 (12372): No heartbeat from core client for 30 sec - exiting 20:24:31 (12372): No heartbeat from core client for 30 sec - exiting 20:24:32 (12372): No heartbeat from core client for 30 sec - exiting 20:24:33 (12372): No heartbeat from core client for 30 sec - exiting 20:24:35 (12372): No heartbeat from core client for 30 sec - exiting 20:24:36 (12372): No heartbeat from core client for 30 sec - exiting 20:24:37 (12372): No heartbeat from core client for 30 sec - exiting 20:24:38 (12372): No heartbeat from core client for 30 sec - exiting 20:24:39 (12372): No heartbeat from core client for 30 sec - exiting 20:24:40 (12372): No heartbeat from core client for 30 sec - exiting 20:24:41 (12372): No heartbeat from core client for 30 sec - exiting 20:24:42 (12372): No heartbeat from core client for 30 sec - exiting 20:24:43 (12372): No heartbeat from core client for 30 sec - exiting 20:24:44 (12372): No heartbeat from core client for 30 sec - exiting 20:24:45 (12372): No heartbeat from core client for 30 sec - exiting 20:24:47 (12372): No heartbeat from core client for 30 sec - exiting 20:24:48 (12372): No heartbeat from core client for 30 sec - exiting 20:24:49 (12372): No heartbeat from core client for 30 sec - exiting 20:24:50 (12372): No heartbeat from core client for 30 sec - exiting 20:24:51 (12372): No heartbeat from core client for 30 sec - exiting 20:24:52 (12372): No heartbeat from core client for 30 sec - exiting 20:24:53 (12372): No heartbeat from core client for 30 sec - exiting 20:24:54 (12372): No heartbeat from core client for 30 sec - exiting 20:24:55 (12372): No heartbeat from core client for 30 sec - exiting 20:24:56 (12372): No heartbeat from core client for 30 sec - exiting 20:24:57 (12372): No heartbeat from core client for 30 sec - exiting 20:24:59 (12372): No heartbeat from core client for 30 sec - exiting 20:25:00 (12372): No heartbeat from core client for 30 sec - exiting 20:25:01 (12372): No heartbeat from core client for 30 sec - exiting 20:25:02 (12372): No heartbeat from core client for 30 sec - exiting 20:25:03 (12372): No heartbeat from core client for 30 sec - exiting 20:25:04 (12372): No heartbeat from core client for 30 sec - exiting 20:25:05 (12372): No heartbeat from core client for 30 sec - exiting 20:25:06 (12372): No heartbeat from core client for 30 sec - exiting 20:25:07 (12372): No heartbeat from core client for 30 sec - exiting 20:25:08 (12372): No heartbeat from core client for 30 sec - exiting 20:25:10 (12372): No heartbeat from core client for 30 sec - exiting 20:25:11 (12372): No heartbeat from core client for 30 sec - exiting 20:25:12 (12372): No heartbeat from core client for 30 sec - exiting 20:25:13 (12372): No heartbeat from core client for 30 sec - exiting 20:25:14 (12372): No heartbeat from core client for 30 sec - exiting 20:25:15 (12372): No heartbeat from core client for 30 sec - exiting 20:25:16 (12372): No heartbeat from core client for 30 sec - exiting 20:25:17 (12372): No heartbeat from core client for 30 sec - exiting 20:25:18 (12372): No heartbeat from core client for 30 sec - exiting 20:25:19 (12372): No heartbeat from core client for 30 sec - exiting 20:25:20 (12372): No heartbeat from core client for 30 sec - exiting 20:25:22 (12372): No heartbeat from core client for 30 sec - exiting 20:25:23 (12372): No heartbeat from core client for 30 sec - exiting 20:25:24 (12372): No heartbeat from core client for 30 sec - exiting 20:25:25 (12372): No heartbeat from core client for 30 sec - exiting 20:25:26 (12372): No heartbeat from core client for 30 sec - exiting 20:25:27 (12372): No heartbeat from core client for 30 sec - exiting 20:25:28 (12372): No heartbeat from core client for 30 sec - exiting 20:25:29 (12372): No heartbeat from core client for 30 sec - exiting 20:25:30 (12372): No heartbeat from core client for 30 sec - exiting 20:25:31 (12372): No heartbeat from core client for 30 sec - exiting 20:25:32 (12372): No heartbeat from core client for 30 sec - exiting 20:25:34 (12372): No heartbeat from core client for 30 sec - exiting 20:25:35 (12372): No heartbeat from core client for 30 sec - exiting 20:25:36 (12372): No heartbeat from core client for 30 sec - exiting 20:25:37 (12372): No heartbeat from core client for 30 sec - exiting 20:25:38 (12372): No heartbeat from core client for 30 sec - exiting 20:25:39 (12372): No heartbeat from core client for 30 sec - exiting 20:25:40 (12372): No heartbeat from core client for 30 sec - exiting 20:25:41 (12372): No heartbeat from core client for 30 sec - exiting 20:25:42 (12372): No heartbeat from core client for 30 sec - exiting 20:25:43 (12372): No heartbeat from core client for 30 sec - exiting 20:25:44 (12372): No heartbeat from core client for 30 sec - exiting 20:25:46 (12372): No heartbeat from core client for 30 sec - exiting 20:25:47 (12372): No heartbeat from core client for 30 sec - exiting 20:25:48 (12372): No heartbeat from core client for 30 sec - exiting 20:25:49 (12372): No heartbeat from core client for 30 sec - exiting 20:25:50 (12372): No heartbeat from core client for 30 sec - exiting 20:25:51 (12372): No heartbeat from core client for 30 sec - exiting 20:25:52 (12372): No heartbeat from core client for 30 sec - exiting 20:25:53 (12372): No heartbeat from core client for 30 sec - exiting 20:25:54 (12372): No heartbeat from core client for 30 sec - exiting 20:25:55 (12372): No heartbeat from core client for 30 sec - exiting 20:25:56 (12372): No heartbeat from core client for 30 sec - exiting 20:25:58 (12372): No heartbeat from core client for 30 sec - exiting 20:25:59 (12372): No heartbeat from core client for 30 sec - exiting 20:26:00 (12372): No heartbeat from core client for 30 sec - exiting 20:26:01 (12372): No heartbeat from core client for 30 sec - exiting 20:26:02 (12372): No heartbeat from core client for 30 sec - exiting 20:26:03 (12372): No heartbeat from core client for 30 sec - exiting 20:26:04 (12372): No heartbeat from core client for 30 sec - exiting 20:26:05 (12372): No heartbeat from core client for 30 sec - exiting 20:26:06 (12372): No heartbeat from core client for 30 sec - exiting 20:26:07 (12372): No heartbeat from core client for 30 sec - exiting 20:26:08 (12372): No heartbeat from core client for 30 sec - exiting 20:26:10 (12372): No heartbeat from core client for 30 sec - exiting 20:26:11 (12372): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:01:24 (9376): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:14:31 (7464): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:15:09 (10516): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12260, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12260, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12260, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12260, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12260, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12260, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13256, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
24 Feb 2013 18:41:46 | 1100141 | 15588981 | hadcm3n_4ioj_1940_40_008303390_0 | 181,440 | 261,468 | 1.4411 |
24 Feb 2013 02:32:57 | 1100141 | 15588981 | hadcm3n_4ioj_1940_40_008303390_0 | 155,520 | 224,825 | 1.4456 |
23 Feb 2013 21:30:00 | 1100141 | 15588981 | hadcm3n_4ioj_1940_40_008303390_0 | 129,600 | 187,721 | 1.4485 |
23 Feb 2013 21:30:00 | 1100141 | 15588981 | hadcm3n_4ioj_1940_40_008303390_0 | 103,680 | 150,524 | 1.4518 |
20 Feb 2013 23:52:39 | 1100141 | 15588981 | hadcm3n_4ioj_1940_40_008303390_0 | 77,760 | 112,926 | 1.4522 |
18 Feb 2013 19:42:35 | 1100141 | 15588981 | hadcm3n_4ioj_1940_40_008303390_0 | 51,840 | 75,426 | 1.4550 |
17 Feb 2013 10:52:20 | 1100141 | 15588981 | hadcm3n_4ioj_1940_40_008303390_0 | 25,920 | 37,771 | 1.4572 |
©2024 cpdn.org