Name | hadcm3n_8c35_1980_40_008725084_0 |
Workunit | 8871062 |
Created | 23 Apr 2014, 13:23:28 UTC |
Sent | 1 May 2014, 14:26:55 UTC |
Report deadline | 31 Jul 2014, 21:54:06 UTC |
Received | 23 May 2014, 12:01:30 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1251442 |
Run time | 9 days 1 hours 46 min 7 sec |
CPU time | 6 days 13 hours 56 min 40 sec |
Validate state | Invalid |
Credit | 2,177.28 |
Device peak FLOPS | 1.96 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.2.33</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4984, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6932, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6932, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6932, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5332, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7532, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6756, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2024, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6404, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4896, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6140, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7480, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2084, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6796, iMonCtr=1 Model crash detected, will try to restart... 09:00:02 (6464): No heartbeat from core client for 30 sec - exiting 09:00:04 (6464): No heartbeat from core client for 30 sec - exiting 09:00:05 (6464): No heartbeat from core client for 30 sec - exiting 09:00:06 (6464): No heartbeat from core client for 30 sec - exiting 09:00:07 (6464): No heartbeat from core client for 30 sec - exiting 09:00:09 (6464): No heartbeat from core client for 30 sec - exiting 09:00:10 (6464): No heartbeat from core client for 30 sec - exiting 09:00:11 (6464): No heartbeat from core client for 30 sec - exiting 09:00:12 (6464): No heartbeat from core client for 30 sec - exiting 09:00:13 (6464): No heartbeat from core client for 30 sec - exiting 09:00:14 (6464): No heartbeat from core client for 30 sec - exiting 09:00:15 (6464): No heartbeat from core client for 30 sec - exiting 09:00:16 (6464): No heartbeat from core client for 30 sec - exiting 09:00:17 (6464): No heartbeat from core client for 30 sec - exiting 09:00:18 (6464): No heartbeat from core client for 30 sec - exiting 09:00:19 (6464): No heartbeat from core client for 30 sec - exiting 09:00:21 (6464): No heartbeat from core client for 30 sec - exiting 09:00:22 (6464): No heartbeat from core client for 30 sec - exiting 09:00:23 (6464): No heartbeat from core client for 30 sec - exiting 09:00:24 (6464): No heartbeat from core client for 30 sec - exiting 09:00:25 (6464): No heartbeat from core client for 30 sec - exiting 09:00:26 (6464): No heartbeat from core client for 30 sec - exiting 09:00:27 (6464): No heartbeat from core client for 30 sec - exiting 09:00:28 (6464): No heartbeat from core client for 30 sec - exiting 09:00:29 (6464): No heartbeat from core client for 30 sec - exiting 09:00:30 (6464): No heartbeat from core client for 30 sec - exiting 09:00:31 (6464): No heartbeat from core client for 30 sec - exiting 09:00:33 (6464): No heartbeat from core client for 30 sec - exiting 09:00:34 (6464): No heartbeat from core client for 30 sec - exiting 09:00:35 (6464): No heartbeat from core client for 30 sec - exiting 09:00:36 (6464): No heartbeat from core client for 30 sec - exiting 09:00:37 (6464): No heartbeat from core client for 30 sec - exiting 09:00:38 (6464): No heartbeat from core client for 30 sec - exiting 09:00:39 (6464): No heartbeat from core client for 30 sec - exiting 09:00:40 (6464): No heartbeat from core client for 30 sec - exiting 09:00:41 (6464): No heartbeat from core client for 30 sec - exiting 09:00:42 (6464): No heartbeat from core client for 30 sec - exiting 09:00:44 (6464): No heartbeat from core client for 30 sec - exiting 09:00:45 (6464): No heartbeat from core client for 30 sec - exiting 09:00:46 (6464): No heartbeat from core client for 30 sec - exiting 09:00:47 (6464): No heartbeat from core client for 30 sec - exiting 09:00:48 (6464): No heartbeat from core client for 30 sec - exiting 09:00:49 (6464): No heartbeat from core client for 30 sec - exiting 09:00:50 (6464): No heartbeat from core client for 30 sec - exiting 09:00:51 (6464): No heartbeat from core client for 30 sec - exiting 09:00:52 (6464): No heartbeat from core client for 30 sec - exiting 09:00:53 (6464): No heartbeat from core client for 30 sec - exiting 09:00:54 (6464): No heartbeat from core client for 30 sec - exiting 09:00:56 (6464): No heartbeat from core client for 30 sec - exiting 09:00:57 (6464): No heartbeat from core client for 30 sec - exiting 09:00:58 (6464): No heartbeat from core client for 30 sec - exiting 09:00:59 (6464): No heartbeat from core client for 30 sec - exiting 09:01:00 (6464): No heartbeat from core client for 30 sec - exiting 09:01:01 (6464): No heartbeat from core client for 30 sec - exiting 09:01:02 (6464): No heartbeat from core client for 30 sec - exiting 09:01:03 (6464): No heartbeat from core client for 30 sec - exiting 09:01:04 (6464): No heartbeat from core client for 30 sec - exiting 09:01:05 (6464): No heartbeat from core client for 30 sec - exiting 09:01:06 (6464): No heartbeat from core client for 30 sec - exiting 09:01:08 (6464): No heartbeat from core client for 30 sec - exiting 09:01:09 (6464): No heartbeat from core client for 30 sec - exiting 09:01:10 (6464): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 18:58:06 (6720): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3192, iMonCtr=1 Model crash detected, will try to restart... 10:20:49 (6300): No heartbeat from core client for 30 sec - exiting 10:20:50 (6300): No heartbeat from core client for 30 sec - exiting 10:20:51 (6300): No heartbeat from core client for 30 sec - exiting 10:20:52 (6300): No heartbeat from core client for 30 sec - exiting 10:20:53 (6300): No heartbeat from core client for 30 sec - exiting 10:20:55 (6300): No heartbeat from core client for 30 sec - exiting 10:20:56 (6300): No heartbeat from core client for 30 sec - exiting 10:20:57 (6300): No heartbeat from core client for 30 sec - exiting 10:20:58 (6300): No heartbeat from core client for 30 sec - exiting 10:20:59 (6300): No heartbeat from core client for 30 sec - exiting 10:21:00 (6300): No heartbeat from core client for 30 sec - exiting 10:21:01 (6300): No heartbeat from core client for 30 sec - exiting 10:21:02 (6300): No heartbeat from core client for 30 sec - exiting 10:21:03 (6300): No heartbeat from core client for 30 sec - exiting 10:21:04 (6300): No heartbeat from core client for 30 sec - exiting 10:21:05 (6300): No heartbeat from core client for 30 sec - exiting 10:21:07 (6300): No heartbeat from core client for 30 sec - exiting 10:21:08 (6300): No heartbeat from core client for 30 sec - exiting 10:21:09 (6300): No heartbeat from core client for 30 sec - exiting 10:21:10 (6300): No heartbeat from core client for 30 sec - exiting 10:21:11 (6300): No heartbeat from core client for 30 sec - exiting 10:21:12 (6300): No heartbeat from core client for 30 sec - exiting 10:21:13 (6300): No heartbeat from core client for 30 sec - exiting 10:21:14 (6300): No heartbeat from core client for 30 sec - exiting 10:21:15 (6300): No heartbeat from core client for 30 sec - exiting 10:21:16 (6300): No heartbeat from core client for 30 sec - exiting 10:21:17 (6300): No heartbeat from core client for 30 sec - exiting 10:21:19 (6300): No heartbeat from core client for 30 sec - exiting 10:21:20 (6300): No heartbeat from core client for 30 sec - exiting 10:21:21 (6300): No heartbeat from core client for 30 sec - exiting 10:21:22 (6300): No heartbeat from core client for 30 sec - exiting 10:21:23 (6300): No heartbeat from core client for 30 sec - exiting 10:21:24 (6300): No heartbeat from core client for 30 sec - exiting 10:21:25 (6300): No heartbeat from core client for 30 sec - exiting 10:21:26 (6300): No heartbeat from core client for 30 sec - exiting 10:21:27 (6300): No heartbeat from core client for 30 sec - exiting 10:21:28 (6300): No heartbeat from core client for 30 sec - exiting 10:21:30 (6300): No heartbeat from core client for 30 sec - exiting 10:21:31 (6300): No heartbeat from core client for 30 sec - exiting 10:21:32 (6300): No heartbeat from core client for 30 sec - exiting 10:21:33 (6300): No heartbeat from core client for 30 sec - exiting 10:21:34 (6300): No heartbeat from core client for 30 sec - exiting 10:21:35 (6300): No heartbeat from core client for 30 sec - exiting 10:21:36 (6300): No heartbeat from core client for 30 sec - exiting 10:21:37 (6300): No heartbeat from core client for 30 sec - exiting 10:21:38 (6300): No heartbeat from core client for 30 sec - exiting 10:21:39 (6300): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:21:40 (6300): No heartbeat from core client for 30 sec - exiting 10:21:42 (6300): No heartbeat from core client for 30 sec - exiting 10:21:43 (6300): No heartbeat from core client for 30 sec - exiting 10:21:44 (6300): No heartbeat from core client for 30 sec - exiting 10:21:45 (6300): No heartbeat from core client for 30 sec - exiting 10:21:46 (6300): No heartbeat from core client for 30 sec - exiting 10:21:47 (6300): No heartbeat from core client for 30 sec - exiting 10:21:48 (6300): No heartbeat from core client for 30 sec - exiting 10:21:49 (6300): No heartbeat from core client for 30 sec - exiting 10:21:50 (6300): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5732, iMonCtr=1 Model crash detected, will try to restart... 13:02:29 (5888): No heartbeat from core client for 30 sec - exiting 13:02:30 (5888): No heartbeat from core client for 30 sec - exiting 13:02:31 (5888): No heartbeat from core client for 30 sec - exiting 13:02:32 (5888): No heartbeat from core client for 30 sec - exiting 13:02:33 (5888): No heartbeat from core client for 30 sec - exiting 13:02:34 (5888): No heartbeat from core client for 30 sec - exiting 13:02:35 (5888): No heartbeat from core client for 30 sec - exiting 13:02:36 (5888): No heartbeat from core client for 30 sec - exiting 13:02:37 (5888): No heartbeat from core client for 30 sec - exiting 13:02:39 (5888): No heartbeat from core client for 30 sec - exiting 13:02:40 (5888): No heartbeat from core client for 30 sec - exiting 13:02:41 (5888): No heartbeat from core client for 30 sec - exiting 13:02:42 (5888): No heartbeat from core client for 30 sec - exiting 13:02:43 (5888): No heartbeat from core client for 30 sec - exiting 13:02:44 (5888): No heartbeat from core client for 30 sec - exiting 13:02:45 (5888): No heartbeat from core client for 30 sec - exiting 13:02:46 (5888): No heartbeat from core client for 30 sec - exiting 13:02:47 (5888): No heartbeat from core client for 30 sec - exiting 13:02:48 (5888): No heartbeat from core client for 30 sec - exiting 13:02:49 (5888): No heartbeat from core client for 30 sec - exiting 13:02:51 (5888): No heartbeat from core client for 30 sec - exiting 13:02:52 (5888): No heartbeat from core client for 30 sec - exiting 13:02:53 (5888): No heartbeat from core client for 30 sec - exiting 13:02:54 (5888): No heartbeat from core client for 30 sec - exiting 13:02:55 (5888): No heartbeat from core client for 30 sec - exiting 13:02:56 (5888): No heartbeat from core client for 30 sec - exiting 13:02:57 (5888): No heartbeat from core client for 30 sec - exiting 13:02:58 (5888): No heartbeat from core client for 30 sec - exiting 13:02:59 (5888): No heartbeat from core client for 30 sec - exiting 13:03:00 (5888): No heartbeat from core client for 30 sec - exiting 13:03:02 (5888): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:03:03 (5888): No heartbeat from core client for 30 sec - exiting 13:03:04 (5888): No heartbeat from core client for 30 sec - exiting 13:03:05 (5888): No heartbeat from core client for 30 sec - exiting 13:03:06 (5888): No heartbeat from core client for 30 sec - exiting 13:03:07 (5888): No heartbeat from core client for 30 sec - exiting 13:03:08 (5888): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 23:33:28 (1908): No heartbeat from core client for 30 sec - exiting 23:33:29 (1908): No heartbeat from core client for 30 sec - exiting 23:33:30 (1908): No heartbeat from core client for 30 sec - exiting 23:33:31 (1908): No heartbeat from core client for 30 sec - exiting 23:33:32 (1908): No heartbeat from core client for 30 sec - exiting 23:33:33 (1908): No heartbeat from core client for 30 sec - exiting 23:33:34 (1908): No heartbeat from core client for 30 sec - exiting 23:33:35 (1908): No heartbeat from core client for 30 sec - exiting 23:33:36 (1908): No heartbeat from core client for 30 sec - exiting 23:33:37 (1908): No heartbeat from core client for 30 sec - exiting 23:33:38 (1908): No heartbeat from core client for 30 sec - exiting 23:33:39 (1908): No heartbeat from core client for 30 sec - exiting 23:33:40 (1908): No heartbeat from core client for 30 sec - exiting 23:33:41 (1908): No heartbeat from core client for 30 sec - exiting 23:33:42 (1908): No heartbeat from core client for 30 sec - exiting 23:33:43 (1908): No heartbeat from core client for 30 sec - exiting 23:33:44 (1908): No heartbeat from core client for 30 sec - exiting 23:33:45 (1908): No heartbeat from core client for 30 sec - exiting 23:33:46 (1908): No heartbeat from core client for 30 sec - exiting 23:33:47 (1908): No heartbeat from core client for 30 sec - exiting 23:33:48 (1908): No heartbeat from core client for 30 sec - exiting 23:33:49 (1908): No heartbeat from core client for 30 sec - exiting 23:33:50 (1908): No heartbeat from core client for 30 sec - exiting 23:33:51 (1908): No heartbeat from core client for 30 sec - exiting 23:33:52 (1908): No heartbeat from core client for 30 sec - exiting 23:33:53 (1908): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8072, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2008, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 19:31:09 (4308): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4612, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 13:07:34 (5868): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7188, iMonCtr=1 Model crash detected, will try to restart... 10:52:18 (4516): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:52:19 (4516): No heartbeat from core client for 30 sec - exiting 10:52:20 (4516): No heartbeat from core client for 30 sec - exiting 10:52:21 (4516): No heartbeat from core client for 30 sec - exiting 10:52:22 (4516): No heartbeat from core client for 30 sec - exiting 10:52:23 (4516): No heartbeat from core client for 30 sec - exiting 10:52:24 (4516): No heartbeat from core client for 30 sec - exiting 10:52:26 (4516): No heartbeat from core client for 30 sec - exiting 10:52:27 (4516): No heartbeat from core client for 30 sec - exiting 10:52:28 (4516): No heartbeat from core client for 30 sec - exiting 11:20:14 (6096): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4712, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4712, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4712, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4712, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4712, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4712, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
23 May 2014 01:59:11 | 1251442 | 16589941 | hadcm3n_8c35_1980_40_008725084_0 | 181,440 | 560,905 | 3.0914 |
20 May 2014 01:55:15 | 1251442 | 16589941 | hadcm3n_8c35_1980_40_008725084_0 | 155,520 | 483,148 | 3.1067 |
16 May 2014 18:23:53 | 1251442 | 16589941 | hadcm3n_8c35_1980_40_008725084_0 | 129,600 | 395,863 | 3.0545 |
14 May 2014 01:20:10 | 1251442 | 16589941 | hadcm3n_8c35_1980_40_008725084_0 | 103,680 | 308,147 | 2.9721 |
10 May 2014 18:07:45 | 1251442 | 16589941 | hadcm3n_8c35_1980_40_008725084_0 | 77,760 | 231,509 | 2.9772 |
07 May 2014 17:33:07 | 1251442 | 16589941 | hadcm3n_8c35_1980_40_008725084_0 | 51,840 | 152,802 | 2.9476 |
05 May 2014 04:37:59 | 1251442 | 16589941 | hadcm3n_8c35_1980_40_008725084_0 | 25,920 | 75,790 | 2.9240 |
©2024 cpdn.org