Name | hadcm3n_yevx_1900_40_007526073_2 |
Workunit | 7723548 |
Created | 3 Nov 2011, 23:10:52 UTC |
Sent | 3 Nov 2011, 23:25:44 UTC |
Report deadline | 3 Feb 2012, 6:52:55 UTC |
Received | 18 Nov 2011, 18:52:54 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1122757 |
Run time | 7 days 15 hours 52 min 17 sec |
CPU time | 7 days 12 hours 13 min 12 sec |
Validate state | Invalid |
Credit | 2,488.32 |
Device peak FLOPS | 1.70 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 23:03:18 (5276): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:03:43 (5276): No heartbeat from core client for 30 sec - exiting 23:03:44 (5276): No heartbeat from core client for 30 sec - exiting 23:03:45 (5276): No heartbeat from core client for 30 sec - exiting 23:03:46 (5276): No heartbeat from core client for 30 sec - exiting 23:03:47 (5276): No heartbeat from core client for 30 sec - exiting 23:03:49 (5276): No heartbeat from core client for 30 sec - exiting 23:03:50 (5276): No heartbeat from core client for 30 sec - exiting 23:03:51 (5276): No heartbeat from core client for 30 sec - exiting 23:03:52 (5276): No heartbeat from core client for 30 sec - exiting 23:03:53 (5276): No heartbeat from core client for 30 sec - exiting 23:03:54 (5276): No heartbeat from core client for 30 sec - exiting 16:35:34 (824): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:35:48 (824): No heartbeat from core client for 30 sec - exiting 16:35:49 (824): No heartbeat from core client for 30 sec - exiting 16:35:50 (824): No heartbeat from core client for 30 sec - exiting 16:35:51 (824): No heartbeat from core client for 30 sec - exiting 16:35:53 (824): No heartbeat from core client for 30 sec - exiting 16:35:54 (824): No heartbeat from core client for 30 sec - exiting 16:35:55 (824): No heartbeat from core client for 30 sec - exiting 16:35:56 (824): No heartbeat from core client for 30 sec - exiting 16:35:57 (824): No heartbeat from core client for 30 sec - exiting 16:35:58 (824): No heartbeat from core client for 30 sec - exiting 16:35:59 (824): No heartbeat from core client for 30 sec - exiting 16:36:00 (824): No heartbeat from core client for 30 sec - exiting 16:36:01 (824): No heartbeat from core client for 30 sec - exiting 16:36:02 (824): No heartbeat from core client for 30 sec - exiting 16:36:03 (824): No heartbeat from core client for 30 sec - exiting 16:36:05 (824): No heartbeat from core client for 30 sec - exiting 16:36:06 (824): No heartbeat from core client for 30 sec - exiting 16:36:07 (824): No heartbeat from core client for 30 sec - exiting 16:36:08 (824): No heartbeat from core client for 30 sec - exiting 16:36:09 (824): No heartbeat from core client for 30 sec - exiting 16:36:14 (824): No heartbeat from core client for 30 sec - exiting 16:36:15 (824): No heartbeat from core client for 30 sec - exiting 16:36:16 (824): No heartbeat from core client for 30 sec - exiting 16:36:17 (824): No heartbeat from core client for 30 sec - exiting 16:36:18 (824): No heartbeat from core client for 30 sec - exiting 16:36:19 (824): No heartbeat from core client for 30 sec - exiting 16:36:21 (824): No heartbeat from core client for 30 sec - exiting 16:36:22 (824): No heartbeat from core client for 30 sec - exiting 16:36:23 (824): No heartbeat from core client for 30 sec - exiting 16:36:24 (824): No heartbeat from core client for 30 sec - exiting 16:36:25 (824): No heartbeat from core client for 30 sec - exiting 16:36:26 (824): No heartbeat from core client for 30 sec - exiting 16:36:27 (824): No heartbeat from core client for 30 sec - exiting 16:36:28 (824): No heartbeat from core client for 30 sec - exiting 16:36:29 (824): No heartbeat from core client for 30 sec - exiting 16:36:30 (824): No heartbeat from core client for 30 sec - exiting 16:36:31 (824): No heartbeat from core client for 30 sec - exiting 16:36:33 (824): No heartbeat from core client for 30 sec - exiting 16:36:34 (824): No heartbeat from core client for 30 sec - exiting 16:36:35 (824): No heartbeat from core client for 30 sec - exiting 16:36:36 (824): No heartbeat from core client for 30 sec - exiting 02:30:40 (7160): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:37:45 (4932): No heartbeat from core client for 30 sec - exiting 02:38:30 (4932): No heartbeat from core client for 30 sec - exiting 02:38:31 (4932): No heartbeat from core client for 30 sec - exiting 02:38:32 (4932): No heartbeat from core client for 30 sec - exiting 02:38:33 (4932): No heartbeat from core client for 30 sec - exiting 02:38:34 (4932): No heartbeat from core client for 30 sec - exiting 02:38:35 (4932): No heartbeat from core client for 30 sec - exiting 02:38:36 (4932): No heartbeat from core client for 30 sec - exiting 02:38:37 (4932): No heartbeat from core client for 30 sec - exiting 02:38:38 (4932): No heartbeat from core client for 30 sec - exiting 02:38:39 (4932): No heartbeat from core client for 30 sec - exiting 02:38:41 (4932): No heartbeat from core client for 30 sec - exiting 02:38:42 (4932): No heartbeat from core client for 30 sec - exiting 02:38:43 (4932): No heartbeat from core client for 30 sec - exiting 02:38:44 (4932): No heartbeat from core client for 30 sec - exiting 02:38:45 (4932): No heartbeat from core client for 30 sec - exiting 02:38:46 (4932): No heartbeat from core client for 30 sec - exiting 02:38:47 (4932): No heartbeat from core client for 30 sec - exiting 02:38:48 (4932): No heartbeat from core client for 30 sec - exiting 02:38:49 (4932): No heartbeat from core client for 30 sec - exiting 02:38:50 (4932): No heartbeat from core client for 30 sec - exiting 02:38:51 (4932): No heartbeat from core client for 30 sec - exiting 02:38:53 (4932): No heartbeat from core client for 30 sec - exiting 02:38:54 (4932): No heartbeat from core client for 30 sec - exiting 02:38:55 (4932): No heartbeat from core client for 30 sec - exiting 02:38:56 (4932): No heartbeat from core client for 30 sec - exiting 02:38:57 (4932): No heartbeat from core client for 30 sec - exiting 02:38:58 (4932): No heartbeat from core client for 30 sec - exiting 02:38:59 (4932): No heartbeat from core client for 30 sec - exiting 02:39:00 (4932): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:23:00 (4452): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:46:31 (3860): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:48:50 (3956): No heartbeat from core client for 30 sec - exiting 03:48:51 (3956): No heartbeat from core client for 30 sec - exiting 03:48:52 (3956): No heartbeat from core client for 30 sec - exiting 03:48:53 (3956): No heartbeat from core client for 30 sec - exiting 03:48:54 (3956): No heartbeat from core client for 30 sec - exiting 03:48:55 (3956): No heartbeat from core client for 30 sec - exiting 03:48:56 (3956): No heartbeat from core client for 30 sec - exiting 03:48:58 (3956): No heartbeat from core client for 30 sec - exiting 03:48:59 (3956): No heartbeat from core client for 30 sec - exiting 03:49:00 (3956): No heartbeat from core client for 30 sec - exiting 03:49:01 (3956): No heartbeat from core client for 30 sec - exiting 03:49:02 (3956): No heartbeat from core client for 30 sec - exiting 03:49:03 (3956): No heartbeat from core client for 30 sec - exiting 03:49:04 (3956): No heartbeat from core client for 30 sec - exiting 03:49:05 (3956): No heartbeat from core client for 30 sec - exiting 03:49:06 (3956): No heartbeat from core client for 30 sec - exiting 03:49:07 (3956): No heartbeat from core client for 30 sec - exiting 03:49:08 (3956): No heartbeat from core client for 30 sec - exiting 03:49:10 (3956): No heartbeat from core client for 30 sec - exiting 03:49:11 (3956): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:32:40 (6088): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:32:57 (6088): No heartbeat from core client for 30 sec - exiting 05:32:58 (6088): No heartbeat from core client for 30 sec - exiting 05:32:59 (6088): No heartbeat from core client for 30 sec - exiting 05:33:00 (6088): No heartbeat from core client for 30 sec - exiting 05:33:01 (6088): No heartbeat from core client for 30 sec - exiting 05:33:03 (6088): No heartbeat from core client for 30 sec - exiting 05:33:04 (6088): No heartbeat from core client for 30 sec - exiting 05:33:05 (6088): No heartbeat from core client for 30 sec - exiting 05:33:06 (6088): No heartbeat from core client for 30 sec - exiting 05:33:07 (6088): No heartbeat from core client for 30 sec - exiting 12:09:40 (7016): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:10:03 (7016): No heartbeat from core client for 30 sec - exiting 12:10:04 (7016): No heartbeat from core client for 30 sec - exiting 12:10:06 (7016): No heartbeat from core client for 30 sec - exiting 12:10:07 (7016): No heartbeat from core client for 30 sec - exiting 06:41:04 (6740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:16:48 (4908): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold 07:22:37 (4896): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:22:38 (4896): No heartbeat from core client for 30 sec - exiting 07:22:39 (4896): No heartbeat from core client for 30 sec - exiting 07:22:40 (4896): No heartbeat from core client for 30 sec - exiting 07:22:41 (4896): No heartbeat from core client for 30 sec - exiting 07:22:42 (4896): No heartbeat from core client for 30 sec - exiting 07:22:43 (4896): No heartbeat from core client for 30 sec - exiting 07:22:45 (4896): No heartbeat from core client for 30 sec - exiting 07:22:46 (4896): No heartbeat from core client for 30 sec - exiting 07:22:47 (4896): No heartbeat from core client for 30 sec - exiting 07:22:48 (4896): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold CPDN Monitor - Quit request from BOINC... 22:22:11 (5780): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:16:26 (2916): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:16:34 (2916): No heartbeat from core client for 30 sec - exiting 22:46:00 (4736): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:46:17 (4736): No heartbeat from core client for 30 sec - exiting 22:46:18 (4736): No heartbeat from core client for 30 sec - exiting 22:46:19 (4736): No heartbeat from core client for 30 sec - exiting 22:46:20 (4736): No heartbeat from core client for 30 sec - exiting 22:46:21 (4736): No heartbeat from core client for 30 sec - exiting 22:46:22 (4736): No heartbeat from core client for 30 sec - exiting 22:46:23 (4736): No heartbeat from core client for 30 sec - exiting 22:46:25 (4736): No heartbeat from core client for 30 sec - exiting 22:46:26 (4736): No heartbeat from core client for 30 sec - exiting 22:46:27 (4736): No heartbeat from core client for 30 sec - exiting 22:46:28 (4736): No heartbeat from core client for 30 sec - exiting 22:46:29 (4736): No heartbeat from core client for 30 sec - exiting 22:46:30 (4736): No heartbeat from core client for 30 sec - exiting 22:46:31 (4736): No heartbeat from core client for 30 sec - exiting 22:46:32 (4736): No heartbeat from core client for 30 sec - exiting 22:46:33 (4736): No heartbeat from core client for 30 sec - exiting 22:46:34 (4736): No heartbeat from core client for 30 sec - exiting 22:46:36 (4736): No heartbeat from core client for 30 sec - exiting 22:46:37 (4736): No heartbeat from core client for 30 sec - exiting 22:46:38 (4736): No heartbeat from core client for 30 sec - exiting 22:46:39 (4736): No heartbeat from core client for 30 sec - exiting 22:46:40 (4736): No heartbeat from core client for 30 sec - exiting 22:46:41 (4736): No heartbeat from core client for 30 sec - exiting 22:46:42 (4736): No heartbeat from core client for 30 sec - exiting 22:46:43 (4736): No heartbeat from core client for 30 sec - exiting 22:46:44 (4736): No heartbeat from core client for 30 sec - exiting 22:46:45 (4736): No heartbeat from core client for 30 sec - exiting 22:46:46 (4736): No heartbeat from core client for 30 sec - exiting 22:46:48 (4736): No heartbeat from core client for 30 sec - exiting 22:46:49 (4736): No heartbeat from core client for 30 sec - exiting 22:46:50 (4736): No heartbeat from core client for 30 sec - exiting 22:46:51 (4736): No heartbeat from core client for 30 sec - exiting 22:46:52 (4736): No heartbeat from core client for 30 sec - exiting 22:46:53 (4736): No heartbeat from core client for 30 sec - exiting 22:46:54 (4736): No heartbeat from core client for 30 sec - exiting 22:46:55 (4736): No heartbeat from core client for 30 sec - exiting 22:46:56 (4736): No heartbeat from core client for 30 sec - exiting 22:46:57 (4736): No heartbeat from core client for 30 sec - exiting 22:46:58 (4736): No heartbeat from core client for 30 sec - exiting 22:47:00 (4736): No heartbeat from core client for 30 sec - exiting 22:47:01 (4736): No heartbeat from core client for 30 sec - exiting 22:47:02 (4736): No heartbeat from core client for 30 sec - exiting 15:25:11 (6660): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:20:29 (5644): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4132, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4132, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4132, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4132, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4132, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4132, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
17 Nov 2011 15:30:10 | 1122757 | 13591360 | hadcm3n_yevx_1900_40_007526073_2 | 207,360 | 618,816 | 2.9843 |
16 Nov 2011 16:47:43 | 1122757 | 13591360 | hadcm3n_yevx_1900_40_007526073_2 | 181,440 | 537,542 | 2.9626 |
15 Nov 2011 18:12:30 | 1122757 | 13591360 | hadcm3n_yevx_1900_40_007526073_2 | 155,520 | 456,177 | 2.9332 |
15 Nov 2011 16:50:24 | 1122757 | 13591360 | hadcm3n_yevx_1900_40_007526073_2 | 129,600 | 395,238 | 3.0497 |
15 Nov 2011 16:50:24 | 1122757 | 13591360 | hadcm3n_yevx_1900_40_007526073_2 | 103,680 | 320,466 | 3.0909 |
15 Nov 2011 16:50:24 | 1122757 | 13591360 | hadcm3n_yevx_1900_40_007526073_2 | 77,760 | 240,761 | 3.0962 |
15 Nov 2011 16:50:24 | 1122757 | 13591360 | hadcm3n_yevx_1900_40_007526073_2 | 51,840 | 159,766 | 3.0819 |
15 Nov 2011 16:50:24 | 1122757 | 13591360 | hadcm3n_yevx_1900_40_007526073_2 | 25,920 | 79,693 | 3.0746 |
©2024 cpdn.org