Name | hadcm3n_80bc_1980_40_008458347_3 |
Workunit | 8609203 |
Created | 10 Mar 2014, 4:23:47 UTC |
Sent | 10 Mar 2014, 4:25:45 UTC |
Report deadline | 9 Jun 2014, 11:52:56 UTC |
Received | 11 Mar 2014, 14:03:11 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1264979 |
Run time | 13 hours 32 min 4 sec |
CPU time | 10 hours 13 min 35 sec |
Validate state | Invalid |
Credit | 311.04 |
Device peak FLOPS | 2.85 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 10:00:08 (4348): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:00:09 (4348): No heartbeat from core client for 30 sec - exiting 10:00:10 (4348): No heartbeat from core client for 30 sec - exiting 10:00:11 (4348): No heartbeat from core client for 30 sec - exiting 10:00:12 (4348): No heartbeat from core client for 30 sec - exiting 10:00:13 (4348): No heartbeat from core client for 30 sec - exiting 10:00:14 (4348): No heartbeat from core client for 30 sec - exiting 10:01:53 (2544): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:01:55 (2544): No heartbeat from core client for 30 sec - exiting 10:01:56 (2544): No heartbeat from core client for 30 sec - exiting 10:01:57 (2544): No heartbeat from core client for 30 sec - exiting 10:01:58 (2544): No heartbeat from core client for 30 sec - exiting 10:01:59 (2544): No heartbeat from core client for 30 sec - exiting 10:02:00 (2544): No heartbeat from core client for 30 sec - exiting 10:02:01 (2544): No heartbeat from core client for 30 sec - exiting 10:02:02 (2544): No heartbeat from core client for 30 sec - exiting 10:02:03 (2544): No heartbeat from core client for 30 sec - exiting 10:02:04 (2544): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 11:05:20 (10308): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:06:44 (10084): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:09:21 (6504): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:09:24 (6504): No heartbeat from core client for 30 sec - exiting 11:09:25 (6504): No heartbeat from core client for 30 sec - exiting 11:09:26 (6504): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 12:16:44 (10612): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:16:45 (10612): No heartbeat from core client for 30 sec - exiting 12:16:46 (10612): No heartbeat from core client for 30 sec - exiting 12:17:57 (10908): No heartbeat from core client for 30 sec - exiting 12:18:00 (10908): No heartbeat from core client for 30 sec - exiting 12:18:01 (10908): No heartbeat from core client for 30 sec - exiting 12:18:02 (10908): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold 12:25:06 (8908): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:25:07 (8908): No heartbeat from core client for 30 sec - exiting 12:25:08 (8908): No heartbeat from core client for 30 sec - exiting 12:25:09 (8908): No heartbeat from core client for 30 sec - exiting 12:26:03 (9616): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:28:52 (10932): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:28:54 (10932): No heartbeat from core client for 30 sec - exiting 12:28:55 (10932): No heartbeat from core client for 30 sec - exiting 12:28:56 (10932): No heartbeat from core client for 30 sec - exiting 12:28:57 (10932): No heartbeat from core client for 30 sec - exiting 12:28:58 (10932): No heartbeat from core client for 30 sec - exiting 12:28:59 (10932): No heartbeat from core client for 30 sec - exiting 12:29:00 (10932): No heartbeat from core client for 30 sec - exiting 12:29:01 (10932): No heartbeat from core client for 30 sec - exiting 12:29:02 (10932): No heartbeat from core client for 30 sec - exiting 12:29:03 (10932): No heartbeat from core client for 30 sec - exiting 12:30:47 (8320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:01:27 (4708): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:02:39 (8788): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:02:48 (8788): No heartbeat from core client for 30 sec - exiting 13:02:49 (8788): No heartbeat from core client for 30 sec - exiting 13:02:50 (8788): No heartbeat from core client for 30 sec - exiting 13:03:29 (1160): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:04:15 (7488): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:04:47 (2940): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... forrtl: Access is denied. Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10168, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 03:08:34 (3412): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:08:35 (3412): No heartbeat from core client for 30 sec - exiting 03:09:17 (1120): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:09:26 (1120): No heartbeat from core client for 30 sec - exiting 03:09:27 (1120): No heartbeat from core client for 30 sec - exiting 03:09:28 (1120): No heartbeat from core client for 30 sec - exiting 03:09:29 (1120): No heartbeat from core client for 30 sec - exiting 03:09:30 (1120): No heartbeat from core client for 30 sec - exiting 03:09:31 (1120): No heartbeat from core client for 30 sec - exiting 03:09:32 (1120): No heartbeat from core client for 30 sec - exiting 03:09:33 (1120): No heartbeat from core client for 30 sec - exiting 03:09:34 (1120): No heartbeat from core client for 30 sec - exiting 03:09:35 (1120): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 03:11:48 (11428): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 03:11:49 (11428): No heartbeat from core client for 30 sec - exiting 03:11:50 (11428): No heartbeat from core client for 30 sec - exiting 03:11:51 (11428): No heartbeat from core client for 30 sec - exiting 03:11:52 (11428): No heartbeat from core client for 30 sec - exiting 03:29:39 (9628): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:30:19 (10680): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:30:20 (10680): No heartbeat from core client for 30 sec - exiting 03:30:21 (10680): No heartbeat from core client for 30 sec - exiting 03:30:22 (10680): No heartbeat from core client for 30 sec - exiting 03:30:23 (10680): No heartbeat from core client for 30 sec - exiting 03:30:24 (10680): No heartbeat from core client for 30 sec - exiting 03:30:25 (10680): No heartbeat from core client for 30 sec - exiting 03:30:26 (10680): No heartbeat from core client for 30 sec - exiting 03:30:27 (10680): No heartbeat from core client for 30 sec - exiting 03:30:28 (10680): No heartbeat from core client for 30 sec - exiting 03:30:29 (10680): No heartbeat from core client for 30 sec - exiting 03:41:37 (2664): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:43:43 (1232): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:46:27 (6644): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:48:18 (11132): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:58:03 (11768): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:58:04 (11768): No heartbeat from core client for 30 sec - exiting 03:58:05 (11768): No heartbeat from core client for 30 sec - exiting 03:58:06 (11768): No heartbeat from core client for 30 sec - exiting 03:58:07 (11768): No heartbeat from core client for 30 sec - exiting 03:58:08 (11768): No heartbeat from core client for 30 sec - exiting 03:58:09 (11768): No heartbeat from core client for 30 sec - exiting 03:58:10 (11768): No heartbeat from core client for 30 sec - exiting 03:58:11 (11768): No heartbeat from core client for 30 sec - exiting 03:58:12 (11768): No heartbeat from core client for 30 sec - exiting 03:58:13 (11768): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 04:34:55 (8672): No heartbeat from core client for 30 sec - exiting 04:34:56 (8672): No heartbeat from core client for 30 sec - exiting 04:34:57 (8672): No heartbeat from core client for 30 sec - exiting 04:34:58 (8672): No heartbeat from core client for 30 sec - exiting 04:34:59 (8672): No heartbeat from core client for 30 sec - exiting 04:35:00 (8672): No heartbeat from core client for 30 sec - exiting 04:35:01 (8672): No heartbeat from core client for 30 sec - exiting 04:35:02 (8672): No heartbeat from core client for 30 sec - exiting 04:35:03 (8672): No heartbeat from core client for 30 sec - exiting 04:35:04 (8672): No heartbeat from core client for 30 sec - exiting 04:35:35 (8672): No heartbeat from core client for 30 sec - exiting 04:35:36 (8672): No heartbeat from core client for 30 sec - exiting 04:35:37 (8672): No heartbeat from core client for 30 sec - exiting 04:35:38 (8672): No heartbeat from core client for 30 sec - exiting 04:35:39 (8672): No heartbeat from core client for 30 sec - exiting 04:35:40 (8672): No heartbeat from core client for 30 sec - exiting 04:49:41 (8672): No heartbeat from core client for 30 sec - exiting 04:49:42 (8672): No heartbeat from core client for 30 sec - exiting 04:49:43 (8672): No heartbeat from core client for 30 sec - exiting 04:49:44 (8672): No heartbeat from core client for 30 sec - exiting 04:49:45 (8672): No heartbeat from core client for 30 sec - exiting 04:49:46 (8672): No heartbeat from core client for 30 sec - exiting 04:49:47 (8672): No heartbeat from core client for 30 sec - exiting 04:49:48 (8672): No heartbeat from core client for 30 sec - exiting 04:49:49 (8672): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:03:23 (6596): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:22:46 (9016): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:23:59 (7072): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:28:44 (1828): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:28:51 (1828): No heartbeat from core client for 30 sec - exiting 05:28:52 (1828): No heartbeat from core client for 30 sec - exiting 05:28:53 (1828): No heartbeat from core client for 30 sec - exiting 05:28:54 (1828): No heartbeat from core client for 30 sec - exiting 05:28:55 (1828): No heartbeat from core client for 30 sec - exiting 05:28:56 (1828): No heartbeat from core client for 30 sec - exiting 05:28:57 (1828): No heartbeat from core client for 30 sec - exiting 05:28:58 (1828):Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 06:16:43 (4328): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:16:44 (4328): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5696, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5696, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5696, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5696, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5696, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5696, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
11 Mar 2014 10:30:18 | 1264979 | 16363347 | hadcm3n_80bc_1980_40_008458347_3 | 25,920 | 35,098 | 1.3541 |
©2024 cpdn.org