Name | hadcm3n_4e17_1940_40_008302403_0 |
Workunit | 8453538 |
Created | 6 Feb 2013, 18:55:43 UTC |
Sent | 6 Feb 2013, 18:56:18 UTC |
Report deadline | 9 May 2013, 2:23:29 UTC |
Received | 5 Jun 2013, 14:04:54 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 25 (0x00000019) Unknown error code |
Computer ID | 1179386 |
Run time | 6 days 18 hours 47 min 52 sec |
CPU time | 6 days 18 hours 16 min 15 sec |
Validate state | Invalid |
Credit | 4,976.64 |
Device peak FLOPS | 2.47 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 14:37:57 (9100): No heartbeat from core client for 30 sec - exiting 14:37:58 (9100): No heartbeat from core client for 30 sec - exiting 14:37:59 (9100): No heartbeat from core client for 30 sec - exiting 14:38:00 (9100): No heartbeat from core client for 30 sec - exiting 14:38:01 (9100): No heartbeat from core client for 30 sec - exiting 14:38:02 (9100): No heartbeat from core client for 30 sec - exiting 14:38:03 (9100): No heartbeat from core client for 30 sec - exiting 14:38:04 (9100): No heartbeat from core client for 30 sec - exiting 14:38:05 (9100): No heartbeat from core client for 30 sec - exiting 14:38:06 (9100): No heartbeat from core client for 30 sec - exiting 14:38:07 (9100): No heartbeat from core client for 30 sec - exiting 14:38:09 (9100): No heartbeat from core client for 30 sec - exiting 14:38:10 (9100): No heartbeat from core client for 30 sec - exiting 14:38:11 (9100): No heartbeat from core client for 30 sec - exiting 14:38:12 (9100): No heartbeat from core client for 30 sec - exiting 14:38:13 (9100): No heartbeat from core client for 30 sec - exiting 14:38:14 (9100): No heartbeat from core client for 30 sec - exiting 14:38:15 (9100): No heartbeat from core client for 30 sec - exiting 14:38:16 (9100): No heartbeat from core client for 30 sec - exiting 14:38:17 (9100): No heartbeat from core client for 30 sec - exiting 14:38:18 (9100): No heartbeat from core client for 30 sec - exiting 14:38:20 (9100): No heartbeat from core client for 30 sec - exiting 14:38:21 (9100): No heartbeat from core client for 30 sec - exiting 14:38:22 (9100): No heartbeat from core client for 30 sec - exiting 14:38:23 (9100): No heartbeat from core client for 30 sec - exiting 14:38:24 (9100): No heartbeat from core client for 30 sec - exiting 14:38:25 (9100): No heartbeat from core client for 30 sec - exiting 14:38:26 (9100): No heartbeat from core client for 30 sec - exiting 14:38:27 (9100): No heartbeat from core client for 30 sec - exiting 14:38:28 (9100): No heartbeat from core client for 30 sec - exiting 14:38:29 (9100): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:38:30 (9100): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 09:05:29 (1808): No heartbeat from core client for 30 sec - exiting 09:05:30 (1808): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 15:08:02 (5128): No heartbeat from core client for 30 sec - exiting 15:08:03 (5128): No heartbeat from core client for 30 sec - exiting 15:08:04 (5128): No heartbeat from core client for 30 sec - exiting 15:08:05 (5128): No heartbeat from core client for 30 sec - exiting 15:08:06 (5128): No heartbeat from core client for 30 sec - exiting 15:08:08 (5128): No heartbeat from core client for 30 sec - exiting 15:08:09 (5128): No heartbeat from core client for 30 sec - exiting 15:08:10 (5128): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:19:40 (3044): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 22:03:42 (6112): No heartbeat from core client for 30 sec - exiting 22:03:43 (6112): No heartbeat from core client for 30 sec - exiting 22:03:44 (6112): No heartbeat from core client for 30 sec - exiting 22:03:45 (6112): No heartbeat from core client for 30 sec - exiting 22:03:46 (6112): No heartbeat from core client for 30 sec - exiting 22:03:47 (6112): No heartbeat from core client for 30 sec - exiting 22:03:48 (6112): No heartbeat from core client for 30 sec - exiting 22:03:49 (6112): No heartbeat from core client for 30 sec - exiting 22:03:50 (6112): No heartbeat from core client for 30 sec - exiting 22:03:51 (6112): No heartbeat from core client for 30 sec - exiting 22:03:52 (6112): No heartbeat from core client for 30 sec - exiting 22:03:53 (6112): No heartbeat from core client for 30 sec - exiting 22:03:54 (6112): No heartbeat from core client for 30 sec - exiting 22:03:55 (6112): No heartbeat from core client for 30 sec - exiting 22:03:56 (6112): No heartbeat from core client for 30 sec - exiting 22:03:57 (6112): No heartbeat from core client for 30 sec - exiting 22:03:58 (6112): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:04:32 (8788): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 07:31:57 (864): No heartbeat from core client for 30 sec - exiting 07:31:58 (864): No heartbeat from core client for 30 sec - exiting 07:31:59 (864): No heartbeat from core client for 30 sec - exiting 07:32:01 (864): No heartbeat from core client for 30 sec - exiting 07:32:02 (864): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 21:14:47 (11640): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 21:05:55 (11668): No heartbeat from core client for 30 sec - exiting 21:05:56 (11668): No heartbeat from core client for 30 sec - exiting 21:05:57 (11668): No heartbeat from core client for 30 sec - exiting 21:05:58 (11668): No heartbeat from core client for 30 sec - exiting 21:05:59 (11668): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:06:00 (11668): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:52:19 (2708): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 03:00:58 (7148): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11180, iMonCtr=1 Model crash detected, will try to restart... 23:07:52 (8124): No heartbeat from core client for 30 sec - exiting 23:07:53 (8124): No heartbeat from core client for 30 sec - exiting 23:07:54 (8124): No heartbeat from core client for 30 sec - exiting 23:07:55 (8124): No heartbeat from core client for 30 sec - exiting 23:07:56 (8124): No heartbeat from core client for 30 sec - exiting 23:07:57 (8124): No heartbeat from core client for 30 sec - exiting 23:07:58 (8124): No heartbeat from core client for 30 sec - exiting 23:07:59 (8124): No heartbeat from core client for 30 sec - exiting 23:08:00 (8124): No heartbeat from core client for 30 sec - exiting 23:08:01 (8124): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:12:50 (5880): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold CPDN Monitor - Quit request from BOINC... 15:59:33 (6404): No heartbeat from core client for 30 sec - exiting 15:59:34 (6404): No heartbeat from core client for 30 sec - exiting 15:59:35 (6404): No heartbeat from core client for 30 sec - exiting 15:59:36 (6404): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:43:19 (6280): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 14:35:18 (2148): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:35:19 (2148): No heartbeat from core client for 30 sec - exiting 14:35:20 (2148): No heartbeat from core client for 30 sec - exiting 14:35:21 (2148): No heartbeat from core client for 30 sec - exiting 14:35:22 (2148): No heartbeat from core client for 30 sec - exiting 14:35:23 (2148): No heartbeat from core client for 30 sec - exiting 14:35:24 (2148): No heartbeat from core client for 30 sec - exiting 14:35:25 (2148): No heartbeat from core client for 30 sec - exiting 14:35:26 (2148): No heartbeat from core client for 30 sec - exiting 14:35:27 (2148): No heartbeat from core client for 30 sec - exiting 14:35:28 (2148): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:51:01 (8080): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 12:49:41 (6980): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 08:14:42 (8920): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 08:57:31 (7556): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
04 Jun 2013 18:51:13 | 1179386 | 15587759 | hadcm3n_4e17_1940_40_008302403_0 | 414,720 | 573,428 | 1.3827 |
04 Jun 2013 17:23:58 | 1179386 | 15587759 | hadcm3n_4e17_1940_40_008302403_0 | 388,800 | 545,229 | 1.4023 |
03 Jun 2013 03:39:29 | 1179386 | 15587759 | hadcm3n_4e17_1940_40_008302403_0 | 362,880 | 513,307 | 1.4145 |
30 May 2013 20:49:43 | 1179386 | 15587759 | hadcm3n_4e17_1940_40_008302403_0 | 336,960 | 479,485 | 1.4230 |
27 May 2013 01:16:20 | 1179386 | 15587759 | hadcm3n_4e17_1940_40_008302403_0 | 311,040 | 447,121 | 1.4375 |
17 May 2013 00:42:14 | 1179386 | 15587759 | hadcm3n_4e17_1940_40_008302403_0 | 285,120 | 414,778 | 1.4547 |
28 Apr 2013 04:24:44 | 1179386 | 15587759 | hadcm3n_4e17_1940_40_008302403_0 | 259,200 | 377,941 | 1.4581 |
27 Apr 2013 03:01:14 | 1179386 | 15587759 | hadcm3n_4e17_1940_40_008302403_0 | 233,280 | 343,997 | 1.4746 |
06 Apr 2013 03:41:36 | 1179386 | 15587759 | hadcm3n_4e17_1940_40_008302403_0 | 207,360 | 310,376 | 1.4968 |
03 Apr 2013 02:12:58 | 1179386 | 15587759 | hadcm3n_4e17_1940_40_008302403_0 | 181,440 | 268,796 | 1.4815 |
01 Apr 2013 23:30:03 | 1179386 | 15587759 | hadcm3n_4e17_1940_40_008302403_0 | 155,520 | 230,609 | 1.4828 |
17 Mar 2013 02:27:35 | 1179386 | 15587759 | hadcm3n_4e17_1940_40_008302403_0 | 129,600 | 194,953 | 1.5043 |
05 Mar 2013 05:19:54 | 1179386 | 15587759 | hadcm3n_4e17_1940_40_008302403_0 | 103,680 | 158,291 | 1.5267 |
02 Mar 2013 19:39:29 | 1179386 | 15587759 | hadcm3n_4e17_1940_40_008302403_0 | 77,760 | 119,175 | 1.5326 |
02 Mar 2013 05:26:47 | 1179386 | 15587759 | hadcm3n_4e17_1940_40_008302403_0 | 51,840 | 80,071 | 1.5446 |
17 Feb 2013 18:44:20 | 1179386 | 15587759 | hadcm3n_4e17_1940_40_008302403_0 | 25,920 | 37,748 | 1.4563 |
©2024 cpdn.org