Name | hadcm3n_od7n_1900_40_008472246_2 |
Workunit | 8623085 |
Created | 13 Oct 2013, 13:51:37 UTC |
Sent | 13 Oct 2013, 13:52:08 UTC |
Report deadline | 12 Jan 2014, 21:19:19 UTC |
Received | 29 Oct 2013, 0:53:45 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1072815 |
Run time | 5 days 16 hours 37 min 10 sec |
CPU time | 5 days 10 hours 43 min 24 sec |
Validate state | Invalid |
Credit | 3,110.40 |
Device peak FLOPS | 2.68 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> (unknown error) - exit code 193 (0xc1) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:51:39 (6032): No heartbeat from core client for 30 sec - exiting 18:51:40 (6032): No heartbeat from core client for 30 sec - exiting 18:51:41 (6032): No heartbeat from core client for 30 sec - exiting 18:51:42 (6032): No heartbeat from core client for 30 sec - exiting 18:51:43 (6032): No heartbeat from core client for 30 sec - exiting 18:51:44 (6032): No heartbeat from core client for 30 sec - exiting 18:51:45 (6032): No heartbeat from core client for 30 sec - exiting 18:51:46 (6032): No heartbeat from core client for 30 sec - exiting 18:51:47 (6032): No heartbeat from core client for 30 sec - exiting 18:51:48 (6032): No heartbeat from core client for 30 sec - exiting 18:51:49 (6032): No heartbeat from core client for 30 sec - exiting 18:51:50 (6032): No heartbeat from core client for 30 sec - exiting 18:51:51 (6032): No heartbeat from core client for 30 sec - exiting 18:51:52 (6032): No heartbeat from core client for 30 sec - exiting 18:51:53 (6032): No heartbeat from core client for 30 sec - exiting 18:51:54 (6032): No heartbeat from core client for 30 sec - exiting 18:51:55 (6032): No heartbeat from core client for 30 sec - exiting 18:51:56 (6032): No heartbeat from core client for 30 sec - exiting 18:51:57 (6032): No heartbeat from core client for 30 sec - exiting 18:51:58 (6032): No heartbeat from core client for 30 sec - exiting 18:51:59 (6032): No heartbeat from core client for 30 sec - exiting 18:52:00 (6032): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:53:41 (6676): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:32:24 (1120): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:03:02 (6404): No heartbeat from core client for 30 sec - exiting 20:03:03 (6404): No heartbeat from core client for 30 sec - exiting 20:03:04 (6404): No heartbeat from core client for 30 sec - exiting 20:03:05 (6404): No heartbeat from core client for 30 sec - exiting 20:03:06 (6404): No heartbeat from core client for 30 sec - exiting 20:03:07 (6404): No heartbeat from core client for 30 sec - exiting 20:03:08 (6404): No heartbeat from core client for 30 sec - exiting 20:03:09 (6404): No heartbeat from core client for 30 sec - exiting 20:03:10 (6404): No heartbeat from core client for 30 sec - exiting 20:03:11 (6404): No heartbeat from core client for 30 sec - exiting 20:03:12 (6404): No heartbeat from core client for 30 sec - exiting 20:03:13 (6404): No heartbeat from core client for 30 sec - exiting 20:03:14 (6404): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6556, iMonCtr=1 Model crash detected, will try to restart... 08:54:24 (3540): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 11:55:57 (2772): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:56:30 (5808): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 00:59:59 (6672): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 19:55:09 (5784): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1980, iMonCtr=1 Model crash detected, will try to restart... 22:00:14 (4284): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1288, iMonCtr=1 Model crash detected, will try to restart... 19:39:45 (7120): No heartbeat from core client for 30 sec - exiting 19:39:46 (7120): No heartbeat from core client for 30 sec - exiting 19:39:47 (7120): No heartbeat from core client for 30 sec - exiting 19:39:48 (7120): No heartbeat from core client for 30 sec - exiting 19:39:49 (7120): No heartbeat from core client for 30 sec - exiting 19:39:50 (7120): No heartbeat from core client for 30 sec - exiting 19:39:51 (7120): No heartbeat from core client for 30 sec - exiting 19:39:52 (7120): No heartbeat from core client for 30 sec - exiting 19:39:53 (7120): No heartbeat from core client for 30 sec - exiting 19:39:54 (7120): No heartbeat from core client for 30 sec - exiting 19:39:55 (7120): No heartbeat from core client for 30 sec - exiting 19:39:56 (7120): No heartbeat from core client for 30 sec - exiting 19:39:57 (7120): No heartbeat from core client for 30 sec - exiting 19:39:58 (7120): No heartbeat from core client for 30 sec - exiting 19:39:59 (7120): No heartbeat from core client for 30 sec - exiting 19:40:00 (7120): No heartbeat from core client for 30 sec - exiting 19:40:01 (7120): No heartbeat from core client for 30 sec - exiting 19:40:02 (7120): No heartbeat from core client for 30 sec - exiting 19:40:03 (7120): No heartbeat from core client for 30 sec - exiting 19:40:04 (7120): No heartbeat from core client for 30 sec - exiting 19:40:05 (7120): No heartbeat from core client for 30 sec - exiting 19:40:06 (7120): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 19:42:16 (5676): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:25:07 (5832): No heartbeat from core client for 30 sec - exiting 18:25:08 (5832): No heartbeat from core client for 30 sec - exiting 18:25:09 (5832): No heartbeat from core client for 30 sec - exiting 18:25:10 (5832): No heartbeat from core client for 30 sec - exiting 18:25:11 (5832): No heartbeat from core client for 30 sec - exiting 18:25:12 (5832): No heartbeat from core client for 30 sec - exiting 18:25:13 (5832): No heartbeat from core client for 30 sec - exiting 18:25:14 (5832): No heartbeat from core client for 30 sec - exiting 18:25:15 (5832): No heartbeat from core client for 30 sec - exiting 18:25:16 (5832): No heartbeat from core client for 30 sec - exiting 18:25:17 (5832): No heartbeat from core client for 30 sec - exiting 18:25:18 (5832): No heartbeat from core client for 30 sec - exiting 18:25:19 (5832): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:25:58 (3268): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 18:51:14 (6800): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:03:09 (5048): No heartbeat from core client for 30 sec - exiting 20:03:10 (5048): No heartbeat from core client for 30 sec - exiting 20:03:11 (5048): No heartbeat from core client for 30 sec - exiting 20:03:12 (5048): No heartbeat from core client for 30 sec - exiting 20:03:13 (5048): No heartbeat from core client for 30 sec - exiting 20:03:14 (5048): No heartbeat from core client for 30 sec - exiting 20:03:15 (5048): No heartbeat from core client for 30 sec - exiting 20:03:16 (5048): No heartbeat from core client for 30 sec - exiting 20:03:17 (5048): No heartbeat from core client for 30 sec - exiting 20:03:18 (5048): No heartbeat from core client for 30 sec - exiting 20:03:19 (5048): No heartbeat from core client for 30 sec - exiting 20:03:20 (5048): No heartbeat from core client for 30 sec - exiting 20:03:21 (5048): No heartbeat from core client for 30 sec - exiting 20:03:22 (5048): No heartbeat from core client for 30 sec - exiting 20:03:23 (5048): No heartbeat from core client for 30 sec - exiting 20:03:24 (5048): No heartbeat from core client for 30 sec - exiting 20:03:25 (5048): No heartbeat from core client for 30 sec - exiting 20:03:26 (5048): No heartbeat from core client for 30 sec - exiting 20:03:27 (5048): No heartbeat from core client for 30 sec - exiting 20:03:28 (5048): No heartbeat from core client for 30 sec - exiting 20:03:29 (5048): No heartbeat from core client for 30 sec - exiting 20:03:30 (5048): No heartbeat from core client for 30 sec - exiting 20:03:31 (5048): No heartbeat from core client for 30 sec - exiting 20:03:32 (5048): No heartbeat from core client for 30 sec - exiting 20:03:33 (5048): No heartbeat from core client for 30 sec - exiting 20:03:34 (5048): No heartbeat from core client for 30 sec - exiting 20:03:35 (5048): No heartbeat from core client for 30 sec - exiting 20:03:36 (5048): No heartbeat from core client for 30 sec - exiting 20:03:37 (5048): No heartbeat from core client for 30 sec - exiting 20:03:38 (5048): No heartbeat from core client for 30 sec - exiting 20:03:39 (5048): No heartbeat from core client for 30 sec - exiting 20:03:40 (5048): No heartbeat from core client for 30 sec - exiting 20:03:41 (5048): No heartbeat from core client for 30 sec - exiting 20:03:42 (5048): No heartbeat from core client for 30 sec - exiting 20:03:43 (5048): No heartbeat from core client for 30 sec - exiting 20:03:44 (5048): No heartbeat from core client for 30 sec - exiting 20:03:45 (5048): No heartbeat from core client for 30 sec - exiting 20:03:46 (5048): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:07:32 (5788): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 12:06:58 (1564): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 00:59:55 (5980): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 12:23:19 (6456): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:24:08 (856): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 18:24:54 (2084): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:26:03 (5468): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
28 Oct 2013 23:53:50 | 1072815 | 16066991 | hadcm3n_od7n_1900_40_008472246_2 | 259,200 | 470,594 | 1.8156 |
27 Oct 2013 13:30:36 | 1072815 | 16066991 | hadcm3n_od7n_1900_40_008472246_2 | 233,280 | 422,110 | 1.8095 |
26 Oct 2013 12:59:06 | 1072815 | 16066991 | hadcm3n_od7n_1900_40_008472246_2 | 207,360 | 375,246 | 1.8096 |
24 Oct 2013 19:41:16 | 1072815 | 16066991 | hadcm3n_od7n_1900_40_008472246_2 | 181,440 | 328,803 | 1.8122 |
21 Oct 2013 23:13:47 | 1072815 | 16066991 | hadcm3n_od7n_1900_40_008472246_2 | 155,520 | 280,992 | 1.8068 |
20 Oct 2013 12:00:52 | 1072815 | 16066991 | hadcm3n_od7n_1900_40_008472246_2 | 129,600 | 233,755 | 1.8037 |
19 Oct 2013 11:41:14 | 1072815 | 16066991 | hadcm3n_od7n_1900_40_008472246_2 | 103,680 | 187,352 | 1.8070 |
18 Oct 2013 12:31:48 | 1072815 | 16066991 | hadcm3n_od7n_1900_40_008472246_2 | 77,760 | 141,078 | 1.8143 |
16 Oct 2013 22:02:57 | 1072815 | 16066991 | hadcm3n_od7n_1900_40_008472246_2 | 51,840 | 94,123 | 1.8156 |
14 Oct 2013 20:23:21 | 1072815 | 16066991 | hadcm3n_od7n_1900_40_008472246_2 | 25,920 | 46,494 | 1.7938 |
©2024 cpdn.org