Name | hadcm3n_4au5_1940_40_008308822_0 |
Workunit | 8459957 |
Created | 7 Feb 2013, 18:50:01 UTC |
Sent | 7 Feb 2013, 19:45:43 UTC |
Report deadline | 10 May 2013, 3:12:54 UTC |
Received | 19 Feb 2013, 20:39:13 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 139 (0x0000008B) Unknown error code |
Computer ID | 1219011 |
Run time | 5 days 5 hours 11 min 39 sec |
CPU time | 4 days 23 hours 9 min 36 sec |
Validate state | Invalid |
Credit | 4,354.56 |
Device peak FLOPS | 4.00 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> process got signal 11 </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 00:53:40 (3484): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 00:57:32 (14694): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:57:33 (14694): No heartbeat from core client for 30 sec - exiting 00:57:34 (14694): No heartbeat from core client for 30 sec - exiting 00:57:35 (14694): No heartbeat from core client for 30 sec - exiting 00:57:36 (14694): No heartbeat from core client for 30 sec - exiting 00:57:37 (14694): No heartbeat from core client for 30 sec - exiting 00:57:38 (14694): No heartbeat from core client for 30 sec - exiting 00:57:39 (14694): No heartbeat from core client for 30 sec - exiting 00:57:40 (14694): No heartbeat from core client for 30 sec - exiting 00:57:41 (14694): No heartbeat from core client for 30 sec - exiting 00:57:42 (14694): No heartbeat from core client for 30 sec - exiting 00:57:43 (14694): No heartbeat from core client for 30 sec - exiting 00:57:44 (14694): No heartbeat from core client for 30 sec - exiting 00:57:45 (14694): No heartbeat from core client for 30 sec - exiting 01:30:57 (14932): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:30:58 (14932): No heartbeat from core client for 30 sec - exiting 01:30:59 (14932): No heartbeat from core client for 30 sec - exiting 01:31:00 (14932): No heartbeat from core client for 30 sec - exiting 01:31:01 (14932): No heartbeat from core client for 30 sec - exiting 01:31:02 (14932): No heartbeat from core client for 30 sec - exiting 01:31:03 (14932): No heartbeat from core client for 30 sec - exiting 01:31:04 (14932): No heartbeat from core client for 30 sec - exiting 01:31:05 (14932): No heartbeat from core client for 30 sec - exiting 01:31:06 (14932): No heartbeat from core client for 30 sec - exiting 01:31:07 (14932): No heartbeat from core client for 30 sec - exiting 01:31:08 (14932): No heartbeat from core client for 30 sec - exiting 01:31:09 (14932): No heartbeat from core client for 30 sec - exiting 01:31:10 (14932): No heartbeat from core client for 30 sec - exiting 01:31:11 (14932): No heartbeat from core client for 30 sec - exiting 01:31:12 (14932): No heartbeat from core client for 30 sec - exiting 01:31:13 (14932): No heartbeat from core client for 30 sec - exiting 01:31:14 (14932): No heartbeat from core client for 30 sec - exiting 01:31:15 (14932): No heartbeat from core client for 30 sec - exiting 01:31:16 (14932): No heartbeat from core client for 30 sec - exiting 01:31:17 (14932): No heartbeat from core client for 30 sec - exiting 01:31:18 (14932): No heartbeat from core client for 30 sec - exiting 01:31:19 (14932): No heartbeat from core client for 30 sec - exiting 01:31:20 (14932): No heartbeat from core client for 30 sec - exiting 01:31:21 (14932): No heartbeat from core client for 30 sec - exiting 01:31:22 (14932): No heartbeat from core client for 30 sec - exiting 01:31:23 (14932): No heartbeat from core client for 30 sec - exiting 01:31:24 (14932): No heartbeat from core client for 30 sec - exiting 01:31:25 (14932): No heartbeat from core client for 30 sec - exiting 01:31:26 (14932): No heartbeat from core client for 30 sec - exiting 01:31:27 (14932): No heartbeat from core client for 30 sec - exiting 01:31:28 (14932): No heartbeat from core client for 30 sec - exiting 01:31:29 (14932): No heartbeat from core client for 30 sec - exiting 01:31:30 (14932): No heartbeat from core client for 30 sec - exiting 01:31:31 (14932): No heartbeat from core client for 30 sec - exiting 01:31:32 (14932): No heartbeat from core client for 30 sec - exiting 01:31:33 (14932): No heartbeat from core client for 30 sec - exiting 01:31:34 (14932): No heartbeat from core client for 30 sec - exiting 01:31:35 (14932): No heartbeat from core client for 30 sec - exiting 01:31:36 (14932): No heartbeat from core client for 30 sec - exiting 01:31:37 (14932): No heartbeat from core client for 30 sec - exiting 01:31:38 (14932): No heartbeat from core client for 30 sec - exiting 01:31:39 (14932): No heartbeat from core client for 30 sec - exiting 01:31:40 (14932): No heartbeat from core client for 30 sec - exiting 01:31:41 (14932): No heartbeat from core client for 30 sec - exiting 01:31:42 (14932): No heartbeat from core client for 30 sec - exiting 01:31:43 (14932): No heartbeat from core client for 30 sec - exiting 01:31:44 (14932): No heartbeat from core client for 30 sec - exiting 01:34:51 (16550): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:34:52 (16550): No heartbeat from core client for 30 sec - exiting 01:34:53 (16550): No heartbeat from core client for 30 sec - exiting 01:34:54 (16550): No heartbeat from core client for 30 sec - exiting 01:34:55 (16550): No heartbeat from core client for 30 sec - exiting 01:34:56 (16550): No heartbeat from core client for 30 sec - exiting 01:34:57 (16550): No heartbeat from core client for 30 sec - exiting 01:34:58 (16550): No heartbeat from core client for 30 sec - exiting 01:34:59 (16550): No heartbeat from core client for 30 sec - exiting 01:35:00 (16550): No heartbeat from core client for 30 sec - exiting 01:35:01 (16550): No heartbeat from core client for 30 sec - exiting 01:35:02 (16550): No heartbeat from core client for 30 sec - exiting 01:35:03 (16550): No heartbeat from core client for 30 sec - exiting 01:35:04 (16550): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
19 Feb 2013 19:42:43 | 1219011 | 15595726 | hadcm3n_4au5_1940_40_008308822_0 | 362,880 | 428,853 | 1.1818 |
18 Feb 2013 03:02:00 | 1219011 | 15595726 | hadcm3n_4au5_1940_40_008308822_0 | 336,960 | 400,257 | 1.1878 |
17 Feb 2013 18:11:37 | 1219011 | 15595726 | hadcm3n_4au5_1940_40_008308822_0 | 311,040 | 369,973 | 1.1895 |
14 Feb 2013 00:53:55 | 1219011 | 15595726 | hadcm3n_4au5_1940_40_008308822_0 | 285,120 | 341,203 | 1.1967 |
13 Feb 2013 13:20:13 | 1219011 | 15595726 | hadcm3n_4au5_1940_40_008308822_0 | 259,200 | 309,376 | 1.1936 |
13 Feb 2013 03:35:15 | 1219011 | 15595726 | hadcm3n_4au5_1940_40_008308822_0 | 233,280 | 276,072 | 1.1834 |
12 Feb 2013 05:43:58 | 1219011 | 15595726 | hadcm3n_4au5_1940_40_008308822_0 | 207,360 | 242,739 | 1.1706 |
11 Feb 2013 04:00:10 | 1219011 | 15595726 | hadcm3n_4au5_1940_40_008308822_0 | 181,440 | 212,916 | 1.1735 |
10 Feb 2013 17:16:21 | 1219011 | 15595726 | hadcm3n_4au5_1940_40_008308822_0 | 155,520 | 180,950 | 1.1635 |
09 Feb 2013 16:45:57 | 1219011 | 15595726 | hadcm3n_4au5_1940_40_008308822_0 | 129,600 | 147,857 | 1.1409 |
09 Feb 2013 07:06:25 | 1219011 | 15595726 | hadcm3n_4au5_1940_40_008308822_0 | 103,680 | 117,918 | 1.1373 |
08 Feb 2013 22:29:02 | 1219011 | 15595726 | hadcm3n_4au5_1940_40_008308822_0 | 77,760 | 88,372 | 1.1365 |
08 Feb 2013 13:55:25 | 1219011 | 15595726 | hadcm3n_4au5_1940_40_008308822_0 | 51,840 | 58,691 | 1.1322 |
08 Feb 2013 05:20:50 | 1219011 | 15595726 | hadcm3n_4au5_1940_40_008308822_0 | 25,920 | 29,137 | 1.1241 |
©2024 cpdn.org