Name | hadcm3s_1re6_2000_2_008939784_1 |
Workunit | 9083959 |
Created | 8 Sep 2014, 18:23:46 UTC |
Sent | 8 Sep 2014, 18:44:52 UTC |
Report deadline | 9 Dec 2014, 2:12:03 UTC |
Received | 16 Oct 2014, 0:19:45 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 9 (0x00000009) Unknown error code |
Computer ID | 976228 |
Run time | 4 days 11 hours 11 min 32 sec |
CPU time | 3 days 10 hours 57 min 34 sec |
Validate state | Invalid |
Credit | 622.08 |
Device peak FLOPS | 2.83 GFLOPS |
Application version | UK Met Office HadCM3 short v7.24 i686-apple-darwin |
Stderr | <core_client_version>6.6.29</core_client_version> <![CDATA[ <message> process exited with code 9 (0x9, -247) </message> <stderr_txt> 01:00:59 (94774): No heartbeat from client for 30 sec - exiting 01:00:59 (94774): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:07:40 (95494): No heartbeat from client for 30 sec - exiting 01:07:41 (95494): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:14:00 (95576): No heartbeat from client for 30 sec - exiting 01:14:00 (95576): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:19:41 (95631): No heartbeat from client for 30 sec - exiting 01:19:41 (95631): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:25:49 (95708): No heartbeat from client for 30 sec - exiting 01:25:49 (95708): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:31:51 (95762): No heartbeat from client for 30 sec - exiting 01:31:51 (95762): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:37:58 (95847): No heartbeat from client for 30 sec - exiting 01:37:58 (95847): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:44:04 (95906): No heartbeat from client for 30 sec - exiting 01:44:04 (95906): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:01:31 (95983): No heartbeat from client for 30 sec - exiting 02:01:32 (95983): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:07:13 (96120): No heartbeat from client for 30 sec - exiting 02:07:13 (96120): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 00:57:23 (96198): No heartbeat from client for 30 sec - exiting 00:57:24 (96198): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:08:39 (9859): No heartbeat from client for 30 sec - exiting 01:08:39 (9859): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:20:19 (9957): No heartbeat from client for 30 sec - exiting 01:20:19 (9957): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:26:01 (10061): No heartbeat from client for 30 sec - exiting 01:26:01 (10061): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:31:57 (10118): No heartbeat from client for 30 sec - exiting 01:31:57 (10118): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:37:58 (10195): No heartbeat from client for 30 sec - exiting 01:37:59 (10195): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:44:01 (10252): No heartbeat from client for 30 sec - exiting 01:44:01 (10252): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:50:03 (10323): No heartbeat from client for 30 sec - exiting 01:50:03 (10323): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:07:00 (10385): No heartbeat from client for 30 sec - exiting 02:07:00 (10385): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 00:02:34 (10527): No heartbeat from client for 30 sec - exiting 00:02:34 (10527): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... SIGSEGV: segmentation violation Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 01:04:26 (52203): No heartbeat from client for 30 sec - exiting 01:04:26 (52203): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:15:56 (56933): No heartbeat from client for 30 sec - exiting 01:15:56 (56933): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:22:02 (57033): No heartbeat from client for 30 sec - exiting 01:22:02 (57033): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:27:56 (57101): No heartbeat from client for 30 sec - exiting 01:27:56 (57101): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:33:54 (57163): No heartbeat from client for 30 sec - exiting 01:33:55 (57163): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:39:55 (57232): No heartbeat from client for 30 sec - exiting 01:39:55 (57232): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:45:57 (57295): No heartbeat from client for 30 sec - exiting 01:45:57 (57295): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:51:56 (57362): No heartbeat from client for 30 sec - exiting 01:51:56 (57362): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:57:42 (57428): No heartbeat from client for 30 sec - exiting 01:57:43 (57428): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 00:46:33 (57492): No heartbeat from client for 30 sec - exiting 00:46:33 (57492): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:09:01 (68273): No heartbeat from client for 30 sec - exiting 01:09:01 (68273): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:15:09 (68441): No heartbeat from client for 30 sec - exiting 01:15:09 (68441): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:26:19 (68517): No heartbeat from client for 30 sec - exiting 01:26:19 (68517): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:32:18 (68607): No heartbeat from client for 30 sec - exiting 01:32:18 (68607): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:38:17 (68685): No heartbeat from client for 30 sec - exiting 01:38:17 (68685): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:44:15 (68740): No heartbeat from client for 30 sec - exiting 01:44:16 (68740): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:48:43 (68816): No heartbeat from client for 30 sec - exiting 01:48:43 (68816): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:50:23 (68864): No heartbeat from client for 30 sec - exiting 01:50:23 (68864): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:01:48 (68885): No heartbeat from client for 30 sec - exiting 02:01:48 (68885): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
13 Oct 2014 07:54:31 | 976228 | 16980400 | hadcm3s_1re6_2000_2_008939784_1 | 51,840 | 149,490 | 2.8837 |
12 Oct 2014 06:31:14 | 976228 | 16980400 | hadcm3s_1re6_2000_2_008939784_1 | 25,920 | 74,778 | 2.8850 |
©2024 climateprediction.net