Name | hadam3p_saf_1czf_1986_1_006944659_0 |
Workunit | 7147975 |
Created | 22 Nov 2010, 15:59:37 UTC |
Sent | 10 Mar 2011, 9:00:45 UTC |
Report deadline | 20 Feb 2012, 14:20:45 UTC |
Received | 20 May 2011, 12:27:20 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 1110741 |
Run time | 2 days 22 hours 4 min 31 sec |
CPU time | 2 days 11 hours 12 min 20 sec |
Validate state | Invalid |
Credit | 1,870.33 |
Device peak FLOPS | 3.26 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Southern Africa v6.08 windows_intelx86 |
Stderr | <core_client_version>6.12.26</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... 10:17:15 (2364): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 10:17:30 (2364): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:13:10 (5252): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:13:13 (5252): No heartbeat from core client for 30 sec - exiting 17:13:16 (5252): No heartbeat from core client for 30 sec - exiting 17:13:17 (5252): No heartbeat from core client for 30 sec - exiting 17:13:18 (5252): No heartbeat from core client for 30 sec - exiting 17:13:19 (5252): No heartbeat from core client for 30 sec - exiting 17:13:20 (5252): No heartbeat from core client for 30 sec - exiting 17:13:21 (5252): No heartbeat from core client for 30 sec - exiting 17:13:22 (5252): No heartbeat from core client for 30 sec - exiting 17:13:23 (5252): No heartbeat from core client for 30 sec - exiting 17:13:24 (5252): No heartbeat from core client for 30 sec - exiting 17:13:25 (5252): No heartbeat from core client for 30 sec - exiting 17:13:26 (5252): No heartbeat from core client for 30 sec - exiting 17:13:27 (5252): No heartbeat from core client for 30 sec - exiting 17:13:38 (18920): Can't acquire lockfile (32) - waiting 35s 17:14:46 (18920): No heartbeat from core client for 30 sec - exiting 17:14:47 (18920): No heartbeat from core client for 30 sec - exiting 17:14:48 (18920): No heartbeat from core client for 30 sec - exiting 17:14:49 (18920): No heartbeat from core client for 30 sec - exiting 17:14:50 (18920): No heartbeat from core client for 30 sec - exiting 17:14:51 (18920): No heartbeat from core client for 30 sec - exiting 17:14:52 (18920): No heartbeat from core client for 30 sec - exiting 17:14:53 (18920): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:17:38 (7108): No heartbeat from core client for 30 sec - exiting 17:18:10 (7108): No heartbeat from core client for 30 sec - exiting 17:18:11 (7108): No heartbeat from core client for 30 sec - exiting 17:18:12 (7108): No heartbeat from core client for 30 sec - exiting 17:18:13 (7108): No heartbeat from core client for 30 sec - exiting 17:18:14 (7108): No heartbeat from core client for 30 sec - exiting 17:18:15 (7108): No heartbeat from core client for 30 sec - exiting 17:18:16 (7108): No heartbeat from core client for 30 sec - exiting 17:18:48 (7108): No heartbeat from core client for 30 sec - exiting 17:18:49 (7108): No heartbeat from core client for 30 sec - exiting 17:18:50 (7108): No heartbeat from core client for 30 sec - exiting 17:18:51 (7108): No heartbeat from core client for 30 sec - exiting 17:21:09 (7108): No heartbeat from core client for 30 sec - exiting 17:21:10 (7108): No heartbeat from core client for 30 sec - exiting 17:21:11 (7108): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:36:14 (10012): Can't acquire lockfile (32) - waiting 35s 18:02:11 (5820): No heartbeat from core client for 30 sec - exiting 18:02:12 (5820): No heartbeat from core client for 30 sec - exiting 18:02:13 (5820): No heartbeat from core client for 30 sec - exiting 18:02:14 (5820): No heartbeat from core client for 30 sec - exiting 18:02:15 (5820): No heartbeat from core client for 30 sec - exiting 18:02:16 (5820): No heartbeat from core client for 30 sec - exiting 18:02:17 (5820): No heartbeat from core client for 30 sec - exiting 18:02:18 (5820): No heartbeat from core client for 30 sec - exiting 18:02:19 (5820): No heartbeat from core client for 30 sec - exiting 18:02:20 (5820): No heartbeat from core client for 30 sec - exiting 18:02:21 (5820): No heartbeat from core client for 30 sec - exiting 18:02:22 (5820): No heartbeat from core client for 30 sec - exiting 18:02:23 (5820): No heartbeat from core client for 30 sec - exiting 18:02:24 (5820): No heartbeat from core client for 30 sec - exiting 18:02:25 (5820): No heartbeat from core client for 30 sec - exiting 18:02:26 (5820): No heartbeat from core client for 30 sec - exiting 18:02:27 (5820): No heartbeat from core client for 30 sec - exiting 18:02:28 (5820): No heartbeat from core client for 30 sec - exiting 18:02:29 (5820): No heartbeat from core client for 30 sec - exiting 18:02:30 (5820): No heartbeat from core client for 30 sec - exiting 18:02:31 (5820): No heartbeat from core client for 30 sec - exiting 18:02:32 (5820): No heartbeat from core client for 30 sec - exiting 18:02:33 (5820): No heartbeat from core client for 30 sec - exiting 18:02:34 (5820): No heartbeat from core client for 30 sec - exiting 18:02:35 (5820): No heartbeat from core client for 30 sec - exiting 18:02:36 (5820): No heartbeat from core client for 30 sec - exiting 18:02:37 (5820): No heartbeat from core client for 30 sec - exiting 18:02:38 (5820): No heartbeat from core client for 30 sec - exiting 18:02:39 (5820): No heartbeat from core client for 30 sec - exiting 18:02:40 (5820): No heartbeat from core client for 30 sec - exiting 18:02:41 (5820): No heartbeat from core client for 30 sec - exiting 18:02:42 (5820): No heartbeat from core client for 30 sec - exiting 18:02:43 (5820): No heartbeat from core client for 30 sec - exiting 18:02:44 (5820): No heartbeat from core client for 30 sec - exiting 18:02:45 (5820): No heartbeat from core client for 30 sec - exiting 18:02:46 (5820): No heartbeat from core client for 30 sec - exiting 18:02:47 (5820): No heartbeat from core client for 30 sec - exiting 18:02:48 (5820): No heartbeat from core client for 30 sec - exiting 18:02:49 (5820): No heartbeat from core client for 30 sec - exiting 18:02:50 (5820): No heartbeat from core client for 30 sec - exiting 18:02:51 (5820): No heartbeat from core client for 30 sec - exiting 18:02:52 (5820): No heartbeat from core client for 30 sec - exiting 18:02:53 (5820): No heartbeat from core client for 30 sec - exiting 18:02:54 (5820): No heartbeat from core client for 30 sec - exiting 18:02:55 (5820): No heartbeat from core client for 30 sec - exiting 18:02:56 (5820): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... 00:48:28 (10904): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 02:19:10 (3192): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:11:47 (11664): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:46:53 (7128): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 20:46:58 (7128): No heartbeat from core client for 30 sec - exiting 20:46:59 (7128): No heartbeat from core client for 30 sec - exiting 20:47:00 (7128): No heartbeat from core client for 30 sec - exiting 20:47:01 (7128): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 15:03:00 (12848): No heartbeat from core client for 30 sec - exiting 15:03:01 (12848): No heartbeat from core client for 30 sec - exiting 15:03:02 (12848): No heartbeat from core client for 30 sec - exiting 15:03:03 (12848): No heartbeat from core client for 30 sec - exiting 15:03:04 (12848): No heartbeat from core client for 30 sec - exiting 15:03:05 (12848): No heartbeat from core client for 30 sec - exiting 15:03:06 (12848): No heartbeat from core client for 30 sec - exiting 15:03:07 (12848): No heartbeat from core client for 30 sec - exiting 15:03:08 (12848): No heartbeat from core client for 30 sec - exiting 15:03:09 (12848): No heartbeat from core client for 30 sec - exiting 15:03:10 (12848): No heartbeat from core client for 30 sec - exiting 15:03:11 (12848): No heartbeat from core client for 30 sec - exiting 15:03:12 (12848): No heartbeat from core client for 30 sec - exiting 15:03:13 (12848): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 18:22:25 (10260): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 02:16:24 (7260): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5188, iMonCtr=2 Model crash detected, will try to restart... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
19 May 2011 12:26:43 | 1110741 | 12225146 | hadam3p_saf_1czf_1986_1_006944659_0 | 115,296 | 213,415 | 1.8510 |
17 May 2011 21:26:29 | 1110741 | 12225146 | hadam3p_saf_1czf_1986_1_006944659_0 | 103,776 | 191,958 | 1.8497 |
17 May 2011 09:19:55 | 1110741 | 12225146 | hadam3p_saf_1czf_1986_1_006944659_0 | 92,256 | 171,282 | 1.8566 |
16 May 2011 20:53:50 | 1110741 | 12225146 | hadam3p_saf_1czf_1986_1_006944659_0 | 80,736 | 151,057 | 1.8710 |
15 May 2011 22:06:41 | 1110741 | 12225146 | hadam3p_saf_1czf_1986_1_006944659_0 | 69,216 | 129,233 | 1.8671 |
10 May 2011 13:09:45 | 1110741 | 12225146 | hadam3p_saf_1czf_1986_1_006944659_0 | 57,696 | 108,919 | 1.8878 |
01 May 2011 14:59:32 | 1110741 | 12225146 | hadam3p_saf_1czf_1986_1_006944659_0 | 46,176 | 90,533 | 1.9606 |
01 May 2011 00:04:25 | 1110741 | 12225146 | hadam3p_saf_1czf_1986_1_006944659_0 | 34,656 | 68,743 | 1.9836 |
30 Apr 2011 15:23:52 | 1110741 | 12225146 | hadam3p_saf_1czf_1986_1_006944659_0 | 23,136 | 46,301 | 2.0013 |
27 Apr 2011 10:04:53 | 1110741 | 12225146 | hadam3p_saf_1czf_1986_1_006944659_0 | 11,616 | 23,405 | 2.0149 |
©2024 climateprediction.net