Name | hadam3p_saf_2979_1960_1_007233653_0 |
Workunit | 7431893 |
Created | 29 Apr 2011, 4:50:32 UTC |
Sent | 1 May 2011, 21:30:16 UTC |
Report deadline | 13 Apr 2012, 2:50:16 UTC |
Received | 3 Jun 2011, 14:04:44 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 1035012 |
Run time | 2 days 18 hours 59 min 12 sec |
CPU time | 2 days 16 hours 4 min 59 sec |
Validate state | Invalid |
Credit | 1,122.82 |
Device peak FLOPS | 2.39 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Southern Africa v6.09 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4000, selfPID=4000, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 06:53:49 (5172): No heartbeat from core client for 30 sec - exiting 06:53:50 (5172): No heartbeat from core client for 30 sec - exiting 06:53:51 (5172): No heartbeat from core client for 30 sec - exiting 06:53:52 (5172): No heartbeat from core client for 30 sec - exiting 06:53:53 (5172): No heartbeat from core client for 30 sec - exiting 06:53:54 (5172): No heartbeat from core client for 30 sec - exiting 06:53:55 (5172): No heartbeat from core client for 30 sec - exiting 06:53:56 (5172): No heartbeat from core client for 30 sec - exiting 06:53:57 (5172): No heartbeat from core client for 30 sec - exiting 06:53:58 (5172): No heartbeat from core client for 30 sec - exiting 06:53:59 (5172): No heartbeat from core client for 30 sec - exiting 06:54:00 (5172): No heartbeat from core client for 30 sec - exiting 06:54:01 (5172): No heartbeat from core client for 30 sec - exiting 06:54:02 (5172): No heartbeat from core client for 30 sec - exiting 06:54:03 (5172): No heartbeat from core client for 30 sec - exiting 06:54:04 (5172): No heartbeat from core client for 30 sec - exiting 06:54:05 (5172): No heartbeat from core client for 30 sec - exiting 06:54:06 (5172): No heartbeat from core client for 30 sec - exiting 06:54:07 (5172): No heartbeat from core client for 30 sec - exiting 06:54:08 (5172): No heartbeat from core client for 30 sec - exiting 06:54:09 (5172): No heartbeat from core client for 30 sec - exiting 06:54:10 (5172): No heartbeat from core client for 30 sec - exiting 06:54:11 (5172): No heartbeat from core client for 30 sec - exiting 06:54:12 (5172): No heartbeat from core client for 30 sec - exiting 06:54:13 (5172): No heartbeat from core client for 30 sec - exiting 06:54:14 (5172): No heartbeat from core client for 30 sec - exiting 06:54:15 (5172): No heartbeat from core client for 30 sec - exiting 06:54:16 (5172): No heartbeat from core client for 30 sec - exiting 06:54:17 (5172): No heartbeat from core client for 30 sec - exiting 06:54:18 (5172): No heartbeat from core client for 30 sec - exiting 06:54:19 (5172): No heartbeat from core client for 30 sec - exiting 06:54:20 (5172): No heartbeat from core client for 30 sec - exiting 06:54:21 (5172): No heartbeat from core client for 30 sec - exiting 06:54:22 (5172): No heartbeat from core client for 30 sec - exiting 06:54:23 (5172): No heartbeat from core client for 30 sec - exiting 06:54:24 (5172): No heartbeat from core client for 30 sec - exiting 06:54:25 (5172): No heartbeat from core client for 30 sec - exiting 06:54:26 (5172): No heartbeat from core client for 30 sec - exiting 06:54:27 (5172): No heartbeat from core client for 30 sec - exiting 06:54:28 (5172): No heartbeat from core client for 30 sec - exiting 06:54:29 (5172): No heartbeat from core client for 30 sec - exiting 06:54:30 (5172): No heartbeat from core client for 30 sec - exiting 06:54:31 (5172): No heartbeat from core client for 30 sec - exiting 06:54:32 (5172): No heartbeat from core client for 30 sec - exiting 06:54:33 (5172): No heartbeat from core client for 30 sec - exiting 06:54:34 (5172): No heartbeat from core client for 30 sec - exiting 06:54:35 (5172): No heartbeat from core client for 30 sec - exiting 06:54:36 (5172): No heartbeat from core client for 30 sec - exiting 06:54:37 (5172): No heartbeat from core client for 30 sec - exiting 06:54:38 (5172): No heartbeat from core client for 30 sec - exiting 06:54:39 (5172): No heartbeat from core client for 30 sec - exiting 06:54:40 (5172): No heartbeat from core client for 30 sec - exiting 06:54:41 (5172): No heartbeat from core client for 30 sec - exiting 06:54:42 (5172): No heartbeat from core client for 30 sec - exiting 06:54:43 (5172): No heartbeat from core client for 30 sec - exiting 06:54:44 (5172): No heartbeat from core client for 30 sec - exiting 06:54:45 (5172): No heartbeat from core client for 30 sec - exiting 06:54:46 (5172): No heartbeat from core client for 30 sec - exiting 06:54:47 (5172): No heartbeat from core client for 30 sec - exiting 06:54:48 (5172): No heartbeat from core client for 30 sec - exiting 06:54:49 (5172): No heartbeat from core client for 30 sec - exiting 06:54:50 (5172): No heartbeat from core client for 30 sec - exiting 06:54:51 (5172): No heartbeat from core client for 30 sec - exiting 06:54:52 (5172): No heartbeat from core client for 30 sec - exiting 06:54:53 (5172): No heartbeat from core client for 30 sec - exiting 06:54:54 (5172): No heartbeat from core client for 30 sec - exiting 06:54:55 (5172): No heartbeat from core client for 30 sec - exiting 06:54:56 (5172): No heartbeat from core client for 30 sec - exiting 06:54:57 (5172): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1644, selfPID=1644, iMonCtr=2 06:42:22 (3544): No heartbeat from core client for 30 sec - exiting 06:42:23 (3544): No heartbeat from core client for 30 sec - exiting 06:42:24 (3544): No heartbeat from core client for 30 sec - exiting 06:42:25 (3544): No heartbeat from core client for 30 sec - exiting 06:42:26 (3544): No heartbeat from core client for 30 sec - exiting 06:42:27 (3544): No heartbeat from core client for 30 sec - exiting 06:42:28 (3544): No heartbeat from core client for 30 sec - exiting 06:42:29 (3544): No heartbeat from core client for 30 sec - exiting 06:42:30 (3544): No heartbeat from core client for 30 sec - exiting 06:42:31 (3544): No heartbeat from core client for 30 sec - exiting 06:42:32 (3544): No heartbeat from core client for 30 sec - exiting 06:42:33 (3544): No heartbeat from core client for 30 sec - exiting 06:42:34 (3544): No heartbeat from core client for 30 sec - exiting 06:42:35 (3544): No heartbeat from core client for 30 sec - exiting 06:42:36 (3544): No heartbeat from core client for 30 sec - exiting 06:42:37 (3544): No heartbeat from core client for 30 sec - exiting 06:42:38 (3544): No heartbeat from core client for 30 sec - exiting 06:42:39 (3544): No heartbeat from core client for 30 sec - exiting 06:42:40 (3544): No heartbeat from core client for 30 sec - exiting 06:42:41 (3544): No heartbeat from core client for 30 sec - exiting 06:42:42 (3544): No heartbeat from core client for 30 sec - exiting 06:42:43 (3544): No heartbeat from core client for 30 sec - exiting 06:42:44 (3544): No heartbeat from core client for 30 sec - exiting 06:42:45 (3544): No heartbeat from core client for 30 sec - exiting 06:42:46 (3544): No heartbeat from core client for 30 sec - exiting 06:42:47 (3544): No heartbeat from core client for 30 sec - exiting 06:42:48 (3544): No heartbeat from core client for 30 sec - exiting 06:42:49 (3544): No heartbeat from core client for 30 sec - exiting 06:42:50 (3544): No heartbeat from core client for 30 sec - exiting 06:42:51 (3544): No heartbeat from core client for 30 sec - exiting 06:42:52 (3544): No heartbeat from core client for 30 sec - exiting 06:42:53 (3544): No heartbeat from core client for 30 sec - exiting 06:42:54 (3544): No heartbeat from core client for 30 sec - exiting 06:42:55 (3544): No heartbeat from core client for 30 sec - exiting 06:42:56 (3544): No heartbeat from core client for 30 sec - exiting 06:42:57 (3544): No heartbeat from core client for 30 sec - exiting 06:42:58 (3544): No heartbeat from core client for 30 sec - exiting 06:42:59 (3544): No heartbeat from core client for 30 sec - exiting 06:43:00 (3544): No heartbeat from core client for 30 sec - exiting 06:43:01 (3544): No heartbeat from core client for 30 sec - exiting 06:43:02 (3544): No heartbeat from core client for 30 sec - exiting 06:43:03 (3544): No heartbeat from core client for 30 sec - exiting 06:43:04 (3544): No heartbeat from core client for 30 sec - exiting 06:43:05 (3544): No heartbeat from core client for 30 sec - exiting 06:43:06 (3544): No heartbeat from core client for 30 sec - exiting 06:43:07 (3544): No heartbeat from core client for 30 sec - exiting 06:43:08 (3544): No heartbeat from core client for 30 sec - exiting 06:43:09 (3544): No heartbeat from core client for 30 sec - exiting 06:43:10 (3544): No heartbeat from core client for 30 sec - exiting 06:43:11 (3544): No heartbeat from core client for 30 sec - exiting 06:43:12 (3544): No heartbeat from core client for 30 sec - exiting 06:43:13 (3544): No heartbeat from core client for 30 sec - exiting 06:43:14 (3544): No heartbeat from core client for 30 sec - exiting 06:43:15 (3544): No heartbeat from core client for 30 sec - exiting 06:43:16 (3544): No heartbeat from core client for 30 sec - exiting 06:43:17 (3544): No heartbeat from core client for 30 sec - exiting 06:43:18 (3544): No heartbeat from core client for 30 sec - exiting 06:43:19 (3544): No heartbeat from core client for 30 sec - exiting 06:43:20 (3544): No heartbeat from core client for 30 sec - exiting 06:43:21 (3544): No heartbeat from core client for 30 sec - exiting 06:43:22 (3544): No heartbeat from core client for 30 sec - exiting 06:43:23 (3544): No heartbeat from core client for 30 sec - exiting 06:43:24 (3544): No heartbeat from core client for 30 sec - exiting 06:43:25 (3544): No heartbeat from core client for 30 sec - exiting 06:43:26 (3544): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7292, selfPID=1500, iMonCtr=1 Model crash detected, will try to restart... 09:17:14 (4424): No heartbeat from core client for 30 sec - exiting 09:17:15 (4424): No heartbeat from core client for 30 sec - exiting 09:17:16 (4424): No heartbeat from core client for 30 sec - exiting 09:17:17 (4424): No heartbeat from core client for 30 sec - exiting 09:17:18 (4424): No heartbeat from core client for 30 sec - exiting 09:17:19 (4424): No heartbeat from core client for 30 sec - exiting 09:17:20 (4424): No heartbeat from core client for 30 sec - exiting 09:17:21 (4424): No heartbeat from core client for 30 sec - exiting 09:17:22 (4424): No heartbeat from core client for 30 sec - exiting 09:17:23 (4424): No heartbeat from core client for 30 sec - exiting 09:17:24 (4424): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3848, selfPID=3848, iMonCtr=2 GCPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2960, selfPID=2960, iMonCtr=2 </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
01 Jun 2011 08:42:49 | 1035012 | 12844134 | hadam3p_saf_2979_1960_1_007233653_0 | 69,216 | 210,367 | 3.0393 |
27 May 2011 20:45:25 | 1035012 | 12844134 | hadam3p_saf_2979_1960_1_007233653_0 | 57,696 | 174,292 | 3.0209 |
27 May 2011 12:21:28 | 1035012 | 12844134 | hadam3p_saf_2979_1960_1_007233653_0 | 46,176 | 146,944 | 3.1823 |
26 May 2011 18:58:04 | 1035012 | 12844134 | hadam3p_saf_2979_1960_1_007233653_0 | 34,656 | 116,364 | 3.3577 |
06 May 2011 06:42:25 | 1035012 | 12844134 | hadam3p_saf_2979_1960_1_007233653_0 | 23,136 | 78,055 | 3.3737 |
04 May 2011 17:38:25 | 1035012 | 12844134 | hadam3p_saf_2979_1960_1_007233653_0 | 11,616 | 39,457 | 3.3968 |
©2024 cpdn.org