Name | hadam3p_saf_0ycp_1985_1_007453724_2 |
Workunit | 7651227 |
Created | 12 Sep 2011, 15:09:27 UTC |
Sent | 12 Sep 2011, 15:10:23 UTC |
Report deadline | 24 Aug 2012, 20:30:23 UTC |
Received | 11 Oct 2011, 11:21:46 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 1157721 |
Run time | 2 days 16 hours 40 min 11 sec |
CPU time | 2 days 2 hours 4 min 38 sec |
Validate state | Invalid |
Credit | 1,122.82 |
Device peak FLOPS | 2.49 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Southern Africa v6.09 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9904, selfPID=9904, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4432, selfPID=4432, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6680, selfPID=8544, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4732, selfPID=4732, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5144, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4860, selfPID=840, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7460, selfPID=7460, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8076, selfPID=8076, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6036, selfPID=6036, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8324, selfPID=8324, iMonCtr=2 10:28:29 (3816): No heartbeat from core client for 30 sec - exiting 10:28:30 (3816): No heartbeat from core client for 30 sec - exiting 10:28:31 (3816): No heartbeat from core client for 30 sec - exiting 10:28:32 (3816): No heartbeat from core client for 30 sec - exiting 10:28:33 (3816): No heartbeat from core client for 30 sec - exiting 10:28:34 (3816): No heartbeat from core client for 30 sec - exiting 10:28:35 (3816): No heartbeat from core client for 30 sec - exiting 10:28:36 (3816): No heartbeat from core client for 30 sec - exiting 10:28:37 (3816): No heartbeat from core client for 30 sec - exiting 10:28:38 (3816): No heartbeat from core client for 30 sec - exiting 10:28:39 (3816): No heartbeat from core client for 30 sec - exiting 10:28:40 (3816): No heartbeat from core client for 30 sec - exiting 10:28:41 (3816): No heartbeat from core client for 30 sec - exiting 10:28:42 (3816): No heartbeat from core client for 30 sec - exiting 10:28:43 (3816): No heartbeat from core client for 30 sec - exiting 10:28:44 (3816): No heartbeat from core client for 30 sec - exiting 10:28:45 (3816): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:28:46 (3816): No heartbeat from core client for 30 sec - exiting 10:28:47 (3816): No heartbeat from core client for 30 sec - exiting 10:28:48 (3816): No heartbeat from core client for 30 sec - exiting 10:28:49 (3816): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5784, selfPID=5784, iMonCtr=2 18:53:20 (8572): No heartbeat from core client for 30 sec - exiting 18:53:21 (8572): No heartbeat from core client for 30 sec - exiting 18:53:22 (8572): No heartbeat from core client for 30 sec - exiting 18:53:23 (8572): No heartbeat from core client for 30 sec - exiting 18:53:24 (8572): No heartbeat from core client for 30 sec - exiting 18:53:25 (8572): No heartbeat from core client for 30 sec - exiting 18:53:26 (8572): No heartbeat from core client for 30 sec - exiting 18:53:27 (8572): No heartbeat from core client for 30 sec - exiting 18:53:28 (8572): No heartbeat from core client for 30 sec - exiting 18:53:29 (8572): No heartbeat from core client for 30 sec - exiting 18:53:30 (8572): No heartbeat from core client for 30 sec - exiting 18:53:31 (8572): No heartbeat from core client for 30 sec - exiting 18:53:32 (8572): No heartbeat from core client for 30 sec - exiting 18:53:33 (8572): No heartbeat from core client for 30 sec - exiting 18:53:34 (8572): No heartbeat from core client for 30 sec - exiting 18:53:35 (8572): No heartbeat from core client for 30 sec - exiting 18:53:36 (8572): No heartbeat from core client for 30 sec - exiting 18:53:37 (8572): No heartbeat from core client for 30 sec - exiting 18:53:38 (8572): No heartbeat from core client for 30 sec - exiting 18:53:39 (8572): No heartbeat from core client for 30 sec - exiting 18:53:40 (8572): No heartbeat from core client for 30 sec - exiting 18:53:41 (8572): No heartbeat from core client for 30 sec - exiting 18:53:42 (8572): No heartbeat from core client for 30 sec - exiting 18:53:43 (8572): No heartbeat from core client for 30 sec - exiting 18:53:44 (8572): No heartbeat from core client for 30 sec - exiting 18:53:45 (8572): No heartbeat from core client for 30 sec - exiting 18:53:46 (8572): No heartbeat from core client for 30 sec - exiting 18:53:47 (8572): No heartbeat from core client for 30 sec - exiting 18:53:48 (8572): No heartbeat from core client for 30 sec - exiting 18:53:49 (8572): No heartbeat from core client for 30 sec - exiting 18:53:50 (8572): No heartbeat from core client for 30 sec - exiting 18:53:51 (8572): No heartbeat from core client for 30 sec - exiting 18:53:52 (8572): No heartbeat from core client for 30 sec - exiting 18:53:53 (8572): No heartbeat from core client for 30 sec - exiting 18:53:54 (8572): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:09:28 (6376): No heartbeat from core client for 30 sec - exiting 19:09:29 (6376): No heartbeat from core client for 30 sec - exiting 19:09:30 (6376): No heartbeat from core client for 30 sec - exiting 19:09:31 (6376): No heartbeat from core client for 30 sec - exiting 19:09:32 (6376): No heartbeat from core client for 30 sec - exiting 19:09:33 (6376): No heartbeat from core client for 30 sec - exiting 19:09:34 (6376): No heartbeat from core client for 30 sec - exiting 19:09:36 (6376): No heartbeat from core client for 30 sec - exiting 19:09:37 (6376): No heartbeat from core client for 30 sec - exiting 19:09:38 (6376): No heartbeat from core client for 30 sec - exiting 19:09:39 (6376): No heartbeat from core client for 30 sec - exiting 19:09:40 (6376): No heartbeat from core client for 30 sec - exiting 19:09:41 (6376): No heartbeat from core client for 30 sec - exiting 19:09:42 (6376): No heartbeat from core client for 30 sec - exiting 19:09:43 (6376): No heartbeat from core client for 30 sec - exiting 19:09:44 (6376): No heartbeat from core client for 30 sec - exiting 19:09:45 (6376): No heartbeat from core client for 30 sec - exiting 19:09:46 (6376): No heartbeat from core client for 30 sec - exiting 19:09:47 (6376): No heartbeat from core client for 30 sec - exiting 19:09:48 (6376): No heartbeat from core client for 30 sec - exiting 19:09:49 (6376): No heartbeat from core client for 30 sec - exiting 19:09:50 (6376): No heartbeat from core client for 30 sec - exiting 19:09:51 (6376): No heartbeat from core client for 30 sec - exiting 19:09:52 (6376): No heartbeat from core client for 30 sec - exiting 19:09:53 (6376): No heartbeat from core client for 30 sec - exiting 19:09:54 (6376): No heartbeat from core client for 30 sec - exiting 19:09:55 (6376): No heartbeat from core client for 30 sec - exiting 19:09:56 (6376): No heartbeat from core client for 30 sec - exiting 19:09:57 (6376): No heartbeat from core client for 30 sec - exiting 19:09:58 (6376): No heartbeat from core client for 30 sec - exiting 19:09:59 (6376): No heartbeat from core client for 30 sec - exiting 19:10:00 (6376): No heartbeat from core client for 30 sec - exiting 19:10:01 (6376): No heartbeat from core client for 30 sec - exiting 19:10:02 (6376): No heartbeat from core client for 30 sec - exiting 19:10:03 (6376): No heartbeat from core client for 30 sec - exiting 19:10:04 (6376): No heartbeat from core client for 30 sec - exiting 19:10:05 (6376): No heartbeat from core client for 30 sec - exiting 19:10:06 (6376): No heartbeat from core client for 30 sec - exiting 19:10:07 (6376): No heartbeat from core client for 30 sec - exiting 19:10:08 (6376): No heartbeat from core client for 30 sec - exiting 19:10:09 (6376): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8972, selfPID=8972, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8380, selfPID=8380, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6424, selfPID=6424, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4332, selfPID=3816, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7944, selfPID=7944, iMonCtr=2 00:20:04 (5980): No heartbeat from core client for 30 sec - exiting 00:20:05 (5980): No heartbeat from core client for 30 sec - exiting 00:20:06 (5980): No heartbeat from core client for 30 sec - exiting 00:20:07 (5980): No heartbeat from core client for 30 sec - exiting 00:20:08 (5980): No heartbeat from core client for 30 sec - exiting 00:20:09 (5980): No heartbeat from core client for 30 sec - exiting 00:20:10 (5980): No heartbeat from core client for 30 sec - exiting 00:20:11 (5980): No heartbeat from core client for 30 sec - exiting 00:20:12 (5980): No heartbeat from core client for 30 sec - exiting 00:20:13 (5980): No heartbeat from core client for 30 sec - exiting 00:20:14 (5980): No heartbeat from core client for 30 sec - exiting 00:20:15 (5980): No heartbeat from core client for 30 sec - exiting 00:20:16 (5980): No heartbeat from core client for 30 sec - exiting 00:20:17 (5980): No heartbeat from core client for 30 sec - exiting 00:20:18 (5980): No heartbeat from core client for 30 sec - exiting 00:20:19 (5980): No heartbeat from core client for 30 sec - exiting 00:20:20 (5980): No heartbeat from core client for 30 sec - exiting 00:20:21 (5980): No heartbeat from core client for 30 sec - exiting 00:20:22 (5980): No heartbeat from core client for 30 sec - exiting 00:20:23 (5980): No heartbeat from core client for 30 sec - exiting 00:20:24 (5980): No heartbeat from core client for 30 sec - exiting 00:20:25 (5980): No heartbeat from core client for 30 sec - exiting 00:20:26 (5980): No heartbeat from core client for 30 sec - exiting 00:20:27 (5980): No heartbeat from core client for 30 sec - exiting 00:20:28 (5980): No heartbeat from core client for 30 sec - exiting 00:20:29 (5980): No heartbeat from core client for 30 sec - exiting 00:20:30 (5980): No heartbeat from core client for 30 sec - exiting 00:20:31 (5980): No heartbeat from core client for 30 sec - exiting 00:20:32 (5980): No heartbeat from core client for 30 sec - exiting 00:20:33 (5980): No heartbeat from core client for 30 sec - exiting 00:20:34 (5980): No heartbeat from core client for 30 sec - exiting 00:20:35 (5980): No heartbeat from core client for 30 sec - exiting 00:20:36 (5980): No heartbeat from core client for 30 sec - exiting 00:20:37 (5980): No heartbeat from core client for 30 sec - exiting 00:20:38 (5980): No heartbeat from core client for 30 sec - exiting 00:20:39 (5980): No heartbeat from core client for 30 sec - exiting 00:20:40 (5980): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8324, selfPID=8324, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 17:57:33 (9952): No heartbeat from core client for 30 sec - exiting 17:57:34 (9952): No heartbeat from core client for 30 sec - exiting 17:57:35 (9952): No heartbeat from core client for 30 sec - exiting 17:57:36 (9952): No heartbeat from core client for 30 sec - exiting 17:57:37 (9952): No heartbeat from core client for 30 sec - exiting 17:57:38 (9952): No heartbeat from core client for 30 sec - exiting 17:57:39 (9952): No heartbeat from core client for 30 sec - exiting 17:57:40 (9952): No heartbeat from core client for 30 sec - exiting 17:57:41 (9952): No heartbeat from core client for 30 sec - exiting 17:57:42 (9952): No heartbeat from core client for 30 sec - exiting 17:57:43 (9952): No heartbeat from core client for 30 sec - exiting 17:57:44 (9952): No heartbeat from core client for 30 sec - exiting 17:57:45 (9952): No heartbeat from core client for 30 sec - exiting 17:57:46 (9952): No heartbeat from core client for 30 sec - exiting 17:57:47 (9952): No heartbeat from core client for 30 sec - exiting 17:57:48 (9952): No heartbeat from core client for 30 sec - exiting 17:57:49 (9952): No heartbeat from core client for 30 sec - exiting 17:57:50 (9952): No heartbeat from core client for 30 sec - exiting 17:57:51 (9952): No heartbeat from core client for 30 sec - exiting 17:57:52 (9952): No heartbeat from core client for 30 sec - exiting 17:57:53 (9952): No heartbeat from core client for 30 sec - exiting 17:57:54 (9952): No heartbeat from core client for 30 sec - exiting 17:57:55 (9952): No heartbeat from core client for 30 sec - exiting 17:57:56 (9952): No heartbeat from core client for 30 sec - exiting 17:57:57 (9952): No heartbeat from core client for 30 sec - exiting 17:57:58 (9952): No heartbeat from core client for 30 sec - exiting 17:57:59 (9952): No heartbeat from core client for 30 sec - exiting 17:58:00 (9952): No heartbeat from core client for 30 sec - exiting 17:58:01 (9952): No heartbeat from core client for 30 sec - exiting 17:58:02 (9952): No heartbeat from core client for 30 sec - exiting 17:58:03 (9952): No heartbeat from core client for 30 sec - exiting 17:58:04 (9952): No heartbeat from core client for 30 sec - exiting 17:58:05 (9952): No heartbeat from core client for 30 sec - exiting 17:58:06 (9952): No heartbeat from core client for 30 sec - exiting 17:58:07 (9952): No heartbeat from core client for 30 sec - exiting 17:58:08 (9952): No heartbeat from core client for 30 sec - exiting 17:58:09 (9952): No heartbeat from core client for 30 sec - exiting 17:58:10 (9952): No heartbeat from core client for 30 sec - exiting 17:58:11 (9952): No heartbeat from core client for 30 sec - exiting 17:58:12 (9952): No heartbeat from core client for 30 sec - exiting 17:58:13 (9952): No heartbeat from core client for 30 sec - exiting 17:58:14 (9952): No heartbeat from core client for 30 sec - exiting 17:58:15 (9952): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:58:16 (9952): No heartbeat from core client for 30 sec - exiting 17:58:17 (9952): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... C16:51:19 (820): No heartbeat from core client for 30 sec - exiting 16:51:20 (820): No heartbeat from core client for 30 sec - exiting 16:51:21 (820): No heartbeat from core client for 30 sec - exiting 16:51:22 (820): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7480, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10032, iMonCtr=2 Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8132, selfPID=5072, iMonCtr=1 Model crash detected, will try to restart... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
10 Oct 2011 10:41:48 | 1157721 | 13379142 | hadam3p_saf_0ycp_1985_1_007453724_2 | 69,216 | 158,199 | 2.2856 |
09 Oct 2011 15:43:09 | 1157721 | 13379142 | hadam3p_saf_0ycp_1985_1_007453724_2 | 57,696 | 131,846 | 2.2852 |
08 Oct 2011 10:51:24 | 1157721 | 13379142 | hadam3p_saf_0ycp_1985_1_007453724_2 | 46,176 | 105,700 | 2.2891 |
06 Oct 2011 02:21:56 | 1157721 | 13379142 | hadam3p_saf_0ycp_1985_1_007453724_2 | 34,677 | 80,088 | 2.3095 |
05 Oct 2011 23:34:24 | 1157721 | 13379142 | hadam3p_saf_0ycp_1985_1_007453724_2 | 34,666 | 79,567 | 2.2952 |
05 Oct 2011 19:09:21 | 1157721 | 13379142 | hadam3p_saf_0ycp_1985_1_007453724_2 | 34,656 | 79,212 | 2.2857 |
01 Oct 2011 16:40:38 | 1157721 | 13379142 | hadam3p_saf_0ycp_1985_1_007453724_2 | 23,137 | 53,174 | 2.2982 |
01 Oct 2011 13:38:00 | 1157721 | 13379142 | hadam3p_saf_0ycp_1985_1_007453724_2 | 23,136 | 52,807 | 2.2825 |
30 Sep 2011 10:17:37 | 1157721 | 13379142 | hadam3p_saf_0ycp_1985_1_007453724_2 | 11,616 | 26,680 | 2.2968 |
©2024 cpdn.org