Name | hadam3p_eu_6aw8_2007_1_008123926_0 |
Workunit | 8279040 |
Created | 9 Aug 2012, 18:40:04 UTC |
Sent | 9 Aug 2012, 18:41:41 UTC |
Report deadline | 23 Jul 2013, 0:01:41 UTC |
Received | 9 Sep 2012, 17:47:25 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 1075496 |
Run time | 2 days 12 hours 26 min 56 sec |
CPU time | 2 days 5 hours 40 min 15 sec |
Validate state | Invalid |
Credit | 1,194.02 |
Device peak FLOPS | 2.79 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1116, selfPID=1116, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1852, selfPID=1852, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4056, selfPID=4056, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2376, selfPID=2376, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3952, selfPID=3952, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3884, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1532, selfPID=1532, iMonCtr=2 17:59:07 (3888): No heartbeat from core client for 30 sec - exiting 17:59:08 (3888): No heartbeat from core client for 30 sec - exiting 17:59:09 (3888): No heartbeat from core client for 30 sec - exiting 17:59:11 (3888): No heartbeat from core client for 30 sec - exiting 17:59:12 (3888): No heartbeat from core client for 30 sec - exiting 17:59:13 (3888): No heartbeat from core client for 30 sec - exiting 17:59:14 (3888): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3020, selfPID=3020, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3928, selfPID=3928, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4012, selfPID=4012, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4952, selfPID=4952, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4856, selfPID=4856, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2964, selfPID=1580, iMonCtr=1 Model crash detected, will try to restart... 17:43:41 (3680): No heartbeat from core client for 30 sec - exiting 17:43:42 (3680): No heartbeat from core client for 30 sec - exiting 17:43:43 (3680): No heartbeat from core client for 30 sec - exiting 17:43:45 (3680): No heartbeat from core client for 30 sec - exiting 17:43:46 (3680): No heartbeat from core client for 30 sec - exiting 17:43:47 (3680): No heartbeat from core client for 30 sec - exiting 17:43:48 (3680): No heartbeat from core client for 30 sec - exiting 17:43:49 (3680): No heartbeat from core client for 30 sec - exiting 17:43:50 (3680): No heartbeat from core client for 30 sec - exiting 17:43:51 (3680): No heartbeat from core client for 30 sec - exiting 17:43:52 (3680): No heartbeat from core client for 30 sec - exiting 17:43:53 (3680): No heartbeat from core client for 30 sec - exiting 17:43:54 (3680): No heartbeat from core client for 30 sec - exiting 17:43:55 (3680): No heartbeat from core client for 30 sec - exiting 17:43:56 (3680): No heartbeat from core client for 30 sec - exiting 17:43:57 (3680): No heartbeat from core client for 30 sec - exiting 17:43:58 (3680): No heartbeat from core client for 30 sec - exiting 17:43:59 (3680): No heartbeat from core client for 30 sec - exiting 17:44:00 (3680): No heartbeat from core client for 30 sec - exiting 17:44:01 (3680): No heartbeat from core client for 30 sec - exiting 17:44:02 (3680): No heartbeat from core client for 30 sec - exiting 17:44:03 (3680): No heartbeat from core client for 30 sec - exiting 17:44:04 (3680): No heartbeat from core client for 30 sec - exiting 17:44:05 (3680): No heartbeat from core client for 30 sec - exiting 17:44:06 (3680): No heartbeat from core client for 30 sec - exiting 17:44:07 (3680): No heartbeat from core client for 30 sec - exiting 17:44:08 (3680): No heartbeat from core client for 30 sec - exiting 17:44:09 (3680): No heartbeat from core client for 30 sec - exiting 17:44:10 (3680): No heartbeat from core client for 30 sec - exiting 17:44:11 (3680): No heartbeat from core client for 30 sec - exiting 17:44:12 (3680): No heartbeat from core client for 30 sec - exiting 17:44:13 (3680): No heartbeat from core client for 30 sec - exiting 17:44:14 (3680): No heartbeat from core client for 30 sec - exiting 17:44:15 (3680): No heartbeat from core client for 30 sec - exiting 17:44:16 (3680): No heartbeat from core client for 30 sec - exiting 17:44:17 (3680): No heartbeat from core client for 30 sec - exiting 17:44:18 (3680): No heartbeat from core client for 30 sec - exiting 17:44:19 (3680): No heartbeat from core client for 30 sec - exiting 17:44:20 (3680): No heartbeat from core client for 30 sec - exiting 17:44:21 (3680): No heartbeat from core client for 30 sec - exiting 17:44:22 (3680): No heartbeat from core client for 30 sec - exiting 17:44:23 (3680): No heartbeat from core client for 30 sec - exiting 17:44:24 (3680): No heartbeat from core client for 30 sec - exiting 17:44:25 (3680): No heartbeat from core client for 30 sec - exiting 17:44:26 (3680): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:15:26 (2180): No heartbeat from core client for 30 sec - exiting 19:15:27 (2180): No heartbeat from core client for 30 sec - exiting 19:15:28 (2180): No heartbeat from core client for 30 sec - exiting 19:15:29 (2180): No heartbeat from core client for 30 sec - exiting 19:15:30 (2180): No heartbeat from core client for 30 sec - exiting 19:15:31 (2180): No heartbeat from core client for 30 sec - exiting 19:15:32 (2180): No heartbeat from core client for 30 sec - exiting 19:15:33 (2180): No heartbeat from core client for 30 sec - exiting 19:15:34 (2180): No heartbeat from core client for 30 sec - exiting 19:15:35 (2180): No heartbeat from core client for 30 sec - exiting 19:15:36 (2180): No heartbeat from core client for 30 sec - exiting 19:15:37 (2180): No heartbeat from core client for 30 sec - exiting 19:15:38 (2180): No heartbeat from core client for 30 sec - exiting 19:15:39 (2180): No heartbeat from core client for 30 sec - exiting 19:15:40 (2180): No heartbeat from core client for 30 sec - exiting 19:15:41 (2180): No heartbeat from core client for 30 sec - exiting 19:15:42 (2180): No heartbeat from core client for 30 sec - exiting 19:15:43 (2180): No heartbeat from core client for 30 sec - exiting 19:15:44 (2180): No heartbeat from core client for 30 sec - exiting 19:15:45 (2180): No heartbeat from core client for 30 sec - exiting 19:15:46 (2180): No heartbeat from core client for 30 sec - exiting 19:15:47 (2180): No heartbeat from core client for 30 sec - exiting 19:15:48 (2180): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:15:49 (2180): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5040, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4468, iMonCtr=2 </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
09 Sep 2012 06:34:33 | 1075496 | 15082471 | hadam3p_eu_6aw8_2007_1_008123926_0 | 69,216 | 166,805 | 2.4099 |
08 Sep 2012 20:41:48 | 1075496 | 15082471 | hadam3p_eu_6aw8_2007_1_008123926_0 | 57,696 | 139,144 | 2.4117 |
02 Sep 2012 14:36:53 | 1075496 | 15082471 | hadam3p_eu_6aw8_2007_1_008123926_0 | 46,176 | 111,554 | 2.4158 |
16 Aug 2012 13:41:55 | 1075496 | 15082471 | hadam3p_eu_6aw8_2007_1_008123926_0 | 34,656 | 83,676 | 2.4145 |
13 Aug 2012 19:36:58 | 1075496 | 15082471 | hadam3p_eu_6aw8_2007_1_008123926_0 | 23,136 | 55,894 | 2.4159 |
10 Aug 2012 10:32:03 | 1075496 | 15082471 | hadam3p_eu_6aw8_2007_1_008123926_0 | 11,616 | 28,135 | 2.4221 |
©2024 climateprediction.net