Name | hadam3p_pnw_33k3_1975_1_007369160_0 |
Workunit | 7566590 |
Created | 24 Jul 2011, 11:46:34 UTC |
Sent | 26 Jul 2011, 18:30:53 UTC |
Report deadline | 7 Jul 2012, 23:50:53 UTC |
Received | 29 Aug 2011, 12:39:52 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -177 (0xFFFFFF4F) ERR_RSC_LIMIT_EXCEEDED |
Computer ID | 833883 |
Run time | 7 days 18 hours 44 min 54 sec |
CPU time | 5 days 18 hours 57 min 34 sec |
Validate state | Invalid |
Credit | 1,754.30 |
Device peak FLOPS | 1.19 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.09 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> Maximum elapsed time exceeded </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5816, selfPID=5816, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4372, selfPID=4372, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:23:51 (4388): No heartbeat from core client for 30 sec - exiting 09:23:52 (4388): No heartbeat from core client for 30 sec - exiting 09:23:53 (4388): No heartbeat from core client for 30 sec - exiting 09:23:54 (4388): No heartbeat from core client for 30 sec - exiting 09:23:55 (4388): No heartbeat from core client for 30 sec - exiting 09:23:56 (4388): No heartbeat from core client for 30 sec - exiting 09:23:57 (4388): No heartbeat from core client for 30 sec - exiting 09:23:58 (4388): No heartbeat from core client for 30 sec - exiting 09:23:59 (4388): No heartbeat from core client for 30 sec - exiting 09:24:01 (4388): No heartbeat from core client for 30 sec - exiting 09:24:02 (4388): No heartbeat from core client for 30 sec - exiting 09:24:03 (4388): No heartbeat from core client for 30 sec - exiting 09:24:04 (4388): No heartbeat from core client for 30 sec - exiting 09:24:05 (4388): No heartbeat from core client for 30 sec - exiting 09:24:06 (4388): No heartbeat from core client for 30 sec - exiting 09:24:07 (4388): No heartbeat from core client for 30 sec - exiting 09:24:08 (4388): No heartbeat from core client for 30 sec - exiting 09:24:09 (4388): No heartbeat from core client for 30 sec - exiting 09:24:10 (4388): No heartbeat from core client for 30 sec - exiting 09:24:12 (4388): No heartbeat from core client for 30 sec - exiting 09:24:13 (4388): No heartbeat from core client for 30 sec - exiting 09:24:14 (4388): No heartbeat from core client for 30 sec - exiting 09:24:15 (4388): No heartbeat from core client for 30 sec - exiting 09:24:16 (4388): No heartbeat from core client for 30 sec - exiting 09:24:17 (4388): No heartbeat from core client for 30 sec - exiting 09:24:18 (4388): No heartbeat from core client for 30 sec - exiting 09:24:19 (4388): No heartbeat from core client for 30 sec - exiting 09:24:20 (4388): No heartbeat from core client for 30 sec - exiting 09:24:21 (4388): No heartbeat from core client for 30 sec - exiting 09:24:22 (4388): No heartbeat from core client for 30 sec - exiting 09:24:24 (4388): No heartbeat from core client for 30 sec - exiting 09:24:25 (4388): No heartbeat from core client for 30 sec - exiting 09:24:26 (4388): No heartbeat from core client for 30 sec - exiting 09:24:27 (4388): No heartbeat from core client for 30 sec - exiting 09:24:28 (4388): No heartbeat from core client for 30 sec - exiting 09:24:29 (4388): No heartbeat from core client for 30 sec - exiting 09:24:30 (4388): No heartbeat from core client for 30 sec - exiting 09:24:31 (4388): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5712, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6588, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4152, selfPID=4152, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3836, selfPID=7128, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Abort request from BOINC... Regional yearly means requires 12 input files got 7 Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
28 Aug 2011 21:44:12 | 833883 | 13156791 | hadam3p_pnw_33k3_1975_1_007369160_0 | 80,736 | 491,340 | 6.0858 |
25 Aug 2011 20:15:15 | 833883 | 13156791 | hadam3p_pnw_33k3_1975_1_007369160_0 | 69,216 | 426,432 | 6.1609 |
23 Aug 2011 14:18:06 | 833883 | 13156791 | hadam3p_pnw_33k3_1975_1_007369160_0 | 57,696 | 352,684 | 6.1128 |
20 Aug 2011 02:38:27 | 833883 | 13156791 | hadam3p_pnw_33k3_1975_1_007369160_0 | 46,176 | 283,085 | 6.1306 |
01 Aug 2011 02:00:55 | 833883 | 13156791 | hadam3p_pnw_33k3_1975_1_007369160_0 | 34,656 | 210,060 | 6.0613 |
30 Jul 2011 16:42:06 | 833883 | 13156791 | hadam3p_pnw_33k3_1975_1_007369160_0 | 23,141 | 139,767 | 6.0398 |
30 Jul 2011 15:30:04 | 833883 | 13156791 | hadam3p_pnw_33k3_1975_1_007369160_0 | 23,138 | 138,961 | 6.0057 |
30 Jul 2011 14:27:11 | 833883 | 13156791 | hadam3p_pnw_33k3_1975_1_007369160_0 | 23,136 | 138,238 | 5.9750 |
28 Jul 2011 19:59:34 | 833883 | 13156791 | hadam3p_pnw_33k3_1975_1_007369160_0 | 11,616 | 69,201 | 5.9574 |
©2024 climateprediction.net