Name | hadam3p_pnw_2yl6_1990_1_007370365_0 |
Workunit | 7567795 |
Created | 24 Jul 2011, 18:06:34 UTC |
Sent | 26 Jul 2011, 16:55:47 UTC |
Report deadline | 7 Jul 2012, 22:15:47 UTC |
Received | 1 Aug 2011, 16:43:41 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -177 (0xFFFFFF4F) ERR_RSC_LIMIT_EXCEEDED |
Computer ID | 1004263 |
Run time | 3 days 10 hours 22 min 15 sec |
CPU time | 3 days 3 hours 18 min 21 sec |
Validate state | Invalid |
Credit | 2,254.93 |
Device peak FLOPS | 2.69 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.09 windows_intelx86 |
Stderr | <core_client_version>6.12.33</core_client_version> <![CDATA[ <message> Maximum elapsed time exceeded </message> <stderr_txt> 18:56:52 (3448): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7048, selfPID=7048, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5208, selfPID=5208, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6192, selfPID=6192, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2160, selfPID=2160, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1136, selfPID=1136, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2980, selfPID=2980, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4188, selfPID=4188, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 01:20:57 (2932): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:20:59 (2932): No heartbeat from core client for 30 sec - exiting 01:21:00 (2932): No heartbeat from core client for 30 sec - exiting 01:21:01 (2932): No heartbeat from core client for 30 sec - exiting 01:21:02 (2932): No heartbeat from core client for 30 sec - exiting 01:21:03 (2932): No heartbeat from core client for 30 sec - exiting 01:21:04 (2932): No heartbeat from core client for 30 sec - exiting 01:21:05 (2932): No heartbeat from core client for 30 sec - exiting 01:21:06 (2932): No heartbeat from core client for 30 sec - exiting 05:19:47 (4916): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 14:16:44 (3848): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:16:55 (7080): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:15:19 (1416): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4644, selfPID=4644, iMonCtr=2 21:15:35 (1416): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Abort request from BOINC... Regional yearly means requires 12 input files got 9 Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
01 Aug 2011 17:07:54 | 1004263 | 13158054 | hadam3p_pnw_2yl6_1990_1_007370365_0 | 103,776 | 261,191 | 2.5169 |
01 Aug 2011 17:07:54 | 1004263 | 13158054 | hadam3p_pnw_2yl6_1990_1_007370365_0 | 92,256 | 232,941 | 2.5249 |
01 Aug 2011 17:07:54 | 1004263 | 13158054 | hadam3p_pnw_2yl6_1990_1_007370365_0 | 80,736 | 204,691 | 2.5353 |
01 Aug 2011 17:07:54 | 1004263 | 13158054 | hadam3p_pnw_2yl6_1990_1_007370365_0 | 69,216 | 175,678 | 2.5381 |
01 Aug 2011 17:07:54 | 1004263 | 13158054 | hadam3p_pnw_2yl6_1990_1_007370365_0 | 57,696 | 146,289 | 2.5355 |
01 Aug 2011 17:07:54 | 1004263 | 13158054 | hadam3p_pnw_2yl6_1990_1_007370365_0 | 46,176 | 116,906 | 2.5317 |
01 Aug 2011 17:07:54 | 1004263 | 13158054 | hadam3p_pnw_2yl6_1990_1_007370365_0 | 34,656 | 87,505 | 2.5250 |
29 Jul 2011 05:27:01 | 1004263 | 13158054 | hadam3p_pnw_2yl6_1990_1_007370365_0 | 23,136 | 58,650 | 2.5350 |
28 Jul 2011 11:47:44 | 1004263 | 13158054 | hadam3p_pnw_2yl6_1990_1_007370365_0 | 11,616 | 29,637 | 2.5514 |
©2024 cpdn.org