Name | hadam3p_anz_n7d4_2012_1_008597160_1 |
Workunit | 8743672 |
Created | 1 Apr 2014, 3:58:12 UTC |
Sent | 1 Apr 2014, 4:06:45 UTC |
Report deadline | 14 Mar 2015, 9:26:45 UTC |
Received | 9 Apr 2014, 15:36:18 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1211747 |
Run time | 3 days 20 hours 2 min 14 sec |
CPU time | 3 days 9 hours 50 min 15 sec |
Validate state | Invalid |
Credit | 4,981.10 |
Device peak FLOPS | 3.91 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> 18:28:19 (1400): Can't acquire lockfile (32) - waiting 35s 18:28:41 (5060): No heartbeat from core client for 30 sec - exiting 18:28:42 (5060): No heartbeat from core client for 30 sec - exiting 18:28:43 (5060): No heartbeat from core client for 30 sec - exiting 18:28:44 (5060): No heartbeat from core client for 30 sec - exiting 18:28:45 (5060): No heartbeat from core client for 30 sec - exiting 18:28:46 (5060): No heartbeat from core client for 30 sec - exiting 18:28:47 (5060): No heartbeat from core client for 30 sec - exiting 18:28:48 (5060): No heartbeat from core client for 30 sec - exiting 18:28:49 (5060): No heartbeat from core client for 30 sec - exiting 18:28:50 (5060): No heartbeat from core client for 30 sec - exiting 18:28:51 (5060): No heartbeat from core client for 30 sec - exiting 18:28:52 (5060): No heartbeat from core client for 30 sec - exiting 18:28:53 (5060): No heartbeat from core client for 30 sec - exiting 18:28:54 (5060): No heartbeat from core client for 30 sec - exiting 18:28:55 (5060): No heartbeat from core client for 30 sec - exiting 18:28:56 (5060): No heartbeat from core client for 30 sec - exiting 18:28:57 (5060): No heartbeat from core client for 30 sec - exiting 18:28:58 (5060): No heartbeat from core client for 30 sec - exiting 18:28:59 (5060): No heartbeat from core client for 30 sec - exiting 18:29:00 (5060): No heartbeat from core client for 30 sec - exiting 18:29:01 (5060): No heartbeat from core client for 30 sec - exiting 18:29:02 (5060): No heartbeat from core client for 30 sec - exiting 18:29:03 (5060): No heartbeat from core client for 30 sec - exiting 18:29:04 (5060): No heartbeat from core client for 30 sec - exiting 18:29:05 (5060): No heartbeat from core client for 30 sec - exiting 18:29:06 (5060): No heartbeat from core client for 30 sec - exiting 18:29:07 (5060): No heartbeat from core client for 30 sec - exiting 18:29:08 (5060): No heartbeat from core client for 30 sec - exiting 18:29:09 (5060): No heartbeat from core client for 30 sec - exiting 18:29:10 (5060): No heartbeat from core client for 30 sec - exiting 18:29:11 (5060): No heartbeat from core client for 30 sec - exiting 18:29:12 (5060): No heartbeat from core client for 30 sec - exiting 18:29:13 (5060): No heartbeat from core client for 30 sec - exiting 18:29:14 (5060): No heartbeat from core client for 30 sec - exiting 18:29:15 (5060): No heartbeat from core client for 30 sec - exiting 18:29:16 (5060): No heartbeat from core client for 30 sec - exiting 18:29:17 (5060): No heartbeat from core client for 30 sec - exiting 18:29:18 (5060): No heartbeat from core client for 30 sec - exiting 18:29:19 (5060): No heartbeat from core client for 30 sec - exiting 18:29:20 (5060): No heartbeat from core client for 30 sec - exiting 18:29:21 (5060): No heartbeat from core client for 30 sec - exiting 18:29:22 (5060): No heartbeat from core client for 30 sec - exiting 18:29:23 (5060): No heartbeat from core client for 30 sec - exiting 18:29:24 (5060): No heartbeat from core client for 30 sec - exiting 18:29:25 (5060): No heartbeat from core client for 30 sec - exiting 18:29:26 (5060): No heartbeat from core client for 30 sec - exiting 18:29:27 (5060): No heartbeat from core client for 30 sec - exiting 18:29:28 (5060): No heartbeat from core client for 30 sec - exiting 18:29:29 (5060): No heartbeat from core client for 30 sec - exiting 18:29:30 (5060): No heartbeat from core client for 30 sec - exiting 18:29:31 (5060): No heartbeat from core client for 30 sec - exiting 18:29:32 (5060): No heartbeat from core client for 30 sec - exiting 18:29:33 (5060): No heartbeat from core client for 30 sec - exiting 18:29:34 (5060): No heartbeat from core client for 30 sec - exiting 18:29:35 (5060): No heartbeat from core client for 30 sec - exiting 18:29:36 (5060): No heartbeat from core client for 30 sec - exiting 18:29:37 (5060): No heartbeat from core client for 30 sec - exiting 18:29:38 (5060): No heartbeat from core client for 30 sec - exiting 18:29:39 (5060): No heartbeat from core client for 30 sec - exiting 18:29:40 (5060): No heartbeat from core client for 30 sec - exiting 18:29:40 (7028): Can't set up shared mem: -1. Will run in standalone mode. 18:29:41 (5060): No heartbeat from core client for 30 sec - exiting 18:29:42 (5060): No heartbeat from core client for 30 sec - exiting 18:29:42 (7048): Can't set up shared mem: -1. Will run in standalone mode. 18:29:43 (5060): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5456, selfPID=5456, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3488, selfPID=3488, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6676, selfPID=6676, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6456, selfPID=6456, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2284, selfPID=2284, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7420, selfPID=7420, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 17:36:56 (5780): No heartbeat from core client for 30 sec - exiting 17:36:57 (5780): No heartbeat from core client for 30 sec - exiting 17:36:58 (5780): No heartbeat from core client for 30 sec - exiting 17:36:59 (5780): No heartbeat from core client for 30 sec - exiting 17:37:00 (5780): No heartbeat from core client for 30 sec - exiting 17:37:01 (5780): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2936, selfPID=2936, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4836, selfPID=4836, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6300, selfPID=6300, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 23:05:01 (5784): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:16:23 (6000): Can't acquire lockfile (32) - waiting 35s 23:16:50 (460): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=6152, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6732, selfPID=5848, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_anz_n7d4_2012_1_008597160_1_11.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_n7d4_2012_1_008597160_1_12.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
09 Apr 2014 01:58:26 | 1211747 | 16430521 | hadam3p_anz_n7d4_2012_1_008597160_1 | 115,499 | 284,926 | 2.4669 |
08 Apr 2014 16:09:32 | 1211747 | 16430521 | hadam3p_anz_n7d4_2012_1_008597160_1 | 103,979 | 256,542 | 2.4672 |
07 Apr 2014 02:59:38 | 1211747 | 16430521 | hadam3p_anz_n7d4_2012_1_008597160_1 | 92,459 | 227,627 | 2.4619 |
06 Apr 2014 08:19:10 | 1211747 | 16430521 | hadam3p_anz_n7d4_2012_1_008597160_1 | 80,939 | 199,250 | 2.4617 |
05 Apr 2014 17:45:40 | 1211747 | 16430521 | hadam3p_anz_n7d4_2012_1_008597160_1 | 69,419 | 171,437 | 2.4696 |
05 Apr 2014 05:19:30 | 1211747 | 16430521 | hadam3p_anz_n7d4_2012_1_008597160_1 | 57,899 | 143,582 | 2.4799 |
04 Apr 2014 15:23:27 | 1211747 | 16430521 | hadam3p_anz_n7d4_2012_1_008597160_1 | 46,379 | 115,453 | 2.4893 |
03 Apr 2014 22:02:42 | 1211747 | 16430521 | hadam3p_anz_n7d4_2012_1_008597160_1 | 34,859 | 87,046 | 2.4971 |
02 Apr 2014 22:46:26 | 1211747 | 16430521 | hadam3p_anz_n7d4_2012_1_008597160_1 | 23,339 | 58,450 | 2.5044 |
02 Apr 2014 02:30:51 | 1211747 | 16430521 | hadam3p_anz_n7d4_2012_1_008597160_1 | 11,819 | 30,114 | 2.5479 |
©2025 cpdn.org