Name | hadam3p_anz_l1ku_2012_1_009300829_0 |
Workunit | 9385017 |
Created | 17 Dec 2014, 19:04:22 UTC |
Sent | 29 Dec 2014, 23:52:25 UTC |
Report deadline | 12 Dec 2015, 5:12:25 UTC |
Received | 7 Jan 2015, 21:02:37 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1341143 |
Run time | 5 days 16 hours 55 min 37 sec |
CPU time | 5 days 9 hours 16 min 33 sec |
Validate state | Invalid |
Credit | 4,484.28 |
Device peak FLOPS | 3.45 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.4.36</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=42376, selfPID=42376, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 22:14:58 (58884): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:36:21 (45316): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:45:34 (56788): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:58:13 (62076): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:33:50 (12152): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:46:46 (1392): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:50:24 (46272): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=59276, selfPID=59276, iMonCtr=2 23:54:54 (62236): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:02:24 (59868): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:05:00 (61960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:21:05 (61380): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:29:58 (59056): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:32:37 (60784): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=56664, selfPID=56664, iMonCtr=2 00:38:52 (61616): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:44:19 (53472): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:08:45 (61672): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:11:20 (62412): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:15:15 (57780): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:35:15 (61452): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:41:49 (60912): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:33:47 (59976): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:44:49 (46484): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:46:50 (57332): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:50:57 (59444): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:56:16 (62364): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:00:12 (61380): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:05:20 (53648): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:08:23 (58700): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:09:35 (57828): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:12:55 (61812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=61164, selfPID=61164, iMonCtr=2 03:17:14 (26832): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:20:21 (60960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:23:18 (61536): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:26:49 (60872): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:28:02 (62328): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:31:12 (55744): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:32:13 (62428): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=59912, selfPID=59912, iMonCtr=2 03:34:49 (62132): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:37:21 (61268): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:40:01 (60888): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:42:14 (62260): No heartbeat from core client for 30 sec - exiting 03:42:15 (62260): No heartbeat from core client for 30 sec - exiting 03:42:16 (62260): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14756, selfPID=14756, iMonCtr=2 03:42:17 (62260): No heartbeat from core client for 30 sec - exiting 03:44:15 (53488): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:45:09 (60100): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=61812, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5396, selfPID=5396, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5396, selfPID=26832, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_anz_l1ku_2012_1_009300829_0_10.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_l1ku_2012_1_009300829_0_11.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_l1ku_2012_1_009300829_0_12.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
08 Jan 2015 16:21:51 | 1341143 | 17587040 | hadam3p_anz_l1ku_2012_1_009300829_0 | 103,979 | 450,019 | 4.3280 |
08 Jan 2015 16:21:51 | 1341143 | 17587040 | hadam3p_anz_l1ku_2012_1_009300829_0 | 92,459 | 399,699 | 4.3230 |
08 Jan 2015 16:21:51 | 1341143 | 17587040 | hadam3p_anz_l1ku_2012_1_009300829_0 | 80,939 | 349,553 | 4.3187 |
05 Jan 2015 09:02:49 | 1341143 | 17587040 | hadam3p_anz_l1ku_2012_1_009300829_0 | 69,419 | 299,404 | 4.3130 |
04 Jan 2015 17:59:48 | 1341143 | 17587040 | hadam3p_anz_l1ku_2012_1_009300829_0 | 57,899 | 249,599 | 4.3109 |
04 Jan 2015 03:37:25 | 1341143 | 17587040 | hadam3p_anz_l1ku_2012_1_009300829_0 | 46,379 | 199,287 | 4.2969 |
03 Jan 2015 11:54:42 | 1341143 | 17587040 | hadam3p_anz_l1ku_2012_1_009300829_0 | 34,859 | 149,466 | 4.2877 |
02 Jan 2015 21:12:49 | 1341143 | 17587040 | hadam3p_anz_l1ku_2012_1_009300829_0 | 23,339 | 100,171 | 4.2920 |
02 Jan 2015 06:25:56 | 1341143 | 17587040 | hadam3p_anz_l1ku_2012_1_009300829_0 | 11,819 | 50,468 | 4.2701 |
©2024 cpdn.org