Name | hadam3p_anz_m1bu_2012_1_009304032_0 |
Workunit | 9388220 |
Created | 17 Dec 2014, 19:29:10 UTC |
Sent | 23 Dec 2014, 19:01:32 UTC |
Report deadline | 6 Dec 2015, 0:21:32 UTC |
Received | 13 Jan 2015, 16:45:29 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1299079 |
Run time | 19 hours 40 min 31 sec |
CPU time | 18 hours 24 min 39 sec |
Validate state | Invalid |
Credit | 509.72 |
Device peak FLOPS | 3.18 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 02:03:43 (22836): No heartbeat from core client for 30 sec - exiting 02:03:44 (22836): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=35524, selfPID=35524, iMonCtr=2 12:22:11 (1696): No heartbeat from core client for 30 sec - exiting 12:22:12 (1696): No heartbeat from core client for 30 sec - exiting 12:22:13 (1696): No heartbeat from core client for 30 sec - exiting 12:22:14 (1696): No heartbeat from core client for 30 sec - exiting 12:22:15 (1696): No heartbeat from core client for 30 sec - exiting 12:22:16 (1696): No heartbeat from core client for 30 sec - exiting 12:22:17 (1696): No heartbeat from core client for 30 sec - exiting 12:22:18 (1696): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 15:47:59 (6032): No heartbeat from core client for 30 sec - exiting 15:48:00 (6032): No heartbeat from core client for 30 sec - exiting 15:48:01 (6032): No heartbeat from core client for 30 sec - exiting 15:48:02 (6032): No heartbeat from core client for 30 sec - exiting 15:48:03 (6032): No heartbeat from core client for 30 sec - exiting 15:48:04 (6032): No heartbeat from core client for 30 sec - exiting 15:48:05 (6032): No heartbeat from core client for 30 sec - exiting 15:48:06 (6032): No heartbeat from core client for 30 sec - exiting 15:48:07 (6032): No heartbeat from core client for 30 sec - exiting 15:48:08 (6032): No heartbeat from core client for 30 sec - exiting 15:48:09 (6032): No heartbeat from core client for 30 sec - exiting 15:48:10 (6032): No heartbeat from core client for 30 sec - exiting 15:48:11 (6032): No heartbeat from core client for 30 sec - exiting 15:48:12 (6032): No heartbeat from core client for 30 sec - exiting 15:48:13 (6032): No heartbeat from core client for 30 sec - exiting 15:48:14 (6032): No heartbeat from core client for 30 sec - exiting 15:48:15 (6032): No heartbeat from core client for 30 sec - exiting 15:48:16 (6032): No heartbeat from core client for 30 sec - exiting 15:48:17 (6032): No heartbeat from core client for 30 sec - exiting 15:48:18 (6032): No heartbeat from core client for 30 sec - exiting 15:48:19 (6032): No heartbeat from core client for 30 sec - exiting 15:48:20 (6032): No heartbeat from core client for 30 sec - exiting 15:48:21 (6032): No heartbeat from core client for 30 sec - exiting 15:48:22 (6032): No heartbeat from core client for 30 sec - exiting 15:48:23 (6032): No heartbeat from core client for 30 sec - exiting 15:48:24 (6032): No heartbeat from core client for 30 sec - exiting 15:48:25 (6032): No heartbeat from core client for 30 sec - exiting 15:48:26 (6032): No heartbeat from core client for 30 sec - exiting 15:48:27 (6032): No heartbeat from core client for 30 sec - exiting 15:48:28 (6032): No heartbeat from core client for 30 sec - exiting 15:48:29 (6032): No heartbeat from core client for 30 sec - exiting 15:48:30 (6032): No heartbeat from core client for 30 sec - exiting 15:48:31 (6032): No heartbeat from core client for 30 sec - exiting 15:48:32 (6032): No heartbeat from core client for 30 sec - exiting 15:48:33 (6032): No heartbeat from core client for 30 sec - exiting 15:48:34 (6032): No heartbeat from core client for 30 sec - exiting 15:48:35 (6032): No heartbeat from core client for 30 sec - exiting 15:48:36 (6032): No heartbeat from core client for 30 sec - exiting 15:48:37 (6032): No heartbeat from core client for 30 sec - exiting 15:48:38 (6032): No heartbeat from core client for 30 sec - exiting 15:48:39 (6032): No heartbeat from core client for 30 sec - exiting 15:48:40 (6032): No heartbeat from core client for 30 sec - exiting 15:48:41 (6032): No heartbeat from core client for 30 sec - exiting 15:48:42 (6032): No heartbeat from core client for 30 sec - exiting 15:48:43 (6032): No heartbeat from core client for 30 sec - exiting 15:48:44 (6032): No heartbeat from core client for 30 sec - exiting 15:48:45 (6032): No heartbeat from core client for 30 sec - exiting 15:48:46 (6032): No heartbeat from core client for 30 sec - exiting 15:48:47 (6032): No heartbeat from core client for 30 sec - exiting 15:48:48 (6032): No heartbeat from core client for 30 sec - exiting 15:48:49 (6032): No heartbeat from core client for 30 sec - exiting 15:48:50 (6032): No heartbeat from core client for 30 sec - exiting 15:48:51 (6032): No heartbeat from core client for 30 sec - exiting 15:48:52 (6032): No heartbeat from core client for 30 sec - exiting 15:48:53 (6032): No heartbeat from core client for 30 sec - exiting 15:48:54 (6032): No heartbeat from core client for 30 sec - exiting 15:48:55 (6032): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6716, selfPID=6716, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 17:20:34 (9916): No heartbeat from core client for 30 sec - exiting 17:20:35 (9916): No heartbeat from core client for 30 sec - exiting 17:20:36 (9916): No heartbeat from core client for 30 sec - exiting 17:20:37 (9916): No heartbeat from core client for 30 sec - exiting 17:20:38 (9916): No heartbeat from core client for 30 sec - exiting 17:20:39 (9916): No heartbeat from core client for 30 sec - exiting 17:20:40 (9916): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9916, selfPID=9916, iMonCtr=2 19:04:12 (14636): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 21:52:06 (17772): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 04:02:02 (33108): No heartbeat from core client for 30 sec - exiting 04:02:03 (33108): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:23:41 (6552): No heartbeat from core client for 30 sec - exiting 14:23:42 (6552): No heartbeat from core client for 30 sec - exiting 14:23:43 (6552): No heartbeat from core client for 30 sec - exiting 14:23:44 (6552): No heartbeat from core client for 30 sec - exiting 14:23:45 (6552): No heartbeat from core client for 30 sec - exiting 14:23:46 (6552): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3748, selfPID=3748, iMonCtr=2 17:40:59 (6032): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 22:17:27 (9544): No heartbeat from core client for 30 sec - exiting 22:17:28 (9544): No heartbeat from core client for 30 sec - exiting 22:17:29 (9544): No heartbeat from core client for 30 sec - exiting 22:17:30 (9544): No heartbeat from core client for 30 sec - exiting 22:17:31 (9544): No heartbeat from core client for 30 sec - exiting 22:17:32 (9544): No heartbeat from core client for 30 sec - exiting 22:17:33 (9544): No heartbeat from core client for 30 sec - exiting 22:17:34 (9544): No heartbeat from core client for 30 sec - exiting 22:17:35 (9544): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:37:11 (7172): No heartbeat from core client for 30 sec - exiting 22:37:12 (7172): No heartbeat from core client for 30 sec - exiting 22:37:13 (7172): No heartbeat from core client for 30 sec - exiting 22:37:14 (7172): No heartbeat from core client for 30 sec - exiting 22:37:15 (7172): No heartbeat from core client for 30 sec - exiting 22:37:16 (7172): No heartbeat from core client for 30 sec - exiting 22:37:17 (7172): No heartbeat from core client for 30 sec - exiting 22:37:18 (7172): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 21:41:53 (17612): No heartbeat from core client for 30 sec - exiting 21:41:54 (17612): No heartbeat from core client for 30 sec - exiting 21:41:55 (17612): No heartbeat from core client for 30 sec - exiting 21:41:56 (17612): No heartbeat from core client for 30 sec - exiting 21:41:57 (17612): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=20828, selfPID=20828, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 00:17:56 (25188): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 13:17:19 (1648): No heartbeat from core client for 30 sec - exiting 13:17:20 (1648): No heartbeat from core client for 30 sec - exiting 13:17:21 (1648): No heartbeat from core client for 30 sec - exiting 13:17:22 (1648): No heartbeat from core client for 30 sec - exiting 13:17:23 (1648): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6468, selfPID=6468, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10988, selfPID=10988, iMonCtr=2 18:58:16 (18612): No heartbeat from core client for 30 sec - exiting 18:58:17 (18612): No heartbeat from core client for 30 sec - exiting 18:58:18 (18612): No heartbeat from core client for 30 sec - exiting 18:58:19 (18612): No heartbeat from core client for 30 sec - exiting 18:58:20 (18612): No heartbeat from core client for 30 sec - exiting 18:58:21 (18612): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6124, selfPID=6124, iMonCtr=2 22:00:31 (16756): No heartbeat from core client for 30 sec - exiting 22:00:32 (16756): No heartbeat from core client for 30 sec - exiting 22:00:33 (16756): No heartbeat from core client for 30 sec - exiting 22:00:34 (16756): No heartbeat from core client for 30 sec - exiting 22:00:35 (16756): No heartbeat from core client for 30 sec - exiting 22:00:36 (16756): No heartbeat from core client for 30 sec - exiting 22:00:37 (16756): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=22336, selfPID=22336, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=19880, selfPID=19880, iMonCtr=2 02:58:24 (27288): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=29524, selfPID=29524, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=30376, selfPID=30376, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=36400, selfPID=36400, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6632, selfPID=6632, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9616, selfPID=9616, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=18856, selfPID=18856, iMonCtr=2 CCPDN Monitor - Quit request from BOINC... 01:41:28 (10068): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 07:22:38 (27344): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:22:40 (27344): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=21624, selfPID=21624, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=31500, selfPID=31500, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=30076, selfPID=30076, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3268, selfPID=3268, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7076, selfPID=7076, iMonCtr=2 09:07:36 (9676): No heartbeat from core client for 30 sec - exiting 09:07:37 (9676): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:49:51 (3436): No heartbeat from core client for 30 sec - exiting 20:49:52 (3436): No heartbeat from core client for 30 sec - exiting 20:49:53 (3436): No heartbeat from core client for 30 sec - exiting 20:49:54 (3436): No heartbeat from core client for 30 sec - exiting 20:49:55 (3436): No heartbeat from core client for 30 sec - exiting 20:49:56 (3436): No heartbeat from core client for 30 sec - exiting 20:49:57 (3436): No heartbeat from core client for 30 sec - exiting 20:49:58 (3436): No heartbeat from core client for 30 sec - exiting 20:50:00 (3436): No heartbeat from core client for 30 sec - exiting 20:50:01 (3436): No heartbeat from core client for 30 sec - exiting 20:50:02 (3436): No heartbeat from core client for 30 sec - exiting 20:50:03 (3436): No heartbeat from core client for 30 sec - exiting 20:50:04 (3436): No heartbeat from core client for 30 sec - exiting 20:50:05 (3436): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5060, selfPID=4840, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_anz_m1bu_2012_1_009304032_0_2.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m1bu_2012_1_009304032_0_3.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m1bu_2012_1_009304032_0_4.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m1bu_2012_1_009304032_0_5.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m1bu_2012_1_009304032_0_6.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m1bu_2012_1_009304032_0_7.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m1bu_2012_1_009304032_0_8.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m1bu_2012_1_009304032_0_9.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m1bu_2012_1_009304032_0_10.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m1bu_2012_1_009304032_0_11.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_m1bu_2012_1_009304032_0_12.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
01 Jan 2015 16:53:56 | 1299079 | 17590265 | hadam3p_anz_m1bu_2012_1_009304032_0 | 11,819 | 36,909 | 3.1229 |
©2024 cpdn.org