Name | hadam3p_pnw_puxj_2013_1_009987292_2 |
Workunit | 9993650 |
Created | 11 Jul 2015, 19:15:32 UTC |
Sent | 14 Jul 2015, 19:29:30 UTC |
Report deadline | 26 Jun 2016, 0:49:30 UTC |
Received | 29 Jul 2015, 19:42:59 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1349053 |
Run time | 4 days 2 hours 35 min 2 sec |
CPU time | 3 days 7 hours 42 min 6 sec |
Validate state | Invalid |
Credit | 1,258.08 |
Device peak FLOPS | 2.39 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v7.27 windows_intelx86 |
Stderr | <core_client_version>6.6.28</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:05:27 (3888): No heartbeat from client for 30 sec - exiting 18:05:27 (3888): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:04:05 (5056): No heartbeat from client for 30 sec - exiting 19:04:05 (5056): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:02:44 (3696): No heartbeat from client for 30 sec - exiting 20:02:44 (3696): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:01:23 (5736): No heartbeat from client for 30 sec - exiting 21:01:23 (5736): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 01:59:46 (1432): No heartbeat from client for 30 sec - exiting 01:59:46 (1432): timer handler: client dead, exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 14:09:00 (4688): No heartbeat from client for 30 sec - exiting 14:09:00 (4688): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:07:31 (5644): No heartbeat from client for 30 sec - exiting 15:07:31 (5644): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:06:07 (4664): No heartbeat from client for 30 sec - exiting 16:06:07 (4664): timer handler: client dead, exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:04:45 (5160): No heartbeat from client for 30 sec - exiting 18:04:45 (5160): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:03:10 (4152): No heartbeat from client for 30 sec - exiting 20:03:10 (4152): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 22:01:30 (5756): No heartbeat from client for 30 sec - exiting 22:01:30 (5756): timer handler: client dead, exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:07:17 (3220): No heartbeat from client for 30 sec - exiting 07:07:17 (3220): timer handler: client dead, exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:05:45 (1528): No heartbeat from client for 30 sec - exiting 08:05:45 (1528): timer handler: client dead, exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 11:04:34 (3960): No heartbeat from client for 30 sec - exiting 11:04:34 (3960): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 14:03:32 (2204): No heartbeat from client for 30 sec - exiting 14:03:32 (2204): timer handler: client dead, exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:02:30 (2156): No heartbeat from client for 30 sec - exiting 17:02:30 (2156): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:01:02 (4564): No heartbeat from client for 30 sec - exiting 18:01:02 (4564): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:59:33 (5596): No heartbeat from client for 30 sec - exiting 18:59:33 (5596): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:58:06 (5332): No heartbeat from client for 30 sec - exiting 19:58:06 (5332): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 01:57:03 (5368): No heartbeat from client for 30 sec - exiting 01:57:03 (5368): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:26:02 (548): No heartbeat from client for 30 sec - exiting 17:26:02 (548): timer handler: client dead, exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 18:24:48 (444): No heartbeat from client for 30 sec - exiting 18:24:48 (444): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5888, selfPID=5888, iMonCtr=2 19:23:33 (2368): No heartbeat from client for 30 sec - exiting 19:23:33 (2368): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:22:29 (5968): No heartbeat from client for 30 sec - exiting 20:22:29 (5968): timer handler: client dead, exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:10:11 (4448): No heartbeat from client for 30 sec - exiting 07:10:11 (4448): timer handler: client dead, exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:09:11 (4192): No heartbeat from client for 30 sec - exiting 08:09:11 (4192): timer handler: client dead, exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:07:55 (1560): No heartbeat from client for 30 sec - exiting 20:07:55 (1560): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 19:20:53 (4832): No heartbeat from client for 30 sec - exiting 19:20:53 (4832): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:19:46 (4776): No heartbeat from client for 30 sec - exiting 20:19:47 (4776): timer handler: client dead, exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 19:08:46 (3928): No heartbeat from client for 30 sec - exiting 19:08:46 (3928): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:07:13 (5688): No heartbeat from client for 30 sec - exiting 21:07:13 (5688): timer handler: client dead, exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 23:06:12 (4732): No heartbeat from client for 30 sec - exiting 23:06:12 (4732): timer handler: client dead, exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:12:21 (3648): No heartbeat from client for 30 sec - exiting 17:12:21 (3648): timer handler: client dead, exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:10:48 (432): No heartbeat from client for 30 sec - exiting 20:10:48 (432): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:09:41 (5908): No heartbeat from client for 30 sec - exiting 21:09:41 (5908): timer handler: client dead, exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... 14:09:35 (4104): No heartbeat from client for 30 sec - exiting 14:09:35 (4104): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:08:17 (5068): No heartbeat from client for 30 sec - exiting 18:08:17 (5068): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 09:11:07 (1252): No heartbeat from client for 30 sec - exiting 09:11:07 (1252): timer handler: client dead, exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 12:09:50 (808): No heartbeat from client for 30 sec - exiting 12:09:50 (808): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:25:53 (4616): No heartbeat from client for 30 sec - exiting 15:25:53 (4616): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:02:52 (5984): No heartbeat from client for 30 sec - exiting 07:02:52 (5984): timer handler: client dead, exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 4 received, exiting... 12:42:18 (3072): called boinc_finish(193) Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4252, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3072, selfPID=4848, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 12:42:32 (4848): called boinc_finish(0) </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_pnw_puxj_2013_1_009987292_2_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_puxj_2013_1_009987292_2_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_puxj_2013_1_009987292_2_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_puxj_2013_1_009987292_2_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_puxj_2013_1_009987292_2_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_puxj_2013_1_009987292_2_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_puxj_2013_1_009987292_2_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_puxj_2013_1_009987292_2_13.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_puxj_2013_1_009987292_2_14.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_puxj_2013_1_009987292_2_15.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_puxj_2013_1_009987292_2_16.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_puxj_2013_1_009987292_2_17.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_puxj_2013_1_009987292_2_18.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
29 Jul 2015 06:37:14 | 1349053 | 18695315 | hadam3p_pnw_puxj_2013_1_009987292_2 | 57,899 | 269,145 | 4.6485 |
27 Jul 2015 23:06:32 | 1349053 | 18695315 | hadam3p_pnw_puxj_2013_1_009987292_2 | 46,379 | 215,650 | 4.6497 |
25 Jul 2015 17:37:31 | 1349053 | 18695315 | hadam3p_pnw_puxj_2013_1_009987292_2 | 34,859 | 161,720 | 4.6393 |
23 Jul 2015 02:07:15 | 1349053 | 18695315 | hadam3p_pnw_puxj_2013_1_009987292_2 | 23,339 | 107,802 | 4.6190 |
19 Jul 2015 18:23:52 | 1349053 | 18695315 | hadam3p_pnw_puxj_2013_1_009987292_2 | 11,819 | 54,323 | 4.5962 |
©2024 cpdn.org