Name | hadam3p_saf_1h5u_1977_1_006959674_1 |
Workunit | 7162990 |
Created | 23 Aug 2012, 18:14:17 UTC |
Sent | 23 Aug 2012, 18:16:27 UTC |
Report deadline | 5 Aug 2013, 23:36:27 UTC |
Received | 31 Aug 2012, 17:24:20 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1224497 |
Run time | 2 days 3 hours 21 min 26 sec |
CPU time | 8 hours 38 min 25 sec |
Validate state | Invalid |
Credit | 1,496.58 |
Device peak FLOPS | 3.69 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Southern Africa v6.09 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 22:28:00 (3456): No heartbeat from core client for 30 sec - exiting 22:28:01 (3456): No heartbeat from core client for 30 sec - exiting 22:28:02 (3456): No heartbeat from core client for 30 sec - exiting 22:28:03 (3456): No heartbeat from core client for 30 sec - exiting 22:28:04 (3456): No heartbeat from core client for 30 sec - exiting 22:28:05 (3456): No heartbeat from core client for 30 sec - exiting 22:28:06 (3456): No heartbeat from core client for 30 sec - exiting 22:28:07 (3456): No heartbeat from core client for 30 sec - exiting 22:28:08 (3456): No heartbeat from core client for 30 sec - exiting 22:28:09 (3456): No heartbeat from core client for 30 sec - exiting 22:28:10 (3456): No heartbeat from core client for 30 sec - exiting 22:28:11 (3456): No heartbeat from core client for 30 sec - exiting 22:28:12 (3456): No heartbeat from core client for 30 sec - exiting 22:28:13 (3456): No heartbeat from core client for 30 sec - exiting 22:28:14 (3456): No heartbeat from core client for 30 sec - exiting 22:28:15 (3456): No heartbeat from core client for 30 sec - exiting 22:28:16 (3456): No heartbeat from core client for 30 sec - exiting 22:28:17 (3456): No heartbeat from core client for 30 sec - exiting 22:28:18 (3456): No heartbeat from core client for 30 sec - exiting 22:28:19 (3456): No heartbeat from core client for 30 sec - exiting 22:28:20 (3456): No heartbeat from core client for 30 sec - exiting 22:28:21 (3456): No heartbeat from core client for 30 sec - exiting 22:28:22 (3456): No heartbeat from core client for 30 sec - exiting 22:28:23 (3456): No heartbeat from core client for 30 sec - exiting 22:28:24 (3456): No heartbeat from core client for 30 sec - exiting 22:28:25 (3456): No heartbeat from core client for 30 sec - exiting 22:28:27 (3456): No heartbeat from core client for 30 sec - exiting 22:28:28 (3456): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:28:29 (3456): No heartbeat from core client for 30 sec - exiting 22:28:30 (3456): No heartbeat from core client for 30 sec - exiting 05:35:16 (5004): No heartbeat from core client for 30 sec - exiting 05:35:17 (5004): No heartbeat from core client for 30 sec - exiting 05:35:18 (5004): No heartbeat from core client for 30 sec - exiting 05:35:19 (5004): No heartbeat from core client for 30 sec - exiting 05:35:20 (5004): No heartbeat from core client for 30 sec - exiting 05:35:21 (5004): No heartbeat from core client for 30 sec - exiting 05:35:22 (5004): No heartbeat from core client for 30 sec - exiting 05:35:23 (5004): No heartbeat from core client for 30 sec - exiting 05:35:25 (5004): No heartbeat from core client for 30 sec - exiting 05:35:26 (5004): No heartbeat from core client for 30 sec - exiting 05:35:27 (5004): No heartbeat from core client for 30 sec - exiting 05:35:28 (5004): No heartbeat from core client for 30 sec - exiting 05:35:29 (5004): No heartbeat from core client for 30 sec - exiting 05:35:30 (5004): No heartbeat from core client for 30 sec - exiting 05:35:31 (5004): No heartbeat from core client for 30 sec - exiting 05:35:32 (5004): No heartbeat from core client for 30 sec - exiting 05:35:33 (5004): No heartbeat from core client for 30 sec - exiting 05:35:34 (5004): No heartbeat from core client for 30 sec - exiting 05:35:35 (5004): No heartbeat from core client for 30 sec - exiting 05:35:37 (5004): No heartbeat from core client for 30 sec - exiting 05:35:38 (5004): No heartbeat from core client for 30 sec - exiting 05:35:39 (5004): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3068, selfPID=4484, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5536, selfPID=4968, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:07:50 (4828): No heartbeat from core client for 30 sec - exiting 07:07:51 (4828): No heartbeat from core client for 30 sec - exiting 07:07:52 (4828): No heartbeat from core client for 30 sec - exiting 07:07:53 (4828): No heartbeat from core client for 30 sec - exiting 07:07:54 (4828): No heartbeat from core client for 30 sec - exiting 07:07:55 (4828): No heartbeat from core client for 30 sec - exiting 07:07:56 (4828): No heartbeat from core client for 30 sec - exiting 07:07:57 (4828): No heartbeat from core client for 30 sec - exiting 07:07:58 (4828): No heartbeat from core client for 30 sec - exiting 07:08:00 (4828): No heartbeat from core client for 30 sec - exiting 07:08:01 (4828): No heartbeat from core client for 30 sec - exiting 07:08:02 (4828): No heartbeat from core client for 30 sec - exiting 07:08:03 (4828): No heartbeat from core client for 30 sec - exiting 07:08:04 (4828): No heartbeat from core client for 30 sec - exiting 07:08:05 (4828): No heartbeat from core client for 30 sec - exiting 07:08:06 (4828): No heartbeat from core client for 30 sec - exiting 07:08:07 (4828): No heartbeat from core client for 30 sec - exiting 07:08:08 (4828): No heartbeat from core client for 30 sec - exiting 07:08:09 (4828): No heartbeat from core client for 30 sec - exiting 07:08:10 (4828): No heartbeat from core client for 30 sec - exiting 07:08:12 (4828): No heartbeat from core client for 30 sec - exiting 07:08:13 (4828): No heartbeat from core client for 30 sec - exiting 07:08:14 (4828): No heartbeat from core client for 30 sec - exiting 07:08:15 (4828): No heartbeat from core client for 30 sec - exiting 07:08:16 (4828): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:08:17 (4828): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 09:24:10 (4616): No heartbeat from core client for 30 sec - exiting 09:24:12 (4616): No heartbeat from core client for 30 sec - exiting 09:24:13 (4616): No heartbeat from core client for 30 sec - exiting 09:24:14 (4616): No heartbeat from core client for 30 sec - exiting 09:24:15 (4616): No heartbeat from core client for 30 sec - exiting 09:24:16 (4616): No heartbeat from core client for 30 sec - exiting 09:24:17 (4616): No heartbeat from core client for 30 sec - exiting 09:24:18 (4616): No heartbeat from core client for 30 sec - exiting 09:24:19 (4616): No heartbeat from core client for 30 sec - exiting 09:24:20 (4616): No heartbeat from core client for 30 sec - exiting 09:24:21 (4616): No heartbeat from core client for 30 sec - exiting 09:24:22 (4616): No heartbeat from core client for 30 sec - exiting 09:24:23 (4616): No heartbeat from core client for 30 sec - exiting 09:24:24 (4616): No heartbeat from core client for 30 sec - exiting 09:24:26 (4616): No heartbeat from core client for 30 sec - exiting 09:24:27 (4616): No heartbeat from core client for 30 sec - exiting 09:24:28 (4616): No heartbeat from core client for 30 sec - exiting 09:24:29 (4616): No heartbeat from core client for 30 sec - exiting 09:24:30 (4616): No heartbeat from core client for 30 sec - exiting 09:24:31 (4616): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:28:00 (5276): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 09:28:02 (4744): No heartbeat from core client for 30 sec - exiting 09:28:03 (4744): No heartbeat from core client for 30 sec - exiting 09:28:04 (4744): No heartbeat from core client for 30 sec - exiting 09:28:05 (4744): No heartbeat from core client for 30 sec - exiting 09:28:06 (4744): No heartbeat from core client for 30 sec - exiting 09:28:07 (4744): No heartbeat from core client for 30 sec - exiting 09:28:08 (4744): No heartbeat from core client for 30 sec - exiting 09:28:09 (4744): No heartbeat from core client for 30 sec - exiting 09:28:10 (4744): No heartbeat from core client for 30 sec - exiting 09:28:12 (4744): No heartbeat from core client for 30 sec - exiting 09:28:13 (4744): No heartbeat from core client for 30 sec - exiting 09:28:14 (4744): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5512, selfPID=5024, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=3536, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3688, selfPID=3688, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3688, selfPID=1156, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_saf_1h5u_1977_1_006959674_1_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1h5u_1977_1_006959674_1_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1h5u_1977_1_006959674_1_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_saf_1h5u_1977_1_006959674_1_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
30 Aug 2012 09:51:17 | 1224497 | 15179865 | hadam3p_saf_1h5u_1977_1_006959674_1 | 92,256 | 150,385 | 1.6301 |
30 Aug 2012 01:34:13 | 1224497 | 15179865 | hadam3p_saf_1h5u_1977_1_006959674_1 | 80,736 | 132,600 | 1.6424 |
29 Aug 2012 16:23:00 | 1224497 | 15179865 | hadam3p_saf_1h5u_1977_1_006959674_1 | 69,216 | 113,907 | 1.6457 |
29 Aug 2012 08:15:18 | 1224497 | 15179865 | hadam3p_saf_1h5u_1977_1_006959674_1 | 57,696 | 95,478 | 1.6548 |
29 Aug 2012 03:03:57 | 1224497 | 15179865 | hadam3p_saf_1h5u_1977_1_006959674_1 | 46,176 | 76,678 | 1.6606 |
28 Aug 2012 01:24:13 | 1224497 | 15179865 | hadam3p_saf_1h5u_1977_1_006959674_1 | 34,656 | 57,295 | 1.6532 |
27 Aug 2012 09:51:35 | 1224497 | 15179865 | hadam3p_saf_1h5u_1977_1_006959674_1 | 23,136 | 38,538 | 1.6657 |
24 Aug 2012 06:50:45 | 1224497 | 15179865 | hadam3p_saf_1h5u_1977_1_006959674_1 | 11,616 | 19,429 | 1.6726 |
©2024 cpdn.org