Name | hadam3p_eu_w7rp_1996_1_006805421_1 |
Workunit | 7008737 |
Created | 7 Mar 2012, 15:53:07 UTC |
Sent | 7 Mar 2012, 15:59:37 UTC |
Report deadline | 17 Feb 2013, 21:19:37 UTC |
Received | 11 Mar 2012, 10:16:54 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1203717 |
Run time | 1 days 9 hours 49 min 41 sec |
CPU time | 1 days 5 hours 36 min 36 sec |
Validate state | Invalid |
Credit | 995.30 |
Device peak FLOPS | 3.49 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 21:24:50 (804): Can't acquire lockfile (32) - waiting 35s 21:25:17 (5696): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:25:25 (804): Can't set up shared mem: -1. Will run in standalone mode. 21:25:27 (5276): Can't set up shared mem: -1. Will run in standalone mode. 21:25:27 (4204): Can't set up shared mem: -1. Will run in standalone mode. 23:13:51 (5580): Can't acquire lockfile (32) - waiting 35s Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5276, iMonCtr=2 23:14:26 (5580): Can't acquire lockfile (32) - exiting 23:14:26 (5580): Error: µ{§ÇµLªk¦s¨úÀɮסA¦]¬°ÀÉ®×¥¿¥Ñ¥t¤@­Óµ{§Ç¨Ï¥Î¡C (0x20) 23:59:43 (292): Can't acquire lockfile (32) - waiting 35s 00:00:18 (292): Can't acquire lockfile (32) - exiting 00:00:18 (292): Error: µ{§ÇµLªk¦s¨úÀɮסA¦]¬°ÀÉ®×¥¿¥Ñ¥t¤@­Óµ{§Ç¨Ï¥Î¡C (0x20) 00:01:38 (1080): Can't acquire lockfile (32) - waiting 35s 00:02:13 (1080): Can't acquire lockfile (32) - exiting 00:02:13 (1080): Error: µ{§ÇµLªk¦s¨úÀɮסA¦]¬°ÀÉ®×¥¿¥Ñ¥t¤@­Óµ{§Ç¨Ï¥Î¡C (0x20) 02:13:55 (4532): Can't acquire lockfile (32) - waiting 35s 02:14:30 (4532): Can't acquire lockfile (32) - exiting 02:14:30 (4532): Error: µ{§ÇµLªk¦s¨úÀɮסA¦]¬°ÀÉ®×¥¿¥Ñ¥t¤@­Óµ{§Ç¨Ï¥Î¡C (0x20) 04:23:37 (4308): Can't acquire lockfile (32) - waiting 35s 04:24:12 (4308): Can't acquire lockfile (32) - exiting 04:24:12 (4308): Error: µ{§ÇµLªk¦s¨úÀɮסA¦]¬°ÀÉ®×¥¿¥Ñ¥t¤@­Óµ{§Ç¨Ï¥Î¡C (0x20) 06:42:18 (3028): Can't acquire lockfile (32) - waiting 35s 06:42:53 (3028): Can't acquire lockfile (32) - exiting 06:42:53 (3028): Error: µ{§ÇµLªk¦s¨úÀɮסA¦]¬°ÀÉ®×¥¿¥Ñ¥t¤@­Óµ{§Ç¨Ï¥Î¡C (0x20) 06:43:03 (760): Can't acquire lockfile (32) - waiting 35s 07:44:47 (3268): Can't acquire lockfile (32) - waiting 35s 07:45:27 (3268): Can't acquire lockfile (32) - exiting 07:45:27 (3268): Error: µ{§ÇµLªk¦s¨úÀɮסA¦]¬°ÀÉ®×¥¿¥Ñ¥t¤@­Óµ{§Ç¨Ï¥Î¡C (0x20) Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 19:09:22 (4216): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:28:17 (2852): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:31:02 (3032): Can't acquire lockfile (32) - waiting 35s Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 23:32:39 (4860): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6952, selfPID=6952, iMonCtr=2 08:54:10 (3224): No heartbeat from core client for 30 sec - exiting 08:54:11 (3224): No heartbeat from core client for 30 sec - exiting 08:54:12 (3224): No heartbeat from core client for 30 sec - exiting 08:54:13 (3224): No heartbeat from core client for 30 sec - exiting 08:54:14 (3224): No heartbeat from core client for 30 sec - exiting 08:54:15 (3224): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3260, selfPID=3260, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2600, selfPID=1660, iMonCtr=1 Model crash detected, will try to restart... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2600, selfPID=2600, iMonCtr=2 Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_eu_w7rp_1996_1_006805421_1_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_w7rp_1996_1_006805421_1_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_w7rp_1996_1_006805421_1_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_w7rp_1996_1_006805421_1_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_w7rp_1996_1_006805421_1_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_w7rp_1996_1_006805421_1_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_w7rp_1996_1_006805421_1_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
11 Mar 2012 04:40:29 | 1203717 | 14229485 | hadam3p_eu_w7rp_1996_1_006805421_1 | 57,696 | 102,297 | 1.7730 |
10 Mar 2012 13:57:46 | 1203717 | 14229485 | hadam3p_eu_w7rp_1996_1_006805421_1 | 46,176 | 75,288 | 1.6305 |
10 Mar 2012 10:54:18 | 1203717 | 14229485 | hadam3p_eu_w7rp_1996_1_006805421_1 | 34,656 | 61,213 | 1.7663 |
10 Mar 2012 10:54:18 | 1203717 | 14229485 | hadam3p_eu_w7rp_1996_1_006805421_1 | 23,136 | 42,068 | 1.8183 |
09 Mar 2012 13:47:56 | 1203717 | 14229485 | hadam3p_eu_w7rp_1996_1_006805421_1 | 11,616 | 16,059 | 1.3825 |
©2024 cpdn.org