Name | hadam3p_eu_r7w6_2013_1_008755116_1 |
Workunit | 8901094 |
Created | 2 Jun 2014, 21:48:11 UTC |
Sent | 2 Jun 2014, 21:48:13 UTC |
Report deadline | 16 May 2015, 3:08:13 UTC |
Received | 4 Jun 2014, 2:08:17 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1174283 |
Run time | 1 days 3 hours 56 min 24 sec |
CPU time | 1 days 1 hours 18 min 31 sec |
Validate state | Invalid |
Credit | 597.84 |
Device peak FLOPS | 3.29 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <stderr_txt> 02:48:09 (21568): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:48:24 (21568): No heartbeat from core client for 30 sec - exiting 02:48:25 (21568): No heartbeat from core client for 30 sec - exiting 02:48:26 (21568): No heartbeat from core client for 30 sec - exiting 02:48:27 (21568): No heartbeat from core client for 30 sec - exiting 02:48:28 (21568): No heartbeat from core client for 30 sec - exiting 02:48:29 (21568): No heartbeat from core client for 30 sec - exiting 02:48:30 (21568): No heartbeat from core client for 30 sec - exiting 02:48:31 (21568): No heartbeat from core client for 30 sec - exiting 02:48:32 (21568): No heartbeat from core client for 30 sec - exiting 02:48:33 (21568): No heartbeat from core client for 30 sec - exiting 02:48:34 (21568): No heartbeat from core client for 30 sec - exiting 02:48:35 (21568): No heartbeat from core client for 30 sec - exiting 02:48:36 (21568): No heartbeat from core client for 30 sec - exiting 02:48:37 (21568): No heartbeat from core client for 30 sec - exiting 02:48:38 (21568): No heartbeat from core client for 30 sec - exiting 02:48:39 (21568): No heartbeat from core client for 30 sec - exiting 02:48:40 (21568): No heartbeat from core client for 30 sec - exiting 02:48:41 (21568): No heartbeat from core client for 30 sec - exiting 02:48:42 (21568): No heartbeat from core client for 30 sec - exiting 02:48:43 (21568): No heartbeat from core client for 30 sec - exiting 02:48:44 (21568): No heartbeat from core client for 30 sec - exiting 02:48:45 (21568): No heartbeat from core client for 30 sec - exiting 02:48:46 (21568): No heartbeat from core client for 30 sec - exiting 02:48:47 (21568): No heartbeat from core client for 30 sec - exiting 02:48:48 (21568): No heartbeat from core client for 30 sec - exiting 02:48:49 (21568): No heartbeat from core client for 30 sec - exiting 02:48:50 (21568): No heartbeat from core client for 30 sec - exiting 02:48:51 (21568): No heartbeat from core client for 30 sec - exiting 02:48:52 (21568): No heartbeat from core client for 30 sec - exiting 02:48:53 (21568): No heartbeat from core client for 30 sec - exiting 02:48:54 (21568): No heartbeat from core client for 30 sec - exiting 02:48:55 (21568): No heartbeat from core client for 30 sec - exiting 02:48:56 (21568): No heartbeat from core client for 30 sec - exiting 02:48:57 (21568): No heartbeat from core client for 30 sec - exiting 02:48:58 (21568): No heartbeat from core client for 30 sec - exiting 02:48:59 (21568): No heartbeat from core client for 30 sec - exiting 02:49:00 (21568): No heartbeat from core client for 30 sec - exiting 02:49:01 (21568): No heartbeat from core client for 30 sec - exiting 02:49:02 (21568): No heartbeat from core client for 30 sec - exiting 02:49:03 (21568): No heartbeat from core client for 30 sec - exiting 02:49:04 (21568): No heartbeat from core client for 30 sec - exiting 02:49:05 (21568): No heartbeat from core client for 30 sec - exiting 02:49:06 (21568): No heartbeat from core client for 30 sec - exiting 02:49:07 (21568): No heartbeat from core client for 30 sec - exiting 02:49:08 (21568): No heartbeat from core client for 30 sec - exiting 02:49:09 (21568): No heartbeat from core client for 30 sec - exiting 02:49:10 (21568): No heartbeat from core client for 30 sec - exiting 02:49:11 (21568): No heartbeat from core client for 30 sec - exiting 02:49:12 (21568): No heartbeat from core client for 30 sec - exiting 02:49:13 (21568): No heartbeat from core client for 30 sec - exiting 02:49:14 (21568): No heartbeat from core client for 30 sec - exiting 02:49:15 (21568): No heartbeat from core client for 30 sec - exiting 02:49:16 (21568): No heartbeat from core client for 30 sec - exiting 02:49:17 (21568): No heartbeat from core client for 30 sec - exiting 02:49:18 (21568): No heartbeat from core client for 30 sec - exiting 02:49:19 (21568): No heartbeat from core client for 30 sec - exiting 02:49:20 (21568): No heartbeat from core client for 30 sec - exiting 02:49:21 (21568): No heartbeat from core client for 30 sec - exiting 02:49:22 (21568): No heartbeat from core client for 30 sec - exiting 02:49:23 (21568): No heartbeat from core client for 30 sec - exiting 02:49:24 (21568): No heartbeat from core client for 30 sec - exiting 02:49:25 (21568): No heartbeat from core client for 30 sec - exiting 02:49:26 (21568): No heartbeat from core client for 30 sec - exiting 02:49:27 (21568): No heartbeat from core client for 30 sec - exiting 02:49:28 (21568): No heartbeat from core client for 30 sec - exiting 02:49:29 (21568): No heartbeat from core client for 30 sec - exiting 02:49:30 (21568): No heartbeat from core client for 30 sec - exiting 02:49:31 (21568): No heartbeat from core client for 30 sec - exiting 02:49:32 (21568): No heartbeat from core client for 30 sec - exiting 02:49:33 (21568): No heartbeat from core client for 30 sec - exiting 02:49:34 (21568): No heartbeat from core client for 30 sec - exiting Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/xaakm.pipe_dummy 2048 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=24228, selfPID=24228, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=24228, selfPID=8032, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_eu_r7w6_2013_1_008755116_1_4.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_r7w6_2013_1_008755116_1_5.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_r7w6_2013_1_008755116_1_6.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_r7w6_2013_1_008755116_1_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_r7w6_2013_1_008755116_1_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_r7w6_2013_1_008755116_1_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_r7w6_2013_1_008755116_1_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_r7w6_2013_1_008755116_1_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_r7w6_2013_1_008755116_1_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
03 Jun 2014 22:09:05 | 1174283 | 16659479 | hadam3p_eu_r7w6_2013_1_008755116_1 | 34,656 | 77,173 | 2.2268 |
03 Jun 2014 13:27:35 | 1174283 | 16659479 | hadam3p_eu_r7w6_2013_1_008755116_1 | 23,136 | 51,300 | 2.2173 |
03 Jun 2014 06:00:58 | 1174283 | 16659479 | hadam3p_eu_r7w6_2013_1_008755116_1 | 11,616 | 25,390 | 2.1858 |
©2024 climateprediction.net