climateprediction.net home page
Task 13462131

Task 13462131

Name hadam3p_eu_60wv_2005_1_007463196_2
Workunit 7660699
Created 5 Oct 2011, 7:36:30 UTC
Sent 5 Oct 2011, 7:45:26 UTC
Report deadline 16 Sep 2012, 13:05:26 UTC
Received 18 Oct 2011, 21:38:08 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1161013
Run time 2 days 6 hours 35 min 45 sec
CPU time 2 days 3 hours 19 min 43 sec
Validate state Invalid
Credit 796.79
Device peak FLOPS 2.80 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
08:57:00 (6044): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
08:57:01 (6044): No heartbeat from core client for 30 sec - exiting
08:57:02 (6044): No heartbeat from core client for 30 sec - exiting
08:57:04 (6044): No heartbeat from core client for 30 sec - exiting
08:57:05 (6044): No heartbeat from core client for 30 sec - exiting
08:57:06 (6044): No heartbeat from core client for 30 sec - exiting
08:57:07 (6044): No heartbeat from core client for 30 sec - exiting
08:57:08 (6044): No heartbeat from core client for 30 sec - exiting
08:57:09 (6044): No heartbeat from core client for 30 sec - exiting
08:57:10 (6044): No heartbeat from core client for 30 sec - exiting
08:57:11 (6044): No heartbeat from core client for 30 sec - exiting
08:57:12 (6044): No heartbeat from core client for 30 sec - exiting
08:57:13 (6044): No heartbeat from core client for 30 sec - exiting
08:57:14 (6044): No heartbeat from core client for 30 sec - exiting
08:57:16 (6044): No heartbeat from core client for 30 sec - exiting
08:57:17 (6044): No heartbeat from core client for 30 sec - exiting
08:57:18 (6044): No heartbeat from core client for 30 sec - exiting
08:57:19 (6044): No heartbeat from core client for 30 sec - exiting
08:57:20 (6044): No heartbeat from core client for 30 sec - exiting
08:57:21 (6044): No heartbeat from core client for 30 sec - exiting
08:57:22 (6044): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
16:30:51 (2384): Can't acquire lockfile (32) - waiting 35s
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3240, selfPID=3240, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6292, selfPID=6292, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2188, selfPID=2188, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
18:36:12 (5056): Can't acquire lockfile (32) - waiting 35s
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1128, selfPID=1128, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CNo Process Handle
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5836, selfPID=3284, iMonCtr=1
No Process Handle
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5836, selfPID=5836, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
10:33:09 (6936): Can't acquire lockfile (32) - waiting 35s
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
14:20:25 (7748): Can't acquire lockfile (32) - waiting 35s
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6904, selfPID=6904, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
17:59:52 (6836): Can't acquire lockfile (32) - waiting 35s
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=620, selfPID=4724, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1060, selfPID=1060, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2892, selfPID=2892, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
10:38:22 (5980): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2248, selfPID=2248, iMonCtr=2
15:07:16 (7976): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:20:10 (6020): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:27:41 (2448): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:38:55 (8056): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6796, selfPID=6796, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

Model crashed: 
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_eu_60wv_2005_1_007463196_2_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_60wv_2005_1_007463196_2_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_60wv_2005_1_007463196_2_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_60wv_2005_1_007463196_2_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_60wv_2005_1_007463196_2_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_60wv_2005_1_007463196_2_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_60wv_2005_1_007463196_2_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_60wv_2005_1_007463196_2_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
15 Oct 2011 21:26:53 1161013 13462131 hadam3p_eu_60wv_2005_1_007463196_2 46,189 145,935 3.1595
15 Oct 2011 17:30:50 1161013 13462131 hadam3p_eu_60wv_2005_1_007463196_2 46,181 145,422 3.1490
14 Oct 2011 17:55:52 1161013 13462131 hadam3p_eu_60wv_2005_1_007463196_2 46,176 144,912 3.1383
13 Oct 2011 11:22:57 1161013 13462131 hadam3p_eu_60wv_2005_1_007463196_2 34,656 109,750 3.1668
12 Oct 2011 09:51:10 1161013 13462131 hadam3p_eu_60wv_2005_1_007463196_2 23,136 78,769 3.4046
10 Oct 2011 18:29:22 1161013 13462131 hadam3p_eu_60wv_2005_1_007463196_2 11,651 39,008 3.3480
10 Oct 2011 18:29:22 1161013 13462131 hadam3p_eu_60wv_2005_1_007463196_2 11,640 38,373 3.2966
10 Oct 2011 17:26:16 1161013 13462131 hadam3p_eu_60wv_2005_1_007463196_2 11,629 37,800 3.2505
06 Oct 2011 08:22:51 1161013 13462131 hadam3p_eu_60wv_2005_1_007463196_2 11,616 37,278 3.2092


©2024 climateprediction.net