climateprediction.net home page
Task 12285581

Task 12285581

Name hadam3p_pnw_zojb_1982_1_007000719_0
Workunit 7204035
Created 24 Nov 2010, 11:31:24 UTC
Sent 26 Jan 2011, 22:57:06 UTC
Report deadline 9 Jan 2012, 4:17:06 UTC
Received 23 Feb 2011, 14:50:51 UTC
Server state Over
Outcome Didn't need
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1064106
Run time 1 days 10 hours 19 min 59 sec
CPU time 1 days 1 hours 46 min 3 sec
Validate state Invalid
Credit 753.03
Device peak FLOPS 2.74 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.08
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
22:09:56 (258620): No heartbeat from core client for 30 sec - exiting
22:09:57 (258620): No heartbeat from core client for 30 sec - exiting
22:09:59 (258620): No heartbeat from core client for 30 sec - exiting
22:10:00 (258620): No heartbeat from core client for 30 sec - exiting
22:10:01 (258620): No heartbeat from core client for 30 sec - exiting
22:10:02 (258620): No heartbeat from core client for 30 sec - exiting
22:10:03 (258620): No heartbeat from core client for 30 sec - exiting
22:10:04 (258620): No heartbeat from core client for 30 sec - exiting
22:10:05 (258620): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:10:06 (258620): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=91952, selfPID=93940, iMonCtr=1
Model crash detected, will try to restart...
23:30:50 (5600): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1148, selfPID=2848, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
14:46:14 (5296): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:46:24 (8040): Can't acquire lockfile (32) - waiting 35s
14:46:59 (8040): Can't acquire lockfile (32) - exiting
14:46:59 (8040): Error: The process cannot access the file because it is being used by another process. (0x20)
14:47:00 (9372): Can't acquire lockfile (32) - waiting 35s
14:47:35 (9372): Can't acquire lockfile (32) - exiting
14:47:35 (9372): Error: The process cannot access the file because it is being used by another process. (0x20)
14:47:35 (5500): Can't acquire lockfile (32) - waiting 35s
14:48:10 (5500): Can't acquire lockfile (32) - exiting
14:48:10 (5500): Error: The process cannot access the file because it is being used by another process. (0x20)
14:48:10 (8356): Can't acquire lockfile (32) - waiting 35s
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2496, selfPID=2496, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4700, selfPID=4700, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4604, selfPID=4604, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5292, selfPID=5292, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4456, selfPID=8172, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 3
12:47:50 (8172): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_pnw_zojb_1982_1_007000719_0_4.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zojb_1982_1_007000719_0_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zojb_1982_1_007000719_0_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zojb_1982_1_007000719_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zojb_1982_1_007000719_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zojb_1982_1_007000719_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zojb_1982_1_007000719_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zojb_1982_1_007000719_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zojb_1982_1_007000719_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
18 Feb 2011 12:42:29 1064106 12285581 hadam3p_pnw_zojb_1982_1_007000719_0 34,656 90,198 2.6027
11 Feb 2011 12:56:45 1064106 12285581 hadam3p_pnw_zojb_1982_1_007000719_0 23,136 60,073 2.5965
10 Feb 2011 04:14:06 1064106 12285581 hadam3p_pnw_zojb_1982_1_007000719_0 11,616 30,178 2.5980


©2024 cpdn.org