climateprediction.net home page
Task 12304612

Task 12304612

Name hadam3p_saf_1xj0_1971_1_007019284_0
Workunit 7222600
Created 24 Nov 2010, 15:51:06 UTC
Sent 15 Jan 2011, 15:22:12 UTC
Report deadline 28 Dec 2011, 20:42:12 UTC
Received 13 Feb 2011, 13:23:42 UTC
Server state Over
Outcome No reply
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1101865
Run time 1 days 7 hours 10 min 37 sec
CPU time 23 hours 56 min 41 sec
Validate state Invalid
Credit 188.44
Device peak FLOPS 1.74 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.08
windows_intelx86
Stderr
<core_client_version>6.10.56</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11492, selfPID=11492, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14420, selfPID=14420, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4232, selfPID=4232, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4608, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
No Process Handle
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6688, selfPID=6688, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
13:28:25 (6692): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
14:24:12 (7716): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:24:13 (7716): No heartbeat from core client for 30 sec - exiting
14:24:14 (7716): No heartbeat from core client for 30 sec - exiting
14:24:15 (7716): No heartbeat from core client for 30 sec - exiting
14:24:16 (7716): No heartbeat from core client for 30 sec - exiting
14:24:17 (7716): No heartbeat from core client for 30 sec - exiting
14:24:18 (7716): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
15:11:42 (7580): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:11:43 (7580): No heartbeat from core client for 30 sec - exiting
15:11:44 (7580): No heartbeat from core client for 30 sec - exiting
15:11:45 (7580): No heartbeat from core client for 30 sec - exiting
15:11:46 (7580): No heartbeat from core client for 30 sec - exiting
15:11:47 (7580): No heartbeat from core client for 30 sec - exiting
15:11:48 (7580): No heartbeat from core client for 30 sec - exiting
15:11:49 (7580): No heartbeat from core client for 30 sec - exiting
15:11:50 (7580): No heartbeat from core client for 30 sec - exiting
15:11:51 (7580): No heartbeat from core client for 30 sec - exiting
15:11:52 (7580): No heartbeat from core client for 30 sec - exiting
15:11:53 (7580): No heartbeat from core client for 30 sec - exiting
15:11:54 (7580): No heartbeat from core client for 30 sec - exiting
15:11:55 (7580): No heartbeat from core client for 30 sec - exiting
15:11:56 (7580): No heartbeat from core client for 30 sec - exiting
15:11:57 (7580): No heartbeat from core client for 30 sec - exiting
15:11:59 (7580): No heartbeat from core client for 30 sec - exiting
15:12:00 (7580): No heartbeat from core client for 30 sec - exiting
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN Monitor - Quit request from BOINC...
No Process Handle
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3940, selfPID=3940, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5964, selfPID=5964, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7104, selfPID=7104, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10112, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3088, selfPID=3088, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14268, selfPID=14268, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6480, selfPID=6480, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6212, selfPID=6212, iMonCtr=2
17:36:26 (4756): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPICPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2580, selfPID=2580, iMonCtr=2
21:56:18 (6496): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:56:19 (6496): No heartbeat from core client for 30 sec - exiting
21:56:20 (6496): No heartbeat from core client for 30 sec - exiting
21:56:21 (6496): No heartbeat from core client for 30 sec - exiting
21:56:22 (6496): No heartbeat from core client for 30 sec - exiting
21:56:23 (6496): No heartbeat from core client for 30 sec - exiting
21:56:24 (6496): No heartbeat from core client for 30 sec - exiting
21:56:25 (6496): No heartbeat from core client for 30 sec - exiting
21:56:26 (6496): No heartbeat from core client for 30 sec - exiting
21:56:27 (6496): No heartbeat from core client for 30 sec - exiting
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=512, selfPID=512, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=512, selfPID=5152, iMonCtr=1
Model crash detected, will try to restart...

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_saf_1xj0_1971_1_007019284_0_2.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1xj0_1971_1_007019284_0_3.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1xj0_1971_1_007019284_0_4.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1xj0_1971_1_007019284_0_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1xj0_1971_1_007019284_0_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1xj0_1971_1_007019284_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1xj0_1971_1_007019284_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1xj0_1971_1_007019284_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1xj0_1971_1_007019284_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1xj0_1971_1_007019284_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1xj0_1971_1_007019284_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
06 Feb 2011 16:44:54 1101865 12304612 hadam3p_saf_1xj0_1971_1_007019284_0 11,616 66,993 5.7673


©2024 cpdn.org