climateprediction.net home page
Task 12320576

Task 12320576

Name hadam3p_saf_213p_1996_1_007033517_0
Workunit 7236833
Created 25 Nov 2010, 10:09:56 UTC
Sent 2 Jan 2011, 8:23:31 UTC
Report deadline 15 Dec 2011, 13:43:31 UTC
Received 11 Jan 2011, 11:22:15 UTC
Server state Over
Outcome Didn't need
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1118275
Run time 4 days 15 hours 51 min 37 sec
CPU time 2 days 21 hours 32 min 26 sec
Validate state Invalid
Credit 1,496.58
Device peak FLOPS 2.29 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.08
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7624, selfPID=7624, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1032, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6060, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4896, selfPID=1416, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2888, selfPID=5524, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
18:54:42 (7356): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5084, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8104, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4812, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7624, selfPID=7624, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5360, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
15:55:32 (1440): Can't acquire lockfile (32) - waiting 35s
Regional Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=7312, iMonCtr=1
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6172, selfPID=7712, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
21:39:43 (7712): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_saf_213p_1996_1_007033517_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_213p_1996_1_007033517_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_213p_1996_1_007033517_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_213p_1996_1_007033517_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
11 Jan 2011 11:26:40 1118275 12320576 hadam3p_saf_213p_1996_1_007033517_0 92,256 240,103 2.6026
10 Jan 2011 12:53:02 1118275 12320576 hadam3p_saf_213p_1996_1_007033517_0 80,736 210,775 2.6107
08 Jan 2011 12:46:19 1118275 12320576 hadam3p_saf_213p_1996_1_007033517_0 69,216 181,662 2.6246
07 Jan 2011 14:24:02 1118275 12320576 hadam3p_saf_213p_1996_1_007033517_0 57,696 152,378 2.6410
05 Jan 2011 15:53:31 1118275 12320576 hadam3p_saf_213p_1996_1_007033517_0 46,176 120,816 2.6164
04 Jan 2011 13:15:33 1118275 12320576 hadam3p_saf_213p_1996_1_007033517_0 34,656 90,765 2.6190
03 Jan 2011 14:50:59 1118275 12320576 hadam3p_saf_213p_1996_1_007033517_0 23,136 59,683 2.5797
03 Jan 2011 02:15:25 1118275 12320576 hadam3p_saf_213p_1996_1_007033517_0 11,616 29,707 2.5574


©2024 cpdn.org