climateprediction.net home page
Task 11965627

Task 11965627

Name hadam3p_saf_v671_1973_1_006742565_0
Workunit 6945909
Created 4 Nov 2010, 0:10:31 UTC
Sent 5 Nov 2010, 18:12:57 UTC
Report deadline 18 Oct 2011, 23:32:57 UTC
Received 8 Nov 2010, 16:07:37 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1112183
Run time 22 hours 42 min 42 sec
CPU time 21 hours 29 min 59 sec
Validate state Invalid
Credit 375.31
Device peak FLOPS 1.89 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.05
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2136, selfPID=2136, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
No Process Handle
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1824, selfPID=1824, iMonCtr=1
No Process Handle
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1824, selfPID=0, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1604, selfPID=1604, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=680, selfPID=680, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1648, selfPID=1648, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=576, selfPID=576, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3508, selfPID=3508, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3680, selfPID=3680, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=716, selfPID=716, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4084, selfPID=4084, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
12:03:46 (2436): Can't acquire lockfile (32) - waiting 35s
12:05:40 (996): Can't acquire lockfile (32) - waiting 35s
12:10:35 (3716): Can't acquire lockfile (32) - waiting 35s
12:18:27 (4004): Can't acquire lockfile (32) - waiting 35s
12:22:11 (2756): Can't acquire lockfile (32) - waiting 35s
CPDN Monitor - Quit request from BOINC...
12:38:39 (2252): Can't acquire lockfile (32) - waiting 35s
12:50:12 (2324): Can't acquire lockfile (32) - waiting 35s
12:05:14 (2760): No heartbeat from core client for 30 sec - exiting
12:05:16 (2760): No heartbeat from core client for 30 sec - exiting
12:05:17 (2760): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:05:18 (2760): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
12:08:59 (404): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
12:16:44 (3096): No heartbeat from core client for 30 sec - exiting
12:16:45 (3096): No heartbeat from core client for 30 sec - exiting
12:16:47 (3096): No heartbeat from core client for 30 sec - exiting
12:16:48 (3096): No heartbeat from core client for 30 sec - exiting
12:16:49 (3096): No heartbeat from core client for 30 sec - exiting
12:16:50 (3096): No heartbeat from core client for 30 sec - exiting
12:16:51 (3096): No heartbeat from core client for 30 sec - exiting
12:16:52 (3096): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
13:13:27 (1132): Can't acquire lockfile (32) - waiting 35s
CPDN Monitor - Quit request from BOINC...
13:17:09 (1848): Can't acquire lockfile (32) - waiting 35s
CPDN Monitor - Quit request from BOINC...
12:24:39 (3280): No heartbeat from core client for 30 sec - exiting
12:24:40 (3280): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:24:42 (3280): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3592, selfPID=3592, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
14:23:57 (684): Can't acquire lockfile (32) - waiting 35s
14:27:37 (3292): Can't acquire lockfile (32) - waiting 35s
CPDN Monitor - Quit request from BOINC...
14:29:31 (3352): Can't acquire lockfile (32) - waiting 35s
CPDN Monitor - Quit request from BOINC...
14:52:01 (3512): Can't acquire lockfile (32) - waiting 35s
14:53:21 (2176): Can't acquire lockfile (32) - waiting 35s
14:55:37 (1648): Can't acquire lockfile (32) - waiting 35s
14:58:04 (2624): Can't acquire lockfile (32) - waiting 35s
14:58:47 (3588): Can't acquire lockfile (32) - waiting 35s
15:06:56 (3904): Can't acquire lockfile (32) - waiting 35s
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
15:22:20 (3616): Can't acquire lockfile (32) - waiting 35s
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
15:27:45 (2672): Can't acquire lockfile (32) - waiting 35s
14:32:48 (2672): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:36:33 (428): Can't acquire lockfile (32) - waiting 35s
15:39:01 (3024): Can't acquire lockfile (32) - waiting 35s
14:43:55 (3796): No heartbeat from core client for 30 sec - exiting
14:43:57 (3796): No heartbeat from core client for 30 sec - exiting
14:43:59 (3796): No heartbeat from core client for 30 sec - exiting
14:44:00 (3796): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:47:49 (3996): No heartbeat from core client for 30 sec - exiting
14:47:50 (3996): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:47:52 (3996): No heartbeat from core client for 30 sec - exiting
14:51:37 (404): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3096, selfPID=3096, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
15:51:34 (2588): Can't acquire lockfile (32) - waiting 35s
CPDN Monitor - Quit request from BOINC...
15:03:11 (4000): No heartbeat from core client for 30 sec - exiting
15:03:12 (4000): No heartbeat from core client for 30 sec - exiting
15:03:15 (4000): No heartbeat from core client for 30 sec - exiting
15:03:16 (4000): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:03:18 (4000): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
16:01:25 (3484): Can't acquire lockfile (32) - waiting 35s
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3368, selfPID=3368, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3228, selfPID=3228, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2844, selfPID=4048, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2924, selfPID=2924, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2412, selfPID=1196, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
16:36:48 (1196): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_saf_v671_1973_1_006742565_0_3.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_v671_1973_1_006742565_0_4.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_v671_1973_1_006742565_0_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_v671_1973_1_006742565_0_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_v671_1973_1_006742565_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_v671_1973_1_006742565_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_v671_1973_1_006742565_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_v671_1973_1_006742565_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_v671_1973_1_006742565_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_v671_1973_1_006742565_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
07 Nov 2010 22:40:48 1112183 11965627 hadam3p_saf_v671_1973_1_006742565_0 23,136 74,346 3.2134
06 Nov 2010 06:40:41 1112183 11965627 hadam3p_saf_v671_1973_1_006742565_0 11,616 37,886 3.2615


©2024 climateprediction.net