climateprediction.net home page
Task 11687281

Task 11687281

Name hadam3p_saf_v8dk_1967_1_006684486_0
Workunit 6887739
Created 26 Aug 2010, 12:11:03 UTC
Sent 29 Aug 2010, 15:43:20 UTC
Report deadline 11 Aug 2011, 21:03:20 UTC
Received 5 Jan 2011, 18:25:55 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1072771
Run time 3 days 22 hours 11 min 11 sec
CPU time 4 days 0 hours 56 min 37 sec
Validate state Invalid
Credit 1,870.33
Device peak FLOPS 2.60 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.05
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<stderr_txt>
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4600, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4624, selfPID=4412, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=164, selfPID=968, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4236, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4148, selfPID=3988, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5632, selfPID=5632, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4688, selfPID=3704, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4272, selfPID=1268, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4540, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4224, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4296, selfPID=3840, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4580, selfPID=3604, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4484, selfPID=3956, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4812, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4492, selfPID=1004, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4100, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4704, selfPID=3204, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4936, selfPID=4760, iMonCtr=1
Model crash detected, will try to restart...
11:11:11 (324): Can't acquire lockfile (32) - waiting 35s
11:11:28 (2308): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4592, selfPID=4592, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4124, selfPID=1788, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3688, iMonCtr=2
Model crash detected, will try to restart...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4896, selfPID=4756, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4860, selfPID=3328, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4892, selfPID=4248, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4496, selfPID=3400, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4504, selfPID=3792, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4316, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3964, selfPID=3880, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDNController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5044, selfPID=3288, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4392, selfPID=2320, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4948, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4980, selfPID=4832, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3292, selfPID=4044, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4700, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4728, selfPID=2124, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3904, selfPID=2588, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5148, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5404, selfPID=5088, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4296, selfPID=2492, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4420, selfPID=3864, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4516, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4616, selfPID=4320, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4448, selfPID=3912, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5128, selfPID=3860, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4460, selfPID=3688, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4272, selfPID=3072, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3204, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4508, selfPID=2300, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4888, selfPID=4016, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Colobal Worker:: CPDn process is Pnot running, exitinunning, exiting, bRetVal = 1, checkPID=0,, iMoPnCt4r=2, 
iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4924, selfPID=644, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4392, selfPID=964, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4996, selfPID=3708, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4640, selfPID=3412, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4684, selfPID=3564, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4212, selfPID=3804, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4664, selfPID=3776, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2388, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5072, selfPID=4112, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
19:19:06 (4112): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_saf_v8dk_1967_1_006684486_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_v8dk_1967_1_006684486_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
01 Jan 2011 11:58:33 1072771 11687281 hadam3p_saf_v8dk_1967_1_006684486_0 115,296 320,136 2.7766
01 Jan 2011 08:39:06 1072771 11687281 hadam3p_saf_v8dk_1967_1_006684486_0 103,776 286,030 2.7562
24 Dec 2010 16:42:45 1072771 11687281 hadam3p_saf_v8dk_1967_1_006684486_0 92,256 254,237 2.7558
10 Dec 2010 18:11:59 1072771 11687281 hadam3p_saf_v8dk_1967_1_006684486_0 80,777 222,996 2.7606
08 Dec 2010 19:50:19 1072771 11687281 hadam3p_saf_v8dk_1967_1_006684486_0 80,754 222,395 2.7540
06 Dec 2010 19:39:53 1072771 11687281 hadam3p_saf_v8dk_1967_1_006684486_0 80,736 221,780 2.7470
26 Nov 2010 21:04:47 1072771 11687281 hadam3p_saf_v8dk_1967_1_006684486_0 69,216 189,271 2.7345
14 Nov 2010 11:36:51 1072771 11687281 hadam3p_saf_v8dk_1967_1_006684486_0 57,696 157,651 2.7324
17 Oct 2010 18:02:33 1072771 11687281 hadam3p_saf_v8dk_1967_1_006684486_0 46,176 125,868 2.7258
04 Oct 2010 18:55:04 1072771 11687281 hadam3p_saf_v8dk_1967_1_006684486_0 34,656 94,799 2.7354
23 Sep 2010 19:34:39 1072771 11687281 hadam3p_saf_v8dk_1967_1_006684486_0 23,136 63,974 2.7651
07 Sep 2010 19:09:40 1072771 11687281 hadam3p_saf_v8dk_1967_1_006684486_0 11,616 31,335 2.6976


©2024 climateprediction.net