climateprediction.net home page
Posts by Skip Da Shu

Posts by Skip Da Shu

11) Message boards : Number crunching : Where do all the errors come from? (Message 31571)
Posted 3 Dec 2007 by Profile Skip Da Shu
Post:
If the answer to Mike\'s question is positive, have you recently run stability checks on the machine? Unless I looked at the wrong entries, the machine had 10 Models in WinXP SP1, none of which were successful. (Yes, three show \'Success\' but they, too, failed; boinc entries must be taken with a grain of salt.)

24 hours of dual Prime-95 wouldn\'t hurt. Just to be sure.


The problem appears to be the IA32 libs. However, it appears I tested stable under WinXP SP1 with FSB set at 216 and default vcore. Then at some point unknown bumped the vcore a notch and the FSB to 218 and didn\'t run a full test (OCCT or Prime95). I haven\'t got Prime95 installed yet but backed the FSB down to 215 and left vcore up 1 notch just to be safe until I can find/install Prime95 or another stress tester. Thanx
12) Message boards : Number crunching : Where do all the errors come from? (Message 31570)
Posted 3 Dec 2007 by Profile Skip Da Shu
Post:

Did you install IA32 support in Ubuntu? (I gather that Ubuntu 64-bit does NOT support 32-bit apps by default).

I used a package manager and found an IA32 library noted as \'shared 32bit libs for AMD64 system\'. Installed that and it has resolved the code 22 on QMC and E@H WUs. Assuming it\'ll do the same for CPDN but have a couple more hours before I can get another one to verify.

Thank you VERY much.

UPDATE: A CPDN HadSM3 Slab WU has now running for about 5 minutes. :-)
13) Message boards : Number crunching : Where do all the errors come from? (Message 31563)
Posted 2 Dec 2007 by Profile Skip Da Shu
Post:
Well ya\'ll didn\'t make me feel all warm and fuzzy about solving my errors... let\'s take a run at it.

Today (after a few days of winding down WUs on the machine) I formated the HDD and installed Xubuntu v7.10 (64bit) on this machine. It\'s been running WinXP for some time with multiple projects on it. It\'s an AMD X2 4200+ with 2 x 256MB of PC4000 RAM. It\'s a dedicated number cruncher as is normally \"headless\".

I installed the v5 stdc++ libs (Gutsy comes with v6) required by several project apps (QMC, E&H, Lieden, WCG and perhaps CPDN). Use the package install to get the AMD64 version of BOINC v5.10.8 up and running as a daemon. I encountered these errors:

Sat 01 Dec 2007 06:24:50 PM CST|QMC@HOME|Reason: Unrecoverable error for result three_ad_anthracene.3996_0 (process exited with code 22 (0x16, -234))

Sat 01 Dec 2007 06:25:47 PM CST|climateprediction.net|Reason: Unrecoverable error for result hadsm3fub_0107_005913005_1 (process exited with code 22 (0x16, -234))

Sat 01 Dec 2007 06:25:51 PM CST|Einstein@Home|Reason: Unrecoverable error for result h1_0666.20_S5R2__265_S5R3a_1 (process exited with code 22 (0x16, -234))

Sat 01 Dec 2007 06:25:53 PM CST|World Community Grid|Reason: Unrecoverable error for result dddt0201k0629_ZINC06913243-0000_00_0 (process exited with code 22 (0x16, -234))


One thing that makes me think it\'s app dependent is that one of WCGs other apps runs fine.

Any thoughts?

PS: I see \"execv: No such file or directory\" in this returned result http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=7013096
14) Questions and Answers : Wish list : Merging old dead computers (Message 27807)
Posted 9 Apr 2007 by Profile Skip Da Shu
Post:
Deleting won\'t happen, because the workunits do not get purged from the database. The units crunched on your machine are still being used for science, so they can not remove it. This project is different from others that have a separate database for the units, CPDN does not. It keeps it on the online database.

They auto-hide, so just ignore them.


Look at 50038
15) Questions and Answers : Wish list : Merging old dead computers (Message 27805)
Posted 9 Apr 2007 by Profile Skip Da Shu
Post:
I have 17 computers listed that I no longer have and / or they no longer crunch. 11 (about to be 12) of these have not contacted the server in over 13 months. We should have the ability to delete them after this much time. If that\'s not possible will you set up an email address where we can send you machine numbers and you delete them from the database? -- Skip
16) Questions and Answers : Preferences : Unable to merge computers (Message 20068)
Posted 9 Feb 2006 by Profile Skip Da Shu
Post:
... I don\'t think that facility was created...

Is not host merging and deleting a stock part of BOINC? Does CPDN have/run no jobs that clean up / remove / purge old returned and/or old unreturned work from the ques/dBs? Is this what you are refering to as not being created?


But I\'m sure they know about the \'problem\'.


Do I detect a \'wee bit\' of sarcasm here? Not much of a \'problem\' when you only have 3 computers listed and only 1 that does anything... I\'d guess that it\'s really 1 computer (why will they not merge? They appear identical)


At the moment, they are busy with the science, which is what their funders want.


Ahhh the old \'science card\'. Well let me give you the stock reply also then... there is NO science without the crunchers.
17) Questions and Answers : Preferences : CPU speed + errors + continuous trickling (Message 20064)
Posted 9 Feb 2006 by Profile Skip Da Shu
Post:
If set to blank screen, there should be no calculations done. You could verify that by watching Task Manager when Blank screen kicks in.


Wait a minute here... how do you see the task manager after the screen goes blank??!

General comment on Memtest86+ & Prime95: I find memtest to be a good final memory test. If I can get it to run w/o erros I don\'t have memory errors with anything I run. In fact Test5 on AMD XPs (especially dual channel) will find things nothing else does. So, great tool and a prerequesit for any sort of stable OC\'ing set up.

However, I have some reservations about the absoluteness of Prime95. I personally believe CPDN to be a better FPU / CPU tester than Prime95. If P95 runs for an hour then the machine is ready to be tested with CPDN. Suspend all the other projects and if it runs overnight it\'s \'good to go\'.

I\'d be curious to see the General Prefs massic80 is using. Also has either machine been tweaked at all? I\'d better read on.
18) Questions and Answers : Preferences : Unable to merge computers (Message 20062)
Posted 9 Feb 2006 by Profile Skip Da Shu
Post:
Sorry, no way around it of which I\'m aware.

In time, the machine count can become a meaningless indicator. Unfortunate, because it renders comparative stats questionable to meaningless.


So with a big stick in hand, who\'s arm do we need to twist to get some clean-up done?

I\'ve got old machines whose last activity was 2004. Machines with no id, no units, no ip, no nothing... a trash entry. Machines who finished all work in 2005 or have only past due w/u\'s from 2005... all forms of trash. I\'ve taken to hidding them as it\'s such a mess.

Who do we need to write to get some action in this area?

19) Message boards : Number crunching : To Completion Time (Message 16466)
Posted 6 Oct 2005 by Profile Skip Da Shu
Post:
... text deleted...
If you suspended the slab unit now and set no new work for CP. I would estimate that around 1400 of 3600 hours to 28 Feb would be taken completing the sulphur model. That\'s about 39% of your time rather than the set 25%. It will then catch up on work on other projects before giving CP more time to do the slab before its deadline.

If you continue doing the slab, CP will end up taking much more of the time available than 39%. So if you want to avoid this, and keep resource usage more in line with what you have set, it could be sensible to suspend the slab model for 5 months. Since you are only 9 hours into the slab, this is what I would recommend.
... text deleted

Good clear thinking! I\'m now upset with myself that I didn\'t think of suspending the slab and running the other one 1st. Duh. Thanx much Crandles.
20) Message boards : Number crunching : To Completion Time (Message 16445)
Posted 5 Oct 2005 by Profile Skip Da Shu
Post:
Are the estimated time to completion predictions for AMD based Windows machines close to reality for the sulfer cycle WUs?

One of my machines is working a slab WU (1.75% complete) with 9 hours crunched and an estimated remaining of about 334 hours. Since CPDN gets 25% of the machines time, that\'s 6 hours per day or about 55 days to complete. Well before the 9/2006 deadline. However, 55 days from now will put us into early December. Still no problem... but... I have a sulfer cycle 4.19 WU sitting in que behind it with a deadline of 2/28/2006 that has not started and has an estimated remaining of about 1543! If I did the math right, 1543 at 6 per day is around 257 days. And this thing will not start till December 1st! Unless someone tells me that the estimated completion time will run down way faster than the actual time crunched... I need to abort this WU before it starts.

For some reason when I upgraded from 4.45 to 4.72 I downloaded both of these WUs.
21) Message boards : Number crunching : Announcement: Database residual problem - misallocated WUs (Message 14285)
Posted 11 Jul 2005 by Profile Skip Da Shu
Post:
>
> Greetings from Andorra.
>
Josgre62,
Howdy from Austin, Texas. I want to see if I can persuade you to turning a small 5% or 10% of your CPU time to doing SETI for a week or two. We have a little cross team situation that you could make a big difference on ;-)

See <a href="http://www.boincsynergy.com/forums/viewtopic.php?p=11539#11539/href">this</a> thread on the BOINC Synergy forums for more info or you can get a hold of me at my <a href="http://home.austin.rr.com/skipsjunk/html/contact.html">website</a>. Thanx, Skip

<img src="http://www.boincsynergy.com/images/stats/comb-134.jpg">
<br>Click <a href="http://home.austin.rr.com/skipsjunk/html/boincdv.html">HERE</a> for the most current version of BoincDV.
22) Questions and Answers : Windows : Duplicate Computer Names/Merge hosts problem (Message 12369)
Posted 7 May 2005 by Profile Skip Da Shu
Post:
&gt; With the projects one and only programmer so busy with the science part, it is
&gt; unlikely that problems such as this will be even thought about anytime soon.
&gt;
&gt; The 'glitch' was a change in the way that BOINC checks the processors internal
&gt; type info and 'publishes' it. I think it was an attempt to fix an earlier
&gt; problem with extracting the data, and it has created another problem.

CPDN has an additional problem that didn't show up on the other projects. I changed CPUs in one of my machines and was able to merge it back with the old host on LHC (and I think on all the projects except CPDN, will confirm). It doesn't seem to want to merge "AuthenticAMD AMD Athlon(tm) XP 3200+" and "AuthenticAMD AMD Athlon(tm)" even though everything else is the same.
23) Questions and Answers : Windows : Best way to remove old machine from project.... (Message 12368)
Posted 7 May 2005 by Profile Skip Da Shu
Post:
Latest DEV version of BOINC (4.37) has option, by project, to set "No New Work". Otherwise you can also create a different profile (venue) and set it to 0 days between connects. Assign old machine to that profile and it'll get no further work.
24) Questions and Answers : Windows : Climate prediction trying to update by connecting to SETI server?! (Message 10710)
Posted 11 Mar 2005 by Profile Skip Da Shu
Post:
Exit BOINC and go to your BOINC folder/directory.
Open Windows Exporer and on the file Account_ClimatePrediction.net.xml right click.
Select Open with...
Select NotePad

Check and correct as needed these 3 lines:

http://climateprediction.net/
5xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx0
climateprediction.net

I think what you have is cross-linked the projects... I've done it before but I don't know how I got there. You might also want to check the account....xml files for the other projects you are attached to.

Hope this solves it for you, Skip
25) Questions and Answers : Windows : alpha versions 4.2x require signed files (Message 10451)
Posted 6 Mar 2005 by Profile Skip Da Shu
Post:
&gt; There is a workaround for this... Check <a> href="http://setiweb.ssl.berkeley.edu/forum_thread.php?id=11887"&gt;this</a> out
&gt; for a full explaination.

Yea, but the link is to the "all my servers are down" site ;-)
26) Questions and Answers : Windows : alpha versions 4.2x require signed files (Message 10029)
Posted 26 Feb 2005 by Profile Skip Da Shu
Post:
&gt; &gt; Any chance CPDN will be correcting this soon?
&gt;
&gt; Yes. Where "soon" is an unknown period of time. They are a bit short staffed
&gt; at the moment, and are
&gt; concentrating on getting the Sulphur Cycle model finished. Which is in a few
&gt; weeks.
&gt;
&gt; BOINC v4.19 is it until you get a "message from the server". And 4.19 has
&gt; known problems with certain proxies.
&gt;

OK, I'll just suspend CPDN for awhile and check back later. Thanx.
27) Questions and Answers : Windows : alpha versions 4.2x require signed files (Message 10012)
Posted 25 Feb 2005 by Profile Skip Da Shu
Post:
I can not get a new work unit on two of my machines because the input files are not signed as required by all versions of BOINC &gt; 4.19.

Any chance CPDN will be correcting this soon?

Thanx, Skip

PS: See LHC messages for lots of details
http://lhcathome.cern.ch/forum_thread.php?id=1141
28) Questions and Answers : Windows : BOINC icon not showing in systray (sometimes) (Message 10011)
Posted 25 Feb 2005 by Profile Skip Da Shu
Post:
Instead of rebooting you can usually just logoff and logon and your systray icons will all show up... usually ;-)

29) Message boards : Number crunching : Bad for CPU to run 100%, 24/7 @ 62 degrees? (Message 10009)
Posted 25 Feb 2005 by Profile Skip Da Shu
Post:
&gt; &gt; Could I please add my name to Graham's question - how do all of you
&gt; measure
&gt; &gt; the temperature? Should everyone running cpdn 24/7 be doing this,
&gt; whatever it
&gt; &gt; is?
&gt; &gt;
&gt; &gt; I just occasionally feel the temp of the air coming out of the back with
&gt; my
&gt; &gt; hand. I clean inside and out from time to time with a mini paint brush
&gt; and
&gt; &gt; vacuum cleaner.
&gt; &gt;
&gt; &gt;
&gt; &gt;
&gt; Try SpeedFan, it's what I use.
&gt; http://www.almico.com/speedfan.php

Or MotherBoardMonitor, MBM, last version I got was MBM5370.exe
It will monitor fans speeds, temps, etc. etc. with various options for display and alarms and the ability to adjust how often it polls for the data. On my "warmer" machines I run it from startup as a systray icon and let it keep logs.

Skip
30) Questions and Answers : Getting started : Deleting a host (Message 4802)
Posted 29 Sep 2004 by Profile Skip Da Shu
Post:
I have retired host #19182 as it was a 199Mhz P1 w/ 64Mb running Win98. It shows TC = 0, RAC = 0 and "No trickles!" but when I try to delete it it says "error: existing results". Can ya'll delete it for me or do something so I can delete it?

Thanx, Skip Da Shu
<img>


Previous 20 · Next 20

©2024 climateprediction.net