climateprediction.net home page
Posts by KeeperC

Posts by KeeperC

1) Message boards : Number crunching : Unofficial BOINC Wiki closing 2006-03-31 (Message 21035)
Posted 4 Mar 2006 by KeeperC
Post:
Paul,

I have no insight into the reasons you are quitting. What I can see is the contribution you have made to the benefit of all and your commitment to improving BOINC and the BOINC community and strengthening both.

Thanks for all your contributions.
2) Message boards : Number crunching : WUs constantly failing (Message 20098)
Posted 10 Feb 2006 by KeeperC
Post:
[url=http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=1612048]This[\\url]
result and [url=http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=1351239]this[\\url] one, both on the same machine, failed at exactly the same point. The machine cannot get past this point in sulphur, despite having many successful slab models to its credit. Two of my other machines have also failed on Sulphur, though less repeatably. I must say that I find this problem quite frustrating. I know the team is focused on the new experiments, but if this undiagnosed problem persists with the coupled model, it will begin to sap my (considerable) commitment to this project. :(

Edit: Sorry, can\'t remember how to put in links but you have the URLs at least.
3) Message boards : Number crunching : Sulphur units constantly failing (Message 19987)
Posted 5 Feb 2006 by KeeperC
Post:
I hate to be brusk, but are any of the 4.22 models completing?


I have had two machines having problems with 4.22 but two others not showing any signs of stress. So far, one completed 4.22 model and the next most complete nearing the end of phase 3.

I don\'t take any precautions - no back-ups, never stop before defrag nor before shutdown. Internet on always.

4) Message boards : Number crunching : Hypothetical? funding plan (Message 19765)
Posted 29 Jan 2006 by KeeperC
Post:

Lets see; if 10% of the 45000 participants pledge at least £10, and 10% of those pledge at least £50, and 10% of those pledge £100 then the total raised would be over £65,000.


Perhaps someone from the project could put this in context. What is the budget of the project? For how many years? How does it break-down into sub-projects, staff/overhead/hardware, etc?

Perhaps the team could also suggest possible costs (including full oncosts and overheads) of a new research student? A programmer? etc?
5) Message boards : Number crunching : WUs constantly failing (Message 19688)
Posted 27 Jan 2006 by KeeperC
Post:

One of my machines has crashed out three times over the last 10 days or so. In each case its -161. I won\'t have access to the machine again until the weekend, but I\'ll look in the yabsd file then.

This is due to a batch of bad WU\'s sent out previously.
Its been resolved. any new WU\'s you get will be ok.


I don\'t think this is the case. If you look at the machine (325133), you will see that the most recent crash was on a model issued on 16th Jan, long after the batch of bad WUs was resolved.
6) Message boards : Number crunching : WUs constantly failing (Message 19650)
Posted 25 Jan 2006 by KeeperC
Post:

One of my machines has crashed out three times over the last 10 days or so. In each case its -161. I won\'t have access to the machine again until the weekend, but I\'ll look in the yabsd file then.
7) Message boards : Number crunching : Results (Message 18537)
Posted 21 Dec 2005 by KeeperC
Post:
What has happenned to the results on your account, mine has vanished. All the trickles from stage two and those just from stage three gone. Has the moderator decided to stop them showing or is it a fault?


Same problem here. The main account page does not show any trickle information anymore (since Dec. 20th). It is very complicated now to check which computer reported a trickle.

PLEASE BRING THIS INFORMATION BACK!!!


Carl has temporarily taken this information of the main account page. The reason for this was that the db queries for trickle information from increasingly many users were overloading the servers. By taking the info off, he could keep the servers online.

Trickle info is still available from each individual result page but this does not give an overview.

If the team can optimize the queries concerned, they plan to restore trickle info to the main page, but this may be some time coming.
8) Message boards : Number crunching : Caught in apparent CP processing loop after 6hrs on WU (Message 17115)
Posted 10 Nov 2005 by KeeperC
Post:
CPDN\'s checkpoints every 144 timesteps (3 model days) and 4 minutes CPU time isn\'t going to be long enough to complete that unless your secs/TS is under 1.7 (which is extremely unlikely with a P4 2.8GHz).


In other words, CPDN saves its progress less frequently than every five minutes. If you exit before it does a save (checkpoint) you will lose all progress back to the previous checkpoint. If you do this every time CPDN runs, you will seem to be in a perpetual loop, reprocessing the same time period.

Two solutions: 1. As Geophi suggests, keep model in memory when inactive. This ensures progress is not lost even though no checkpoint has been saved to disk. 2. set a timeslice greater than required to guarantee at least one checkpoint per timeslice.

If you just do 2. you will always lose some work - back to the last checkpoint. So it is always worth doing 1 as well. I would recommend both.
9) Message boards : Number crunching : question on backing up (Message 16308)
Posted 28 Sep 2005 by KeeperC
Post:
Backup the whole BOINC folder. There are some BOINC files that log progress and you will crash if they get out of line with the application.

There is advice on an automated backup in the Wiki.


You need to stop processing before making the back-up to ensure that the log files Andrew mentions stay consistent.
10) Questions and Answers : Windows : BOINK Seems to crash (Message 16262)
Posted 26 Sep 2005 by KeeperC
Post:
(why doesn\'t un-install delete the BOINK directory and files (Yawn))


Upgrading BOINC from 4.19 to 4.45 required a full uninstall of the old version followed by new install of the new. Leaving the BOINC directory in place allowed users to upgrade midway through a model without losing their work.
11) Questions and Answers : Windows : Cannot get work units (Message 16188)
Posted 23 Sep 2005 by KeeperC
Post:
I think BOINC is insisting on downloading a sulphur model, and the problem is associated with these models. If you detatch and re-attach, you may just get a slab model instead, and there is no problem with these.

You may subsequently need to merge computers because this process will create the newly attached computer as a separate machine from the old one. Merging is easy from your account pages (my computers)
12) Questions and Answers : Windows : Got a computing error and after that I wasn\'t able to do anything, neither downloading nor computing work (Message 16187)
Posted 23 Sep 2005 by KeeperC
Post:
The problem only seems to occur with sulphur models. Since no model is currently running, try detatching from the project and attaching again. If you are assigned a slab model instead of a sulphur one, the download will go fine.

Do you know what caused your first model to crash, the one you thought was running finely?
13) Message boards : Number crunching : Sulphur Download? (Message 16157)
Posted 21 Sep 2005 by KeeperC
Post:
Carl said


there are only 5K sulphur experiments and it looks like they\'ve all been handed out! I may regen some more as well as other workunits that have 3 uploader URL\'s (BOINC will now \"round robin\" uploaders upon a failure if there is more than one uploader URL in a workunit -- AMEN! :)


Since then more have been generated but I don\'t know if they have all been handed out. Project stats indicate there are 5598 trickling hosts. This is slightly up on a couple of days ago, but I wouldn\'t want to draw strong conclusions from this.


My computer tried to download two sulphur models last night. Both errored out during the download of the zip file with \"file not found on server\". I emailed Carl detailed error messages today and his response was:

hmmm, looks like boinc deleted the workunit files but is still trying to send out results for that workunit. I\'ll turn off the workunits/results that are pointing to missing files.


14) Message boards : Number crunching : RAC (Or lack thereof) (Message 15993)
Posted 14 Sep 2005 by KeeperC
Post:

For comparison with all those HT Pentiums, my stock Athlon XP3000+ is currently doing 4.83 sec/ts

With the Slab model it ran around 2.98 sec/ts

That is a 62% slow down.
15) Message boards : Number crunching : Announcement: Database residual problem - misallocated WUs (Message 12717)
Posted 20 May 2005 by KeeperC
Post:

I received an "unsent" workunit to process - 2nf4_300144914_0, result ID 838087.
The machine in question is 18108.

I have now aborted and am runing normally with a new wu.
16) Message boards : Number crunching : New Work Request Before Phase Two (Message 12282)
Posted 4 May 2005 by KeeperC
Post:
My Linux client (4.19) with HADSM 4.13, downloaded its new model at the end of Phase 1, too. At first, I thought the current model must have crashed, but it hasn't. The new model is just sitting, waiting patiently for the current model to finish. I hope the transition at the end of phase 3, due in three days or so, will go smoothly.
17) Questions and Answers : Windows : Answers for heat related problems here. (Message 11906)
Posted 18 Apr 2005 by KeeperC
Post:
> Next I raised the laptop
> by 1/2 inch at the 4 corners. Here is the the surprise, not ONE DEGREE of
> running temperature change seen over the next hour of operation.

Bones,

The heat from my Dell laptop (D800, 1.6Ghz, 1MB) split the wooden top of my solid cherry desk. :( I bought a passive cooler to raise it up to keep the desk rather than the computer cool!
18) Message boards : Cafe CPDN : CPDN WOWmugs are available to buy! (Message 10875)
Posted 14 Mar 2005 by KeeperC
Post:
> I just got my CPDN mug (going away present from the CPDN crew - thanks!) ---
> it's pretty cool. It contained a CPDN pamphlet which I guess the OU shop pops
> in as a nice bonus/propaganda piece. It was very well packed (fitted
> styrofoam case -- oh dear is that a poor ecological choice though ;-) and
> survived the approx. 4000 mile trip from Oxford to Philadelphia just fine.
>
>

Mine just arrived today, too. Very nice!
19) Message boards : Cafe CPDN : CPDN WOWmugs are available to buy! (Message 10695)
Posted 11 Mar 2005 by KeeperC
Post:
> My mugs have just arrived - I'm drinking from one now.
>

Oh goodie - perhaps mine will arrive tonight!
20) Message boards : Cafe CPDN : CPDN WOWmugs are available to buy! (Message 10694)
Posted 11 Mar 2005 by KeeperC
Post:
[Deleted duplicate post]


Next 20

©2024 climateprediction.net