climateprediction.net home page
Posts by old_user5994

Posts by old_user5994

21) Message boards : Number crunching : Model crashes (Message 33432)
Posted 18 Apr 2008 by Profile old_user5994
Post:
Paul, I haven\'t found \'The device does not recognize the command\' in the Unofficial Boinc Wiki, maybe because as far as I know the phrase never appears in the BOINC manager messages, only on the task\'s web page as an \'explanation\' (!!) of why it crashed.

http://www.boinc-wiki.info/Category:BOINC_Message

(Still an invaluable resource.)

Some of these messages rival ancient Taoist sayings. You might approach an approximate understanding if you retired to a mountain-top then meditated and lived on water and air until age 120.



Fading memory says that I never saw that message ...

If I never had it reported to me ... well, I could only research what and when someone, like you, reported a message ...

Sadly, I lost my mind and had to quit ...

More happily, though I have not found my mind again, or my axe (Nancy seems to hide it on me every time I find it again, and how can you be an insane axe killer without an axe is beyond me ... but I digress) ... I can at least get about pretty well most days ...

Anyway, you did get an answer and hopefully you are on your way again ...

Just one note, some systems seem to be resistant to running some of the projects and we cannot always figure out what the \"magic\" might be ... SO, it is possible that the system just will not run CPDN ...

For a time MY Mac Pro was eating models and it turns out that someone found an issue with the application and now I have 3 models to plow through ... another 900 hours for one ... :)

Anyway, good luck ...

You learned the first lesson of BOINC... ask when you don\'t know ... :)

Ignore the idiots that may show up to give you a hard time ...

For what it is worth, run through the forums and find the good guys and gals and if needed ask directly and most will try to point you right ...
22) Message boards : Number crunching : Model crashes (Message 33426)
Posted 18 Apr 2008 by Profile old_user5994
Post:
Side comment, there is the \"Unofficial BOINC Wiki\" which at one point had a whole section where you could look up each and every message and get an explanation of what that message meant and if it was status, error, or something else ...

It has been two years since I worked on the UBW so I am not sure how many messages will not be there at all because they were added later ...

But, the UBW was at one point the \"encyclopedia\" of BOINC ... and though some say the content is dated, I am willing to bet that it is not as dated as they say ... the core of BOINC has not changed in the two years I was not attached ...

Anyway, just another place to look for information ...
23) Message boards : Number crunching : Redundancy (Message 33415)
Posted 18 Apr 2008 by Profile old_user5994
Post:
I\'m aware that the vocabulary is just part of the overall problem.

In BOINC beta I argued for BOINC COnsole and Plug-in for the BOINC Manager and Science Applications because we could then use the game console that people use as a mental model...

I was shot down because, as I have also long argued, the geeks in us cannot seem to grasp the fact that not everyone is as fascinated with this stuff as we might be ...
24) Message boards : Number crunching : Redundancy (Message 33407)
Posted 18 Apr 2008 by Profile old_user5994
Post:
And the original poster has been noticeably conspicuous by his absence since his post.


So, you don\'t like talking to me ... <sob> ... :)
25) Message boards : Number crunching : Redundancy (Message 33405)
Posted 18 Apr 2008 by Profile old_user5994
Post:
It is not really the fault of the participants to not know. If you look at the \"simple\" interface, how would they know?

But, even with the more complex interface, it still takes some training and time before you know what is going on ...

Even though I was once pretty competent with BOINC two years ago, there is a surprisigly large amount of information that we take for granted that we will understand ...

Though Rummy took a lot of flack for his statement, it is true that there are the known knowns, the unknown knowns, the known unknowns and the unknown unknowns ...

The last one is the killer, most people don\'t see or know (or maybe even care) how different CPDN is from other BOINC projects. Though I have to say that with the LARGE field out there it is getting to be a little less alone for strangeness in the way it does business ...
26) Message boards : Number crunching : Redundancy (Message 33391)
Posted 17 Apr 2008 by Profile old_user5994
Post:
I know that reliable volunteer computers for CPDN are scarce, so I don\'t want to be crunching redundant work.


A *LOT* of science is about doing the same experiment over and over again.

The reason we don\'t have \"Cold Fusion\" powered cars is quite simply that no one could duplicate the experimental results. Thusly, Cold Fusion by that process is not possible.

Other reasons are the ones provided by Les and Iain, ....
27) Message boards : Number crunching : Cooling a new quad (Message 33373)
Posted 16 Apr 2008 by Profile old_user5994
Post:
The \"Best\" place is over at SaH in NC where the rabid OC guys hang out.

If needs be, attach and run a couple tasks so you can post ...

There are constant discussion on the best fans and all the other details.

You will get a lot more input over there ...
28) Questions and Answers : Macintosh : Shared Memory, other thread locked and pinned (Message 33347)
Posted 14 Apr 2008 by Profile old_user5994
Post:
Iain,

Yes, 5 hours on both roughly ... so, looking good ... by now I would have expected them to die ... I got some other project things going on, so it looks like they will be in the \"hopper\" for now as will have to catch CPU when then can ... though I did not want 3 CPDN to run ... ah well ...

luck of the draw I suppose ...

Oh, and all three were running at the same time ... though I stil have 12.14 G Free RAM ... now what to do with that space??? :)
29) Questions and Answers : Macintosh : Shared Memory, other thread locked and pinned (Message 33340)
Posted 14 Apr 2008 by Profile old_user5994
Post:
Iain,

Well, even with a resource share of 5,000 I could not induce it to pull another model ...

On an 8 CPU system ...

I don\'t know how to fiddle the parameters to get it to pull another model ...

Maybe if I suspend all other projects?

{edit}Dng nab it .. pulled TWO !!!!!{/edit}

{edit 2}It looks like the two of them are running ... both have 30 min on their clocks ... {/edit 2}
30) Questions and Answers : Macintosh : Shared Memory, other thread locked and pinned (Message 33332)
Posted 13 Apr 2008 by Profile old_user5994
Post:
The shared memory situation can be improved by using the spyhill patch (see the error-code-six sticky at the top of this forum).

But it appears that it probably won\'t resolve the slab model problem which you are experiencing, all it will do is increase the number of Boinc tasks that you can run simultaneously.


I did not have an issue with any other project running on the 8+ G memory before the patch, or after, ...

I *DO* recall a massive allocation of memory though I am loath to try it again and to \"burn through\" models just for the heck of it ... :)

So, we wait in patience ... :)

Hey, my disk drive is now testing block number -2,009,510,784 and counting down to -13,64,672,512
31) Questions and Answers : Macintosh : Shared Memory, other thread locked and pinned (Message 33327)
Posted 13 Apr 2008 by Profile old_user5994
Post:
maybe *@home applications are not able to deal with more then 2GB of memory or Mac OSX has a bug.



Now there is a happy thought ...

It could be the size though ...

I am running TechTool Pro 4.6.1 on a 1.5 TB disk and it is curently counting NEGATIVE blocks at the far end ... someone used a unsigned long to store the block count (it will always be 0 or more) and read it out as a signed int ...

One of the reasons I do not like C as a programming language. Loose typing is what gets you bugs like this ...

WIth stronger typing the compiler tells you about errors like this so you have to expecially use a type-case to convert from one type to another and then it is on the programmer\'s head if he makes the wrong choices ... but, when you have to do it explicitly and cannot get away with it ... well, thinking about changing from one type to another always made me think of the boundary conditions ...
32) Questions and Answers : Macintosh : Shared Memory, other thread locked and pinned (Message 33324)
Posted 13 Apr 2008 by Profile old_user5994
Post:
Hmmm, mine don\'t even have that much in the std error listed ...

To *ME* this looks more and more like a permissions problem ...

In your case it may be something with the file system. Mine, who knows as there are no messages in the returned data file. Are you running Tiger or Leopard?

My Mac Pro is:

Model Name: Mac Pro
Model Identifier: MacPro3,1
Processor Name: Quad-Core Intel Xeon
Processor Speed: 3.2 GHz
Number Of Processors: 2
Total Number Of Cores: 8
L2 Cache (per processor): 12 MB
Memory: 16 GB
Bus Speed: 1.6 GHz
Boot ROM Version: MP31.006C.B05
SMC Version: 1.25f4

The memory sticks are all from the same vendor and look to be pretty close in batch # so should be close running mates ... no errors reported so running well ...

I am running Leopard with all the latest patches ...
33) Questions and Answers : Macintosh : Shared Memory, other thread locked and pinned (Message 33321)
Posted 13 Apr 2008 by Profile old_user5994
Post:

Tolu is going to have a look at the slabs-on-Leopard issue next week to see if he can find out what is going wrong there.

cool!

It could be as simple as permissions/security is different ...

I THINK there is also some subtle differences in disk layout ... though I cannot be sure.

I *DO* know that even the change in the version of Mac Pro I have, for example, broke the TechTools 4.6.1 toolkit so it will not boot off of the CD/DVD ... they asked me about the memory speed ... not sure what that might have to do with it ... but ...

Of course, I did leave the debris of half a dozen models in my account for my Mac Pro so he can look at the log there ...

I am still in the middle of a long disk test so if it requires a re-boot I can\'t help for a couple days ... but, should he want to test something easy I can try it ...

He can PM me, or if he is a packrat like me he may even have my e-mail address from days of old (it has not changed) ... or, if he posts here ... I am \"watching\" this thread ...
34) Questions and Answers : Macintosh : Shared Memory, other thread locked and pinned (Message 33310)
Posted 12 Apr 2008 by Profile old_user5994
Post:
Well, you guys have some time to figure it out... :)

I have one model running and I think I want to wait till it is done before I do much of anything ... :)

I would hate to blow up a perfectly good running model.
35) Questions and Answers : Macintosh : Shared Memory, other thread locked and pinned (Message 33303)
Posted 12 Apr 2008 by Profile old_user5994
Post:
Not sure I know anything about Macs either ... :)

It could be ... I know a number of things are different in Leopard and that is what I am running on both Macs... of course, there is no model for the G5 ... so, I will have to do other things over there ... :)

I am in the middle of a days long test so, can\'t stop to reboot ... but I could try that ... increase the numbers again ... with 16G of ram it is not like I will run out of it soon ... I am only using a little bit of it at the moment ...
36) Questions and Answers : Macintosh : Shared Memory, other thread locked and pinned (Message 33291)
Posted 11 Apr 2008 by Profile old_user5994
Post:
Just as a follow on note, it does appear that all the other models were \"Slabs\" and I now have a coupled model on the Mac Pro and it has 67 hours of runtime on its clock ... so, that looks decent.

A side observation, not sure what it MIGHT mean, and not sure you want me to \"blow through\" a bunch of models to test (but will if you ask), my memory usage is LOW, I mean, only a couple Gig out of the 16 is being used ... my recollection was that earlier that HUGE chunks of memory were allocated and locked on earlier runs.

I only had 8 G then, but I recall it ALL being consumed. So, the \"shared memory\" may be some OTHER memory allocation issue on the Mac Pro (Intel) machines with the slab models.

Again, it is YOUR models ... hate to blow them up just for laughs...

One OTHER note, though I did have ECC errors they were all corrected so not sure if that is relevant to the discussion or not, but, for completeness, thought I would mention it. It just seems to me, if you want Mac machines to participate and run the slab models, in that Iain points out that this seems to be a \'standard\" problem ...

And I do have the 16M shared memory allocation ... should you wish to check my numbers I will be glad to post them too ...
37) Questions and Answers : Macintosh : Shared Memory, other thread locked and pinned (Message 33255)
Posted 8 Apr 2008 by Profile old_user5994
Post:
It took me longer than it should... but, it does look like you are correct. The other models may have all been Slabs (or I did not have the \"fix\" in place.

BUt, I have a HadCM3 model with an hour on the clock. Which is a step ahead because the other models never got started. Of course, it also says 1089:29:47 to go ...

We will see how it goes.
38) Questions and Answers : Macintosh : Shared Memory, other thread locked and pinned (Message 33251)
Posted 7 Apr 2008 by Profile old_user5994
Post:
Well, I have a new Mac Pro with 8 CPU and tried to run CPDN and the models die. I do have the \"fix\" in place to up the shared memory.

I have 12G Main memory, 2.73 TB free disk space (though it is on a HW RAID 5 array ...)

Memory says it is healthy (I did find one stick reporting correctable ECC errors - it is out now) ...

Not sure what else to test. Any thoughts? I can post reports if you tell me what you want to see ...
39) Message boards : Number crunching : Sudden Dramatic Plunge in Credit? (Message 33230)
Posted 6 Apr 2008 by Profile old_user5994
Post:
Since I admitted I am still alive and breathing on other sites ...

If I may chime in here ...

The historical reason for the credit staying was that SAH classic teams sometimes had one or two people that had large amounts of credit and they effectively became dictators on how the team was run. Do it our way or we will leave. was their threat.

IN that teams were intended to foster some level of competition, this led to the \"newer\" idea that credit should stay regardless of the people coming and going.

I am not making a negative diatribe against CPDN, just the historical observation.

Much as I like my accumulation sometimes I wish we could grow up a little bit ... :)

For me, the point of the credit is the personal accomplishment for each project and a footnote that \"I did that\" ... for others it seems to be a size contest, sadly.

In the long run, I think those teams would have been better off saying good by to the people causing problems ... 10 small contributers can be far more important than one big one.
40) Message boards : Number crunching : Unofficial BOINC Wiki closing 2006-03-31 (Message 20767)
Posted 26 Feb 2006 by Profile old_user5994
Post:
Ladies and Gentlemen,

The military has a axiom that you should never reinforce defeats. Looking back at my participation in the BOINC world I can only conclude that I have been doing just that.

So, it is time to cease. I had thought to explain the motivations that have brought me to this point but have decided that for the most part it would just be another wasted effort. Those that need to heed the words will see no need. And those that agree with them have no power. So, not much point in long explanations. But, in short, I find I can no longer be a part of a system such as I see today. Dissent is suppressed, good ideas are ignored or marginalized, valuable work is discarded, participants are abused; all this, and more.

It is said that for evil to triumph, all that is needed is for good people to do nothing. For a long time I have tried to combat the problems I have seen with virtually no success. In that nothing else I have tried has been effective I have but one last option and that is to cease supporting activities run by people I can no long trust.

With this in mind I have run all my computers out of work, removed BOINC and shut down the machines. However, I will be funding the BOINC Wiki site until the end of the quarter, 2006-03-31, at which point the site will vanish and the domain will become available.

For those that contributed, and those that used the Wiki, I am truly grateful.

If anyone wishes to preserve the Unofficial BOINC Wiki, I have a number of files that may be of interest:

http://boinc-doc.net/sites.zip My working files of the old site and the Wiki including the code development for the Google search (263M) {not really necessary, unless you like history}.
http://boinc-doc.net/log-files.zip the log file samples for adding messages along with my GREP tool (33.5M).
http://boinc-doc.net/misc.zip Just what it says... all the other stuff includes samples and snippets of material I had not gotten to yet (13.1M).
http://boinc-doc.net/BoincViewLogs.zip For those that might be interested in a data-set for analysis (5.8M).

http://boinc-doc.net/images.zip The images directory of the Wiki site with the image files needed (37M).
http://boinc-doc.net/db118605226.csv.zip Wiki database in a CSV dump (7.2M).
http://boinc-doc.net/db118605226.sql.zip WIki database in a SQL Insert dump (8M).

The explanation on how I implemented the Google search is here: http://meta.wikimedia.org/wiki/Google_Search

Paul


Previous 20 · Next 20

©2024 climateprediction.net