Message boards :
Number crunching :
New error message
Message board moderation
Author | Message |
---|---|
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Well, I haven't seen it before. Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. task = t817 |
Send message Joined: 15 Jan 11 Posts: 175 Credit: 6,242,691 RAC: 699 |
Interesting. Since one could surmise that -ve pressures would be associated with either -ve volumes or -ve temperatures, perhaps the conditions simulate instants before the Big Bang. |
Send message Joined: 7 Aug 04 Posts: 2169 Credit: 64,555,907 RAC: 5,858 |
Obviously an unstable atmosphere result. We used to get those with the old hadcm3's back when we started running them. Could be parameters, except every one of that computer's models have crashed with that error. My guess is the CPU and/or memory is being stressed too much and throwing errors. Time for a prime95 run for that computer. It's been a long time since we've had to recommend that. I guess that shows that most people running the models nowadays have distributed computing experience of some type. |
Send message Joined: 15 May 09 Posts: 4350 Credit: 16,558,487 RAC: 4,810 |
Obviously an unstable atmosphere result. We used to get those with the old hadcm3's back when we started running them. Could be parameters, except every one of that computer's models have crashed with that error. Especially as this is with several different task types. Presumably, -ve pressure is the only impossible climate checked for though I seem to remember in the past, it was always, "Invalid Theta?" |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
It could be that the new message is for something that gets checked for first. Or it could be that the modelling is getting more detailed, and there are new messages. Doesn't negative pressure occur in tornadoes/typhoons/hurricanes? If so, no big bang. Except from electrical activity in the clouds. |
Send message Joined: 7 Aug 04 Posts: 2169 Credit: 64,555,907 RAC: 5,858 |
P = (rho * R * T) / V in the atmosphere. rho = density of gas (can't be negative, near 0 in outer space) R = a constant that is positive T = temperature of that gas in deg Kelvin (can't be negative in any real atmosphere) V = volume of that gas So, it is a test for negative pressures, but may as well be a check for negative absolute temperatures. The following are comments on the purpose of the P_TH_ADJ subroutine in model code: CLL SUBROUTINE P_TH_ADJ ------------------------------------------- PTHADJ1A.3 CLL PTHADJ1A.4 CLL PURPOSE: CALCULATES ADDS SURFACE PRESSURE INCREMENTS USING PTHADJ1A.5 CLL EQUATION (27). CALCULATES AND ADDS POTENTIAL TEMPERATURE PTHADJ1A.6 CLL INCREMENTS USING EQUATION (28). PTHADJ1A.7 We used to get these error messages with the hadcm3's back in the day. Using google search will bring up a bunch of model results, and a few threads in this forum where those errors are mentioned. Negative theta detected is a much more common error message, for whatever reason. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
I wonder if that computer is grossly over clocked? That may account for everything failing soon after starting, and may even cause a negative value in variables that can't be negative, due to the same incorrect value being returned from the fpu. I think that I'll ask about this tomorrow, and also see if that person can be sent an email. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
That error means that one or more of the needed files for those tasks couldn't be found on the server. There's no other reports yet, so it may be an isolated incident. What happens depends on the message in the Status column of the Tasks tab. If they show as an error, then Abort them. |
Send message Joined: 5 Aug 04 Posts: 1496 Credit: 95,522,203 RAC: 0 |
I had one similar to that today, Les, also for a Wah2 Eu task. Fourteen lines of the 'Parse' error Misty mentioned. Next contact was normal and a new task snagged. I tend not to worry about one-off messages on my machines -- until they recur, or someone else posts about the problem. Jim "We have met the enemy and he is us." -- Pogo Greetings from coastal Washington state, the scenic US Pacific Northwest. |
Send message Joined: 31 Dec 07 Posts: 1152 Credit: 22,108,055 RAC: 2,609 |
They will continue to show as �in progress� on the account page until they time out. |
Send message Joined: 1 Sep 04 Posts: 161 Credit: 81,421,805 RAC: 1,225 |
Misty - I haven't done this for a while, but it used to work. Wait until all of your current models have completed (you have no work). REMOVE the CPDN project from your computer. Wait a few minutes. ADD the CPDN project back to your computer. CPDN will realize you can't possibly have any models on your computer and will abort all the models in the CPDN database that are currently in-progress on your computer. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
Misty That sort of thing is called a ghost model, and lots of people have them in their collection. Like Casper, they're friendly, and don't hurt anything, so the simplest idea is to just ignore them. |
Send message Joined: 18 Jul 13 Posts: 438 Credit: 24,541,631 RAC: 1,870 |
I recently had few of these ghost ones and did as WB8ILI suggested here. Detached and re-attached to the CPDN project. Besides getting rid of ghosts one frees WU's for others to crunch. In my case they were too old and labelled as No-Resubmission, but for newer it should work just fine. |
Send message Joined: 5 Jul 09 Posts: 63 Credit: 6,091,274 RAC: 0 |
I recently had few of these ghost ones and did as WB8ILI suggested here. Detached and re-attached to the CPDN project. Besides getting rid of ghosts one frees WU's for others to crunch. In my case they were too old and labelled as No-Resubmission, but for newer it should work just fine. It also clears out any old files left over from previous CPND projects etc. Kevin |
Send message Joined: 15 May 09 Posts: 4350 Credit: 16,558,487 RAC: 4,810 |
Wait until all of your current models have completed (you have no work). Is the important bit. Otherwise you lose real work! Depending on the speed of your computer and boinc settings etc. you may take a long time to reach the no work position unless you set the project to, "no new tasks." |
Send message Joined: 1 Sep 04 Posts: 161 Credit: 81,421,805 RAC: 1,225 |
I know it has been posted many times that one missing result is no big deal. But, I am always afraid there is some scientist in Australia or New Zealand just waiting for this last result to complete a study. So, I like to get my ghosts back into the queue. Sometimes I do have to wait 10 days or more before I have no work left before removing the project from the computer. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
I suspect that there's no problem for the scientists, either with models like this, or with those that are "being sat on" by people so that their computer always has work way into the future. If they don't get sufficient results back in what they consider to be a reasonable amount of time, then they just need to issue a new batch. With batch numbers now included in the model names, this makes it even easier for them to write simple scripts to find out what is happening. |
Send message Joined: 15 Jan 06 Posts: 637 Credit: 26,751,529 RAC: 653 |
If they don't get sufficient results back in what they consider to be a reasonable amount of time, then they just need to issue a new batch. It seems to me that is the ultimate argument for shorter deadlines. Otherwise, the project will issue multiple copies of the same work just in the hope of getting one back in time. The "hoarders", if I may call them that, then may never know that their work doesn't count, and hence have no incentive to change their buffer sizes, or whatever else it takes. But the bigger problem is that the work could have been sent out to machines that don't delay so long to begin with. At least one project on World Community Grid has a "reliable machine" designation for those that return the results early, which is a shorter time period than the formal deadline. It would appear to me that the scientists would be insisting that some such approach is implemented here. Or else they are forced to just issue multiple work as indicated above, assuming they are really not willing to delay their publications a year. |
Send message Joined: 5 Sep 04 Posts: 7629 Credit: 24,240,330 RAC: 0 |
I recently suggested shorter times, but didn't get anywhere, which is why I think what I said in my post may be "how things are". Given that we have a "distributed input" from all over the planet, it may be simpler than trying to get the work to reliable computers. Although it should be possible to write scripts to look for hoarders and serial crashers, implementing a way for fast, reliable machines to get more of the work load may be tricky. It would also explain the lack of work for Linux computers, where there's an increasing number of 64 biter's who don't bother checking even once to see if things are OK with the tasks that they get. Anyway, it's only a problem for/with some computers. And the owners will get their credit without knowing that they're wasting electricity. |
Send message Joined: 22 Feb 15 Posts: 3 Credit: 1,065,624 RAC: 0 |
Got this popup today when i got home. http://imgur.com/cxboWF7 |
©2024 climateprediction.net