climateprediction.net home page
Rampant compute errors

Rampant compute errors

Message boards : Number crunching : Rampant compute errors
Message board moderation

To post messages, you must log in.

AuthorMessage
old_user506792

Send message
Joined: 12 Mar 08
Posts: 1
Credit: 1,806,577
RAC: 0
Message 36767 - Posted: 22 Apr 2009, 6:14:17 UTC

I've noticed that all my new tasks are ending in compute errors. Normally I would suspect that something is wrong with my machine (I'm resetting the project just in case - since I have no tasks, I figure it can't hurt!), but looking into these tasks, it seems that other people are having the same problem with those tasks. Are there just a bunch of bad tasks out there, or is something else going on?



ID: 36767 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 36768 - Posted: 22 Apr 2009, 6:48:34 UTC
Last modified: 22 Apr 2009, 6:49:00 UTC

If you look at the Task details page for your failed models, and click on stderr out, you'll see the problem.
This one for instance.

Mac computers have problems with the hadcm3 models because of stack space.

There's a thread here which discusses the problem. Near the start is a link to a post that explains how to increase the stack space.
Backups: Here
ID: 36768 · Report as offensive     Reply Quote
Profile Thyme Lawn
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1283
Credit: 15,824,334
RAC: 0
Message 36774 - Posted: 22 Apr 2009, 11:29:55 UTC - in response to Message 36768.  

Mac computers have problems with the hadcm3 models because of stack space.

A more likely cause is described here. A compiler version change means there's a library incompatibility between version 6 of the HadCM3 application and older model types which has yet to be resolved. HadAM3P probably uses the same compiler version as HadCM3.
"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer
ID: 36774 · Report as offensive     Reply Quote
old_user453270

Send message
Joined: 23 May 07
Posts: 1
Credit: 218,432
RAC: 0
Message 36776 - Posted: 22 Apr 2009, 12:05:38 UTC - in response to Message 36767.  

I've noticed that all my new tasks are ending in compute errors. Normally I would suspect that something is wrong with my machine (I'm resetting the project just in case - since I have no tasks, I figure it can't hurt!), but looking into these tasks, it seems that other people are having the same problem with those tasks. Are there just a bunch of bad tasks out there, or is something else going on?

Same here. Already some weeks I have this problem. I now detached from the project because after so long a time I figure that it should be remedied. I already changed back to BOINC version 6.2.15, deleted the whole BOINC dir, reattached to projects. But to no avail.

Frans.
ID: 36776 · Report as offensive     Reply Quote
Profile mo.v
Volunteer moderator
Avatar

Send message
Joined: 29 Sep 04
Posts: 2363
Credit: 14,611,758
RAC: 0
Message 36784 - Posted: 23 Apr 2009, 11:23:26 UTC

Hi Frans, welcome to the forum.

The HadSM and HadSM MH models should not have this problem amd the HadAM models will also probably run fine on your machine. You can select these models in the ClimatePrediction preferences in your account.

I agree that this problem has been known for quite a long time and ideally should have already been fixed. However, CPDN only has two programmers (Tolu and Milo) and it's a massive project. They simply haven't got enough time to do everything we and they and the researchers need. Last week Milo had to go into work during his Easter holiday to sort out the big server problem.
Cpdn news
ID: 36784 · Report as offensive     Reply Quote

Message boards : Number crunching : Rampant compute errors

©2024 cpdn.org