climateprediction.net home page
Problem with Workunit?

Problem with Workunit?

Message boards : Number crunching : Problem with Workunit?
Message board moderation

To post messages, you must log in.

AuthorMessage
Chidge

Send message
Joined: 7 Jun 06
Posts: 5
Credit: 3,228,997
RAC: 0
Message 25350 - Posted: 29 Nov 2006, 18:05:10 UTC
Last modified: 29 Nov 2006, 18:12:11 UTC

Hi,

I have a hadcm3 WU which is sending trickles as normal but they aren\'t being shown as received.

clicky

Percentage completed is increasing normally as well - currently 50.734% .

Do I have to abort? thanks :)
ID: 25350 · Report as offensive     Reply Quote
Profile astroWX
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1496
Credit: 95,522,203
RAC: 0
Message 25351 - Posted: 29 Nov 2006, 20:21:31 UTC

Hi, Chidge, welcome to the forum.

Hmmm. There was a problem with logging Trickles Friday thru the weekend. It\'s been resolved.

Given that your Trickles are going up normally, they should be in the database and should be credited sooner or later. (After the weekend outage, my Trickles took a day longer to appear than some others who posted.)

No need to abort the Run. These things tend to come out right in the end.
"We have met the enemy and he is us." -- Pogo
Greetings from coastal Washington state, the scenic US Pacific Northwest.
ID: 25351 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 25352 - Posted: 29 Nov 2006, 20:44:23 UTC

Hi Chidge

Did you receive something similar to this for your last upload:
29/11/2006 7:58:28 PM|climateprediction.net|Scheduler request to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi succeeded[


If so, then the trickle should be on the upload server, but possibly not processed and transfered to the trickle server yet.

Also, have a look in the folder for the cpdn project, and see if the trickle_up icon is still there, but \"greyed\".
I\'ve got one like this from an upload an hour ago.

ID: 25352 · Report as offensive     Reply Quote
Profile MikeMarsUK
Volunteer moderator
Avatar

Send message
Joined: 13 Jan 06
Posts: 1498
Credit: 15,613,038
RAC: 0
Message 25354 - Posted: 29 Nov 2006, 21:02:03 UTC


The other possibility is a \'looping\' model. To work out if this is happening, keep making a note of the \'model date\' (displayed on the \'show graphics\' screen). If this is cycling between 1st Dec and some arbitrary date in the next year, but never reaching the next year\'s 1st Dec, then it may be a looping model.
I'm a volunteer and my views are my own.
News and Announcements and FAQ
ID: 25354 · Report as offensive     Reply Quote
Chidge

Send message
Joined: 7 Jun 06
Posts: 5
Credit: 3,228,997
RAC: 0
Message 25359 - Posted: 29 Nov 2006, 23:35:45 UTC
Last modified: 29 Nov 2006, 23:42:34 UTC

Hi,

thank you for all the replies :)

this is the last trickle up message I have:

29/11/2006 14:59:20|climateprediction.net|Sending scheduler request to http://climateapps2.oucs.ox.ac.uk/cpdnboinc_cgi/cgi
29/11/2006 14:59:20|climateprediction.net|Reason: To send trickle-up message
29/11/2006 14:59:20|climateprediction.net|(not requesting new work or reporting completed tasks)
29/11/2006 14:59:25|climateprediction.net|Scheduler request succeeded

I looked for the trickle_up icon in the cpdn folder within boinc but couldn\'t find anything - might not be looking in the right place? :P

As for a \'looping model\' I\'m currently at 24/05/2002 50.92% so I will keep checking to see if that is increasing normally

Finally though if I scroll to the top of the messages I have:
21/11/2006 18:25:37|climateprediction.net|Task hadcm3lbm_b5ed_05289480_1 is 134362.43 days overdue.
21/11/2006 18:25:37|climateprediction.net|You may not get credit for it. Consider aborting it.

Looking at the report deadline it says 1901 - that cant be right?

Incidentally I have been making backups of my cpdn folder - the most recent I have is from the beginning of November
ID: 25359 · Report as offensive     Reply Quote
old_user202664

Send message
Joined: 13 Oct 06
Posts: 60
Credit: 7,893
RAC: 0
Message 25362 - Posted: 30 Nov 2006, 0:17:56 UTC

I\'m not 100% sure but I think I\'ve read about a crash or bug that makes BOINC set the deadline to 1901. But apart from annoying messages I don\'t think it hurts...
ID: 25362 · Report as offensive     Reply Quote
Profile mo.v
Volunteer moderator
Avatar

Send message
Joined: 29 Sep 04
Posts: 2363
Credit: 14,611,758
RAC: 0
Message 25364 - Posted: 30 Nov 2006, 2:18:47 UTC

Don\'t abort it unless it turns out to be a looper! The deadline date won\'t be imposed as long as the computer trickles fairly regularly.
Cpdn news
ID: 25364 · Report as offensive     Reply Quote
Profile astroWX
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1496
Credit: 95,522,203
RAC: 0
Message 25365 - Posted: 30 Nov 2006, 2:40:35 UTC

The calendar resets to 1901 when the motherboard battery dies or the BIOS jumper is grounded.

If you\'re not running other boinc clients, it\'s no big deal.
"We have met the enemy and he is us." -- Pogo
Greetings from coastal Washington state, the scenic US Pacific Northwest.
ID: 25365 · Report as offensive     Reply Quote
Chidge

Send message
Joined: 7 Jun 06
Posts: 5
Credit: 3,228,997
RAC: 0
Message 25369 - Posted: 30 Nov 2006, 10:34:23 UTC
Last modified: 30 Nov 2006, 10:48:50 UTC

Hi,

I\'m thinking my model might be looping then

Checking it now it says 03/02/2002 50.728%
Yesterday it was 24/05/2002 50.92%

I have a backup from when the model was at 38% (not sure about the model date)
Would it be worth using this or will it fail at the same point again?

thanks for all your help
ID: 25369 · Report as offensive     Reply Quote
Profile astroWX
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1496
Credit: 95,522,203
RAC: 0
Message 25377 - Posted: 30 Nov 2006, 18:00:06 UTC

Carl posted that it was most important to get past 50% (the \"hindcast\" part).

12% is a lot of rerun (for which you won\'t receive credits) and it might fail at the same place. Some are known to continue on. A friend had the problem and retrys failed. (She had a backup which involved less rerun, so she retried at least twice before performing the mercy-killing.) So, in the end,it\'s your call.

"We have met the enemy and he is us." -- Pogo
Greetings from coastal Washington state, the scenic US Pacific Northwest.
ID: 25377 · Report as offensive     Reply Quote
Profile MikeMarsUK
Volunteer moderator
Avatar

Send message
Joined: 13 Jan 06
Posts: 1498
Credit: 15,613,038
RAC: 0
Message 25382 - Posted: 30 Nov 2006, 20:14:41 UTC


In general the odds of getting past the looping point is extremely low, so it\'d be best to zap this model and start with another one. On the other hand, if you\'re going to crash anywhere, at 50.x% is the very best place to do it, since it\'s just after the upload!

I'm a volunteer and my views are my own.
News and Announcements and FAQ
ID: 25382 · Report as offensive     Reply Quote

Message boards : Number crunching : Problem with Workunit?

©2024 cpdn.org