climateprediction.net home page
daily message timeouts

daily message timeouts

Questions and Answers : Windows : daily message timeouts
Message board moderation

To post messages, you must log in.

AuthorMessage
old_user216094

Send message
Joined: 30 Dec 06
Posts: 3
Credit: 476,808
RAC: 0
Message 29616 - Posted: 19 Jul 2007, 8:52:06 UTC

This might or might not be a problem at all, not quit sure.

At least 6 or 7 times a day, in the \'messages\' portion of BOINC, the model states that it restarts w/a \'message timeout\' error. Pretty sure it shouldn\'t have to restart that many times per day and I also don\'t see anything uploading to the server for at least a month. The model itself has about 40% completed however...

If anyone\'s had this issue, please let me know.

Thanks
ID: 29616 · Report as offensive     Reply Quote
Profile MikeMarsUK
Volunteer moderator
Avatar

Send message
Joined: 13 Jan 06
Posts: 1498
Credit: 15,613,038
RAC: 0
Message 29619 - Posted: 19 Jul 2007, 12:05:28 UTC


Is anything else happening on the PC when that happens, for example, something using 100% of CPU time for a while?

I'm a volunteer and my views are my own.
News and Announcements and FAQ
ID: 29619 · Report as offensive     Reply Quote
old_user216094

Send message
Joined: 30 Dec 06
Posts: 3
Credit: 476,808
RAC: 0
Message 29649 - Posted: 21 Jul 2007, 3:25:01 UTC - in response to Message 29619.  

It seems to pretty much happen when the PC\'s idle. No screensaver or power saving mode is ever applied. Just a run-of-the-mill Dell PC, no overclocking or anything done. I set it up on another Dell almost identical, and there\'s very few restarts, plus it at least uploads a trickle a few times a month. This current PC hasn\'t uploaded since Jul. 1st.

ID: 29649 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 29650 - Posted: 21 Jul 2007, 4:39:09 UTC


Please copy 20 lines or so from the Messages tab, and paste them here.
Start from well before a timeout message, and continue until well after, so that we can get an idea of what is happening.


Backups: Here
ID: 29650 · Report as offensive     Reply Quote
Profile MikeMarsUK
Volunteer moderator
Avatar

Send message
Joined: 13 Jan 06
Posts: 1498
Credit: 15,613,038
RAC: 0
Message 29651 - Posted: 21 Jul 2007, 7:22:31 UTC


One additional question ... which firewall do you use? I noticed a few days ago that someone was getting the same messages in their log, and they said there was a problem with the firewall at the time.

I'm a volunteer and my views are my own.
News and Announcements and FAQ
ID: 29651 · Report as offensive     Reply Quote
old_user216094

Send message
Joined: 30 Dec 06
Posts: 3
Credit: 476,808
RAC: 0
Message 29665 - Posted: 23 Jul 2007, 0:45:37 UTC - in response to Message 29651.  

Here\'s what little content I currently have in the mess. folder. I\'ve restarted it recently and apparently that resets the log.

Also, I\'ve gotten rid of all software firewalls, the only thing I use is a common linksys WRT54G broadband router. Can\'t see it causing a model restart...is it an internal computation restart or something to do w/the network connection? I\"ll try to post more logs a week or so down the road.

Thanks for the responses.
================================================================

7/20/2007 7:35:55 PM||Starting BOINC client version 5.8.15 for windows_intelx86
7/20/2007 7:35:55 PM||log flags: task, file_xfer, sched_ops
7/20/2007 7:35:55 PM||Libraries: libcurl/7.16.0 OpenSSL/0.9.8a zlib/1.2.3
7/20/2007 7:35:55 PM||Data directory: C:\\Program Files\\BOINC
7/20/2007 7:35:55 PM||Processor: 2 GenuineIntel Intel(R) Pentium(R) 4 CPU 2.80GHz [x86 Family 15 Model 3 Stepping 4] [fpu tsc sse sse2 mmx]
7/20/2007 7:35:55 PM||Memory: 1.50 GB physical, 5.26 GB virtual
7/20/2007 7:35:55 PM||Disk: 232.88 GB total, 162.85 GB free
7/20/2007 7:35:55 PM|climateprediction.net|URL: http://climateprediction.net/; Computer ID: 573373; location: home; project prefs: default
7/20/2007 7:35:55 PM||General prefs: from climateprediction.net (last modified 2007-03-20 06:00:41)
7/20/2007 7:35:55 PM||Host location: home
7/20/2007 7:35:55 PM||General prefs: no separate prefs for home; using your defaults
7/20/2007 7:35:55 PM||Reading preferences override file
7/20/2007 7:35:56 PM|climateprediction.net|Restarting task hadcm3ohe_27qb_05751669_0 using hadcm3 version 515
7/21/2007 7:45:20 AM||Restarting hadcm3ohe_27qb_05751669_0 - message timeout
7/21/2007 8:03:17 AM||Restarting hadcm3ohe_27qb_05751669_0 - message timeout
7/21/2007 8:03:17 AM|climateprediction.net|Restarting task hadcm3ohe_27qb_05751669_0 using hadcm3 version 515
7/22/2007 6:44:55 AM||Restarting hadcm3ohe_27qb_05751669_0 - message timeout
7/22/2007 6:48:07 AM||Restarting hadcm3ohe_27qb_05751669_0 - message timeout

==========================================================================
ID: 29665 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 29666 - Posted: 23 Jul 2007, 1:18:56 UTC

OK, that\'s enough to go on. At least for someone who has seen it before, which isn\'t me.

For future reference, all of the logs are archived in the BOINC folder:
stdoutdae.txt contains ALL of the messages, and
stderrdae.txt contains only the error messages.



Backups: Here
ID: 29666 · Report as offensive     Reply Quote
Profile astroWX
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1496
Credit: 95,522,203
RAC: 0
Message 29667 - Posted: 23 Jul 2007, 3:28:50 UTC

BUFFIN: Read Failed: Permission denied
BUFFIN: C I/O Error - Return code = 32

Model crashed: umshell1.f: STWORK : I/O error - PP fixed length header GA
Fatal crash! :-(


Must confess I don\'t know what this means but it makes me wonder what you may have done with the machine, and/or to the boinc/CPDN folder, prior to this incident.

This isn\'t likely to be helpful, but it seems to relate to the \'permissions\' thing:
http://www.openldap.org/lists/openldap-devel/200501/msg00006.html

"We have met the enemy and he is us." -- Pogo
Greetings from coastal Washington state, the scenic US Pacific Northwest.
ID: 29667 · Report as offensive     Reply Quote
Profile Thyme Lawn
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1283
Credit: 15,824,334
RAC: 0
Message 29673 - Posted: 23 Jul 2007, 7:07:27 UTC - in response to Message 29667.  

BUFFIN: Read Failed: Permission denied
BUFFIN: C I/O Error - Return code = 32

Model crashed: umshell1.f: STWORK : I/O error - PP fixed length header GA
Fatal crash! :-(


Must confess I don\'t know what this means but it makes me wonder what you may have done with the machine, and/or to the boinc/CPDN folder, prior to this incident.

Have a look in the directory C:\\Program Files\\BOINC\\projects\\climateprediction.net\\hadcm3ohe_27qb_05751669_0\\dataout.

Does it have any zero length files or more than one *.pd* file? That would point towards a known (but very rare) problem that affected 5.15 (caused by shutting down BOINC in the small window between end of year post-processing and the next checkpoint).
"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer
ID: 29673 · Report as offensive     Reply Quote

Questions and Answers : Windows : daily message timeouts

©2024 climateprediction.net