climateprediction.net home page
error code -161 on suphur cycle WU, now all work fails

error code -161 on suphur cycle WU, now all work fails

Questions and Answers : Windows : error code -161 on suphur cycle WU, now all work fails
Message board moderation

To post messages, you must log in.

AuthorMessage
Thunder

Send message
Joined: 1 Sep 04
Posts: 42
Credit: 6,475,117
RAC: 0
Message 16244 - Posted: 25 Sep 2005, 16:04:38 UTC
Last modified: 25 Sep 2005, 16:11:03 UTC

I got this error from this WU:

BAD WU

Edit: I\'ve tried every combination of bbcode tags I can find to get the error code to display. I can\'t quote it and have it display. Please look at the WU itself... I\'m on my 9th edit of the post and am giving up. :P


<core_client_version>4.45</core_client_version>
<message><file_xfer_error>
  <file_name>46ii_100295338_1_1.zip</file_name>
  <error_code>-161</error_code>
  <error_message></error_message>
</file_xfer_error>
<file_xfer_error>
  <file_name>46ii_100295338_1_2.zip</file_name>
  <error_code>-161</error_code>
  <error_message></error_message>
</file_xfer_error>
<file_xfer_error>
  <file_name>46ii_100295338_1_3.zip</file_name>
  <error_code>-161</error_code>
  <error_message></error_message>
</file_xfer_error>
<file_xfer_error>
  <file_name>46ii_100295338_1_4.zip</file_name>
  <error_code>-161</error_code>
  <error_message></error_message>
</file_xfer_error>
<file_xfer_error>
  <file_name>46ii_100295338_1_5.zip</file_name>
  <error_code>-161</error_code>
  <error_message></error_message>
</file_xfer_error>

</message>



and now all further WU\'s error out with:

<core_client_version>4.45</core_client_version>
<message> - exit code -1073741819 (0xc0000005)
</message>

I\'ve tried restarting BOINC and then resetting CPDN on this machine, but I won\'t know for about 10 hours (when communciation is no longer deferred) if it made any difference).

Obviously, the BOINC version, you all can see. The installation is running Einstein@Home as well. I\'m confident the computer is stable (not overclocked, etc.) and well maintained.

Any idea what I can do about this? (I know there are lots of other posts that appear to relate to this problem, but none seemed exactly the same and none seemed to offer any solution or ideas)
ID: 16244 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 16245 - Posted: 25 Sep 2005, 16:58:45 UTC

The secret to posting these errors is to paste, and then edit to change the arrow brackets to square brackets.

The only person I know who has fixed a problem with the error code 1073741819 is <a href=\"http://climateapps2.oucs.ox.ac.uk/cpdnboinc/forum_thread.php?id=3229\"> here.</a> And in his case it appears to have been a graphics card problem.

I\'ve seen reports of error 161, but I\'m not sure what it means.

There are also reports starting to show up that Bern is down yet again.
Have a look in client_state.xml for the model name, and find where the zips are mentioned.
Just before each one is a line about the destination uploader. If it\'s Bern, it will be something like: unib.ch, in which case you\'ll just have to wait until they are back up again.

ID: 16245 · Report as offensive     Reply Quote
Thunder

Send message
Joined: 1 Sep 04
Posts: 42
Credit: 6,475,117
RAC: 0
Message 16246 - Posted: 25 Sep 2005, 18:16:19 UTC - in response to Message 16245.  

The secret to posting these errors is to paste, and then edit to change the arrow brackets to square brackets.


Okay, thanks, I\'ll make a point of using a search/replace in a text editor before I post \'em. :)

The only person I know who has fixed a problem with the error code 1073741819 is <a href=\"http://climateapps2.oucs.ox.ac.uk/cpdnboinc/forum_thread.php?id=3229\"> here.</a> And in his case it appears to have been a graphics card problem.


Unfortunately, I\'m sure that\'s not my case. I\'m running BOINC as a service and never display the graphics (screensaver disabled, managed from boincview on another machine)


Have a look in client_state.xml for the model name, and find where the zips are mentioned.
Just before each one is a line about the destination uploader. If it\'s Bern, it will be something like: unib.ch, in which case you\'ll just have to wait until they are back up again.



Okay, I suppose this will be informative as a \'post-mortem\', but the client has terminated/abandoned the run, so it\'s too late to do anything about it now.

I was depressed enough over yet another failed run (my 45th) that I stopped to add up how much processing time was devoted to runs that failed for some reason. 36,032,432 seconds or 417 cpu/days of time that didn\'t produce anything useful. :P I also have 14,189,264.00 seconds towards 2 results that appear to be \'unknown\' though, I think they completed. Thankfully I do at least have 82,252,941 seconds (952 cpu/days) that produced 13 complete runs at least!

\'Tis a bit frustrating to think that 4/5ths of my models are going to error out and therefore nearly 1/3rd of my computing time will be wasted. :\\
ID: 16246 · Report as offensive     Reply Quote
Thunder

Send message
Joined: 1 Sep 04
Posts: 42
Credit: 6,475,117
RAC: 0
Message 16511 - Posted: 9 Oct 2005, 21:54:26 UTC - in response to Message 16246.  

Well, this is frustrating....

The same computer just got to the end of phase 1 of another sulpher cycle unit and had the exact same error.

Here\'s the result:

http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=1102562

I can\'t find any reference to what an error -161 is anywhere, but it\'s evidently going to keep happening. :(

Any ideas before I have to take yet another machine off CPDN because it just won\'t run correctly?
ID: 16511 · Report as offensive     Reply Quote
Profile geophi
Volunteer moderator

Send message
Joined: 7 Aug 04
Posts: 2181
Credit: 64,766,246
RAC: 653
Message 16512 - Posted: 9 Oct 2005, 22:36:24 UTC

Sorry, I\'m not able to help you with the 161 error, but it doesn\'t look like an end of phase error as you were 14 trickles into the phase, and the original failure where you were 16 trickles into the phase.
ID: 16512 · Report as offensive     Reply Quote
Thunder

Send message
Joined: 1 Sep 04
Posts: 42
Credit: 6,475,117
RAC: 0
Message 16514 - Posted: 10 Oct 2005, 2:52:49 UTC - in response to Message 16512.  

Sorry, I\'m not able to help you with the 161 error, but it doesn\'t look like an end of phase error as you were 14 trickles into the phase, and the original failure where you were 16 trickles into the phase.


Hrmm.... true.

Well, I\'ll make one more check tomorrow to make sure the hardware is working fine, but I checked it with memtest86 and SuperPi after the last failed WU and both completed without so much as a hiccup. Scandisk found no problems, etc.

This is a stock HP/Compaq business desktop... no overclock, nothing but factory original and it\'s only a few months old. This thing runs stable as a rock on anything but CPDN, it seems. :(
ID: 16514 · Report as offensive     Reply Quote
Profile geophi
Volunteer moderator

Send message
Joined: 7 Aug 04
Posts: 2181
Credit: 64,766,246
RAC: 653
Message 16515 - Posted: 10 Oct 2005, 3:13:09 UTC - in response to Message 16514.  

This is a stock HP/Compaq business desktop... no overclock, nothing but factory original and it\'s only a few months old. This thing runs stable as a rock on anything but CPDN, it seems. :(

I had -1073741819 errors for awhile on one PC. Updated the BIOS and they went away. The BIOS update supposedly fixed some memory incompability...even though Prime95 and memtest ran fine.

Then again, maybe it was some odd software incompatability...
ID: 16515 · Report as offensive     Reply Quote
Thunder

Send message
Joined: 1 Sep 04
Posts: 42
Credit: 6,475,117
RAC: 0
Message 16517 - Posted: 10 Oct 2005, 5:21:53 UTC - in response to Message 16515.  

I had -1073741819 errors for awhile on one PC. Updated the BIOS and they went away. The BIOS update supposedly fixed some memory incompability...even though Prime95 and memtest ran fine.


I\'ll check and see if there are any newer BIOS updates beyond what it came with tomorrow.
ID: 16517 · Report as offensive     Reply Quote
Thunder

Send message
Joined: 1 Sep 04
Posts: 42
Credit: 6,475,117
RAC: 0
Message 16534 - Posted: 10 Oct 2005, 17:51:36 UTC - in response to Message 16517.  

I\'ll check and see if there are any newer BIOS updates beyond what it came with tomorrow.


The only updates beyond what I have are only recommended for some specific hardware issues (none of which apply to this system) and the recommendation is to leave the BIOS I have on the system. Since I take a definite approach of \'if it ain\'t broke, don\'t fix it\', I\'m leaving that alone.

I now see that there are posts over on the other message boards (the CPDN classic ones that I can\'t seem to ever get my account to work on, so I can\'t post to them) about many, many others experiencing this -161 error with suphur_cycles, so I\'m filing this away in the catagory of \"CPDN\'s problem, not mine\".
ID: 16534 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 16536 - Posted: 10 Oct 2005, 19:16:40 UTC

To post on the community board, you have to register there separately.
These boards are at Oxford Uni, and the other is at the Open University at Milton Keynes, about 40 miles to the NE.

ID: 16536 · Report as offensive     Reply Quote
Thunder

Send message
Joined: 1 Sep 04
Posts: 42
Credit: 6,475,117
RAC: 0
Message 16606 - Posted: 14 Oct 2005, 15:03:42 UTC - in response to Message 16511.  

Well, now another WU just failed with error code -5 (0xfffffffb).

I\'m going to go nuts if I have to try to figure out the meaning of another error code so I\'m pulling this one off CPDN as well. (Now the 5th machine of 9 that I\'ve had to pull off CPDN because it just won\'t run on them).

The really weird part is that the only machines I have that seem to run CPDN reliably are the 2 that I cobbled together myself. Every single stock PC that I use from Compaq/HP and IBM all error more than they work reliably. All of them run every other BOINC project perfectly and all complete every stress test I throw at them perfectly (Memtest86+, SuperPi and Prime95 torture tests), but they just can\'t seem to do CPDN. :(
ID: 16606 · Report as offensive     Reply Quote
racinjimy

Send message
Joined: 19 Apr 05
Posts: 53
Credit: 6,325,436
RAC: 0
Message 16625 - Posted: 15 Oct 2005, 6:03:20 UTC - in response to Message 16606.  

The really weird part is that the only machines I have that seem to run CPDN reliably are the 2 that I cobbled together myself. Every single stock PC that I use from Compaq/HP and IBM all error more than they work reliably.


hmmmmmmm

maybe not so wierd after all

I think the machines you put together probably use better hardware than the HP/Compaq junk
ID: 16625 · Report as offensive     Reply Quote
Thunder

Send message
Joined: 1 Sep 04
Posts: 42
Credit: 6,475,117
RAC: 0
Message 17066 - Posted: 8 Nov 2005, 21:39:06 UTC - in response to Message 16606.  

Anyone yet have any idea what these -161 errors are or mean?

I just had another one fail with it:

http://climateapps2.oucs.ox.ac.uk/cpdnboinc/result.php?resultid=1136194
ID: 17066 · Report as offensive     Reply Quote
Les Bayliss
Volunteer moderator

Send message
Joined: 5 Sep 04
Posts: 7629
Credit: 24,240,330
RAC: 0
Message 17069 - Posted: 8 Nov 2005, 22:38:21 UTC
Last modified: 8 Nov 2005, 22:38:57 UTC

Your original post mentioned 4.45
Assuming that this the BOINC version, there is a bug in it.
I posted a long reply to another person about this yesterday.
Rather than re-type it all again, you can read what I said <a href=\"http://climateapps2.oucs.ox.ac.uk/cpdnboinc/forum_thread.php?id=3412\"> here.</a>
Some of it may be useful to you.

ID: 17069 · Report as offensive     Reply Quote
Profile Thyme Lawn
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1283
Credit: 15,824,334
RAC: 0
Message 17076 - Posted: 9 Nov 2005, 7:58:16 UTC - in response to Message 17066.  

Anyone yet have any idea what these -161 errors are or mean?

The -161 errors are not the real problem. They indicate that the application has finished running and BOINC is attempting to upload result files that haven\'t been created.

The -161 errors are masking the real problem, which might be revealed by looking at the stdoutdae.txt and stderrdae.txt files in the BOINC directory.
"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer
ID: 17076 · Report as offensive     Reply Quote
Thunder

Send message
Joined: 1 Sep 04
Posts: 42
Credit: 6,475,117
RAC: 0
Message 17090 - Posted: 9 Nov 2005, 16:52:16 UTC - in response to Message 17069.  

Your original post mentioned 4.45
Assuming that this the BOINC version, there is a bug in it.


Thank you for the reply Les, but I\'ve been using the custom compliled 4.45b client since about the 3rd time I had my clients cease processing CPDN due to the \'failure to exit\' benchmark bug, so I\'m sure that\'s not the problem.

I\'m just now opening and looking at the files that Thyme suggested to see what I can find there.
ID: 17090 · Report as offensive     Reply Quote
Thunder

Send message
Joined: 1 Sep 04
Posts: 42
Credit: 6,475,117
RAC: 0
Message 17092 - Posted: 9 Nov 2005, 17:05:47 UTC - in response to Message 17076.  
Last modified: 9 Nov 2005, 17:06:40 UTC

The -161 errors are not the real problem. They indicate that the application has finished running and BOINC is attempting to upload result files that haven\'t been created.

The -161 errors are masking the real problem, which might be revealed by looking at the stdoutdae.txt and stderrdae.txt files in the BOINC directory.


No Thyme, I\'m afraid I must disagree... there are no other errors in either of those files that indicate anything beyond exactly what appears in the result that I linked to. Just in case, I\'m copy/pasting (replacing the \'evil characters that won\'t display\' with [ and so forth) the exact output here:

From stderrdae:

2005-11-07 21:37:48 [climateprediction.net] Unrecoverable error for result sulphur_480b_000297275_0 ([file_xfer_error]
[file_name]sulphur_480b_000297275_0_1.zip[/file_name]
[error_code]-161[/error_code]
[error_message][/error_message]
[/file_xfer_error]
[file_xfer_error]
[file_name]sulphur_480b_000297275_0_2.zip[/file_name]
[error_code]-161[/error_code]
[error_message][/error_message]
[/file_xfer_error]
[file_xfer_error]
[file_name]sulphur_480b_000297275_0_3.zip[/file_name]
[error_code]-161[/error_code]
[error_message][/error_message]
[/file_xfer_error]
[file_xfer_error]
[file_name]sulphur_480b_000297275_0_4.zip[/file_name]
[error_code]-161[/error_code]
[error_message][/error_message]
[/file_xfer_error]
[file_xfer_error]
[file_name]sulphur_480b_000297275_0_5.zip[/file_name]
[error_code]-161[/error_code]
[error_message][/error_message]
[/file_xfer_error]
)


and from stdoutdae:

2005-11-07 21:37:48 [climateprediction.net] Unrecoverable error for result sulphur_480b_000297275_0 ([file_xfer_error]
[file_name]sulphur_480b_000297275_0_1.zip[/file_name]
[error_code]-161[/error_code]
[error_message][/error_message]
[/file_xfer_error]
[file_xfer_error]
[file_name]sulphur_480b_000297275_0_2.zip[/file_name]
[error_code]-161[/error_code]
[error_message][/error_message]
[/file_xfer_error]
[file_xfer_error]
[file_name]sulphur_480b_000297275_0_3.zip[/file_name]
[error_code]-161[/error_code]
[error_message][/error_message]
[/file_xfer_error]
[file_xfer_error]
[file_name]sulphur_480b_000297275_0_4.zip[/file_name]
[error_code]-161[/error_code]
[error_message][/error_message]
[/file_xfer_error]
[file_xfer_error]
[file_name]sulphur_480b_000297275_0_5.zip[/file_name]
[error_code]-161[/error_code]
[error_message][/error_message]
[/file_xfer_error]
)


I\'ll swear on as big a stack of bibles as you\'d care to put before me that there is absolutely nothing immediately before, nor after those errors that indicate anything other than the normal operation of the client (pausing, switching, downloading, uploading stuff, etc.)
ID: 17092 · Report as offensive     Reply Quote
Profile Thyme Lawn
Volunteer moderator

Send message
Joined: 5 Aug 04
Posts: 1283
Credit: 15,824,334
RAC: 0
Message 17113 - Posted: 10 Nov 2005, 8:50:36 UTC - in response to Message 17092.  

I\'ll swear on as big a stack of bibles as you\'d care to put before me that there is absolutely nothing immediately before, nor after those errors that indicate anything other than the normal operation of the client (pausing, switching, downloading, uploading stuff, etc.)

OK, but that doesn\'t take away the fact that the the error messages are caused by BOINC attempting to upload result files that don\'t exist. Which can only happen if it thinks the application has completed.
"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer
ID: 17113 · Report as offensive     Reply Quote

Questions and Answers : Windows : error code -161 on suphur cycle WU, now all work fails

©2024 cpdn.org