Message boards : Number crunching : constant computation errors
Author | Message |
---|---|
pfschumacher Send message Joined: 23 Nov 05 Posts: 6 Credit: 1,446,040 RAC: 0 |
i run the same work on several computers, rosetta at home. its my only project. on one computer which i check once a day, i will see all downloaded files have suffered a computation error. i update, get new work, it runs while i am watching it and then again the next day, all errors. i am sure someone needs more information to answer this question, but is there some basic wrongness you might know about? none of my other computers suffer computation errors, ever. not one time. thanks. |
Evan Send message Joined: 23 Dec 05 Posts: 268 Credit: 402,585 RAC: 0 |
Do you mean the latest problem with validate errors? This has been now corrected. |
pfschumacher Send message Joined: 23 Nov 05 Posts: 6 Credit: 1,446,040 RAC: 0 |
Do you mean the latest problem with validate errors? This has been now corrected. no. i mean nearly every time i check my computer, which for that particular computer is about one time a day, all the work units are noted "computation error." they are not submitted, and i dont have new work lined up when i notice it. i feel im wasting units and a good hard working computer, but i dont know why they all total out with errors. this has been happening for a month or so. |
Snags Send message Joined: 22 Feb 07 Posts: 198 Credit: 2,888,320 RAC: 0 |
Which of your five computers are you referring to? Is it 91041? If I go back far enough in the tasks list I see a lot of tasks reported with 0 CPU time and one of the following messages: <message> Input file minirosetta_1.97_windows_intelx86.exe missing or invalid: -163 </message> or <message> app_version download error: couldn't get input files: <file_xfer_error> <file_name>minirosetta_1.97_windows_intelx86.exe</file_name> <error_code>-120</error_code> <error_message>signature verification failed</error_message> </file_xfer_error> </message> More recently this computer has completed several tasks successfully and a few ended with a different client error: -1073741819 (0xc0000005). Here's a link to one of those tasks:280033729 From your posts it appears than when you manually update all the files get downloaded and the program runs fine but when you leave the computer on its own it rejects the .exe file thus the tasks can't run. Sorry if I'm asking the obvious but when you manually update do you get any popups from your antivirus or firewall programs? Are you running the exact same versions and using the exact same settings on all your machines? I'm guessing there is a difference somewhere that would explain why this machine is blocking the files and the other machines aren't. As for the third error message here's the entry from the BOINC FAQ Service. Someone more knowledgeable than I will have to guide you from here. Snags |
mikey Send message Joined: 5 Jan 06 Posts: 1895 Credit: 9,208,737 RAC: 3,249 |
Do you mean the latest problem with validate errors? This has been now corrected. I have found that sometimes just shutting down the pc and after an hour or so restarting it fixes some of these unknown errors. I don't know why, it just sometimes works. Other than that follow Snagletooth's suggestions. |
pfschumacher Send message Joined: 23 Nov 05 Posts: 6 Credit: 1,446,040 RAC: 0 |
Which of your five computers are you referring to? Is it 91041? If I go back far enough in the tasks list I see a lot of tasks reported with 0 CPU time and one of the following messages: snagle, i never receive pop ups from my virus program, and i keep the boinc updated to its latest version. i just updated the version recently but i recall these errors regardless of what version i have been using. your insight makes sense to me, and yes the computer you mentioned is the one in question. why would it reject the .exe files, i cannot say. i dont know enough about computers to answer that question, but it appears that is precisely what is happening. thank you for the reply, ill try "someone more knowledgable" if they can be found, and let you know of their response. paul |
LizzieBarry Send message Joined: 25 Feb 08 Posts: 76 Credit: 201,862 RAC: 0 |
snagle, I never receive pop ups from my virus program, and I keep the boinc updated to its latest version. I just updated the version recently but I recall these errors regardless of what version I have been using. Go into your antivirus program and check the quarantine. Norton, Nod32 and Kaspersky are all capable of isolating minirosetta_1.97_windows_intelx86.exe |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
Is this a new problem for that host? You said it's been occurring for a month or so, but is that when you attached the host to Rosetta? Some corporate firewalls will filter .exe downloads over http, which is what BOINC does, at least when working with R@h. Are your other machines on the same network? Rosetta Moderator: Mod.Sense |
mikey Send message Joined: 5 Jan 06 Posts: 1895 Credit: 9,208,737 RAC: 3,249 |
snagle, i never receive pop ups from my virus program, and i keep the boinc updated to its latest version. i just updated the version recently but i recall these errors regardless of what version i have been using. One thing you can do is exclude the Boinc folders from the anti-virus as far as scanning and keeping track of goes. Within your anti-virus program there should be a way to exclude certain folders, lots of us have done this. |
pfschumacher Send message Joined: 23 Nov 05 Posts: 6 Credit: 1,446,040 RAC: 0 |
Is this a new problem for that host? You said it's been occurring for a month or so, but is that when you attached the host to Rosetta? Some corporate firewalls will filter .exe downloads over http, which is what BOINC does, at least when working with R@h. thank you lizzie and mod. boinc is allowed and okay by my virus program (mcafee) and i have been attached to to this host for over a year. if by that you mean that this computer is boinc-ing and running rosetta. i have two other machines on this network. i have not had an error for a few days now, and i have never reset this project. i should maybe try that if i get more errors. otherwise from my end i am not sure how to proceed. |
pfschumacher Send message Joined: 23 Nov 05 Posts: 6 Credit: 1,446,040 RAC: 0 |
Is this a new problem for that host? You said it's been occurring for a month or so, but is that when you attached the host to Rosetta? Some corporate firewalls will filter .exe downloads over http, which is what BOINC does, at least when working with R@h. to update, i have not had a single computation error since i changed my cpu usage settings from 100% to 90%. coincidental or not, it seems to be working. |
Snags Send message Joined: 22 Feb 07 Posts: 198 Credit: 2,888,320 RAC: 0 |
to update, i have not had a single computation error since i changed my cpu usage settings from 100% to 90%. coincidental or not, it seems to be working. ? While there have not been missing file errors in the last few days, 3 out of 10 results returned so far today have ended with client errors: 280379185 280341063 280316245 - exit code -1073741819 (0xc0000005) The Windows Runtime Debugger is engaged so perhaps someone will be able to spot a clue in the stderr out. You can try the BOINC FAQ Service link I posted earlier for some ideas. Again, someone else will have to guide you as far as checking drivers, etc. Perhaps someone has insight into the intermittent nature of these errors. If I'm reading the numbers right there's not a lot of memory available, if any of it's bad would that cause the occasional error or would every task fail? Snags |
Greg_BE Send message Joined: 30 May 06 Posts: 5691 Credit: 5,859,226 RAC: 0 |
to update, i have not had a single computation error since i changed my cpu usage settings from 100% to 90%. coincidental or not, it seems to be working. Someone else posted a question along the same lines. Take a look at my message here and see if that helps with your problems. |
macko Send message Joined: 25 Jun 09 Posts: 32 Credit: 153,495 RAC: 0 |
|
mikey Send message Joined: 5 Jan 06 Posts: 1895 Credit: 9,208,737 RAC: 3,249 |
Hi all, Have you blown out the cobwebs in your machine lately? That sounds like a heat related problem, lowering the cpu usage lowers the heat output of the cpu is my thinking. |
Message boards :
Number crunching :
constant computation errors
©2024 University of Washington
https://www.bakerlab.org