Message boards : Number crunching : Can't report to server on one machine
Author | Message |
---|---|
Plasmon_attack Send message Joined: 2 May 10 Posts: 13 Credit: 15,451,384 RAC: 0 |
Hi Everyone, recently one of my computers (Win7 64-bit, dual quad core xeons, 6 GB ram, hyperthreaded, boinc 6.10.58) suddenly started having problems communicating with the server. It's still crunching work but says, "Scheduler request failed: Couldn't connect to server." Other computers on the same network are working fine. I've uninstalled and reinstalled, made sure to delete the Boinc directory, and yet when I reinstall it's like nothing happened because the same work units are ready to report, the project is still attached, etc. The error is, "Boinc couldn't do internet communication, and no default connection is selected. Please connect to the internet, or select a default connection using advanced/options/connection." I don't need a proxy or anything for my network. Any idea what to put in? Is there a better way to reinstall? I have like 300 completed workunits piled up and would like to report them. Thanks! Tony |
mikey Send message Joined: 5 Jan 06 Posts: 1895 Credit: 9,178,442 RAC: 3,202 |
Hi Everyone, recently one of my computers (Win7 64-bit, dual quad core xeons, 6 GB ram, hyperthreaded, boinc 6.10.58) suddenly started having problems communicating with the server. It's still crunching work but says, "Scheduler request failed: Couldn't connect to server." Other computers on the same network are working fine. Does the pc otherwise connect to the net just fine? And you are seeing the exact same units because that is a 'feature' that was turned on recently here at Rosie, it has been at some other projects for awhile now. If you are using a wireless connection try running a temporary wire to the machine and see if that fixes it. |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
Have you rebooted the PC? Have you tried to PING boinc.bakerlab.org from that machine? (perhaps the machine is having trouble reaching a DNS server to resolve the domain name) Rosetta Moderator: Mod.Sense |
Plasmon_attack Send message Joined: 2 May 10 Posts: 13 Credit: 15,451,384 RAC: 0 |
Yes, I tried rebooting first, and several times, and between installations of different versions of the BOINC client. It appears that our network admins may have disabled ping as I can't ping anything (even Google) from any computers here (note one is a laptop and ping works fine at home). Note, the network is working as there are seven other computers on the same network that are able to reach the server without an issue. There was a power outage that crashed the machine (yes it's on a backup but the outage was too long) and this computer hasn't been able to connect. I guess, is there a better way to uninstall Boinc? Clearly the uninstall doesn't do it totally because, once reinstalled, it knows it's connected to Rosetta AND still knows what work units are completed and not completed. Is there a way to wipe it off completely and start over? Thanks |
TPCBF Send message Joined: 29 Nov 10 Posts: 111 Credit: 5,085,161 RAC: 1,288 |
Yes, I tried rebooting first, and several times, and between installations of different versions of the BOINC client.I had the same problem with one of my laptops. All the sudden it wouldn't connect back to the server, getting the same error messages you mentioned. Nothing changed on the laptop itself, nothing changed on the LAN/firewall, which would effect all of my roughly one dozen PCs crunching. I had to uninstall BOINC (running only R@H) and manually deleted both the C:Program filesBOINC folder as well as the C:Documents and SettingsAll UsersApplication DataBOINC folder. Can it be you deleted only the first one but not the second one (where the "Application Data" folder is a "hidden" folder)? Once I did that, I was able to re-install (actually downloaded the binary installer from World Community Grid) and re-attach to the R@H project. Lost the three finished WU in the process though... This was on Windows XPSP3 though, on Windows 7, the second folder should of course be in "C:Users..." Ralf |
Murasaki Send message Joined: 20 Apr 06 Posts: 303 Credit: 511,418 RAC: 0 |
A few things to try:
|
TPCBF Send message Joined: 29 Nov 10 Posts: 111 Credit: 5,085,161 RAC: 1,288 |
As I mentioned, I had the very same issue and in my case *Check that the computer has a network connection - are you able to use an internet browser/send emails/view files on other machines on the network?Network and Internet connection was just fine, no problems what so ever. Only BOINC had connectivity issues, for whatever reason... *In BOINC "Advanced View" check whether you have "Network activity always available" selected in the Activity menu.Didn't make a difference. All other clients have the default of "...based on preferences" and work just fine *Attach the machine to another BOINC project and see if you can communicate with them.Catch22, you can't do that when BOINC claims that it doesn't have Internet connectivity. It's not a R@H specific issue, rather a BOINC related issue. I tried at first to re-install the client downloaded from the World Community Grid and it wouldn't work either. As I mentioned before, I had to uninstall the program as well as delete the application data folder before a re-install would work just fine as if nothing ever happened. This is definitely a snag in the BOINC software, as it happened to my on a host that was doing at that time nothing but just crunching for R@H and out of nowhere experienced those issues... Ralf |
Plasmon_attack Send message Joined: 2 May 10 Posts: 13 Credit: 15,451,384 RAC: 0 |
Umm, yes, I did check that the computer has internet access and that BOINC is allowed to do network access before asking for help. @Wolf...your situation and suggestions are a lot closer to what I think the issue is, namely, uninstall doesn't fully uninstall and leaves 'ghosts' behind that keep a fresh install from functioning. I've tried what you said and am a little further, but not quite there. I was able to locate program data in a the 'program data' folder. I also found a leftover directory in program files (x86). ProgramData has the project specific data, work units, a lot of .xml files, etc., and I wiped it out. A fresh install now prompts to setup a new project. However, it still can't communicate with the servers. It says they're temporarily unavailable (but I tried several projects) and then it gives the same network error. For Win7 are there any other hidden places the program could be leaving traces that remind if of this broken state? I've looked in the users directories (and I have hidden fiels showing) but I'm not finding anything that looks boinc related. Sorry about your 3 workunits. I've been trying to fix this for a while as all 16 cores were crunching and lost ~400 workunits :( |
dcdc Send message Joined: 3 Nov 05 Posts: 1832 Credit: 119,668,178 RAC: 10,680 |
There are a few registry entries, but I don't think they contain anything that would affect network access - AFAIK they just contain the locations of the boinc folders, but MS would be able to clarify that. Could the firewall have been set to block BOINC? Umm, yes, I did check that the computer has internet access and that BOINC is allowed to do network access before asking for help. |
Plasmon_attack Send message Joined: 2 May 10 Posts: 13 Credit: 15,451,384 RAC: 0 |
Ok, got it, I worked backwards through all versions of the BOINC client and found that 6.6.38 works. I'm back up, though I think the server is mad at me for losing the ~400 units that were completed so it only gave me one to start with (for 16 cores!). I guess eventually it'll recover. |
Murasaki Send message Joined: 20 Apr 06 Posts: 303 Credit: 511,418 RAC: 0 |
Umm, yes, I did check that the computer has internet access and that BOINC is allowed to do network access before asking for help. When someone asks for technical assistance there are two choices. One, assume it is something complex and start with the most complicated solutions. Two, decide to rule out the obvious possibilities first. Going with the second option does result in some people thinking you are an idiot for suggesting the obvious, but it is surprising how often the simple things are overlooked. |
dcdc Send message Joined: 3 Nov 05 Posts: 1832 Credit: 119,668,178 RAC: 10,680 |
Ok, got it, I worked backwards through all versions of the BOINC client and found that 6.6.38 works. I'm back up, though I think the server is mad at me for losing the ~400 units that were completed so it only gave me one to start with (for 16 cores!). I guess eventually it'll recover. good to hear it's up and running again ;) |
Message boards :
Number crunching :
Can't report to server on one machine
©2024 University of Washington
https://www.bakerlab.org