BOINC backing off too long

Message boards : Number crunching : BOINC backing off too long

To post messages, you must log in.

AuthorMessage
Profile Gerry Rough
Avatar

Send message
Joined: 2 Jan 06
Posts: 111
Credit: 1,389,340
RAC: 0
Message 32960 - Posted: 20 Dec 2006, 14:35:57 UTC

This is a continuation of another thread.

I have been having problems with my laptop backing off comunications with BOINC for a week. This happens when I am working on my laptop but not connected to the internet for hours at a time. When BOINC fails to find a scheduler four consecutive times, it backs off for an entire week. Here are the final messages:

12/20/2006 8:36:33 AM|rosetta@home|Scheduler request failed: couldn't resolve host name
12/20/2006 8:36:33 AM|rosetta@home|Deferring scheduler requests for 1 minutes and 0 seconds
12/20/2006 8:37:33 AM|rosetta@home|Fetching scheduler list
12/20/2006 8:37:38 AM|rosetta@home|Scheduler list fetch failed: http error
12/20/2006 8:37:38 AM|rosetta@home|4 consecutive failures fetching scheduler list - deferring 604800 seconds
12/20/2006 8:37:38 AM|rosetta@home|Deferring scheduler requests for 1 weeks, 0 days, 0 hours, 0 minutes and 0 seconds

This is the BOINC version of evil itself!!

I did change the settings to => Run based on preferences, but that didn't help. The messages above are the latest result from that experiment. I didn't used to have this problem, not sure when it started

(Click for detailed stats)
ID: 32960 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
FluffyChicken
Avatar

Send message
Joined: 1 Nov 05
Posts: 1260
Credit: 369,635
RAC: 0
Message 32962 - Posted: 20 Dec 2006, 14:51:57 UTC

Just thought, since this is a boinc problem you could use the new help system
http://boinc.berkeley.edu/help.php

If you want to go to the man at the top, look for David Anderson ;-)

You'll need skype (which is free).

all I can think of is there is a boinc bug occuring.

(i.e could be fixed by using the current development version 5.8.x)
Team mauisun.org
ID: 32962 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Feet1st
Avatar

Send message
Joined: 30 Dec 05
Posts: 1755
Credit: 4,690,520
RAC: 0
Message 32969 - Posted: 20 Dec 2006, 15:51:52 UTC

So what BOINC version is doing that?

If you basically only have network access when you are at the PC, you might work around the problem by suspending network activity when you leave your machine. Then get connected and select that the network is always available. This is done in your Activity tab.
Add this signature to your EMail:
Running Microsoft's "System Idle Process" will never help cure cancer, AIDS nor Alzheimer's. But running Rosetta@home just might!
https://boinc.bakerlab.org/rosetta/
ID: 32969 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Gerry Rough
Avatar

Send message
Joined: 2 Jan 06
Posts: 111
Credit: 1,389,340
RAC: 0
Message 32972 - Posted: 20 Dec 2006, 17:13:53 UTC - in response to Message 32969.  
Last modified: 20 Dec 2006, 17:14:53 UTC

So what BOINC version is doing that?

If you basically only have network access when you are at the PC, you might work around the problem by suspending network activity when you leave your machine. Then get connected and select that the network is always available. This is done in your Activity tab.


The version is 5.4.9. Actually, I have network access when I am at home and not at school doing homework and such for class. I am suspending network activity as you suggest, and seeing what happens over the next five or six hours. If this works, I will have to remember to go to the activity dialog when I return and plug in again.


(Click for detailed stats)
ID: 32972 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Jack Shaftoe
Avatar

Send message
Joined: 30 Apr 06
Posts: 115
Credit: 1,307,916
RAC: 0
Message 32991 - Posted: 20 Dec 2006, 22:59:05 UTC - in response to Message 32972.  

So what BOINC version is doing that?


The version is 5.4.9.


You might try 5.4.11 Gerry and see if the problem persists.
ID: 32991 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Gerry Rough
Avatar

Send message
Joined: 2 Jan 06
Posts: 111
Credit: 1,389,340
RAC: 0
Message 33005 - Posted: 21 Dec 2006, 1:58:22 UTC - in response to Message 32991.  

So what BOINC version is doing that?


The version is 5.4.9.


You might try 5.4.11 Gerry and see if the problem persists.


It seems that Feet's idea seems to have done the trick: I've been watching to see if the same problem has been replicated, and it has not returned after nearly twelve hours of crunching. I think I will have to remember to go to the activity dialog when I return and plug in again. Apart from that, if I remember correctly, the only difference between 5.4.9 and 5.4.11 is a bug fix to a Norton Antivirus problem.

(Click for detailed stats)
ID: 33005 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Astro
Avatar

Send message
Joined: 2 Oct 05
Posts: 987
Credit: 500,253
RAC: 0
Message 33008 - Posted: 21 Dec 2006, 2:12:51 UTC - in response to Message 33005.  

the only difference between 5.4.9 and 5.4.11 is a bug fix to a Norton Antivirus problem.

Close.

Mcafee
ID: 33008 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Feet1st
Avatar

Send message
Joined: 30 Dec 05
Posts: 1755
Credit: 4,690,520
RAC: 0
Message 33050 - Posted: 21 Dec 2006, 16:10:12 UTC

By disabling the network access, you're just preventing BOINC from making those attempts to connect, and thus avoid the 4 strikes and you're out for a week. I still do not understand that one. Mine does not behave that way. You might flag the project for "no new tasks", work through what you've got, get it all reported, and then reset the project. Perhaps there's some historical data that makes BOINC think your history warrents such a one week delay. The project reset should get you back to settings with no such history (i.e. more like everyone else).

If that doesn't change it, since you have one more thing to remember, you want to insulate yourself a bit so that you continue to crunch even if you forget once and a while to enable BOINC to connect to the network. You can do this simply by increasing your "cache" of work. This is done in General Preferences by increasing the "connect to network every ... days" setting. If you have 1.0 days of work for example, then you would have time to forget one day when you get back from class, and still have work to keep you busy through morning.
Add this signature to your EMail:
Running Microsoft's "System Idle Process" will never help cure cancer, AIDS nor Alzheimer's. But running Rosetta@home just might!
https://boinc.bakerlab.org/rosetta/
ID: 33050 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Gerry Rough
Avatar

Send message
Joined: 2 Jan 06
Posts: 111
Credit: 1,389,340
RAC: 0
Message 33052 - Posted: 21 Dec 2006, 16:18:11 UTC - in response to Message 33050.  

By disabling the network access, you're just preventing BOINC from making those attempts to connect, and thus avoid the 4 strikes and you're out for a week. I still do not understand that one. Mine does not behave that way. You might flag the project for "no new tasks", work through what you've got, get it all reported, and then reset the project. Perhaps there's some historical data that makes BOINC think your history warrents such a one week delay. The project reset should get you back to settings with no such history (i.e. more like everyone else).

If that doesn't change it, since you have one more thing to remember, you want to insulate yourself a bit so that you continue to crunch even if you forget once and a while to enable BOINC to connect to the network. You can do this simply by increasing your "cache" of work. This is done in General Preferences by increasing the "connect to network every ... days" setting. If you have 1.0 days of work for example, then you would have time to forget one day when you get back from class, and still have work to keep you busy through morning.


Will try and see what happens. :)


(Click for detailed stats)
ID: 33052 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Gerry Rough
Avatar

Send message
Joined: 2 Jan 06
Posts: 111
Credit: 1,389,340
RAC: 0
Message 33683 - Posted: 29 Dec 2006, 15:38:20 UTC

Actually, after I did as you suggested, I thought the problem might be fixed after I unplugged R@H for about 7 hours to see what the messages were going to look like. Everything seems to work fine: it just kept backing off for a few hours at most, usually 30 minutes to a couple of hours. My cache, by the way, has been 1 day for probably over a year now. Sure enough, now that E@H is off line, now the problem is back with E@H: 4 failures and now I can't crunch for a week unless I reset or manually communicate like below in this thread.

I was thinking this morning that it might be that once boinc sees that the servers are not communicating, as with E@H now, they go into a "project is down" mode, meaning they query the server once per week until the project is up again, much like P@H. Don't know why this is so for my computer in such a short period of time, though. About the only thing I can think of is to uninstal/reinstall. But then again, since there is a new version coming out soon, I think I will wait to do that when I do the upgrade on that host. Any other suggestions?

(Click for detailed stats)
ID: 33683 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
FluffyChicken
Avatar

Send message
Joined: 1 Nov 05
Posts: 1260
Credit: 369,635
RAC: 0
Message 33684 - Posted: 29 Dec 2006, 15:53:17 UTC - in response to Message 33683.  

Actually, after I did as you suggested, I thought the problem might be fixed after I unplugged R@H for about 7 hours to see what the messages were going to look like. Everything seems to work fine: it just kept backing off for a few hours at most, usually 30 minutes to a couple of hours. My cache, by the way, has been 1 day for probably over a year now. Sure enough, now that E@H is off line, now the problem is back with E@H: 4 failures and now I can't crunch for a week unless I reset or manually communicate like below in this thread.

I was thinking this morning that it might be that once boinc sees that the servers are not communicating, as with E@H now, they go into a "project is down" mode, meaning they query the server once per week until the project is up again, much like P@H. Don't know why this is so for my computer in such a short period of time, though. About the only thing I can think of is to uninstal/reinstall. But then again, since there is a new version coming out soon, I think I will wait to do that when I do the upgrade on that host. Any other suggestions?


Give 5.8.0 a go now ;-)

Team mauisun.org
ID: 33684 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Gerry Rough
Avatar

Send message
Joined: 2 Jan 06
Posts: 111
Credit: 1,389,340
RAC: 0
Message 33687 - Posted: 29 Dec 2006, 16:23:09 UTC - in response to Message 33684.  

Actually, after I did as you suggested, I thought the problem might be fixed after I unplugged R@H for about 7 hours to see what the messages were going to look like. Everything seems to work fine: it just kept backing off for a few hours at most, usually 30 minutes to a couple of hours. My cache, by the way, has been 1 day for probably over a year now. Sure enough, now that E@H is off line, now the problem is back with E@H: 4 failures and now I can't crunch for a week unless I reset or manually communicate like below in this thread.

I was thinking this morning that it might be that once boinc sees that the servers are not communicating, as with E@H now, they go into a "project is down" mode, meaning they query the server once per week until the project is up again, much like P@H. Don't know why this is so for my computer in such a short period of time, though. About the only thing I can think of is to uninstal/reinstall. But then again, since there is a new version coming out soon, I think I will wait to do that when I do the upgrade on that host. Any other suggestions?


Give 5.8.0 a go now ;-)


I'm sort of queezy about test versions. Is the new version pretty stable now?


(Click for detailed stats)
ID: 33687 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Astro
Avatar

Send message
Joined: 2 Oct 05
Posts: 987
Credit: 500,253
RAC: 0
Message 33688 - Posted: 29 Dec 2006, 16:27:02 UTC

I'm an official alpha tester, I've not seen anything that would be considered a "showstopper". I am checking out an issue with 5.7.5 for linux though.

At worst you just uninstall and reinstall the previous version. No work should be lost, so you risk practically nothing in giving it a "go" as it were.

tony
ID: 33688 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Tom Philippart
Avatar

Send message
Joined: 29 May 06
Posts: 183
Credit: 834,667
RAC: 0
Message 33691 - Posted: 29 Dec 2006, 16:29:57 UTC

I run 5.8.0 since it was released, I didn't notice any bugs so far. It's stable.
ID: 33691 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : BOINC backing off too long



©2024 University of Washington
https://www.bakerlab.org