Rosetta keeps preempting

Message boards : Number crunching : Rosetta keeps preempting

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
Profile Feet1st
Avatar

Send message
Joined: 30 Dec 05
Posts: 1755
Credit: 4,690,520
RAC: 0
Message 15428 - Posted: 3 May 2006, 20:17:49 UTC - in response to Message 15426.  

I am running two projects... computer with 2 processors, one processor for each

You are thinking the way *I* do :) You are thinking that if each project has a 50% share, that the scheduler should just run a WU from each all the time, one on each CPU.

Unfortunately, BOINC doesn't see things as clearly as that. I don't fully understand why. Suffice it to say BOINC MAY decide it would LOVE to have TWO climate WUs running (for a short time), and pull down another 1yr WU!! I have MY climate set to "No new work" (Projects tab) to avoid ever having more than 1 climate WU on a PC.

BOINC actually does a very good job of meeting your objectives for resource share. You just have to look at it over the course of 100hrs of runtime or so, rather than 100 minutes.

In short, if you just leave it alone, and let it run 2 Rosetta WUs for a (short) while, it will tally up debt to climate and then run climate more to make up for it. But don't expect it to just lock on running one on each CPU. The scheduler doesn't think that way. To get close though, just leave everything alone (i.e. don't suspend WUs or projects) for a day or so, and you'll find more often then not it does just that, one project on each CPU.

...another solution would be to double your R@H resource share, so it gets 2/3rds and climate gets 1/3rd ;)
Add this signature to your EMail:
Running Microsoft's "System Idle Process" will never help cure cancer, AIDS nor Alzheimer's. But running Rosetta@home just might!
https://boinc.bakerlab.org/rosetta/
ID: 15428 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Moderator9
Volunteer moderator

Send message
Joined: 22 Jan 06
Posts: 1014
Credit: 0
RAC: 0
Message 15436 - Posted: 3 May 2006, 20:57:11 UTC - in response to Message 15426.  


Moderator9
ROSETTA@home FAQ
Moderator Contact
ID: 15436 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Rusty Lafavour

Send message
Joined: 12 Apr 06
Posts: 4
Credit: 26,391
RAC: 0
Message 15442 - Posted: 3 May 2006, 21:33:34 UTC

Thanks for the responses. Even though time is of the essence for me on the CPN. I will let what happens, happen for a couple days to see the final results of both projects. I feel good helping out on both projects, running 24/7 to help
ID: 15442 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Rebirther
Avatar

Send message
Joined: 17 Sep 05
Posts: 116
Credit: 41,315
RAC: 0
Message 15446 - Posted: 3 May 2006, 21:45:44 UTC

The problem is downloading new files. So Boinc is switching projects (WUs). My experiences are to download WUs for some days and stop getting new work. So you can see that CPDN is running all the time on one CPU and Rosetta on the other. You can try this. It helped me ;)
ID: 15446 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Rollo

Send message
Joined: 2 Jan 06
Posts: 21
Credit: 106,369
RAC: 0
Message 15458 - Posted: 3 May 2006, 23:26:06 UTC
Last modified: 3 May 2006, 23:28:26 UTC

What about setting no new work for CPDN, so you have only one model running, and than increase the resource share of CPDN to more than 50%, lets say 60%. Since you have only one CPDN model, boinc cannot run that model on both cores simultaneously. So you can run CPDN effectivly at maximum of 50% share. This is lower than your setting of 60%. Therefore CPDN should always run and prevent rosetta from starting on both cores.
ID: 15458 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Rusty Lafavour

Send message
Joined: 12 Apr 06
Posts: 4
Credit: 26,391
RAC: 0
Message 15465 - Posted: 4 May 2006, 0:37:14 UTC


ID: 15465 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David@home
Avatar

Send message
Joined: 7 Oct 05
Posts: 29
Credit: 185,330
RAC: 0
Message 15498 - Posted: 4 May 2006, 11:44:50 UTC

Just trying to figure out why my Rosetta WUs only run for about three hours rather than the default of four hours.

Using the default settings for Target CPU run time each WU seems to run for about three hours. This raises a couple of questions:

1) For the model steps that have not been run are these assigned to another WU so all combinations are covered or are they simply not available in the results database?

2) Why does Rosetta decide not to run the fourth hour? Is it because it thinks it can not complete the fourth hour in the available Switch between applications every 60 minutes slot?

Thanks





ID: 15498 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Astro
Avatar

Send message
Joined: 2 Oct 05
Posts: 987
Credit: 500,253
RAC: 0
Message 15499 - Posted: 4 May 2006, 11:49:48 UTC
Last modified: 4 May 2006, 11:51:37 UTC

I use the default "not selected" and I see the same thing on 3 puters. Don't know why. my amd64 3700's will usually to 20 models/hour, so that shouldn't make it stop at 3 hours.
ID: 15499 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Feet1st
Avatar

Send message
Joined: 30 Dec 05
Posts: 1755
Credit: 4,690,520
RAC: 0
Message 15510 - Posted: 4 May 2006, 15:41:33 UTC - in response to Message 15498.  

Just trying to figure out why my Rosetta WUs only run for about three hours rather than the default of four hours.

Using the default settings for Target CPU run time each WU seems to run for about three hours. This raises a couple of questions:

1) For the model steps that have not been run are these assigned to another WU so all combinations are covered or are they simply not available in the results database?

2) Why does Rosetta decide not to run the fourth hour? Is it because it thinks it can not complete the fourth hour in the available Switch between applications every 60 minutes slot?


The switch between applications is basically a guideline more than a rule. The "switch time" is really just telling BOINC how frequently you want it to reconsider what should be running. Often times it decides to continue running what it has going already. But I believe it ALSO reconsiders when a WU completes.

As for why they complete is 3 hours timeframe rather than 4... check how many models you got crunched in what time. Divide it out and figure out how many minutes per model, and if you add one more model to the existing runtime, if you cross over the 4hr preference, then you've got it with #2, "it thinks it can not complete" another full model in under your 4hr objective. So, it ends now instead.

The client application (Rosetta in our case-at-hand) does not KNOW when BOINC may rip the CPU away from it. It is blissfully unaware of the "switch between application every" setting. This is why the new checkpoint is such a huge great deal!

As for point #1, it's really a huge space, the number of models POSSIBLE on a WU. And so they randomize it. The project isn't depending upon you to crunch these 20 given models or whatever. It's trying to get the install-base to crunch on the order of a million models at random points in a space where there are at least hundreds of millions of models possible.

As you might expect, with this randomness there is a small amount of duplication where two clients strike upon the same starting value and essentially crunch the same exact model. But the space it so vast that this duplication is still less than 5%. If they had 100x more crunching power and were going for 100,000,000 models instead, then they might have to assign it out to eliminate duplication, but we're not there yet.

(all that info. is based on other posts I've read. I don't really have any other knowledge about it).
Add this signature to your EMail:
Running Microsoft's "System Idle Process" will never help cure cancer, AIDS nor Alzheimer's. But running Rosetta@home just might!
https://boinc.bakerlab.org/rosetta/
ID: 15510 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
AMD_is_logical

Send message
Joined: 20 Dec 05
Posts: 299
Credit: 31,460,681
RAC: 0
Message 15517 - Posted: 4 May 2006, 16:28:40 UTC - in response to Message 15498.  

Using the default settings for Target CPU run time each WU seems to run for about three hours.


Those WUs probably have their default set to three hours.

ID: 15517 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2

Message boards : Number crunching : Rosetta keeps preempting



©2024 University of Washington
https://www.bakerlab.org