Comments/questions on Rosetta@home journal

Author	Message
Feet1st Send message Joined: 30 Dec 05 Posts: 1755 Credit: 4,690,520 RAC: 0	Message 13189 - Posted: 7 Apr 2006, 21:37:00 UTC - in response to Message 13184. I always do a nice suspend and wait until it clears from memory dag was talking about suspending the projects in the BOINC manager I presume. You might want to try using the hybernate shutdown on your portable. This basically pushes everything in memory out to disk, leaves the programs active and when you power up again, everything is where you left off. Not sure how that would work for BOINC. But it sure works great for word processors, browsers etc. Add this signature to your EMail: Running Microsoft's "System Idle Process" will never help cure cancer, AIDS nor Alzheimer's. But running Rosetta@home just might! https://boinc.bakerlab.org/rosetta/ ID: 13189 · Rating: 0 · rate: /

dag Send message Joined: 16 Dec 05 Posts: 106 Credit: 1,000,020 RAC: 0	Message 13245 - Posted: 8 Apr 2006, 15:58:20 UTC - in response to Message 13189. I always do a nice suspend and wait until it clears from memory ... You might want to try using the hybernate shutdown on your portable. ... Putting it in standby mode may work for a Linux system, but we're talking about Windoz here. I depend on the daily reboot to clear the accumulated cruft. Also, it doesn't seem to do well with going to sleep on a docking station with wired LAN and waking up all alone with wireless LAN, different keyboard, mouse, etc., and vise-versa. (XP Pro w/SP2+) dag dag --Finding aliens is cool, but understanding the structure of proteins is useful. ID: 13245 · Rating: 0 · rate: /

BennyRop Send message Joined: 17 Dec 05 Posts: 555 Credit: 140,800 RAC: 0	Message 13365 - Posted: 9 Apr 2006, 21:30:38 UTC Would it be possible to run sample batches of new WUs through Ralph, and to post a notice here that a test run on Ralph is happening - i.e. it's own moderator-only thread (so those that shut off Ralph when they see no work can turn it back on?) But these failures bring up a science related question or two. When looking at last month's WUs that ended up failing in less than a minute on 2Ghz machines, almost all of the ones reported failed 3 times and were discarded. At least one of them got picked up after two Windows client failures and finished fine on a Linux client. Was this because mostly because the Win clients so far outnumber the Linux clients, or more of the fact that the Linux clients were actually producing results for 2+ hours, while the Win clients were failing almost instantly and grabbing more WUs to fail? What's different with the clients that allows Mac and Linux clients to handle these WUs that almost instantly fail on Windows clients? What's different with the WUs themselves? I assume that biology doesn't allow amino acid/protein chains #87 and #88 in a 109 AA protein chain to be encrypted with "Bill Gates Suxx0rs" :) It'd be funny if it could be proven, but not likely. grin Are these larger or smaller than the WUs we've been working on - or have some unique visual feature? i.e. it looks like two plates held together in the middle with a few atoms that look like a rod - a yo-yo. (Yes, I know that's probably not even possible.) You've mentioned that for some WUs that we've turned in 10,000 models for, that we haven't come close to the native structure. (You mentioned approaches that will allow us to go through the process twice.. and hopefully get close to those structures without having to run through 100,000 models/decoys or more..) I see at C562_EColi (the last result on the home page) that it lists the size as 106 AA; which is how we got used to having protein's size described at DF. At what size does the Rosetta client fail to get close enough to the native structure with 10,000 models/decoys? Or is it just that the longer the protein chain is, the more likely it has complex structures? ID: 13365 · Rating: 0 · rate: /

R/B Send message Joined: 8 Dec 05 Posts: 195 Credit: 28,095 RAC: 0	Message 13449 - Posted: 11 Apr 2006, 12:51:57 UTC I read a post over in setiathome about Rosetta being mentioned in the newsletter of 'Livescience'. Good job, guys. Founder of BOINC GROUP - Objectivists - Philosophically minded rational data crunchers. ID: 13449 · Rating: 0 · rate: /

dgnuff Send message Joined: 1 Nov 05 Posts: 350 Credit: 24,773,605 RAC: 0	Message 13580 - Posted: 12 Apr 2006, 21:20:04 UTC Mostly to David Baker, but if you're looking in, the last couple of posts you've made in your journal cover 95% of the reason I'm here. There's somthing I really like about being in right at startup, and knowing that the crunching I'm doing is helping to actually fine tune the tools that someday might find a cure for HIV, or Malaria. Keeping that in mind, in your last note you talk about sidechain sampling. Is it possible to explain in layman's terms what this is? So far, you've been able to do a great job of explaining the search methodologies, talking about explorerers on a planet, or explaining how the HIV search will go: trying to design a custom protein that will allow our immune system to generate antibodies effective against the "static" portions of the virus. Etc. Any chance of an explanation of that sort for sidechains? Enquiring minds want to know. :) ID: 13580 · Rating: 0 · rate: /

David Baker Volunteer moderator Project administrator Project developer Project scientist Send message Joined: 17 Sep 05 Posts: 705 Credit: 559,847 RAC: 0	Message 13618 - Posted: 13 Apr 2006, 5:08:06 UTC - in response to Message 13580. Mostly to David Baker, but if you're looking in, the last couple of posts you've made in your journal cover 95% of the reason I'm here. There's somthing I really like about being in right at startup, and knowing that the crunching I'm doing is helping to actually fine tune the tools that someday might find a cure for HIV, or Malaria. Keeping that in mind, in your last note you talk about sidechain sampling. Is it possible to explain in layman's terms what this is? So far, you've been able to do a great job of explaining the search methodologies, talking about explorerers on a planet, or explaining how the HIV search will go: trying to design a custom protein that will allow our immune system to generate antibodies effective against the "static" portions of the virus. Etc. Any chance of an explanation of that sort for sidechains? Enquiring minds want to know. :) Sure (and thanks for the words of encouragement!). sorry about the jargon. proteins are chains of amino acids. there are 20 types of amino acids. they have a common backbone, but differ in the "sidechains". when they are spliced together, the resulting protein has a linear "backbone" with "sidechains" coming off, one for each amino acid. these sidechains can adopt a number of different possible conformations (shapes). the improvement is to try out many different possibiities for each of these shapes at each step in our sampling processs. as I said in the journal, this helps the search, but before we can deploy this on rosetta@home we need to track down and fix the windows specific problem. ID: 13618 · Rating: 0 · rate: /

Grutte Pier [Wa Oars]~MAB The Frisian Send message Joined: 6 Nov 05 Posts: 87 Credit: 497,588 RAC: 0	Message 13635 - Posted: 13 Apr 2006, 12:37:16 UTC - in response to Message 13619. Last modified: 13 Apr 2006, 12:39:14 UTC Good news today: second, David Kim has awarded credits to those who lost valuable time during the problems last weekend. Suppose it will be handled just as fast as those credits for time exceeding WU's and in that case : just don't bother as I will not bother to report them anymore. Already written those off. Counting the days "17". ID: 13635 · Rating: -2 · rate: /

Cureseekers~Kristof Send message Joined: 5 Nov 05 Posts: 80 Credit: 689,603 RAC: 0	Message 14470 - Posted: 23 Apr 2006, 11:54:46 UTC I'm curious to see the source code. When I read in the e-mails the names of the functions [get_the_hell_out()] I guess the complete source code must be an equivalent :p Member of Dutch Power Cows ID: 14470 · Rating: 0 · rate: /

Feet1st Send message Joined: 30 Dec 05 Posts: 1755 Credit: 4,690,520 RAC: 0	Message 14481 - Posted: 23 Apr 2006, 14:51:39 UTC Could the watchdog message be revised? Rather than: Rosetta score stayed the same too long. Watchdog is killing the run! How about: Rosetta score stayed the same too long. Watchdog is ending the run! or, keeping inline with the programmer humor going here, perhaps: Watchdog is barking. Postman ending delivery. Add this signature to your EMail: Running Microsoft's "System Idle Process" will never help cure cancer, AIDS nor Alzheimer's. But running Rosetta@home just might! https://boinc.bakerlab.org/rosetta/ ID: 14481 · Rating: 0 · rate: /

Rhiju Volunteer moderator Send message Joined: 8 Jan 06 Posts: 223 Credit: 3,546 RAC: 0	Message 14638 - Posted: 26 Apr 2006, 7:51:50 UTC - in response to Message 14481. How embarrassing... both the watchdog "killing" and get_the_hell_out() are my silly recent contributions. Its not representative of the rest of the code -- I didn't expect these error messages and functions to get broadcast to such a wide audience! I assure you that the rest of the code is totally dry and scientific. We'll change "killing" to "ending" in the release after the next (I read the message too late). Could the watchdog message be revised? Rather than: Rosetta score stayed the same too long. Watchdog is killing the run! How about: Rosetta score stayed the same too long. Watchdog is ending the run! or, keeping inline with the programmer humor going here, perhaps: Watchdog is barking. Postman ending delivery. ID: 14638 · Rating: 0 · rate: /

BennyRop Send message Joined: 17 Dec 05 Posts: 555 Credit: 140,800 RAC: 0	Message 14639 - Posted: 26 Apr 2006, 8:20:47 UTC I had a friend that worked in tech support at Microsoft - who had a noose dangling from the ceiling and inside the noose was a picture frame. It was in reference to "My windows is hung." Now I can picture Rhiu's workspace with a 12 inch blade dangling from the ceiling and stained in catchup.. :) In reference to "terminating a process with extreme prejudice".. grin Here's hoping the code helps eliminate most of the remaining bugs that are frustrating the users.. so we can run through CASP with as few problems as possible. ID: 14639 · Rating: 0 · rate: /

Cureseekers~Kristof Send message Joined: 5 Nov 05 Posts: 80 Credit: 689,603 RAC: 0	Message 14640 - Posted: 26 Apr 2006, 8:22:31 UTC Why embarrassing? I like the name of the function ;) Member of Dutch Power Cows ID: 14640 · Rating: 0 · rate: /

tralala Send message Joined: 8 Apr 06 Posts: 376 Credit: 581,806 RAC: 0	Message 14736 - Posted: 27 Apr 2006, 8:16:31 UTC Just a comment to Davids last entry. I would strongly suggest to delay the release of the new app till Monday and use the Weekend for thorough testing. Failing WUs are a real burden for the project since they are sent out again for several weeks. We all hope that this won't happen with the new watchdog but you never know. ID: 14736 · Rating: 0 · rate: /

Cureseekers~Kristof Send message Joined: 5 Nov 05 Posts: 80 Credit: 689,603 RAC: 0	Message 14738 - Posted: 27 Apr 2006, 9:39:31 UTC I agree with tralala! If there happens anything now, the project will lose more power than with 2 days delay of a new version... Member of Dutch Power Cows ID: 14738 · Rating: 0 · rate: /

rbpeake Send message Joined: 25 Sep 05 Posts: 168 Credit: 247,828 RAC: 0	Message 14756 - Posted: 27 Apr 2006, 14:41:44 UTC Imho, a weekend release is OK if the project team has a strong feeling of confidence in the new release, and importantly, if someone on the Project Team can commit to monitoring the release very carefully over the entire weekend. In this way if problems do arise, someone on the Team can jump in immediately and make fixes before things snowball and turn into big problems. Thanks! Regards, Bob P. ID: 14756 · Rating: 0 · rate: /

Team_Elteor_Borislavj~Intelligence Send message Joined: 7 Dec 05 Posts: 14 Credit: 56,027 RAC: 0	Message 14842 - Posted: 28 Apr 2006, 9:05:09 UTC And he's glad to see to number of pc's rising, i think it will drop a bit at the end of the Stampede ID: 14842 · Rating: 0 · rate: /

hugothehermit Send message Joined: 26 Sep 05 Posts: 238 Credit: 314,893 RAC: 0	Message 15392 - Posted: 3 May 2006, 6:22:51 UTC Last modified: 3 May 2006, 6:27:50 UTC If you know the amino acid sequances, and you know the structures of mutiple proteins then why don't you combine the two? Like this: amino acid n is usually folded x fasion. If you see my point, you could make a database of all the known structures for that sequence, then weight the probability of an amio acid structure, so as to have a most probable to least probable structure for each of the amino acids sequances, test each to find the energy levels for the overall protein that you are trying to predict. Of course this wouldn't find a structure that to date is unknown but it maybe a shortcut for the others. You may do something like this already. It was just a thought that struck me so I thought I'd bounce it off the R@H team. Edit: clarity ID: 15392 · Rating: 0 · rate: /

Ethan Volunteer moderator Send message Joined: 22 Aug 05 Posts: 286 Credit: 9,304,700 RAC: 0	Message 15393 - Posted: 3 May 2006, 6:43:25 UTC - in response to Message 15392. Last modified: 3 May 2006, 6:46:54 UTC I believe this is what they try to do. . but consider the following. . if you have a 20 aa (amino acid) sequence. . you might know how the first and second interact by themselves. . but what happens when 6 and 7 in the chain loop around and interact with the start? Now, an assumption you made at the beggining (they fold like x) is not true since something external is acting opon them. I think you're right in that it's worth having a database of these types of things. . and I seem to remember the lab folks mentioning that is what took up some of our disk space. . . but: I could be wrong :) -E ID: 15393 · Rating: 0 · rate: /

Bin Qian Send message Joined: 13 Jul 05 Posts: 33 Credit: 36,897 RAC: 0	Message 15459 - Posted: 3 May 2006, 23:32:25 UTC - in response to Message 15392. Great thoughts! You are definitely on the right track of a general protocol for structure prediction. The approach you described - the comparative modeling approach - is actually being routinely used by biologists nowadays. One aspect of the Rosetta@Home project is to develope methods to improve the template-based structure models. Some of the work units you are currently running are doing extractly this type of work! You can read a little more about these work units here. If you know the amino acid sequances, and you know the structures of mutiple proteins then why don't you combine the two? Like this: amino acid n is usually folded x fasion. If you see my point, you could make a database of all the known structures for that sequence, then weight the probability of an amio acid structure, so as to have a most probable to least probable structure for each of the amino acids sequances, test each to find the energy levels for the overall protein that you are trying to predict. Of course this wouldn't find a structure that to date is unknown but it maybe a shortcut for the others. You may do something like this already. It was just a thought that struck me so I thought I'd bounce it off the R@H team. Edit: clarity ID: 15459 · Rating: 0 · rate: /

hugothehermit Send message Joined: 26 Sep 05 Posts: 238 Credit: 314,893 RAC: 0	Message 15486 - Posted: 4 May 2006, 7:21:05 UTC Last modified: 4 May 2006, 7:27:04 UTC Thanks for the info Dr(?) Bin Qian I hadn't even thought of using the families of the protein, because I didn't know they existed :), very clever I must say. That begs the question: Is it possible to group all of the unknown proteins into families? I had a look on wikipedia X-Ray Crystallography (under Biological structures) and found that ~100 proteins have been solved by electron microscope, so is it possible to get another Rosetta shortcut by getting a rough estimate of what the protein shape or family is this way or perhaps a even early data from x-ray crystallography, or it's chemical reactions etc... ? As I understand it, any type of reduction in the search space is a very good thing. That's enough for me, I'm using your time that is better spent elsewhere. Edit to make more sense ID: 15486 · Rating: 0 · rate: /