Message boards : Rosetta@home Science : Comments/questions on Rosetta@home journal
Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · 9 . . . 10 · Next
Author | Message |
---|---|
rbpeake Send message Joined: 25 Sep 05 Posts: 168 Credit: 247,828 RAC: 0 |
Imho, a weekend release is OK if the project team has a strong feeling of confidence in the new release, and importantly, if someone on the Project Team can commit to monitoring the release very carefully over the entire weekend. In this way if problems do arise, someone on the Team can jump in immediately and make fixes before things snowball and turn into big problems. Thanks! Regards, Bob P. |
Team_Elteor_Borislavj~Intelligence Send message Joined: 7 Dec 05 Posts: 14 Credit: 56,027 RAC: 0 |
And he's glad to see to number of pc's rising, i think it will drop a bit at the end of the Stampede |
hugothehermit Send message Joined: 26 Sep 05 Posts: 238 Credit: 314,893 RAC: 0 |
If you know the amino acid sequances, and you know the structures of mutiple proteins then why don't you combine the two? Like this: amino acid n is usually folded x fasion. If you see my point, you could make a database of all the known structures for that sequence, then weight the probability of an amio acid structure, so as to have a most probable to least probable structure for each of the amino acids sequances, test each to find the energy levels for the overall protein that you are trying to predict. Of course this wouldn't find a structure that to date is unknown but it maybe a shortcut for the others. You may do something like this already. It was just a thought that struck me so I thought I'd bounce it off the R@H team. Edit: clarity |
Ethan Volunteer moderator Send message Joined: 22 Aug 05 Posts: 286 Credit: 9,304,700 RAC: 0 |
I believe this is what they try to do. . but consider the following. . if you have a 20 aa (amino acid) sequence. . you might know how the first and second interact by themselves. . but what happens when 6 and 7 in the chain loop around and interact with the start? Now, an assumption you made at the beggining (they fold like x) is not true since something external is acting opon them. I think you're right in that it's worth having a database of these types of things. . and I seem to remember the lab folks mentioning that is what took up some of our disk space. . . but: I could be wrong :) -E |
Bin Qian Send message Joined: 13 Jul 05 Posts: 33 Credit: 36,897 RAC: 0 |
Great thoughts! You are definitely on the right track of a general protocol for structure prediction. The approach you described - the comparative modeling approach - is actually being routinely used by biologists nowadays. One aspect of the Rosetta@Home project is to develope methods to improve the template-based structure models. Some of the work units you are currently running are doing extractly this type of work! You can read a little more about these work units here. If you know the amino acid sequances, and you know the structures of mutiple proteins then why don't you combine the two? |
hugothehermit Send message Joined: 26 Sep 05 Posts: 238 Credit: 314,893 RAC: 0 |
Thanks for the info Dr(?) Bin Qian I hadn't even thought of using the families of the protein, because I didn't know they existed :), very clever I must say. That begs the question: Is it possible to group all of the unknown proteins into families? I had a look on wikipedia X-Ray Crystallography (under Biological structures) and found that ~100 proteins have been solved by electron microscope, so is it possible to get another Rosetta shortcut by getting a rough estimate of what the protein shape or family is this way or perhaps a even early data from x-ray crystallography, or it's chemical reactions etc... ? As I understand it, any type of reduction in the search space is a very good thing. That's enough for me, I'm using your time that is better spent elsewhere. Edit to make more sense |
eberndl Send message Joined: 17 Sep 05 Posts: 47 Credit: 3,062,163 RAC: 2,108 |
Hugo, From what I understand, the whole point is for Rosetta to be able to predict the structure WITHOUT using the X-ray data. As it stands, that data (or data from protein NMR) is what provides the images in the "Native" box of the screen saver. Although being able to start with a good approximation is very important, it's also important to be able to start from scratch, because most proteins don't have homologues that have been solved yet. Questions? Try the Wiki! Take a look inside my brain |
tralala Send message Joined: 8 Apr 06 Posts: 376 Credit: 581,806 RAC: 0 |
Hugo, But as more proteins get solved the more homologues you will find. Somewhere they mentioned that they perhaps can use "cheap" experimental data such as visual microscopic data to have a rough estimation how the protein will look like. If this computationally aproach shall ever be useful they need to be able to predict the structure with limited computing power. Either by finding more folding "rules" or algorithms which will restrain the conformational space eihter by combining it with easy to obtain experimental data. |
TioSuper Send message Joined: 2 May 06 Posts: 17 Credit: 164 RAC: 0 |
How embarrassing... both the watchdog "killing" and get_the_hell_out() are my silly recent contributions. Its not representative of the rest of the code -- I didn't expect these error messages and functions to get broadcast to such a wide audience! I assure you that the rest of the code is totally dry and scientific. We'll change "killing" to "ending" in the release after the next (I read the message too late). Dry and scientific usually is boring. Anyhow, a quirky and sometimes macabre sense of humour is a sign of genius. :) Keep the good work and please don't let your humorous side be destroyed by science. ( Or science as dry and unhumorous people think it should be) |
Feet1st Send message Joined: 30 Dec 05 Posts: 1755 Credit: 4,690,520 RAC: 0 |
From what I understand, the whole point is for Rosetta to be able to predict the structure WITHOUT using the X-ray data. Hugo made reference to ...early data from x-ray crystallography, or it's chemical reactions etc. I read an article which I wanted to link, but still haven't found it. It explained more about the X-ray process. The nutshell I got out of it is that there really is no "early" or "preliminary data". You either have done the complete X-ray study, or you haven't. Once you get the thing to crystilize (which is the hard part I guess, can take months and is considered more of an art than a science), then you take it to a one-of-a-kind huge X-ray device with superpowers and supercomputers, and you scan it from all angles all in an afternoon. The huge expense is that this is no simple hospital X-Ray machine. And it still takes hours of scans, I think the article said they scan every 2 degrees of rotation, to get the 3D image. So, even if you had the BILLION proteins all crystilized, it would STILL take WAY too long (billions of hours) to X-Ray all of them. As for chemical reactions... I believe that is how they know the sequence that the amino acids comprising the protein are in. But, ya, perhaps they could do some other chemistry on it to infer some broader info. to get clues as to how they are arranged. Add this signature to your EMail: Running Microsoft's "System Idle Process" will never help cure cancer, AIDS nor Alzheimer's. But running Rosetta@home just might! https://boinc.bakerlab.org/rosetta/ |
eberndl Send message Joined: 17 Sep 05 Posts: 47 Credit: 3,062,163 RAC: 2,108 |
So, even if you had the BILLION proteins all crystilized, it would STILL take WAY too long (billions of hours) to X-Ray all of them. Feet1st, you're right it does take hours to Xray each crystal, but it can take weeks or MONTHS to figure out how to crystallize a protein for the first time. And once you have the xrays... you have to run that through a massive computer program (another couple hours), and what that will give is an electron density map, which looks like this: But all those solid yellow lines have to be added BY HAND to the density map. Once this first estimate of the protein is obtained, they do a theoretical Xray on their model protein, and look at the differences between the actual (from the crystal) and experimental (from the model) scatter graphs, and then they go through and try to reduce the differences as best they can. again, they have to do this by hand. As for chemical reactions... I believe that is how they know the sequence that the amino acids comprising the protein are in. But, ya, perhaps they could do some other chemistry on it to infer some broader info. to get clues as to how they are arranged. There are 2 main ways to figure out the sequence of a protein: Get the cDNA and convert it to protein, or take the protein and do tandem Mass Spectroscopy on it, which can give you the actual protein sequence (except that they can't tell leucine and isoleucine apart due to their identical sizes). Questions? Try the Wiki! Take a look inside my brain |
Feet1st Send message Joined: 30 Dec 05 Posts: 1755 Credit: 4,690,520 RAC: 0 |
Always good stuff from eberndl. I always learn more from your posts. Thanks. If anyone else remembers the article, it talked about an entreprenure that's working on methods of doing the x-ray crystallography FASTER. And how each protein can cost $100,000s of USD. Anyway around it, existing approaches will take many thousands of years at the current pace, and after that, you still don't really know how what you need to know to DESIGN a protein that will target HIV or Alzheimer's. You'd have to experimentally create a bunch of proteins, then do the crystallography and then see if they happen to be a shape that will dock with your target protein. This is why the TOP7 is such a unique thing. An artificial protein. They actually planned it out, and BUILT a protein, and (I think) it took the shape they had predicted (or very close). So, if the prediction mechanism were perfect... then you could use the mass spectroscopy techniques to get the sequence of amino acids, calculate the shape, model another protein that will "dock" with the target, and then produce that new modelled protein. (Hence the REST of the R@H logo about protein design and docking). This is also what CASP is all about. You don't KNOW what it really looks like, how close can you predict based solely on the sequence of amino acids. Some predict by hand, some predict via distributed computing ;) ...net result, when you discover a new virus (SARS, bird flu, HIV...), you could readily create a protein treatment that targets it. And if you've got an existing database of the other billion proteins in the body, then you could create a treatment with minimal sideeffects (i.e. one that will NOT dock with other proteins in your body, just the diseased ones). Add this signature to your EMail: Running Microsoft's "System Idle Process" will never help cure cancer, AIDS nor Alzheimer's. But running Rosetta@home just might! https://boinc.bakerlab.org/rosetta/ |
Dimitris Hatzopoulos Send message Joined: 5 Jan 06 Posts: 336 Credit: 80,939 RAC: 0 |
In any event, if you are interested in finding more about our research, and can stand the ums, you can find it at: Just a quick note that even the "regular" (not high-bandwidth version) 320x240 movie available for download from the abovementioned URL is 127MBytes and a bit over 1hr long. I have just watched random 5-10min pieces of the video, but I found it quite understandable (having no background at all in the field). Absolutely worth watching. Best UFO Resources Wikipedia R@h How-To: Join Distributed Computing projects that benefit humanity |
BennyRop Send message Joined: 17 Dec 05 Posts: 555 Credit: 140,800 RAC: 0 |
I remember watching a video around 1980 in high school which was a presentation by the Kodiak Electrical Association about the proposed hydroelectric dam at Terror Lake. It discussed the effect on the bears that lived in the valley that would be turned into a larger lake behind the dam. They had the manager of the local electrical plant on the bank of Terror Lake, being eaten alive by mosquitos, and using a slightly distracting amount of "ums" in the conversation. A couple of years ago, I visited a museum in Pensacola, Florida, and a Floridian that had visited Terror Lake some time in the past had killed a pair of Kodiak bears, had the taxidermist mount them, and sent to Pensacola where his family donated them to the museum. The point being is that I remember when I saw that video.. By halfway through, the "ums" have really calmed down - so if you want to improve the presentation that you'll be making, see if you can do it in ... 5 minute segments. On the first pass, use your hand to hold your lips together on one side of your mouth, and talk with a rediculous accent. Relax, and then do it again - correctly. (Perhaps there's better advice from those that deal with videotaping.. have attended ToastMaster presentations, etc.) As a benefit to the project, you might be able to use the rediculous accent version of the video to bribe teams like The Knights Who Say Neee! into doubling their production for a few months, for the right to be the first team to get to see the silly version of the presentation first. :) As others have said, the material in the whole 1 hour presentation seemed really easy to understand, especially with description in layman's english about what most of the technobabble was. (I was informed by a client that she didn't understand a single thing I was telling her about why her computer died..) |
rbpeake Send message Joined: 25 Sep 05 Posts: 168 Credit: 247,828 RAC: 0 |
In any event, if you are interested in finding more about our research, and can stand the ums, you can find it at: I agree, and believe that you are your own worst critic (all of us are), and my focus on your presentation was on what you had to say. I found it fascinating, and a means to get to know you better. So David, I would suggest featuring the link on the project main page. Very worthwhile! Regards, Bob P. |
Hoelder1in Send message Joined: 30 Sep 05 Posts: 169 Credit: 3,915,947 RAC: 0 |
http://norfolk.cs.washington.edu/htbin-post/unrestricted/colloq/details.cgi?id=449 Indeed ! There is one specific question I wanted to ask, though: right at the end of the lecture there was a statement which seemed to say that the work about cutting DNA at specific sites (as described on the "Disease Related Research" page) was already under way. I would definitely be interested to hear how far this work has proceeded (e.g., in terms of developing this for therapeutic purposes) and what is planed for the future. Thanks, -H. Team betterhumans.com - discuss and celebrate the future - hoelder1in.org |
Dimitris Hatzopoulos Send message Joined: 5 Jan 06 Posts: 336 Credit: 80,939 RAC: 0 |
A question I have (maybe it's answered somewhere on the site and I missed it) is about the current focus. Are there still ongoing changes to the way Rosetta energy function is computed, or is that part basically finished and nowadays all efforts are towards better / more efficient methods to sample the conformational space? To rephrase it, would the recent difficulties with 1tul be due the fact that even 1 million structures still won't get one of our "explorers" near the energy minimum? Or was "1tul" one rare case where the energy function didn't work so well? It would be nice to also plot the native structure's Rosetta energy function value on those charts, as you did in the past in turquoise colour. Best UFO Resources Wikipedia R@h How-To: Join Distributed Computing projects that benefit humanity |
David Baker Volunteer moderator Project administrator Project developer Project scientist Send message Joined: 17 Sep 05 Posts: 705 Credit: 559,847 RAC: 0 |
A question I have (maybe it's answered somewhere on the site and I missed it) is about the current focus. for 1tul it is very clearly a sampling problem--the native structure has much lower energy but no explorers got anywhere close. we have a number of improved "jumping" methods for larger beta sheet containing proteins that we are continuing to develop. work in the group is currently directed at improving both the energy function and the sampling method, but it is clear that sampling is a far greater problem, particularly for larger more complex proteins. we should still be plotting the native protein energies as you point out. |
David Baker Volunteer moderator Project administrator Project developer Project scientist Send message Joined: 17 Sep 05 Posts: 705 Credit: 559,847 RAC: 0 |
http://norfolk.cs.washington.edu/htbin-post/unrestricted/colloq/details.cgi?id=449 We have worked out the science/technology needed to create new DNA cleaving enzymes (if you ever look at the scientific journal "Nature" you will see our paper on this in a few weeks). We are currently hard at work trying to create enzymes that cleave within genes that cause disease and within pathogens. Once we have these they can be tested as therapeutics, but this is still a step or two away. |
BennyRop Send message Joined: 17 Dec 05 Posts: 555 Credit: 140,800 RAC: 0 |
You might also want to think up a way to encourage a production increase in the smaller teams, and the vast horde of those who don't belong to a team at all. Perhaps look for increases in models/month; and weigh those that have added an extra machine or 5 a little higher than those that increase Rosetta's Boinc share from 10% to 100%. (i.e. grab a couple of both). Perhaps picking out one at random so a non teamed member that runs a second cpu for all of Casp7 has a chance of being picked. |
Message boards :
Rosetta@home Science :
Comments/questions on Rosetta@home journal
©2024 University of Washington
https://www.bakerlab.org