Comments/questions on Rosetta@home journal

Message boards : Rosetta@home Science : Comments/questions on Rosetta@home journal

To post messages, you must log in.

Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · 9 . . . 10 · Next

AuthorMessage
Profile rbpeake

Send message
Joined: 25 Sep 05
Posts: 168
Credit: 247,828
RAC: 0
Message 14756 - Posted: 27 Apr 2006, 14:41:44 UTC

Imho, a weekend release is OK if the project team has a strong feeling of confidence in the new release, and importantly, if someone on the Project Team can commit to monitoring the release very carefully over the entire weekend.

In this way if problems do arise, someone on the Team can jump in immediately and make fixes before things snowball and turn into big problems.

Thanks!
Regards,
Bob P.
ID: 14756 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Profile Team_Elteor_Borislavj~Intelligence

Send message
Joined: 7 Dec 05
Posts: 14
Credit: 56,027
RAC: 0
Message 14842 - Posted: 28 Apr 2006, 9:05:09 UTC

And he's glad to see to number of pc's rising, i think it will drop a bit at the end of the Stampede
ID: 14842 · Rating: 0 · rate: Rate + / Rate - Report as offensive
hugothehermit

Send message
Joined: 26 Sep 05
Posts: 238
Credit: 314,893
RAC: 0
Message 15392 - Posted: 3 May 2006, 6:22:51 UTC
Last modified: 3 May 2006, 6:27:50 UTC

If you know the amino acid sequances, and you know the structures of mutiple proteins then why don't you combine the two?

Like this: amino acid n is usually folded x fasion.

If you see my point, you could make a database of all the known structures for that sequence, then weight the probability of an amio acid structure, so as to have a most probable to least probable structure for each of the amino acids sequances, test each to find the energy levels for the overall protein that you are trying to predict.

Of course this wouldn't find a structure that to date is unknown but it maybe a shortcut for the others.

You may do something like this already. It was just a thought that struck me so I thought I'd bounce it off the R@H team.

Edit: clarity
ID: 15392 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Ethan
Volunteer moderator

Send message
Joined: 22 Aug 05
Posts: 286
Credit: 9,304,700
RAC: 0
Message 15393 - Posted: 3 May 2006, 6:43:25 UTC - in response to Message 15392.  
Last modified: 3 May 2006, 6:46:54 UTC

I believe this is what they try to do. . but consider the following. . if you have a 20 aa (amino acid) sequence. . you might know how the first and second interact by themselves. . but what happens when 6 and 7 in the chain loop around and interact with the start? Now, an assumption you made at the beggining (they fold like x) is not true since something external is acting opon them.

I think you're right in that it's worth having a database of these types of things. . and I seem to remember the lab folks mentioning that is what took up some of our disk space. . . but:

I could be wrong :)

-E
ID: 15393 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Bin Qian

Send message
Joined: 13 Jul 05
Posts: 33
Credit: 36,897
RAC: 0
Message 15459 - Posted: 3 May 2006, 23:32:25 UTC - in response to Message 15392.  

Great thoughts! You are definitely on the right track of a general protocol for structure prediction. The approach you described - the comparative modeling approach - is actually being routinely used by biologists nowadays.

One aspect of the Rosetta@Home project is to develope methods to improve the template-based structure models. Some of the work units you are currently running are doing extractly this type of work! You can read a little more about these work units here.

If you know the amino acid sequances, and you know the structures of mutiple proteins then why don't you combine the two?

Like this: amino acid n is usually folded x fasion.

If you see my point, you could make a database of all the known structures for that sequence, then weight the probability of an amio acid structure, so as to have a most probable to least probable structure for each of the amino acids sequances, test each to find the energy levels for the overall protein that you are trying to predict.

Of course this wouldn't find a structure that to date is unknown but it maybe a shortcut for the others.

You may do something like this already. It was just a thought that struck me so I thought I'd bounce it off the R@H team.

Edit: clarity


ID: 15459 · Rating: 0 · rate: Rate + / Rate - Report as offensive
hugothehermit

Send message
Joined: 26 Sep 05
Posts: 238
Credit: 314,893
RAC: 0
Message 15486 - Posted: 4 May 2006, 7:21:05 UTC
Last modified: 4 May 2006, 7:27:04 UTC

Thanks for the info Dr(?) Bin Qian I hadn't even thought of using the families of the protein, because I didn't know they existed :), very clever I must say.

That begs the question: Is it possible to group all of the unknown proteins into families?

I had a look on wikipedia X-Ray Crystallography (under Biological structures) and found that ~100 proteins have been solved by electron microscope, so is it possible to get another Rosetta shortcut by getting a rough estimate of what the protein shape or family is this way or perhaps a even early data from x-ray crystallography, or it's chemical reactions etc... ?

As I understand it, any type of reduction in the search space is a very good thing.

That's enough for me, I'm using your time that is better spent elsewhere.

Edit to make more sense
ID: 15486 · Rating: 0 · rate: Rate + / Rate - Report as offensive
eberndl
Avatar

Send message
Joined: 17 Sep 05
Posts: 47
Credit: 3,062,163
RAC: 2,108
Message 15532 - Posted: 4 May 2006, 20:39:12 UTC

Hugo,

From what I understand, the whole point is for Rosetta to be able to predict the structure WITHOUT using the X-ray data. As it stands, that data (or data from protein NMR) is what provides the images in the "Native" box of the screen saver. Although being able to start with a good approximation is very important, it's also important to be able to start from scratch, because most proteins don't have homologues that have been solved yet.


Questions? Try the Wiki!
Take a look inside my brain
ID: 15532 · Rating: 0 · rate: Rate + / Rate - Report as offensive
tralala

Send message
Joined: 8 Apr 06
Posts: 376
Credit: 581,806
RAC: 0
Message 15534 - Posted: 4 May 2006, 21:03:09 UTC - in response to Message 15532.  

Hugo,

From what I understand, the whole point is for Rosetta to be able to predict the structure WITHOUT using the X-ray data. As it stands, that data (or data from protein NMR) is what provides the images in the "Native" box of the screen saver. Although being able to start with a good approximation is very important, it's also important to be able to start from scratch, because most proteins don't have homologues that have been solved yet.


But as more proteins get solved the more homologues you will find. Somewhere they mentioned that they perhaps can use "cheap" experimental data such as visual microscopic data to have a rough estimation how the protein will look like. If this computationally aproach shall ever be useful they need to be able to predict the structure with limited computing power. Either by finding more folding "rules" or algorithms which will restrain the conformational space eihter by combining it with easy to obtain experimental data.
ID: 15534 · Rating: 0 · rate: Rate + / Rate - Report as offensive
TioSuper

Send message
Joined: 2 May 06
Posts: 17
Credit: 164
RAC: 0
Message 15535 - Posted: 4 May 2006, 21:07:56 UTC - in response to Message 14638.  

How embarrassing... both the watchdog "killing" and get_the_hell_out() are my silly recent contributions. Its not representative of the rest of the code -- I didn't expect these error messages and functions to get broadcast to such a wide audience! I assure you that the rest of the code is totally dry and scientific. We'll change "killing" to "ending" in the release after the next (I read the message too late).



Dry and scientific usually is boring. Anyhow, a quirky and sometimes macabre sense of humour is a sign of genius. :) Keep the good work and please don't let your humorous side be destroyed by science. ( Or science as dry and unhumorous people think it should be)

ID: 15535 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Profile Feet1st
Avatar

Send message
Joined: 30 Dec 05
Posts: 1755
Credit: 4,690,520
RAC: 0
Message 15537 - Posted: 4 May 2006, 22:20:16 UTC - in response to Message 15532.  

From what I understand, the whole point is for Rosetta to be able to predict the structure WITHOUT using the X-ray data.

Hugo made reference to
...early data from x-ray crystallography, or it's chemical reactions etc.

I read an article which I wanted to link, but still haven't found it. It explained more about the X-ray process. The nutshell I got out of it is that there really is no "early" or "preliminary data". You either have done the complete X-ray study, or you haven't. Once you get the thing to crystilize (which is the hard part I guess, can take months and is considered more of an art than a science), then you take it to a one-of-a-kind huge X-ray device with superpowers and supercomputers, and you scan it from all angles all in an afternoon.

The huge expense is that this is no simple hospital X-Ray machine. And it still takes hours of scans, I think the article said they scan every 2 degrees of rotation, to get the 3D image.

So, even if you had the BILLION proteins all crystilized, it would STILL take WAY too long (billions of hours) to X-Ray all of them.

As for chemical reactions... I believe that is how they know the sequence that the amino acids comprising the protein are in. But, ya, perhaps they could do some other chemistry on it to infer some broader info. to get clues as to how they are arranged.
Add this signature to your EMail:
Running Microsoft's "System Idle Process" will never help cure cancer, AIDS nor Alzheimer's. But running Rosetta@home just might!
https://boinc.bakerlab.org/rosetta/
ID: 15537 · Rating: 0 · rate: Rate + / Rate - Report as offensive
eberndl
Avatar

Send message
Joined: 17 Sep 05
Posts: 47
Credit: 3,062,163
RAC: 2,108
Message 15593 - Posted: 5 May 2006, 21:15:19 UTC

So, even if you had the BILLION proteins all crystilized, it would STILL take WAY too long (billions of hours) to X-Ray all of them.


Feet1st, you're right it does take hours to Xray each crystal, but it can take weeks or MONTHS to figure out how to crystallize a protein for the first time. And once you have the xrays... you have to run that through a massive computer program (another couple hours), and what that will give is an electron density map, which looks like this:



But all those solid yellow lines have to be added BY HAND to the density map. Once this first estimate of the protein is obtained, they do a theoretical Xray on their model protein, and look at the differences between the actual (from the crystal) and experimental (from the model) scatter graphs, and then they go through and try to reduce the differences as best they can. again, they have to do this by hand.

As for chemical reactions... I believe that is how they know the sequence that the amino acids comprising the protein are in. But, ya, perhaps they could do some other chemistry on it to infer some broader info. to get clues as to how they are arranged.


There are 2 main ways to figure out the sequence of a protein: Get the cDNA and convert it to protein, or take the protein and do tandem Mass Spectroscopy on it, which can give you the actual protein sequence (except that they can't tell leucine and isoleucine apart due to their identical sizes).


Questions? Try the Wiki!
Take a look inside my brain
ID: 15593 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Profile Feet1st
Avatar

Send message
Joined: 30 Dec 05
Posts: 1755
Credit: 4,690,520
RAC: 0
Message 15594 - Posted: 5 May 2006, 22:58:05 UTC
Last modified: 5 May 2006, 22:58:51 UTC

Always good stuff from eberndl. I always learn more from your posts. Thanks.

If anyone else remembers the article, it talked about an entreprenure that's working on methods of doing the x-ray crystallography FASTER. And how each protein can cost $100,000s of USD.

Anyway around it, existing approaches will take many thousands of years at the current pace, and after that, you still don't really know how what you need to know to DESIGN a protein that will target HIV or Alzheimer's. You'd have to experimentally create a bunch of proteins, then do the crystallography and then see if they happen to be a shape that will dock with your target protein.

This is why the TOP7 is such a unique thing. An artificial protein. They actually planned it out, and BUILT a protein, and (I think) it took the shape they had predicted (or very close). So, if the prediction mechanism were perfect... then you could use the mass spectroscopy techniques to get the sequence of amino acids, calculate the shape, model another protein that will "dock" with the target, and then produce that new modelled protein. (Hence the REST of the R@H logo about protein design and docking).

This is also what CASP is all about. You don't KNOW what it really looks like, how close can you predict based solely on the sequence of amino acids. Some predict by hand, some predict via distributed computing ;)

...net result, when you discover a new virus (SARS, bird flu, HIV...), you could readily create a protein treatment that targets it. And if you've got an existing database of the other billion proteins in the body, then you could create a treatment with minimal sideeffects (i.e. one that will NOT dock with other proteins in your body, just the diseased ones).
Add this signature to your EMail:
Running Microsoft's "System Idle Process" will never help cure cancer, AIDS nor Alzheimer's. But running Rosetta@home just might!
https://boinc.bakerlab.org/rosetta/
ID: 15594 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Profile Dimitris Hatzopoulos

Send message
Joined: 5 Jan 06
Posts: 336
Credit: 80,939
RAC: 0
Message 15612 - Posted: 6 May 2006, 16:35:40 UTC
Last modified: 6 May 2006, 17:01:41 UTC

In any event, if you are interested in finding more about our research, and can stand the ums, you can find it at:

http://norfolk.cs.washington.edu/htbin-post/unrestricted/colloq/details.cgi?id=449


Just a quick note that even the "regular" (not high-bandwidth version) 320x240 movie available for download from the abovementioned URL is 127MBytes and a bit over 1hr long.

I have just watched random 5-10min pieces of the video, but I found it quite understandable (having no background at all in the field). Absolutely worth watching.
Best UFO Resources
Wikipedia R@h
How-To: Join Distributed Computing projects that benefit humanity
ID: 15612 · Rating: 0 · rate: Rate + / Rate - Report as offensive
BennyRop

Send message
Joined: 17 Dec 05
Posts: 555
Credit: 140,800
RAC: 0
Message 15618 - Posted: 6 May 2006, 18:47:34 UTC
Last modified: 6 May 2006, 18:53:52 UTC

I remember watching a video around 1980 in high school which was a presentation by the Kodiak Electrical Association about the proposed hydroelectric dam at Terror Lake. It discussed the effect on the bears that lived in the valley that would be turned into a larger lake behind the dam. They had the manager of the local electrical plant on the bank of Terror Lake, being eaten alive by mosquitos, and using a slightly distracting amount of "ums" in the conversation. A couple of years ago, I visited a museum in Pensacola, Florida, and a Floridian that had visited Terror Lake some time in the past had killed a pair of Kodiak bears, had the taxidermist mount them, and sent to Pensacola where his family donated them to the museum. The point being is that I remember when I saw that video..

By halfway through, the "ums" have really calmed down - so if you want to improve the presentation that you'll be making, see if you can do it in ... 5 minute segments. On the first pass, use your hand to hold your lips together on one side of your mouth, and talk with a rediculous accent. Relax, and then do it again - correctly. (Perhaps there's better advice from those that deal with videotaping.. have attended ToastMaster presentations, etc.) As a benefit to the project, you might be able to use the rediculous accent version of the video to bribe teams like The Knights Who Say Neee! into doubling their production for a few months, for the right to be the first team to get to see the silly version of the presentation first. :)

As others have said, the material in the whole 1 hour presentation seemed really easy to understand, especially with description in layman's english about what most of the technobabble was. (I was informed by a client that she didn't understand a single thing I was telling her about why her computer died..)
ID: 15618 · Rating: 1 · rate: Rate + / Rate - Report as offensive
Profile rbpeake

Send message
Joined: 25 Sep 05
Posts: 168
Credit: 247,828
RAC: 0
Message 15627 - Posted: 6 May 2006, 21:04:25 UTC - in response to Message 15612.  

In any event, if you are interested in finding more about our research, and can stand the ums, you can find it at:

http://norfolk.cs.washington.edu/htbin-post/unrestricted/colloq/details.cgi?id=449


Absolutely worth watching.

I agree, and believe that you are your own worst critic (all of us are), and my focus on your presentation was on what you had to say. I found it fascinating, and a means to get to know you better. So David, I would suggest featuring the link on the project main page. Very worthwhile!

Regards,
Bob P.
ID: 15627 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Profile Hoelder1in
Avatar

Send message
Joined: 30 Sep 05
Posts: 169
Credit: 3,915,947
RAC: 0
Message 15642 - Posted: 7 May 2006, 8:40:39 UTC - in response to Message 15627.  
Last modified: 7 May 2006, 8:42:06 UTC

http://norfolk.cs.washington.edu/htbin-post/unrestricted/colloq/details.cgi?id=449

Absolutely worth watching.

I agree, and believe that you are your own worst critic (all of us are), and my focus on your presentation was on what you had to say. I found it fascinating, and a means to get to know you better. So David, I would suggest featuring the link on the project main page. Very worthwhile!

Indeed !

There is one specific question I wanted to ask, though: right at the end of the lecture there was a statement which seemed to say that the work about cutting DNA at specific sites (as described on the "Disease Related Research" page) was already under way. I would definitely be interested to hear how far this work has proceeded (e.g., in terms of developing this for therapeutic purposes) and what is planed for the future. Thanks, -H.
Team betterhumans.com - discuss and celebrate the future - hoelder1in.org
ID: 15642 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Profile Dimitris Hatzopoulos

Send message
Joined: 5 Jan 06
Posts: 336
Credit: 80,939
RAC: 0
Message 15648 - Posted: 7 May 2006, 15:18:56 UTC
Last modified: 7 May 2006, 15:19:40 UTC

A question I have (maybe it's answered somewhere on the site and I missed it) is about the current focus.

Are there still ongoing changes to the way Rosetta energy function is computed, or is that part basically finished and nowadays all efforts are towards better / more efficient methods to sample the conformational space?

To rephrase it, would the recent difficulties with 1tul be due the fact that even 1 million structures still won't get one of our "explorers" near the energy minimum? Or was "1tul" one rare case where the energy function didn't work so well?



It would be nice to also plot the native structure's Rosetta energy function value on those charts, as you did in the past in turquoise colour.



Best UFO Resources
Wikipedia R@h
How-To: Join Distributed Computing projects that benefit humanity
ID: 15648 · Rating: 0 · rate: Rate + / Rate - Report as offensive
David Baker
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 17 Sep 05
Posts: 705
Credit: 559,847
RAC: 0
Message 15655 - Posted: 7 May 2006, 16:54:19 UTC - in response to Message 15648.  

A question I have (maybe it's answered somewhere on the site and I missed it) is about the current focus.

Are there still ongoing changes to the way Rosetta energy function is computed, or is that part basically finished and nowadays all efforts are towards better / more efficient methods to sample the conformational space?

To rephrase it, would the recent difficulties with 1tul be due the fact that even 1 million structures still won't get one of our "explorers" near the energy minimum? Or was "1tul" one rare case where the energy function didn't work so well?



It would be nice to also plot the native structure's Rosetta energy function value on those charts, as you did in the past in turquoise colour.





for 1tul it is very clearly a sampling problem--the native structure has much lower energy but no explorers got anywhere close. we have a number of improved "jumping" methods for larger beta sheet containing proteins that we are continuing to develop. work in the group is currently directed at improving both the energy function and the sampling method, but it is clear that sampling is a far greater problem, particularly for larger more complex proteins.
we should still be plotting the native protein energies as you point out.

ID: 15655 · Rating: 0 · rate: Rate + / Rate - Report as offensive
David Baker
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 17 Sep 05
Posts: 705
Credit: 559,847
RAC: 0
Message 15656 - Posted: 7 May 2006, 16:58:05 UTC - in response to Message 15642.  

http://norfolk.cs.washington.edu/htbin-post/unrestricted/colloq/details.cgi?id=449

Absolutely worth watching.

I agree, and believe that you are your own worst critic (all of us are), and my focus on your presentation was on what you had to say. I found it fascinating, and a means to get to know you better. So David, I would suggest featuring the link on the project main page. Very worthwhile!

Indeed !

There is one specific question I wanted to ask, though: right at the end of the lecture there was a statement which seemed to say that the work about cutting DNA at specific sites (as described on the "Disease Related Research" page) was already under way. I would definitely be interested to hear how far this work has proceeded (e.g., in terms of developing this for therapeutic purposes) and what is planed for the future. Thanks, -H.


We have worked out the science/technology needed to create new DNA cleaving enzymes (if you ever look at the scientific journal "Nature" you will see our paper on this in a few weeks). We are currently hard at work trying to create enzymes that cleave within genes that cause disease and within pathogens. Once we have these they can be tested as therapeutics, but this is still a step or two away.
ID: 15656 · Rating: 0 · rate: Rate + / Rate - Report as offensive
BennyRop

Send message
Joined: 17 Dec 05
Posts: 555
Credit: 140,800
RAC: 0
Message 15708 - Posted: 9 May 2006, 12:03:29 UTC

You might also want to think up a way to encourage a production increase in the smaller teams, and the vast horde of those who don't belong to a team at all. Perhaps look for increases in models/month; and weigh those that have added an extra machine or 5 a little higher than those that increase Rosetta's Boinc share from 10% to 100%. (i.e. grab a couple of both). Perhaps picking out one at random so a non teamed member that runs a second cpu for all of Casp7 has a chance of being picked.




ID: 15708 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · 9 . . . 10 · Next

Message boards : Rosetta@home Science : Comments/questions on Rosetta@home journal



©2024 University of Washington
https://www.bakerlab.org