Message boards : Rosetta@home Science : AlphaFold reveals the structure of protein universe
Author | Message |
---|---|
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1997 Credit: 9,747,451 RAC: 9,335 |
200 milion proteins In partnership with EMBL’s European Bioinformatics Institute (EMBL-EBI), we’re now releasing predicted structures for nearly all catalogued proteins known to science, which will expand the AlphaFold DB by over 200x - from nearly 1 million structures to over 200 million structures - with the potential to dramatically increase our understanding of biology. |
Jim1348 Send message Joined: 19 Jan 06 Posts: 881 Credit: 52,257,545 RAC: 0 |
Remarkable stuff that will change the world. However, it does not appear that we can contribute to this work directly, since it is mainly done in-house. But they do mention mining: Structural search tools like Foldseek and Dali are allowing users to very quickly search for entries similar to a given protein. This could be a first step toward mining large sequence datasets for practically useful proteins, such as those that break down plastic, and it could provide clues about protein function. The update of the database to include over 200 million predicted structures will further amplify this impact. I really have a very limited idea of how that works, but the LODA project seeks to discover new mining algorithms: https://boinc.loda-lang.org/loda/ I have done a little of it, and think I will do more. |
Stevie G Send message Joined: 15 Dec 18 Posts: 107 Credit: 865,910 RAC: 1,988 |
"....mining large sequence datasets for practically useful proteins, such as those that break down plastic..." We need to reduce microplastics, but don't want to produce Ice Nine. :>)) S. Gaber |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1997 Credit: 9,747,451 RAC: 9,335 |
Remarkable stuff that will change the world. However, it does not appear that we can contribute to this work directly, since it is mainly done in-house. See my post about FoldSeek P.S. A boinc project with FoldSeek will be great! |
Jim1348 Send message Joined: 19 Jan 06 Posts: 881 Credit: 52,257,545 RAC: 0 |
See my post about FoldSeek Yes, it always depends on the size of the data that they have to shuffle around, and the latency. We are ready when they are. |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1997 Credit: 9,747,451 RAC: 9,335 |
Yes, it always depends on the size of the data that they have to shuffle around, and the latency. FoldSeek database is 700gb to download, 950gb extracted. There is a lot of work to do :-P |
Message boards :
Rosetta@home Science :
AlphaFold reveals the structure of protein universe
©2024 University of Washington
https://www.bakerlab.org