Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 161 · 162 · 163 · 164 · 165 · 166 · 167 . . . 309 · Next

AuthorMessage
Falconet

Send message
Joined: 9 Mar 09
Posts: 354
Credit: 1,276,393
RAC: 2,018
Message 104352 - Posted: 20 Jan 2022, 23:46:03 UTC - in response to Message 104351.  

Ah ha! Well sorry for the misspelling. Quick typing and no thinking after work.
4th dimension does exist...but anyway...again after work..not thinking. Missed the pun.

Now...if you go here: https://robetta.bakerlab.org/queue.php?id=&target=&username=&seq=&page=2 and look at the active tasks at random, you will see that the majority are queued for RoseTTAFold which is the AI.

You can read about it here:
https://www.ipd.uw.edu/2021/07/rosettafold-accurate-protein-structure-prediction-accessible-to-all/

Now bed...



Since these Robetta jobs aren't run at Rosetta@home, it doesn't make sense that they would get queued at Rosetta@home.
They seem to be directly sent from Robetta to the Baker Lab cluster and to the HHMI's Janelia Research Campus.

My bet is the 2.6 million tasks on the Rosetta@home queue are all Pythons.
ID: 104352 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 104353 - Posted: 21 Jan 2022, 0:27:08 UTC - in response to Message 104351.  

Now...if you go here: https://robetta.bakerlab.org/queue.php?id=&target=&username=&seq=&page=2 and look at the active tasks at random, you will see that the majority are queued for RoseTTAFold which is the AI.
The first thing I spotted when looking at the "user input" page was the word KILL somewhere in the sequence. Either I'm a homicidal maniac, or it's because I'm playing fallout 4 on a Ryzen 3900XT that should be folding.
ID: 104353 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 104354 - Posted: 21 Jan 2022, 1:23:20 UTC - in response to Message 104352.  
Last modified: 21 Jan 2022, 1:23:33 UTC

https://robetta.bakerlab.org/queue.php?id=&target=&username=&seq=&page=2
Since these Robetta jobs aren't run at Rosetta@home, it doesn't make sense that they would get queued at Rosetta@home.
They seem to be directly sent from Robetta to the Baker Lab cluster and to the HHMI's Janelia Research Campus.

My bet is the 2.6 million tasks on the Rosetta@home queue are all Pythons.
So how come there are so many usernames, it certainly looks like Boinc.
ID: 104354 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 104355 - Posted: 21 Jan 2022, 7:18:41 UTC - in response to Message 104354.  
Last modified: 21 Jan 2022, 7:34:58 UTC

https://robetta.bakerlab.org/queue.php?id=&target=&username=&seq=&page=2
Since these Robetta jobs aren't run at Rosetta@home, it doesn't make sense that they would get queued at Rosetta@home.
They seem to be directly sent from Robetta to the Baker Lab cluster and to the HHMI's Janelia Research Campus.

My bet is the 2.6 million tasks on the Rosetta@home queue are all Pythons.
So how come there are so many usernames, it certainly looks like Boinc.



That's not users related to us. That's the person who submitted the protein.
Again...if its Robetta, its not us. That's all there is to it.


I'll have to dig around some more.
I have always known from other sources that Robetta server supplied Rosetta servers.
That Robetta was where everything is stored.
That may not be the case, but it's something that will take some digging around.
The group is not that transparent on how their setup works.
ID: 104355 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Falconet

Send message
Joined: 9 Mar 09
Posts: 354
Credit: 1,276,393
RAC: 2,018
Message 104357 - Posted: 21 Jan 2022, 10:18:38 UTC - in response to Message 104355.  

Robetta, as far as I can tell, is separate from Rosetta@home and is used mostly by researchers outside of the Baker Lab/IPD. It's an interface for users who wish to get computing power for their jobs.
Jobs that require the use of Rosetta 4.20 that are submitted to Robetta get sent to Rosetta@home but the rest goes to the other servers that they set up when they launched RoseTTAFold.
ID: 104357 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
aperience

Send message
Joined: 26 May 21
Posts: 1
Credit: 178,981
RAC: 53
Message 104360 - Posted: 21 Jan 2022, 14:06:18 UTC - in response to Message 104355.  

https://robetta.bakerlab.org/queue.php?id=&target=&username=&seq=&page=2
Since these Robetta jobs aren't run at Rosetta@home, it doesn't make sense that they would get queued at Rosetta@home.
They seem to be directly sent from Robetta to the Baker Lab cluster and to the HHMI's Janelia Research Campus.

My bet is the 2.6 million tasks on the Rosetta@home queue are all Pythons.
So how come there are so many usernames, it certainly looks like Boinc.



That's not users related to us. That's the person who submitted the protein.
Again...if its Robetta, its not us. That's all there is to it.


I'll have to dig around some more.
I have always known from other sources that Robetta server supplied Rosetta servers.
That Robetta was where everything is stored.
That may not be the case, but it's something that will take some digging around.
The group is not that transparent on how their setup works.


Hi, regular Boinc user here, first time posting.

I don't know why, but I've never had issues with python projects tasks. I haven't done anything special apart from downloading VirtualBox alongside Boinc. I assume many people did, unless there's some controversy around VirtualBox that I'm unaware of.
I've seen that I can't have more than 2 python tasks at a time, whatever the reason is, but it's not a limitation for me, as I mostly run tasks for Numberfields@home.

I'm just posting to tell you that users like me exist, although we may be rare (I assume I'm not the only one). I'm not saying that there aren't users related to the running team among the ~1000 users running python tasks right now, I can just tell that I'm not one of them and that it's possible to run python tasks through Boinc and VB. I guess I'm just lucky it works on my laptop.
I can give you more info if you want but I haven't really followed the discussion and I'm not an expert with computers, so try to give instructions if you want technical details.
ID: 104360 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 104361 - Posted: 21 Jan 2022, 15:19:49 UTC
Last modified: 21 Jan 2022, 15:59:59 UTC

I have posted this many times before. They should make it a sticky if there were any moderator around to do it.

If you are running VirtualBox 6.1.x, you will get the "Vm job unmanageable" problem with the pythons. That is true whether you are running Windows or Linux.
The difference is that it can be fixed with Windows. You go back to VirtualBox 5.2.44
https://www.virtualbox.org/wiki/Download_Old_Builds_5_2

Unfortunately, that does not work on Linux, at least Ubuntu. Firstly, Ubuntu 20.04.3 works only with VBox 6.1.x.
Secondly, even going back to Ubuntu 18.04.6, which allows you to install VBox 5.2.44, still has the problem.

They need to fix it at the project end, by compiling a new Vbox wrapper. They did it on LHC, and it works there. (It has to do with the COM interface, in case you are interested.)

NB: If you reboot frequently, you may not see the problem. It usually occurs after the pythons have been running 12 hours or so, but I have seen it even after a reboot on Ubuntu.
ID: 104361 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 104363 - Posted: 21 Jan 2022, 18:12:33 UTC - in response to Message 104361.  

That's only one of the problems, I can't get python to work on 6 of 7 Windows PCs no matter which version of VB I use. One of them fails validation at the server end, the rest just sit with almost 0 CPU time.

Pythons for 12 hours? They average 2 hours here.

As for only being able to run a couple at a time, you need a lot of RAM. I can run 5 but it won't do 6 in 16GB.
ID: 104363 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 104364 - Posted: 21 Jan 2022, 18:31:15 UTC - in response to Message 104363.  

Pythons for 12 hours? They average 2 hours here.

As for only being able to run a couple at a time, you need a lot of RAM. I can run 5 but it won't do 6 in 16GB.

No, not an individual run for 12 hours. After running a series of them continually.
I didn't say anything about running only two. I usually run at least eight, and am presently running twenty on a Ryzen 3900X with 80 GB of memory.
ID: 104364 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 104365 - Posted: 21 Jan 2022, 18:33:59 UTC - in response to Message 104364.  
Last modified: 21 Jan 2022, 18:34:14 UTC

Pythons for 12 hours? They average 2 hours here.

As for only being able to run a couple at a time, you need a lot of RAM. I can run 5 but it won't do 6 in 16GB.

No, not an individual run for 12 hours. After running a series of them continually.
I didn't say anything about running only two. I usually run at least eight, and am presently running twenty on a Ryzen 3900X with 80 GB of memory.
The two was the post before yours. Sorry, I was too lazy to quote who I was talking to in which bit. I'm giving a dodgy machine another go on the old VB, but I think I've tried this. It just doesn't use the CPU at all.
ID: 104365 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 104366 - Posted: 21 Jan 2022, 18:36:18 UTC - in response to Message 104365.  

I'm giving a dodgy machine another go on the old VB, but I think I've tried this. It just doesn't use the CPU at all.

I wish I had a fix, but you have tried a lot. Good luck.
ID: 104366 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 104368 - Posted: 21 Jan 2022, 19:09:29 UTC - in response to Message 104357.  

Robetta, as far as I can tell, is separate from Rosetta@home and is used mostly by researchers outside of the Baker Lab/IPD. It's an interface for users who wish to get computing power for their jobs.
Jobs that require the use of Rosetta 4.20 that are submitted to Robetta get sent to Rosetta@home but the rest goes to the other servers that they set up when they launched RoseTTAFold.



Ok..so then where do they get the million something tasks in queue?
But yet there appears to be only a few thousand released?
There has always been a million something in queue, even back when it was just 4.2 alone.
So something doesn't add up.
And that you can't see what is next in line....but yet you can see Robetta?
Plus someone kept quoting Robetta information some time ago as if that was where RAH gets its work.
ID: 104368 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 104369 - Posted: 21 Jan 2022, 19:12:32 UTC - in response to Message 104364.  

Pythons for 12 hours? They average 2 hours here.

As for only being able to run a couple at a time, you need a lot of RAM. I can run 5 but it won't do 6 in 16GB.

No, not an individual run for 12 hours. After running a series of them continually.
I didn't say anything about running only two. I usually run at least eight, and am presently running twenty on a Ryzen 3900X with 80 GB of memory.



Holy cow! 80 gigs?!?! That's more than my budget can afford!
It hurt enough to put in 32 on top of the 16 I put in about 4 years ago.

I just can't see investing much more for being a volunteer.
At best another 1080 or better, but that's it.
ID: 104369 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 104370 - Posted: 21 Jan 2022, 19:15:32 UTC - in response to Message 104361.  

I have posted this many times before. They should make it a sticky if there were any moderator around to do it.

If you are running VirtualBox 6.1.x, you will get the "Vm job unmanageable" problem with the pythons. That is true whether you are running Windows or Linux.
The difference is that it can be fixed with Windows. You go back to VirtualBox 5.2.44
https://www.virtualbox.org/wiki/Download_Old_Builds_5_2

Unfortunately, that does not work on Linux, at least Ubuntu. Firstly, Ubuntu 20.04.3 works only with VBox 6.1.x.
Secondly, even going back to Ubuntu 18.04.6, which allows you to install VBox 5.2.44, still has the problem.

They need to fix it at the project end, by compiling a new Vbox wrapper. They did it on LHC, and it works there. (It has to do with the COM interface, in case you are interested.)

NB: If you reboot frequently, you may not see the problem. It usually occurs after the pythons have been running 12 hours or so, but I have seen it even after a reboot on Ubuntu.



Jim, I went back to 6.1 and I do not have problems.
I can run all my projects there.
Going back to 5.2 is a good place to start for trouble shooting Python VM problms, but this can affect other projects.
ID: 104370 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 104371 - Posted: 21 Jan 2022, 19:28:57 UTC - in response to Message 104369.  

Pythons for 12 hours? They average 2 hours here.

As for only being able to run a couple at a time, you need a lot of RAM. I can run 5 but it won't do 6 in 16GB.

No, not an individual run for 12 hours. After running a series of them continually.
I didn't say anything about running only two. I usually run at least eight, and am presently running twenty on a Ryzen 3900X with 80 GB of memory.



Holy cow! 80 gigs?!?! That's more than my budget can afford!
It hurt enough to put in 32 on top of the 16 I put in about 4 years ago.

I just can't see investing much more for being a volunteer.
At best another 1080 or better, but that's it.
I have a Ryzen with 64GB, but it's my main computer. Less than that is pitiful by today's standards. It will take 128GB.
I have two Boinc only machines with 36GB in them. I upped them just enough to run LHC.
ID: 104371 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 104372 - Posted: 21 Jan 2022, 21:03:34 UTC - in response to Message 104371.  

I have a Ryzen with 64GB, but it's my main computer. Less than that is pitiful by today's standards. It will take 128GB.
I have two Boinc only machines with 36GB in them. I upped them just enough to run LHC.

Most projects don't take nearly as much as the pythons or LHC of course.

I like memory, but beyond 64 GB you have stability problems, since you have to use all four slots.
Sometimes it works, but you often have to juggle memory around. You may have to spend more than you anticipated.
Two slots is a lot safer.
ID: 104372 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Falconet

Send message
Joined: 9 Mar 09
Posts: 354
Credit: 1,276,393
RAC: 2,018
Message 104373 - Posted: 21 Jan 2022, 21:10:55 UTC - in response to Message 104368.  

Robetta, as far as I can tell, is separate from Rosetta@home and is used mostly by researchers outside of the Baker Lab/IPD. It's an interface for users who wish to get computing power for their jobs.
Jobs that require the use of Rosetta 4.20 that are submitted to Robetta get sent to Rosetta@home but the rest goes to the other servers that they set up when they launched RoseTTAFold.



Ok..so then where do they get the million something tasks in queue?
But yet there appears to be only a few thousand released?
There has always been a million something in queue, even back when it was just 4.2 alone.
So something doesn't add up.
And that you can't see what is next in line....but yet you can see Robetta?
Plus someone kept quoting Robetta information some time ago as if that was where RAH gets its work.


The queue we see in the Rosetta@home page represents the jobs that the IPD directly submits to run at Rosetta@home + whatever gets submitted at Rosetta to run on Rosetta@home, the rb_11111_11111 jobs.

If I could see the Rosetta@home queue, it would likely be close to 100% Rosetta Python jobs. The Pythons are refilled from the queue up to a max of 5,000 ready to send. I don't know why (server resources constraints?) but it's not like Rosetta@home can do a lot of these at any given time so no point in increasing that value, So yeah, those 2.6 million jobs on the queue are Pythons. Sid Celery posted something a few months ago that he received from Admin or someone like that who said that the Python job that had been submitted by one of the IPD researchers was "huge".

I think one of the features on Robetta is to make sure that everything is open - that is, any researcher can see what is being worked on both present and past and maybe avoid duplication of work. I recall during the pandemic that they asked Robetta users to make sure their jobs were visible to others so that everyone could benefit.
(I have a suspicion that researchers can hide their jobs - often times, I try to search for the jobs I'm running on my computers using the ID number but Robetta doesn't return any results).

I don't know who said that but my impression has always been that work that comes from Robetta is labelled rb and everything else that isn't labelled rb is directly submitted to Rosetta@home.
ID: 104373 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 104375 - Posted: 21 Jan 2022, 22:08:20 UTC - in response to Message 104372.  
Last modified: 21 Jan 2022, 22:08:46 UTC

Most projects don't take nearly as much as the pythons or LHC of course.

I like memory, but beyond 64 GB you have stability problems, since you have to use all four slots.
Sometimes it works, but you often have to juggle memory around. You may have to spend more than you anticipated.
Two slots is a lot safer.
Everything works better with more memory, if you're not using it you get a massive disk cache. Using all four slots does not cause stability problems. Always test your new memory with memtest before use, even quality stuff has duds.
ID: 104375 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 104376 - Posted: 21 Jan 2022, 22:11:10 UTC - in response to Message 104373.  

Sid Celery posted something a few months ago that he received from Admin or someone like that who said that the Python job that had been submitted by one of the IPD researchers was "huge".
It's not that big, it's only a few million tasks. I've seen the queue at 15 million. But maybe that was several projects at once.
ID: 104376 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1725
Credit: 18,380,064
RAC: 20,136
Message 104377 - Posted: 21 Jan 2022, 22:16:09 UTC - in response to Message 104368.  

Ok..so then where do they get the million something tasks in queue?
But yet there appears to be only a few thousand released?
There has always been a million something in queue, even back when it was just 4.2 alone.
So something doesn't add up.

Back when there were Rosetta 4.20 Tasks, all those millions were Rosetta 4.30 Tasks and if you checked the Total queued jobs number it would gradually run down to zero (or jump up again as new work was released).

Now most of the work is Python, and that's what that number shows. Extremely occasionally it jumps up again when that extremely rare batch of Rosetta 4.20 work is released.,
However most of the time it sits around the 2-2.7 million mark, this is because the amount of Python work being done is being done at roughly the same rate as new Python work is released.


The Unset value in the Application task list is the amount of work that's ready to go for that particular application (i think the ratio is 6:1 Rosetta 4.20:Python). The Tasks ready to send value under the Computing Status is the Rosetta 4.20 & Python Tasks by application values combined.
The Total queued jobs value is both the Rosetta 4.20 work, and the Python work and both the Unset 4.20 & Python work all combined. It is the total of all types of work at all stages of yet to be processed.
Grant
Darwin NT
ID: 104377 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 161 · 162 · 163 · 164 · 165 · 166 · 167 . . . 309 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org