How to determine what future work is coming this way from Robetta

Message boards : Number crunching : How to determine what future work is coming this way from Robetta

To post messages, you must log in.

AuthorMessage
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5664
Credit: 5,711,666
RAC: 1,686
Message 103809 - Posted: 11 Dec 2021, 9:46:27 UTC

We keep seeing the "millions" of jobs in queue, but yet when you look at Robetta, What of those jobs have we seen already via the "completed" work and what tags can we look at to tell what is future work?

Is python stuff included in there? If so under what target name?
ID: 103809 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5664
Credit: 5,711,666
RAC: 1,686
Message 103810 - Posted: 11 Dec 2021, 12:46:21 UTC - in response to Message 103809.  

We keep seeing the "millions" of jobs in queue, but yet when you look at Robetta, What of those jobs have we seen already via the "completed" work and what tags can we look at to tell what is future work?

Is python stuff included in there? If so under what target name?



I found stuff with RoseTTAFold which is the neural network.
So not all of the millions are for us.
It appears the only way to find out what is for us and for the neural network is to look at each job ID
ID: 103810 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Falconet

Send message
Joined: 9 Mar 09
Posts: 350
Credit: 1,017,068
RAC: 183
Message 103812 - Posted: 11 Dec 2021, 14:37:04 UTC - in response to Message 103810.  
Last modified: 11 Dec 2021, 14:39:50 UTC

RoseTTAFold jobs don't get queued at Rosetta@home since they aren't meant for Rosetta@home. I think that if they did, we'd know just from looking at the queue - with all the RoseTTAFold jobs being submitted to Robetta, I doubt we'd have 100,000 left on the queue like we did some weeks ago or so.
The Pythons come directly from the Rosetta@home/IPD team so they don't come from Robetta, which is for scientists around the world to use.

Yes, that is probably the only way to find out if there is work coming to us.
Then again, sometimes I receive Robetta work which I can't find using the ID at the Robetta page.

The 4.20 tasks should last a few days I guess.
ID: 103812 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5664
Credit: 5,711,666
RAC: 1,686
Message 103813 - Posted: 11 Dec 2021, 19:56:58 UTC - in response to Message 103812.  

RoseTTAFold jobs don't get queued at Rosetta@home since they aren't meant for Rosetta@home. I think that if they did, we'd know just from looking at the queue - with all the RoseTTAFold jobs being submitted to Robetta, I doubt we'd have 100,000 left on the queue like we did some weeks ago or so.
The Pythons come directly from the Rosetta@home/IPD team so they don't come from Robetta, which is for scientists around the world to use.

Yes, that is probably the only way to find out if there is work coming to us.
Then again, sometimes I receive Robetta work which I can't find using the ID at the Robetta page.

The 4.20 tasks should last a few days I guess.



Which just goes to show I was right a long time back. We are getting leftovers and things that can't be done on the AI.
Which is pretty much following TAC's way of using BOINC.
Oh well, i crunch what comes. RAH is not my highest priority any more.
ID: 103813 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5664
Credit: 5,711,666
RAC: 1,686
Message 103814 - Posted: 11 Dec 2021, 20:27:28 UTC

4.2 her2 tasks are something that is important to me.
Deals with breast cancer.
My wife had a hormonal tumor.
Not sure if her deals with that kind of tumor, but her grandmother had breast cancer as well.
So nice to see something related to that.

I am finally after all these years beginning to understand all that gibberish in the task names.
ID: 103814 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Falconet

Send message
Joined: 9 Mar 09
Posts: 350
Credit: 1,017,068
RAC: 183
Message 103815 - Posted: 12 Dec 2021, 14:08:40 UTC - in response to Message 103814.  

4.2 her2 tasks are something that is important to me.
Deals with breast cancer.
My wife had a hormonal tumor.
Not sure if her deals with that kind of tumor, but her grandmother had breast cancer as well.
So nice to see something related to that.

I am finally after all these years beginning to understand all that gibberish in the task names.


I am sorry to hear that.
And yes, looking at the task names, there is usually a good chance that we can figure out what it is about just by using a search engine.
ID: 103815 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5664
Credit: 5,711,666
RAC: 1,686
Message 103818 - Posted: 12 Dec 2021, 20:13:16 UTC - in response to Message 103815.  

4.2 her2 tasks are something that is important to me.
Deals with breast cancer.
My wife had a hormonal tumor.
Not sure if her deals with that kind of tumor, but her grandmother had breast cancer as well.
So nice to see something related to that.

I am finally after all these years beginning to understand all that gibberish in the task names.


I am sorry to hear that.
And yes, looking at the task names, there is usually a good chance that we can figure out what it is about just by using a search engine.



At first I figured it was some sort of random secret code and never bothered with it. Finally decided after all these years to look at it after gaining some insight based on Robetta protein strings.
ID: 103818 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mad_Max

Send message
Joined: 31 Dec 09
Posts: 207
Credit: 23,419,190
RAC: 12,095
Message 103889 - Posted: 24 Dec 2021, 1:44:28 UTC
Last modified: 24 Dec 2021, 1:52:23 UTC

I think there are still a plenty of work for regular Rosetta in queue.
And work shortage is just another example of misconfiguration and poor maintenance of the project tech staf (admins/programmers).

Looks like work generator daemons configured to maintain about 5000 tasks in ready to sent state (taking it from the large work queue displayed on the home page when needed), but this target set as combined number.
Ignoring the fact that the project now has two fundamentally different types of tasks that are not mixed with each other.

There are two separate work generators. One creates regular Rosetta WUs only and second is creating Python vBOX WUs only.
You can see them on server status page: https://boinc.bakerlab.org/rosetta/server_status.php
rah_make_work_rosetta - regular WUs generator
rah_make_work_rosetta_python_projects - Python WUs generator

Pythons WUs processed by volunteer computer at much lower pace (due to much higher system requirements, absence of vBox on some of BOINC client installation, some users do NOT want vBOX tasks and disabled it, etc)
This cause current work cache (Tasks ready to send) after some time end up occupied by vBox python WUs only and no any tasks for regular Rosetta available to download.
But regular WUs genenator does not kick in because target work cache size is already filled (by Python tasks) and produce new work only after some Pyton tasks distributed to clients "freeing up space" for regular tasks.

So now regular rosetta performance and work supply is limited by performance of much slower Python work queue.
ID: 103889 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mad_Max

Send message
Joined: 31 Dec 09
Posts: 207
Credit: 23,419,190
RAC: 12,095
Message 103890 - Posted: 24 Dec 2021, 2:25:40 UTC
Last modified: 24 Dec 2021, 2:48:04 UTC

P.S.

I saw messages in other topic about that project is simply run out of all regular (4.20) Rosetta WUs and all few millions WUs in main work queue is Python tasks only.
But I see that is not the point. New regular WUs continue to be created and distributed every day by server, only sometimes at a greatly reduced pace (you need to be a lucky one to get some, but it still possible).
And after monitoring the server status, it looks like this pace is limited by the factors that I described in the previous post.
ID: 103890 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1498
Credit: 14,729,488
RAC: 16,392
Message 103892 - Posted: 24 Dec 2021, 6:58:30 UTC - in response to Message 103890.  
Last modified: 24 Dec 2021, 7:00:53 UTC

P.S.

I saw messages in other topic about that project is simply run out of all regular (4.20) Rosetta WUs and all few millions WUs in main work queue is Python tasks only.
But I see that is not the point. New regular WUs continue to be created and distributed every day by server, only sometimes at a greatly reduced pace (you need to be a lucky one to get some, but it still possible).
And after monitoring the server status, it looks like this pace is limited by the factors that I described in the previous post.
No (and that applies to all of your previous post as well).
Some Rosetta 4.20 work is occasionally submitted- those are the rb Tasks.
For the usual Rosetta 4.30 Tasks, the project needs to load up those Tasks. Since that hasn't happened, there is only the occasional resend or rb Task available for Rosetta 4.20 work.
All the current Total queued jobs are Python Tasks, as is shown on the Server status page.
Grant
Darwin NT
ID: 103892 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 376
Credit: 10,760,051
RAC: 6,738
Message 103899 - Posted: 24 Dec 2021, 20:59:32 UTC

Looks like work generator daemons configured to maintain about 5000 tasks in ready to sent state (taking it from the large work queue displayed on the home page when needed), but this target set as combined number.
Ignoring the fact that the project now has two fundamentally different types of tasks that are not mixed with each other.


There are two separate limits in play, 5,000 for the Python tasks and 33,000 for the 4.20 tasks. When there are 4.20 tasks available they are fed into the queue independently of the Python tasks.
ID: 103899 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1829
Credit: 116,058,833
RAC: 65,235
Message 103900 - Posted: 24 Dec 2021, 23:36:28 UTC - in response to Message 103899.  

Looks like work generator daemons configured to maintain about 5000 tasks in ready to sent state (taking it from the large work queue displayed on the home page when needed), but this target set as combined number.
Ignoring the fact that the project now has two fundamentally different types of tasks that are not mixed with each other.


There are two separate limits in play, 5,000 for the Python tasks and 33,000 for the 4.20 tasks. When there are 4.20 tasks available they are fed into the queue independently of the Python tasks.


My understanding is that that queue gets drained quite quickly, especially after a bit of a task-drought, and the buffer only gets refilled once every 6(?) hours. Maybe the project need to reduce that number down to 1 or 2 hours... Anyone have any experience of the server setting options and limitations?
ID: 103900 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1498
Credit: 14,729,488
RAC: 16,392
Message 103901 - Posted: 25 Dec 2021, 0:23:54 UTC - in response to Message 103900.  

My understanding is that that queue gets drained quite quickly, especially after a bit of a task-drought, and the buffer only gets refilled once every 6(?) hours.
Nope, it gets filled as it empties.
However the period between updates of the various stats is not only different for different values, but it's also very lengthy- several hours in some cases.

There can be times where it might show some Tasks still available, but you're not able to get any because there aren't actually any to be had. And there can be periods where it shows no Tasks available, even though you're able to get work, it's just that the Server status page hasn't yet updated to show that new work as been loaded up.
Grant
Darwin NT
ID: 103901 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1829
Credit: 116,058,833
RAC: 65,235
Message 103902 - Posted: 25 Dec 2021, 9:09:39 UTC - in response to Message 103901.  

My understanding is that that queue gets drained quite quickly, especially after a bit of a task-drought, and the buffer only gets refilled once every 6(?) hours.
Nope, it gets filled as it empties.
However the period between updates of the various stats is not only different for different values, but it's also very lengthy- several hours in some cases.

There can be times where it might show some Tasks still available, but you're not able to get any because there aren't actually any to be had. And there can be periods where it shows no Tasks available, even though you're able to get work, it's just that the Server status page hasn't yet updated to show that new work as been loaded up.


Understood - thanks for that.
ID: 103902 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : How to determine what future work is coming this way from Robetta



©2024 University of Washington
https://www.bakerlab.org