Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 132 · 133 · 134 · 135 · 136 · 137 · 138 . . . 309 · Next

AuthorMessage
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1725
Credit: 18,391,361
RAC: 19,589
Message 103317 - Posted: 15 Nov 2021, 22:01:03 UTC - in response to Message 103314.  

The new Rosetta 4.20 work seems to be a small batch. Queued tasks increased to 2.85 million from around 2.28 million. Won't last long.
And not as long as you might think- the average processing time for the current Rosetta 4.20 Tasks is roughly 2 hours, a quarter of the default Target CPU run time.
So we're chewing through the work around 4 times faster than usual.
Grant
Darwin NT
ID: 103317 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2141
Credit: 41,526,036
RAC: 10,392
Message 103320 - Posted: 15 Nov 2021, 23:14:13 UTC - in response to Message 103311.  

I shut down R@H on all my computers and have switched to covid research on World Community Grid. Absolutely no problems keeping all threads busy doing constructive work. Much better environment

I run WCG Open Pandemics too, but as I've said before, it really is useless, seemingly checking things that no-one wants to know and having produced nothing of any interest to anyone.
Covid is my top priority too, but I'm glad when other tasks come down.
It's good to keep busy, but in that case, doing nothing - or anything else - may actually be the better option from all I can glean

If you don't like OPN, perhaps you could run the Mapping Cancer Markers instead.

I run all sub-projects on WCG.
While Rosetta was out of work I still have a remaining 108 WCG tasks to complete, only 13 of which are OPN thankfully
ID: 103320 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2141
Credit: 41,526,036
RAC: 10,392
Message 103321 - Posted: 15 Nov 2021, 23:21:58 UTC - in response to Message 103312.  

I shut down R@H on all my computers and have switched to covid research on World Community Grid. Absolutely no problems keeping all threads busy doing constructive work. Much better environment

I run WCG Open Pandemics too, but as I've said before, it really is useless, seemingly checking things that no-one wants to know and having produced nothing of any interest to anyone.
Covid is my top priority too, but I'm glad when other tasks come down.
It's good to keep busy, but in that case, doing nothing - or anything else - may actually be the better option from all I can glean


You want COVID join SiDock. 100% covid.

Question: Then what is Open checking with the COVID 19.7.21?
https://www.worldcommunitygrid.org/about_us/article.s?articleId=715

Nothing of any use or interest to anyone from what I can make out.
I thought this was solved by everyone else in the world in the first few weeks.
No-one's trying to bind to the spike any more, are they? They've been aiming at close binding to the RBD of the spike for the last 18 months, which they've also pretty much achieved.
Maybe they're explaining it very badly or I'm understanding it very badly (entirely possible)
ID: 103321 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2141
Credit: 41,526,036
RAC: 10,392
Message 103324 - Posted: 15 Nov 2021, 23:41:21 UTC - in response to Message 103317.  

The new Rosetta 4.20 work seems to be a small batch. Queued tasks increased to 2.85 million from around 2.28 million. Won't last long.
And not as long as you might think- the average processing time for the current Rosetta 4.20 Tasks is roughly 2 hours, a quarter of the default Target CPU run time.
So we're chewing through the work around 4 times faster than usual.

Yes. I took a quick look and all the completed tasks I can see all end after 100 decoys exactly.
Pretty sure this is a simple mistake - it happened once before a few years ago and they just increased the limit of decoys so the tasks could make at least the default runtime.
I've just pointed it out.

I've also made the exaggerated (and not entirely true) claim that discussion of Vbox tasks here has degenerated into asking how it can be uninstalled at the earliest possible moment.
While knowing that some people are running it successfully, istm this is a distinction without much of a difference.
I haven't gone into any details as I'm ignorant of any, but I leave it to them how they go about improving matters, as whatever's happening atm is far from acceptable to almost everyone.

It'll be a bad thing if the decoy limit causes Rosetta 4.20 tasks to run out so quickly all we have left to do is moan a lot more about Vbox/Python tasks
ID: 103324 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1234
Credit: 14,338,560
RAC: 2,014
Message 103326 - Posted: 16 Nov 2021, 0:13:59 UTC - in response to Message 103314.  

[snip]
Currently, all of my computers are fully occupied doing Covid or cancer research on WCG, SI-Dock or TN-Grid or climate research on CPDN. When Rosetta has work again it will take it’s fair share of the cores available - it was the same when TN-Grid stopped supplying work and when SI-Dock stopped supplying work and when I cocked up my settings and couldn’t get CPDN work.

I used to do TN-Grid, until a computer failure lost the information for doing so.

Could you mention their website, so I can connect my new computer to the project?



Tn-Grid - https://gene.disi.unitn.it/test/index.php


The new Rosetta 4.20 work seems to be a small batch. Queued tasks increased to 2.85 million from around 2.28 million. Won't last long.

Thanks - my new computer is now connected.
ID: 103326 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1725
Credit: 18,391,361
RAC: 19,589
Message 103329 - Posted: 16 Nov 2021, 0:50:43 UTC - in response to Message 103324.  

I've also made the exaggerated (and not entirely true) claim that discussion of Vbox tasks here has degenerated into asking how it can be uninstalled at the earliest possible moment.
In many cases people need it for other projects, so them leaving Rosetta is generally their chosen option.
If the project gave the option (as most other projects do with multiple applications) to select what applications to run & not run, it would be an easy fix. But since that option isn't available, leaving Rosetta is the easiest fix.
Grant
Darwin NT
ID: 103329 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2141
Credit: 41,526,036
RAC: 10,392
Message 103331 - Posted: 16 Nov 2021, 4:05:24 UTC - in response to Message 103329.  

I've also made the exaggerated (and not entirely true) claim that discussion of Vbox tasks here has degenerated into asking how it can be uninstalled at the earliest possible moment.
In many cases people need it for other projects, so them leaving Rosetta is generally their chosen option.
If the project gave the option (as most other projects do with multiple applications) to select what applications to run & not run, it would be an easy fix. But since that option isn't available, leaving Rosetta is the easiest fix.

I didn't mean that, but only because I assumed that people who were successfully running Vbox64 tasks elsewhere were the few who are running it ok here too.
Is that not right? (I'm struggling to follow which people have which problems).
I was really talking about me and those others with similar problems to me.
It was only really sarcasm from me and tbh I shouldn't have written it.

Anyway, bygones. I'll let them look at what they have to do to get <something> both less resource-intensive and running more successfully before I try it out again - if ever.
ID: 103331 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1725
Credit: 18,391,361
RAC: 19,589
Message 103332 - Posted: 16 Nov 2021, 4:44:43 UTC - in response to Message 103331.  

I've also made the exaggerated (and not entirely true) claim that discussion of Vbox tasks here has degenerated into asking how it can be uninstalled at the earliest possible moment.
In many cases people need it for other projects, so them leaving Rosetta is generally their chosen option.
If the project gave the option (as most other projects do with multiple applications) to select what applications to run & not run, it would be an easy fix. But since that option isn't available, leaving Rosetta is the easiest fix.

I didn't mean that, but only because I assumed that people who were successfully running Vbox64 tasks elsewhere were the few who are running it ok here too.
Is that not right? (I'm struggling to follow which people have which problems).
From the comments i've seen, some of people that are having issues with Python work here aren't having issues running work on other projects that require the use of VirtuaBox.
The two main issues i've noticed- The huge requirements (7.5GB of RAM per Task) for Python work, and Tasks that never end (some that appear to get to the point of being initialised, then no processing actually gets done, while there are others that get close to finishing, then get no further even after a couple of days), resulting other Tasks not being done until the problem ones are aborted by the user (if they happen to notice that there is a problem).
Grant
Darwin NT
ID: 103332 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
gemini8
Avatar

Send message
Joined: 25 Feb 12
Posts: 5
Credit: 3,097,384
RAC: 1,034
Message 103347 - Posted: 16 Nov 2021, 15:34:48 UTC

Status: Zeitüberschreitung - keine Antwort.
I see, that's what you call it if you send work that runs for more than three days on a Ryzen 7 3700X.
Haven't been monitoring the machine before today.
This morning the task still wanted to go for 28 seconds, now it's down to half of that.
Unfortunately I don't think it will finish without a real error, and the server already calls it an error anyway.
So, time for canceling.
- - - - - - - - - - - - - - -
Greetings, Jens
ID: 103347 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
gemini8
Avatar

Send message
Joined: 25 Feb 12
Posts: 5
Credit: 3,097,384
RAC: 1,034
Message 103348 - Posted: 16 Nov 2021, 15:47:03 UTC - in response to Message 103347.  

Wiped Rosetta from my Boinc directory.
Set Boinc to use less disk space.
Hopefully, there's not enough space left for that Python stuff.
- - - - - - - - - - - - - - -
Greetings, Jens
ID: 103348 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 103349 - Posted: 16 Nov 2021, 15:48:46 UTC - in response to Message 103347.  
Last modified: 16 Nov 2021, 16:01:34 UTC

I see a lot of people wasting a lot of time. That is what happens when forums don't have any input from the project scientists, or at least knowledgeable administrators to keep things straight.
https://boinc.bakerlab.org/rosetta/forum_thread.php?id=6893&postid=103232#103232

The comments just descend into trivia.
I am out, until someone can confirm that the pythons are working correctly, and I can select them.
I don't expect it for at least a couple of months at this rate, if ever. Lots of luck.

PS - The periodic unavailability of the regular Rosettas tells me that they don't have enough work to keep everyone supplied. They have more crunchers than they need. Good for them.

PPS - My Ryzen 3900X's work quite nicely on Folding, though they don't need nearly so much memory. But they are superbly good at informing you what they are doing, with a description of every work unit that is sent out.
ID: 103349 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 103358 - Posted: 16 Nov 2021, 23:08:06 UTC - in response to Message 103349.  

I see a lot of people wasting a lot of time. That is what happens when forums don't have any input from the project scientists, or at least knowledgeable administrators to keep things straight.
https://boinc.bakerlab.org/rosetta/forum_thread.php?id=6893&postid=103232#103232

The comments just descend into trivia.
I am out, until someone can confirm that the pythons are working correctly, and I can select them.
I don't expect it for at least a couple of months at this rate, if ever. Lots of luck.

PS - The periodic unavailability of the regular Rosettas tells me that they don't have enough work to keep everyone supplied. They have more crunchers than they need. Good for them.

PPS - My Ryzen 3900X's work quite nicely on Folding, though they don't need nearly so much memory. But they are superbly good at informing you what they are doing, with a description of every work unit that is sent out.

-----------------------------------------------------------------------------------------------
PS - The periodic unavailability of the regular Rosettas tells me that they don't have enough work to keep everyone supplied. They have more crunchers than they need. Good for them.

Because a lot of our work is now going to the AI Neural network.
I think that is where the majority of the work will go in the future.
This project will turn into the same as BOINC TAC out of Texas. They are a supercomputing project and things that don't fit their super computer get sent out to home systems.
ID: 103358 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 103359 - Posted: 16 Nov 2021, 23:57:43 UTC - in response to Message 103358.  

Because a lot of our work is now going to the AI Neural network.
I think that is where the majority of the work will go in the future.
This project will turn into the same as BOINC TAC out of Texas. They are a supercomputing project and things that don't fit their super computer get sent out to home systems.

I pointed that out about a year ago when MIP on WCG (which uses Rosetta) went in-house. They didn't need the crunchers any more.

And that is probably true for the regular Rosettas. That is why I am disappointed that the pythons don't work, and they have made no attempt that they have told us about to fix it.
If you have a lot of memory, you should be able to do them. Maybe it is just a side project for them? I don't know how important it is.
ID: 103359 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 103362 - Posted: 17 Nov 2021, 7:24:17 UTC - in response to Message 103359.  

Because a lot of our work is now going to the AI Neural network.
I think that is where the majority of the work will go in the future.
This project will turn into the same as BOINC TAC out of Texas. They are a supercomputing project and things that don't fit their super computer get sent out to home systems.

I pointed that out about a year ago when MIP on WCG (which uses Rosetta) went in-house. They didn't need the crunchers any more.

And that is probably true for the regular Rosettas. That is why I am disappointed that the pythons don't work, and they have made no attempt that they have told us about to fix it.
If you have a lot of memory, you should be able to do them. Maybe it is just a side project for them? I don't know how important it is.


Admin doesn't care either. He hasn't responded to either one of my messages about Python and the dysfunctional web set up. It seems to be a take it as it is or leave it attitude now. Which is a shame. This project started out really caring. Now its a thing to dump on.
ID: 103362 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1725
Credit: 18,391,361
RAC: 19,589
Message 103364 - Posted: 17 Nov 2021, 7:58:28 UTC
Last modified: 17 Nov 2021, 7:59:47 UTC

I'm getting "No VBox installed" messages again along with no new work when reporting completed work, and the Ready to send number has taken a dive, so i'm guessing we've already finished off that last batch of Rosetta 4.20 work.
Grant
Darwin NT
ID: 103364 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2141
Credit: 41,526,036
RAC: 10,392
Message 103365 - Posted: 17 Nov 2021, 8:07:07 UTC - in response to Message 103364.  

I'm getting "No VBox installed" messages again along with no new work when reporting completed work, and the Ready to send number has taken a dive, so i'm guessing we've already finished off that last batch of Rosetta 4.20 work.

Confirmed here... <sigh>
ID: 103365 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
gemini8
Avatar

Send message
Joined: 25 Feb 12
Posts: 5
Credit: 3,097,384
RAC: 1,034
Message 103366 - Posted: 17 Nov 2021, 11:22:21 UTC - in response to Message 103349.  
Last modified: 17 Nov 2021, 11:24:14 UTC

I am out, until someone can confirm that the pythons are working correctly, and I can select them.

Part one of that is no problem.
I returned at least two proper results before the one I was complaining about.
Unfortunately I can't show them off anymore, as the db seems to have taken them out by now.
vbox can be a hassle on every project, and I encountered similar problems elsewhere.
Dealing with projects that have nothing except vbox apps is one thing, but Rosetta isn't one of those.
So the possibility to unselect those monsters should obviously and absolutely be existing!
- - - - - - - - - - - - - - -
Greetings, Jens
ID: 103366 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1895
Credit: 9,214,418
RAC: 1,328
Message 103367 - Posted: 17 Nov 2021, 12:27:35 UTC - in response to Message 103366.  

I am out, until someone can confirm that the pythons are working correctly, and I can select them.


Part one of that is no problem.
I returned at least two proper results before the one I was complaining about.
Unfortunately I can't show them off anymore, as the db seems to have taken them out by now.
vbox can be a hassle on every project, and I encountered similar problems elsewhere.
Dealing with projects that have nothing except vbox apps is one thing, but Rosetta isn't one of those.
So the possibility to unselect those monsters should obviously and absolutely be existing!


I can confirm that the Python tasks work just fine on my Windows 11 beta testing laptop, I am running ver 6.1.26 of VBox and ver 7.16.20 of Boinc:

1450357706 1293408677 4767328 17 Nov 2021, 0:09:28 UTC 20 Nov 2021, 0:09:28 UTC In progress --- --- --- rosetta python projects v1.03 (vbox64)
windows_x86_64
1450297317 1293408267 4767328 17 Nov 2021, 0:09:28 UTC 20 Nov 2021, 0:09:28 UTC In progress --- --- --- rosetta python projects v1.03 (vbox64)
windows_x86_64
1450357611 1293408634 4767328 17 Nov 2021, 0:08:51 UTC 17 Nov 2021, 12:13:59 UTC Cancelled by server 0.00 0.00 --- rosetta python projects v1.03 (vbox64)
windows_x86_64
1450286960 1293407997 4767328 17 Nov 2021, 0:08:03 UTC 20 Nov 2021, 0:08:03 UTC In progress --- --- --- rosetta python projects v1.03 (vbox64)
windows_x86_64
1450357628 1293408478 4767328 17 Nov 2021, 0:08:03 UTC 20 Nov 2021, 0:08:03 UTC In progress --- --- --- rosetta python projects v1.03 (vbox64)
windows_x86_64
1449564483 1293454887 4767328 15 Nov 2021, 23:25:48 UTC 17 Nov 2021, 12:12:47 UTC Completed and validated 22,488.82 20,606.27 185.74 rosetta python projects v1.03 (vbox64)
windows_x86_64
1449564347 1293454813 4767328 15 Nov 2021, 23:25:48 UTC 18 Nov 2021, 23:25:48 UTC In progress --- --- --- rosetta python projects v1.03 (vbox64)
windows_x86_64
1449486504 1293416591 4767328 14 Nov 2021, 3:21:18 UTC 17 Nov 2021, 5:45:55 UTC Completed and validated 34,804.85 31,782.95 441.35 rosetta python projects v1.03 (vbox64)
windows_x86_64
1449487643 1293416839 4767328 14 Nov 2021, 3:20:38 UTC 16 Nov 2021, 13:32:46 UTC Completed and validated 40,661.08 40,217.94 515.61 rosetta python projects v1.03 (vbox64)
windows_x86_64
1449487583 1293416824 4767328 14 Nov 2021, 3:20:38 UTC 16 Nov 2021, 19:56:35 UTC Completed and validated 22,768.22 20,693.41 186.15 rosetta python projects v1.03 (vbox64)
windows_x86_64
ID: 103367 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 103369 - Posted: 17 Nov 2021, 13:35:13 UTC - in response to Message 103366.  

I returned at least two proper results before the one I was complaining about.
Unfortunately I can't show them off anymore, as the db seems to have taken them out by now.

Most of my pythons work too. But the ones that fail hang up the machine for long periods of time.
That is not acceptable to me. I will leave to those willing to attend to it.
ID: 103369 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 2002
Credit: 9,781,025
RAC: 4,962
Message 103370 - Posted: 17 Nov 2021, 13:44:32 UTC - in response to Message 103359.  
Last modified: 17 Nov 2021, 13:49:35 UTC

I pointed that out about a year ago when MIP on WCG (which uses Rosetta) went in-house. They didn't need the crunchers any more.


I don't think so.
In-house HPC needs some points:
- a lot of performative hardware
- a big and prepared IT team
- simulations as much as possible homogeneous

Rosetta@Home has not these points. WCG (when it was IBM) has.
When IPD/BakerLab needs great computational power that cannot split on Boinc, they always use external source: AWS, Azure, TACC, etc.

But, maybe i'm wrong.....
ID: 103370 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 132 · 133 · 134 · 135 · 136 · 137 · 138 . . . 309 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org