1.03 (vbox64) is out for rosetta python projects

Message boards : Number crunching : 1.03 (vbox64) is out for rosetta python projects

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1829
Credit: 115,879,649
RAC: 60,477
Message 102363 - Posted: 8 Aug 2021, 14:05:19 UTC
Last modified: 8 Aug 2021, 14:05:40 UTC

Oooh I got one!

https://ibb.co/T29Jn4y
ID: 102363 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1829
Credit: 115,879,649
RAC: 60,477
Message 102365 - Posted: 8 Aug 2021, 16:06:37 UTC

ID: 102365 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Kissagogo27

Send message
Joined: 31 Mar 20
Posts: 86
Credit: 2,652,894
RAC: 2,248
Message 102366 - Posted: 8 Aug 2021, 19:26:03 UTC


Peak working set size 57.25 MB
Peak swap size 55.54 MB
Peak disk usage 7,947.27 MB


not so small on disk ^^
ID: 102366 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 102368 - Posted: 8 Aug 2021, 19:51:28 UTC - in response to Message 102366.  

I finished one four days ago. BoincTasks reported the memory usage at almost 8 GB.

Rosetta@home 1.03 rosetta python projects (vbox64) afbr-PTAMBA-BEN3_pp-mNIP-TZR_pp_0_1397965_85_1 01:15:06 (01:14:23) 8/4/2021 6:40:46 PM 8/4/2021 6:42:50 PM 99.05 Reported: OK + Ryzen3900X-1 3286.57 MB 7629.39 MB

ID: 102368 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Lilkat

Send message
Joined: 3 Nov 07
Posts: 2
Credit: 37,546,103
RAC: 0
Message 102726 - Posted: 19 Sep 2021, 13:03:35 UTC

I'm thinking of doing another PC build to run the Rosetta Python units, specifically so it can be loaded with a lot of RAM. Is there a minimum CPU generation or OS that is required to run it? I have a pair of Ryzen 3950 units that run Linux, and are significantly constrained by 32GB of RAM, and they can get the Python work units. I also have an E3-1245v3 (Intel Haswell Xeon) with 32GB of RAM under Windows 7, but it only has picked up the normal Rosetta work units so far, even though it otherwise has the RAM for two work units.

My plan was to get a Haswell or Broadwell-based client set up under Linux with around 300-500GB of RAM, but I don't want to go through the trouble if it won't run the Python units.
ID: 102726 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 376
Credit: 10,736,206
RAC: 5,659
Message 102734 - Posted: 19 Sep 2021, 20:58:54 UTC - in response to Message 102107.  

I doubt we'll be out of normal Rosetta units for a while yet because I believe the Robetta server takes requests in from around the world and distributes them on r@h. I wouldn't expect all those users to start using trrosetta yet, but I might well be wrong. Would be good to hear from someone in the project.


The problem is the selection of tasks to queue for delivery to us users.

There may be millions of tasks waiting but there are only ever 29k tasks queued and that can (and have been) be skewed to be all python.
ID: 102734 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 376
Credit: 10,736,206
RAC: 5,659
Message 102735 - Posted: 19 Sep 2021, 21:01:21 UTC - in response to Message 102128.  

Hopefully they don't switch exclusively to vbox - or give users the ability to choose workunit types. There are doubtlessly countless machines that do not have virtual box installed and likely will not do so as they simply sit and crunch with users not checking the forums.

How much memory does each WU consume with these new python WUs? Do they finally use SSE/Avx? I don't think I have gotten one yet.
thanks


If my machine are anything to go be, if you don’t have box installed it won’t download python tasks.
ID: 102735 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 102736 - Posted: 19 Sep 2021, 21:50:01 UTC - in response to Message 102734.  

The problem is the selection of tasks to queue for delivery to us users.

There may be millions of tasks waiting but there are only ever 29k tasks queued and that can (and have been) be skewed to be all python.

Mixing them all up shows that they don't understand their cruncher base, nor listen to them (no great surprise there).
I would choose all pythons on one machine, and the regular ones on another. That way you can make the appropriate memory available.
As it is, it is overkill for some work units, and not enough for others.

I was hoping for better, but have some Plan B's in mind.
ID: 102736 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5664
Credit: 5,711,666
RAC: 1,996
Message 102741 - Posted: 19 Sep 2021, 23:45:18 UTC
Last modified: 19 Sep 2021, 23:47:59 UTC

Memory usage is a rounded 8 GB or 7.63 GB

Current server load Unsent In Progress
Rosetta 10653 344270
rosetta python projects 14759 13148
ID: 102741 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Lilkat

Send message
Joined: 3 Nov 07
Posts: 2
Credit: 37,546,103
RAC: 0
Message 102744 - Posted: 20 Sep 2021, 0:02:44 UTC

After doing some homework, I think it would be more cost-effective to bump both of my Ryzen 3950x machines from 32GB to 128GB, and tune the <max_concurrent> setting to allocate the first 32GB of RAM for non Python work units, instead of building a Haswell or Broadwell-based machine specific to the task. For my x470D4U board, it will unfortunately mean that my ECC RAM speed will drop from 2666 MHz (1-2 sticks) to 2400 MHz (4 sticks), but it's not a terrible decrease. If Rosetta's future is (a lot) more RAM, then I'll meet them where they are going.
ID: 102744 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1488
Credit: 14,677,518
RAC: 14,569
Message 102748 - Posted: 20 Sep 2021, 5:06:52 UTC - in response to Message 102726.  

I have a pair of Ryzen 3950 units that run Linux, and are significantly constrained by 32GB of RAM,
To avoid memory issues when running Rosetta Tasks, you need to allow roughly 1.3GB of RAM per Task being run. So you'd need 42GB RAM minimum to run Rosetta tasks on all cores & threads of that system.
Given that's a dual memory channel CPU, you'd be looking at 2*32GB DIMMs to make use of all cores & threads for Rosetta 4.20 work.
If running VirtualBox Tasks, even with the motherboard maxed out on RAM it wouldn't be possible to make use of all available cores/threads for VritualBox Tasks.
Grant
Darwin NT
ID: 102748 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5664
Credit: 5,711,666
RAC: 1,996
Message 102755 - Posted: 20 Sep 2021, 11:16:07 UTC - in response to Message 102748.  

I have a pair of Ryzen 3950 units that run Linux, and are significantly constrained by 32GB of RAM,
To avoid memory issues when running Rosetta Tasks, you need to allow roughly 1.3GB of RAM per Task being run. So you'd need 42GB RAM minimum to run Rosetta tasks on all cores & threads of that system.
Given that's a dual memory channel CPU, you'd be looking at 2*32GB DIMMs to make use of all cores & threads for Rosetta 4.20 work.
If running VirtualBox Tasks, even with the motherboard maxed out on RAM it wouldn't be possible to make use of all available cores/threads for VritualBox Tasks.


We don't know how long Python will last or what is in the pipeline for Vbox work after Python is done, so I personally would not go through all the fuss of upgrading and reconfiguring my system just for Python. Besides I don't know of any command that lets you isolate Python and reject 4.20 work. The scheduler spits out both. So how many Python and how many 4.20 tasks will you get? That is unknown.
ID: 102755 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 102757 - Posted: 20 Sep 2021, 12:12:17 UTC - in response to Message 102755.  

At least they are a bit consistent, and are giving me four pythons with each update request. I can handle that nicely on this 64 GB machine.
Maybe they measure how much memory you have and send accordingly?

I have other machines with 48 GB memory, so they would work too if they can get the MD5 error fixed. We will see.
ID: 102757 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 102859 - Posted: 1 Oct 2021, 9:26:57 UTC - in response to Message 102757.  
Last modified: 1 Oct 2021, 9:37:31 UTC

And I am now running four more pythons on a Ryzen 3900X with 48 GB of memory (with 24 virtual cores), with two more in the buffer.
But there aren't any on two Ryzen 3950X with 48 GB of memory (with 32 virtual cores).

That may be just the luck of the draw, but I am beginning to think that they calculate how much memory you have per core.
ID: 102859 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1488
Credit: 14,677,518
RAC: 14,569
Message 102860 - Posted: 1 Oct 2021, 10:29:41 UTC - in response to Message 102859.  

That may be just the luck of the draw, but I am beginning to think that they calculate how much memory you have per core.
If so, it would be nice if they did that with the Rosetta 4.20 Tasks as well.
Save a lot of grief with large RAM requirement Tasks on hosts with low amounts of RAM per core/thread.
Grant
Darwin NT
ID: 102860 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 102861 - Posted: 1 Oct 2021, 15:22:42 UTC - in response to Message 102860.  

If so, it would be nice if they did that with the Rosetta 4.20 Tasks as well.
Save a lot of grief with large RAM requirement Tasks on hosts with low amounts of RAM per core/thread.

Actually, the 4.20 Rosettas are more predictable now. They seem to run around 1 GB, more or less. I don't see the large variations that they had before, when you could go from 3 TB down to 300 MB.

Maybe that is due to the new way they are pre-processing them now with the AI stuff, if that is what they are doing. They don't tell us.
ID: 102861 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 102862 - Posted: 1 Oct 2021, 21:12:01 UTC - in response to Message 102859.  

But there aren't any on two Ryzen 3950X with 48 GB of memory (with 32 virtual cores).

I now have four pythons on one of the 3950X and five on the other, in the buffer.
So I have enough memory for them to download. Maybe they limit how many run at a time, or at least BOINC does I hope.
ID: 102862 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1894
Credit: 8,767,498
RAC: 6,467
Message 102952 - Posted: 14 Oct 2021, 12:08:38 UTC - in response to Message 102862.  

But there aren't any on two Ryzen 3950X with 48 GB of memory (with 32 virtual cores).

I now have four pythons on one of the 3950X and five on the other, in the buffer.
So I have enough memory for them to download. Maybe they limit how many run at a time, or at least BOINC does I hope.


I have an AMD Ryzen 7 4800 series cpu with 8/16 cores and 16gb of ram and Rosetta tried to run 8 Pyton units at one time, I have Boinc set to only use 50% of the cpu's. Of course everything slowed to a crawl so I made it so Rosetta only runs 1 task at a time and things are working but the 6 hour time estimate to run the tasks is not even close as I'm seeing closer to 24 hours to finish a task. BUT the Rosetta website says I'm only using 8k to 16k seconds to run a task so something is still not right.
ID: 102952 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5664
Credit: 5,711,666
RAC: 1,996
Message 102959 - Posted: 14 Oct 2021, 19:20:52 UTC

Have you done the following:

1) checked you vbox environment for any remains of tasks completed or aborted? Occasionally you have to manually clean your vbox. ( which might explain my other thread as I just checked my vbox)

2) have you checked to make sure your version is up to date for the main program and have you also checked your extensions.

If all this is ok...I don't have answers other than what Mikey and the others go on about with memory.
I had something similar happen with ATLAS long ago when I started there (long run times and failures). But I would refer you to my discussion in the other thread so as to not clog this thread.

What machine are we talking about anyway? You have 5 computers active in the last 30 days, only 1 is running Rosetta and at the moment you are running just 4.2 tasks like me.

And what was the last Python you completed successfully?
ID: 102959 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2

Message boards : Number crunching : 1.03 (vbox64) is out for rosetta python projects



©2024 University of Washington
https://www.bakerlab.org