Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 129 · 130 · 131 · 132 · 133 · 134 · 135 . . . 280 · Next

AuthorMessage
stratos412

Send message
Joined: 18 Mar 20
Posts: 12
Credit: 133,065
RAC: 480
Message 103222 - Posted: 13 Nov 2021, 7:40:35 UTC - in response to Message 103214.  
Last modified: 13 Nov 2021, 7:41:08 UTC

I am exerience the same problem as some of the other members.

"Rosetta@home | Message from server: VirtualBox jobs require hardware acceleration support. Your processor does not support the required instruction set."

VirtualBox is installed (version 6.1.12). Virtualization is enabled on BIOS and I can conifrm that is enabled on Win10 Task Manager too.
PC specs: https://boinc.bakerlab.org/rosetta/show_host_detail.php?hostid=5945561

Any ideas what is going wrong ?

It could mean that the CPU of your computer doesn't support virtualization, or you have it turned off in the BIOS settings.

You could mention what model of computer you are running, so someone can tell you how to reach the BIOS settings,



CPU Model: AMD Ryzen 5 3400G with Radeon Vega Graphics
MOTHERBOARD: ASRock A320M_HDV (AM4)

Virtualization is enabled on BIOS, I can confirm that. Check link

https://drive.google.com/file/d/1A1MjBVI291CL8Y5iTg2fzj-pDl-Z9Olc/view?usp=sharing
ID: 103222 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5664
Credit: 5,711,666
RAC: 764
Message 103223 - Posted: 13 Nov 2021, 8:33:40 UTC - in response to Message 103221.  

Hi,

I am also now getting this message in the Event Log for Rosetta: "Message from server: VirtualBox jobs require hardware acceleration support. Your processor does not support the required instruction set." This apparently started in the last couple of days, maybe even today.

I have been running R@H for over 1.5 years, usually 24 hrs a day, with no problems, VirtualBox or otherwise. Could something have changed very recently with R@H to cause this? Does anyone have any idea (besides not have VB) what could suddenly cause this?

Thanks.

Doug



Did you go back into BIOS and tell it to allow virtualization?
When you update everything goes back to default.
ID: 103223 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Edgar_Berlin

Send message
Joined: 6 Apr 20
Posts: 2
Credit: 840,276
RAC: 0
Message 103225 - Posted: 13 Nov 2021, 9:14:48 UTC - in response to Message 103223.  

I am sorry, if it has been asked before, but here I found the only mention of "VirtualBox"..
I am running Boinc (now latest version 7.16.20, but it does not matter) Standalone on Windows 10.
Since some days I get the message:
"Rosetta@home: Notice from server
VirtualBox is not installed"
All my remaining Rosetta tasks finished and uploaded. However, I am not able to download new tasks. Response in the message log ist always:
"Notice from server: VirtualBox is not installed. Project requested delay..."

Is this a new technical requirement of Rosetta (to use the Virtual Box version) or is there any other change or problem in the infrastructure?

Thanks, Edgar
ID: 103225 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1507
Credit: 14,957,731
RAC: 21,678
Message 103226 - Posted: 13 Nov 2021, 9:45:48 UTC - in response to Message 103225.  

Is this a new technical requirement of Rosetta (to use the Virtual Box version) or is there any other change or problem in the infrastructure?
We don't know whether it's just a case of all the last batch of Rosetta 4.20 work being done, and we're now waiting on more to be released, or if there is more but it's a server issue stopping it from being sent out.
Grant
Darwin NT
ID: 103226 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Edgar_Berlin

Send message
Joined: 6 Apr 20
Posts: 2
Credit: 840,276
RAC: 0
Message 103227 - Posted: 13 Nov 2021, 9:54:54 UTC - in response to Message 103226.  

Thank you for the response!
I just got a new task 16min ago and the "VirtualBox" message disappeared.
So message was a little bit misleading and all seems back to normal now.
ID: 103227 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Falconet

Send message
Joined: 9 Mar 09
Posts: 350
Credit: 1,049,304
RAC: 2,286
Message 103228 - Posted: 13 Nov 2021, 10:14:54 UTC - in response to Message 103227.  

Thank you for the response!
I just got a new task 16min ago and the "VirtualBox" message disappeared.
So message was a little bit misleading and all seems back to normal now.



VirtualBox is necessary for the Rosetta Python tasks but not the standard Rosetta 4.20 tasks.
ID: 103228 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile mmstick

Send message
Joined: 4 Dec 12
Posts: 8
Credit: 606,792
RAC: 0
Message 103229 - Posted: 13 Nov 2021, 13:20:52 UTC

I wish they'd use KVM/QEMU instead of Virtualbox for Linux. It's the much more efficient method of virtualization on Linux that doesn't require installing external DKMS modules since it's supported directly by the Linux kernel. That said, I don't see why we're even using virtualization when a sandboxed namespace does the job just as well. Anyway, call me when there's interest in seeking open source contributors to transition from Python to Rust.
ID: 103229 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 103230 - Posted: 13 Nov 2021, 13:33:05 UTC - in response to Message 103229.  

I wish they'd use KVM/QEMU instead of Virtualbox for Linux.

I am sure it would work better, since it can't work worse. It is getting practically impossible to run the pythons. The first problem is "Vm job unmanageable" suspensions, which occur on all of my machines no matter what steps I take (mainly limiting cores) to prevent it. You need to either wait a long time, or reboot to fix it.

But now the problem is that about half the pythons won't run at all. They get stuck at less than 1% CPU utilization, and I have to abort them.
I am moving away from interventionist projects on my machines, and the pythons are the next ones to go.
ID: 103230 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile mmstick

Send message
Joined: 4 Dec 12
Posts: 8
Credit: 606,792
RAC: 0
Message 103231 - Posted: 13 Nov 2021, 14:07:08 UTC - in response to Message 103230.  

I do constantly get the issue of having to abort Python units at 99.996% completion, even on my Ryzen 5700g desktop with 64 GB RAM, which seems to be good enough for running 8 python units simultaneously on each physical core. Have tried to limit the number of Python work units to 4 just in case so I can run 12 normal tasks in addition to that, but apparently using an app_config.xml to define max-concurrent work units causes BOINC to repeatedly ask for 12 work units every 30 seconds, so had to abort that attempt.
ID: 103231 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 103232 - Posted: 13 Nov 2021, 14:19:06 UTC - in response to Message 103231.  

I do constantly get the issue of having to abort Python units at 99.996% completion, even on my Ryzen 5700g desktop with 64 GB RAM, which seems to be good enough for running 8 python units simultaneously on each physical core.

It isn't a problem of memory, and you don't need to go to 99%.
If in the first five minutes they are less than 1% CPU utilization, you can abort them. I use BoincTasks to monitor that.
ID: 103232 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
doug

Send message
Joined: 28 Mar 20
Posts: 8
Credit: 1,360,190
RAC: 1,964
Message 103233 - Posted: 13 Nov 2021, 16:04:36 UTC - in response to Message 103223.  

Thanks for the reply.

I have not done that, nor have I ever had to do it in the past. I'm running Win10 with all the latest updates. In Task Manager, on the second (Performance) tab, at the bottom with all the CPU info, it says "Virtualization: Enabled". Does that address what you are asking about? If not, do you know where in Windows I can find the info you are asking for?

Thanks.

Doug




[/img]
ID: 103233 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Falconet

Send message
Joined: 9 Mar 09
Posts: 350
Credit: 1,049,304
RAC: 2,286
Message 103234 - Posted: 13 Nov 2021, 16:43:14 UTC - in response to Message 103233.  
Last modified: 13 Nov 2021, 16:44:38 UTC

Deleted.
ID: 103234 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5664
Credit: 5,711,666
RAC: 764
Message 103236 - Posted: 13 Nov 2021, 19:39:03 UTC
Last modified: 13 Nov 2021, 19:39:49 UTC

Maybe try what I do for LHC ATLAS which is a very picky project and has a hard time running on single cores and such.

I have in the past wrote an app_config that forced it to run on just 4 cores and 1 task at a time.
Now I can set that in the web preferences of this project.

So maybe you can try that for Python. But being it falls under "Rosetta" it will apply to all tasks from RAH.
Another stupid thing from this project and you can not set this in the web preferences here either.
ID: 103236 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
.clair.

Send message
Joined: 2 Jan 07
Posts: 274
Credit: 26,399,595
RAC: 0
Message 103239 - Posted: 13 Nov 2021, 21:52:12 UTC
Last modified: 13 Nov 2021, 22:07:51 UTC

I tried to get RPP to run multithreaded with this app config :-

<app_config>
<app>
<name>rosetta_python_projects</name>
</app>
<app_version>
<app_name>rosetta_python_projects</app_name>
<plan_class>vbox64</plan_class>
<avg_ncpus>5</avg_ncpus>
</app_version>
</app_config>

but even though it shows on boinc manager as ` Running(5cpus) `
each RPP task runs 25 threads total, so unless the data they are crunching is very linier.
it don't actualy do it when looking at cpu graphs, any ideas as to what else could be in an app config to force it to use multi thread
or could it be hard coded in the VM not to??
or am I wasting my time trying :(

I changed it around from the one I use at cosmology@home

<app_config>
<app>
<name>camb_boinc2docker</name>
<max_concurrent>2</max_concurrent>
</app>
<app_version>
<app_name>camb_boinc2docker</app_name>
<plan_class>vbox64_mt</plan_class>
<avg_ncpus>7</avg_ncpus>
</app_version>
</app_config>
ID: 103239 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1225
Credit: 13,908,055
RAC: 3,520
Message 103240 - Posted: 13 Nov 2021, 22:43:41 UTC - in response to Message 103239.  
Last modified: 13 Nov 2021, 22:46:18 UTC

I tried to get RPP to run multithreaded with this app config :-

[snip]

It's rare that you can make a program run multithreaded unless it's written to know how to do so.

Changing the app config file isn't enough if that's all you do.
ID: 103240 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5664
Credit: 5,711,666
RAC: 764
Message 103241 - Posted: 14 Nov 2021, 0:37:20 UTC
Last modified: 14 Nov 2021, 0:38:58 UTC

<name>rosetta_python_projects</name>

That as far as I know is an internal naming of the type of task.
As far as I know all tasks fall under "rosetta"

I have not found a way to isloate python tasks.
ID: 103241 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
.clair.

Send message
Joined: 2 Jan 07
Posts: 274
Credit: 26,399,595
RAC: 0
Message 103242 - Posted: 14 Nov 2021, 0:54:20 UTC
Last modified: 14 Nov 2021, 1:46:48 UTC

I decided to have a go at it again, give the computer a full reboot [not out the door]
even if it is / was a case of knowing just enuf to make a big mess of it
I did get a some xml errors noted in event log,
I just keep bashing away at it till something happens :)
well
it did some thing . . . . .
I know it sounds like something from a Frankenstine video
because one of the `vboxheadless.exe` instances in win7 resource monitor is using 22% of cpu on 16 core cpu, [one cpu is only 6.25%]
could someone be mad enuf to try it @home and see what happens
only new tasks downloaded AFTER the app-config is in place will get the new settings config
ID: 103242 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile mmstick

Send message
Joined: 4 Dec 12
Posts: 8
Credit: 606,792
RAC: 0
Message 103243 - Posted: 14 Nov 2021, 1:11:05 UTC

Using an app_config to set the max-concurrent value will cause your system to endlessly request work until you've fully depleted the server of work units. I don't recommend doing so until this issue is fixed: https://github.com/BOINC/boinc/issues/4322
ID: 103243 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
.clair.

Send message
Joined: 2 Jan 07
Posts: 274
Credit: 26,399,595
RAC: 0
Message 103244 - Posted: 14 Nov 2021, 1:32:00 UTC - in response to Message 103243.  
Last modified: 14 Nov 2021, 1:33:02 UTC

Using an app_config to set the max-concurrent value will cause your system to endlessly request work until you've fully depleted the server of work units. I don't recommend doing so until this issue is fixed: https://github.com/BOINC/boinc/issues/4322

I have not run cosmo@home for several months , endless workfetch was stopped by them having a limit serverside on the number of workunits anyone was allowed to have
I have been reading the threads here on R@H with interest about that work fetch problem
ID: 103244 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 103245 - Posted: 14 Nov 2021, 2:54:04 UTC - in response to Message 103244.  

I have been reading the threads here on R@H with interest about that work fetch problem

I first ran into it several years ago on WCG. More recently, we had a discussion of it on LHC.
https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=5720&postid=45308#45308

Also:
https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=5726&postid=45384#45384

It has been reported to BOINC.
https://github.com/BOINC/boinc/issues/4322
ID: 103245 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 129 · 130 · 131 · 132 · 133 · 134 · 135 . . . 280 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org