Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 123 · 124 · 125 · 126 · 127 · 128 · 129 . . . 309 · Next

AuthorMessage
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1725
Credit: 18,421,610
RAC: 20,216
Message 102749 - Posted: 20 Sep 2021, 5:09:51 UTC - in response to Message 102728.  

How can I make it stop? I have been a long time supporter of RH, but will disconnect from the project if it keeps wasting my bandwidth with 2GB failed downloads.

Temporarily STOP REQUESTING NEW WORK FOR THE PROJET by settling NO NEW WORK for Rosetta on your PC(S) in BOINC until the project fixes the problem.
That means you will get no work at all, even though there are just under 13 million Rosetta 4.20 Tasks available.
Grant
Darwin NT
ID: 102749 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1725
Credit: 18,421,610
RAC: 20,216
Message 102750 - Posted: 20 Sep 2021, 5:16:20 UTC - in response to Message 102745.  

I received an odd message today:

Rosetta@home: Notice from server
VirtualBox is not installed
9/19/2021 6:01:50 PM

It is correct that I do not have VirtualBox installed, and I have not used VirtualBox to run BOINC programs.
That was sorted out yesterday.
It started on the 18th & when it did, it stopped those without VirtualBox from getting any Rosetta work- even that which doesn't need VirtualBox. Once they fixed it, there were no more notices about a lack of VirtualBox in Event logs, and work started flowing again- as can be seen in the Graph below.
In particular, the big dip on the 18th.


Grant
Darwin NT
ID: 102750 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Admin
Project administrator

Send message
Joined: 1 Jul 05
Posts: 5144
Credit: 0
RAC: 0
Message 102761 - Posted: 20 Sep 2021, 17:58:33 UTC

There is a researcher in the group that has submitted a large batch of VM jobs that requires Virtual Box. More information is provided at https://boinc.berkeley.edu/wiki/VirtualBox.

This is the first large batch of "rosetta_python_projects" VM jobs but we hope to have more in the future. Since this is our first attempt at using BOINC's VM option, we will undoubtedly run into issues as we scale up. Please be patient as we try to resolve these issues. Our standard rosetta application is still being used but the number of available jobs can fluctuate as it has before depending on the computing demand from our researchers.

The VM app feature allows researchers much more flexibility to run scientific software that otherwise would not be possible on R@h. For example, the "rosetta_python_projects" app allows researchers to run deep learning based structure prediction pipelines including RoseTTAFold and AlphaFold2, and other python based sofware packages like AIMNet which is being used in the current batch of VM jobs to minimize potential microcycle therapeutics.
ID: 102761 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 102763 - Posted: 20 Sep 2021, 20:06:43 UTC - in response to Message 102761.  

This is the first large batch of "rosetta_python_projects" VM jobs but we hope to have more in the future.

Very good. I have several machines with lots of memory waiting to do them.

BUT: You need to make them selectable from the regular Rosettas.
A simple checkbox would do, but even better would be the ability to select the number downloaded (or running) at at time.
See LHC, or Cosmology, or WCG for examples.
ID: 102763 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Ole Pettersen

Send message
Joined: 4 Dec 10
Posts: 7
Credit: 11,846,331
RAC: 3,306
Message 102765 - Posted: 21 Sep 2021, 0:15:46 UTC - in response to Message 102745.  

It is just an informational message, hoping that you will install VBox to crunch more tasks. Thanks.
ID: 102765 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2141
Credit: 41,539,024
RAC: 10,411
Message 102768 - Posted: 21 Sep 2021, 1:19:33 UTC - in response to Message 102761.  

There is a researcher in the group that has submitted a large batch of VM jobs that requires Virtual Box. More information is provided at https://boinc.berkeley.edu/wiki/VirtualBox

Virtualization and BOINC
BOINC uses virtualization to allow scientists to develop applications on their preferred operating system (usually Linux) and then run them on volunteered Windows and Mac computers. We call these VM apps.

BOINC's virtualization support uses a system called VirtualBox, which is open-source software maintained and distributed by Oracle. For more information about VirtualBox, please see their web site at https://www.virtualbox.org/

To run VM apps, your computer must have VirtualBox installed. Starting with BOINC version 7.2.28, the recommended BOINC installer for Windows includes VirtualBox as well. You can also install VirtualBox separately.

We recommend the VirtualBox version that is included in the recommended BOINC Windows installer, since newer VirtualBox versions do not work correctly for all projects.

If you are running Windows, it is essential that you install a version of BOINC that matches your version of Windows - use 64-bit BOINC on 64-bit versions of Windows, 32-bit BOINC on 32-bit versions of Windows. Otherwise, BOINC will not be able to detect the presence of VirtualBox.

Oh! So it's that thing I've been ignoring for several years... #Doh

Download BOINC + VirtualBox
ID: 102768 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Tomcat雄猫

Send message
Joined: 20 Dec 14
Posts: 180
Credit: 5,386,173
RAC: 0
Message 102781 - Posted: 21 Sep 2021, 9:54:11 UTC - in response to Message 102768.  
Last modified: 21 Sep 2021, 9:54:22 UTC

aaaf-IDC_pp-FPR_pp-mNHM_pp-NMBEN3_pp_0_1737815_2_1
<core_client_version>7.16.11</core_client_version>
<![CDATA[
<message>
WU download error: couldn't get input files:
<file_xfer_error>
  <file_name>AIMNet_vm_v2.vdi</file_name>
  <error_code>-119 (md5 checksum failed for file)</error_code>
  <error_message>MD5 check failed</error_message>
</file_xfer_error>
</message>
]]>


There goes another one. I have yet to encounter a Python task that doesn't error out. Is it something wrong with my setup, or is there a problem with these tasks? I've tried the version bundled with BOINC and the latest version, still 100% error rate.
ID: 102781 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile lazyacevw

Send message
Joined: 18 Mar 20
Posts: 12
Credit: 93,576,463
RAC: 0
Message 102783 - Posted: 21 Sep 2021, 12:05:08 UTC

I feel like I am missing out on some new information or something. Do these new "rosetta_python_projects" run native on Linux? Virtualbox is only needed on Windows and Mac, correct? I was under the impression that all Windows BOINC installations were already containers or VMs on Windows or Macs.
ID: 102783 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Falconet

Send message
Joined: 9 Mar 09
Posts: 354
Credit: 1,276,393
RAC: 1,656
Message 102784 - Posted: 21 Sep 2021, 12:28:43 UTC - in response to Message 102783.  

I feel like I am missing out on some new information or something. Do these new "rosetta_python_projects" run native on Linux? Virtualbox is only needed on Windows and Mac, correct? I was under the impression that all Windows BOINC installations were already containers or VMs on Windows or Macs.



The applications page says:

Linux running on an AMD x86_64 or Intel EM64T CPU 1.03 (vbox64)

Same as Windows and MacOS. No mention of a native app.
ID: 102784 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1832
Credit: 119,860,059
RAC: 6,147
Message 102786 - Posted: 21 Sep 2021, 12:54:04 UTC

I'm pretty sure it uses VBox regardless of the host OS. I guess that way it can ship with the correct dependancies etc.
ID: 102786 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile lazyacevw

Send message
Joined: 18 Mar 20
Posts: 12
Credit: 93,576,463
RAC: 0
Message 102787 - Posted: 21 Sep 2021, 14:37:28 UTC - in response to Message 102784.  
Last modified: 21 Sep 2021, 14:41:50 UTC

Interesting. I took a look at the applications page https://boinc.bakerlab.org/rosetta/apps.php and the VirtualBox page https://boinc.berkeley.edu/wiki/VirtualBox but I don't see anything that would indicate a definitive answer. My Linux systems do not have VirtualBox installed. The apps page does indicate they are running on Linux systems but they don't indicate if VirtualBox is or needs to be installed. The same goes for Windows in the apps page.

It was my understanding that Rosetta tasks were already written for Linux environments and so it was just easier to containerize Linux on Windows than re-write everything for Windows or Macs.

I haven't seen anything in my logs about downloading AIMNet_vm_v2.vdi and when I check out my boinc directory, I see minirosetta and 4.20 executables but I don't see anything about vbox64 or python.

I have 671 tasks in progress and I am showing 0 rosetta python tasks. I was just curious if there is anything I need to do to keep my systems up to date so that they can maximize processing capability for Rosetta.
ID: 102787 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Falconet

Send message
Joined: 9 Mar 09
Posts: 354
Credit: 1,276,393
RAC: 1,656
Message 102788 - Posted: 21 Sep 2021, 14:44:07 UTC - in response to Message 102787.  

If Virtualbox isn't installed, your system won't download any work units or executables related to the vbox app.

This "1.03 (vbox64)" indicates the need of the pyrosetta app for virtualbox.
ID: 102788 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
YAG

Send message
Joined: 13 Oct 19
Posts: 7
Credit: 13,015,426
RAC: 0
Message 102795 - Posted: 21 Sep 2021, 20:42:57 UTC

Hello,

Why could I receive the message "Communication with VM Hipervisor failed"? In the VBox register appears the error
«Failed to create the VirtualBox object!
NS_ERROR_SOCKET_FAIL (0xC1F30200) IPC Daemon socket error.»

I have the Boinc version 7.16.16 and the VirtualBox 6.1.26 r145957 running on Debian GNU/Linux 11 (bullseye) [5.10.0-8-amd64|libc 2.31 (Debian GLIBC 2.31-13)].
The boinc user is added to the vboxusers, vboxguests and libvirt groups.

Any clue about what could be happening? I have no experience with VirtualBox, I installed it just as a Rosetta@home requirement.
ID: 102795 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Ole Pettersen

Send message
Joined: 4 Dec 10
Posts: 7
Credit: 11,846,331
RAC: 3,306
Message 102798 - Posted: 22 Sep 2021, 2:53:51 UTC - in response to Message 102795.  

I got this when I tried to open Oracle VM VirtualBox:

ID: 102798 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1832
Credit: 119,860,059
RAC: 6,147
Message 102799 - Posted: 22 Sep 2021, 6:40:34 UTC
Last modified: 22 Sep 2021, 6:41:06 UTC

ID: 102799 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 102806 - Posted: 22 Sep 2021, 13:31:46 UTC - in response to Message 102787.  
Last modified: 22 Sep 2021, 13:37:56 UTC

Interesting. I took a look at the applications page https://boinc.bakerlab.org/rosetta/apps.php and the VirtualBox page https://boinc.berkeley.edu/wiki/VirtualBox but I don't see anything that would indicate a definitive answer. My Linux systems do not have VirtualBox installed. The apps page does indicate they are running on Linux systems but they don't indicate if VirtualBox is or needs to be installed. The same goes for Windows in the apps page.

It was my understanding that Rosetta tasks were already written for Linux environments and so it was just easier to containerize Linux on Windows than re-write everything for Windows or Macs.

You need VirtualBox for Linux too. There are different versions of Linux, with different libraries. It can cause problems on any project.

Even LHC uses VirtualBox on Linux for Theory and ATLAS, though they also have "native" apps for them. But the native apps need a "container" such as singularity, which is even more complicated to set up than just installing VirtualBox. Though it would be nice to have native versions for Rosetta too if they can manage it, since it runs a little more efficiently, and probably needs less memory.
ID: 102806 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 102809 - Posted: 22 Sep 2021, 16:27:01 UTC
Last modified: 22 Sep 2021, 16:41:27 UTC

DEK/ADMIN/dcdc

What the heck is going on with your scheduler?
Or is there a bug in BOINC?

I have now 998 (What the *!&*#?) tasks sitting in my queue.
I lost 11 python tasks because something clogged my system with over 600 4.20 tasks
I am going to abort 900 tasks because there is no way hell I can plow through that many.
That's 332 days of crunching. That just beyond belief!!!

Plus that clogs up my system because your project is limited to a certain number of cpu's while I share the rest with several other projects and all your tasks have 8 hour run times. If I drop that to 4 that would be still over 5 months of work. And all these tasks were due on the 24th?

Explain this to me.....

BOINC bug or your system bug?

----
Sent back 880 or so tasks...geees...i better go find a limiting command
ID: 102809 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 102811 - Posted: 22 Sep 2021, 16:41:08 UTC - in response to Message 102809.  

What the heck is going on with your scheduler?
Or is there a bug in BOINC?

I have now 998 (What the *!&*#?) tasks sitting in my queue.

I have not had that problem since getting rid of the "max concurrent" in the app_config.xml, as we discussed earlier.
ID: 102811 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 102812 - Posted: 22 Sep 2021, 17:20:46 UTC - in response to Message 102811.  
Last modified: 22 Sep 2021, 17:37:14 UTC

What the heck is going on with your scheduler?
Or is there a bug in BOINC?

I have now 998 (What the *!&*#?) tasks sitting in my queue.

I have not had that problem since getting rid of the "max concurrent" in the app_config.xml, as we discussed earlier.



I have to have max concurrent in order to limit the number of cpu's RAH uses, otherwise my idea of splitting up my system so every project has its own group of cores is out the door and then I run into problem of every project dominating my system and some get all the work for days on end and others don't.

If RAH would do like LHC and allow ME to pick how many cores to give it, then I would not have to do max concurrent.

I need to find a command to limit the number of files downloaded.

OR would project_max_concurrent be any better than max_concurrent?
How about ncpus and take out max_concurrent?
ID: 102812 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 102813 - Posted: 22 Sep 2021, 17:51:13 UTC - in response to Message 102812.  

OR would project_max_concurrent be any better than max_concurrent?
How about ncpus and take out max_concurrent?

Unfortunately, project_max_concurrent won't work either.
https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=5720&postid=45323#45323

You could try ncpus, but I think that is mainly for multi-threaded apps.

What I do is just create separate BOINC instances for each type of work unit when I need to limit them.
Then you can just set "use at most % of the processors" to limit it to what you want.

It is a bit of a pain, but actually simple enough once you do it the first time.
This more or less gives you all you need.
https://www.overclock.net/threads/guide-setting-up-multiple-boinc-instances.1628924/

I think that on Windows, they neglect to tell you how to start it up automatically, though you can do it manually easily enough.
I used Task Scheduler to start the BOINC client automatically, but don't have the details since I usually use Ubuntu, where it is a bit simpler.
I think you have to start up BOINC Manager separately in the "Programs/Startup" folder, but that was on Win7 and I am now on Win10.
ID: 102813 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 123 · 124 · 125 · 126 · 127 · 128 · 129 . . . 309 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org