Rosetta running on ARM platforms

Message boards : Number crunching : Rosetta running on ARM platforms

To post messages, you must log in.

Previous · 1 · 2 · 3 · Next

AuthorMessage
lakotamm

Send message
Joined: 28 Jun 19
Posts: 22
Credit: 171,192
RAC: 0
Message 93428 - Posted: 4 Apr 2020, 20:49:18 UTC

Reading this thread I can see that my RPI3 will not be useful... Fair enough, I can have one device doing Einstein.

If you find a way to use it, I will be happy to set it up.
ID: 93428 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile p3d-cluster

Send message
Joined: 24 Oct 07
Posts: 3
Credit: 25,319,228
RAC: 0
Message 93434 - Posted: 4 Apr 2020, 21:49:16 UTC - in response to Message 35867.  
Last modified: 4 Apr 2020, 22:00:38 UTC

Let's start a conversation about ARM.


Yes!

I have several Odroid XU4, C2 and 4 x N2 crunching for BOINC.
The N2s with 4GB RAM I attached to Rosetta.

I had only purchased 8GB eMMC for these, which had never been an issue.
Now with Rosetta consuming >500MB for the project directory as well as 862MB per each slot, this wouldn't work for even one WU with the little space that was free.

I put the BOINC data directory on an additional SD card, a USB thumb drive as well as an NFS share served from a NAS. Each works well with 3-4 WUs in parallel so far.
Sometimes they hit the max. memory allowance (95% of 4GB), then only 3 WUs run.
The rest of the cores are crunching the way smaller Tn-Grid WUs, so there are no lost CPU cycles ;-)

I'll also try and use an Odroid C2 with 2GB, maybe it can run 1-2 WUs, lets see...

edit:
Thats not going to work on Odroid C2:
Sat 04 Apr 2020 23:59:02 CEST | Rosetta@home | Message from server: Rosetta for Portable Devices needs 1907.35 MB RAM but only 1554.63 MB is available for use.
ID: 93434 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
bkil
Avatar

Send message
Joined: 11 Jan 20
Posts: 97
Credit: 4,433,288
RAC: 0
Message 93436 - Posted: 4 Apr 2020, 22:16:39 UTC - in response to Message 93434.  

You may consider using a file system with transparent compression like btrfs (not sure if ZFS works in such a tight case).
ID: 93436 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile p3d-cluster

Send message
Joined: 24 Oct 07
Posts: 3
Credit: 25,319,228
RAC: 0
Message 93437 - Posted: 4 Apr 2020, 22:26:24 UTC

These Odroids have only a single / file system created during install, that automatically enlarges itself to maximum size of the underlying device during first boot.
Hard to squeeze in extra file systems :-)
I'm OK with the current setup, maybe I'll move all N2 BOINC data directories to the NAS NFS. Being lazy, I'm most likely only doing that once the SD or thumb drives fail :-D
ID: 93437 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
MarkJ

Send message
Joined: 28 Mar 20
Posts: 72
Credit: 25,010,478
RAC: 53
Message 93448 - Posted: 5 Apr 2020, 1:16:53 UTC - in response to Message 93234.  
Last modified: 5 Apr 2020, 1:25:52 UTC

Does this include Raspberry Pis? If so, do you know which models/OS?

Pi4B, preferably the 4GB version. You’ll have to put it into 64 bit mode (aarch64). You can use Raspbian or Ubuntu. Rosetta tasks can use up to 1.5GB of memory when running so you need to limit how many run at a time.

See MarksRpiCluster for how to get a Pi4 with Raspbian going with Rosetta.
BOINC blog
ID: 93448 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
bkil
Avatar

Send message
Joined: 11 Jan 20
Posts: 97
Credit: 4,433,288
RAC: 0
Message 93461 - Posted: 5 Apr 2020, 7:01:23 UTC - in response to Message 93437.  

Not sure what initramfs it has, but if it supports that file system you can convert the whole root from your PC.
ID: 93461 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
CallMeFoxie

Send message
Joined: 22 Mar 20
Posts: 8
Credit: 152,280
RAC: 0
Message 93950 - Posted: 9 Apr 2020, 8:49:01 UTC

Too bad it requires 1900MB -ish, as I have a cluster of 8x Pine64+ (quadcore Cortex, aarch64, 2GB RAM) but due to how the memory is laid out I have about 1850MB after booting up available :( cannot crunch even 1 - 2 tasks. And adding a small swap gets ignored unsurprisingly.
ID: 93950 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
PorkyPies

Send message
Joined: 6 Apr 20
Posts: 45
Credit: 1,650,779
RAC: 0
Message 94024 - Posted: 10 Apr 2020, 0:04:52 UTC - in response to Message 93950.  

Too bad it requires 1900MB -ish, as I have a cluster of 8x Pine64+ (quadcore Cortex, aarch64, 2GB RAM) but due to how the memory is laid out I have about 1850MB after booting up available :( cannot crunch even 1 - 2 tasks. And adding a small swap gets ignored unsurprisingly.

It doesn't need 1.9GB. The highest I've seen my Pi4's using has been 918MB. We have some people using the Pi4 2GB model so it shouldn't be an issue. You'll only be able to run one at a time but given you have 8 Pine64's that is 8 tasks you could be running.
MarksRpiCluster
ID: 94024 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
bkil
Avatar

Send message
Joined: 11 Jan 20
Posts: 97
Credit: 4,433,288
RAC: 0
Message 94051 - Posted: 10 Apr 2020, 10:11:42 UTC - in response to Message 93950.  
Last modified: 10 Apr 2020, 10:13:07 UTC

Not sure whether you could increase your available memory a bit by reducing video card allocation (not familiar with that platform).

I myself face the very same problem on a phone and a netbook. I've patched out the check from the boinc source code and now it works correctly... zram with deflate compression does wonders along with a little swap, but they don't use more than 0.5-1GB anyway.

They check whether you have 2000000000 bytes available, not sure how much thought went into determining this limit.
ID: 94051 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
CallMeFoxie

Send message
Joined: 22 Mar 20
Posts: 8
Credit: 152,280
RAC: 0
Message 94157 - Posted: 11 Apr 2020, 14:25:08 UTC - in response to Message 94024.  

Too bad it requires 1900MB -ish, as I have a cluster of 8x Pine64+ (quadcore Cortex, aarch64, 2GB RAM) but due to how the memory is laid out I have about 1850MB after booting up available :( cannot crunch even 1 - 2 tasks. And adding a small swap gets ignored unsurprisingly.

It doesn't need 1.9GB. The highest I've seen my Pi4's using has been 918MB. We have some people using the Pi4 2GB model so it shouldn't be an issue. You'll only be able to run one at a time but given you have 8 Pine64's that is 8 tasks you could be running.


yup the problem is that when it tries to download a task it checks for available memory. I might patch it out as others did.

No idea how to change GPU memory on this platform either, tried googling with no luck tbh. And on top of GPU memory there's another 165MB reserved by kernel for some peripherals and I have no idea which ones, I removed most of the drivers I didn't even need and nothing in the DTBs. :(

time to go into the source of BOINC!
ID: 94157 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
bkil
Avatar

Send message
Joined: 11 Jan 20
Posts: 97
Credit: 4,433,288
RAC: 0
Message 94160 - Posted: 11 Apr 2020, 14:55:12 UTC - in response to Message 94157.  
Last modified: 11 Apr 2020, 14:57:38 UTC

Watch out, as although I mostly see an RSS between 300-600MB, sometimes over a GB, I've now finished one that had 1.85GB RSS! Probably that wasn't all active set, but still something to keep in mind when sizing your RAM + zram + swap.

To patch it, basically this is what you need:
cat boinc-7.9.3+dfsg/debian/patches/allow_memory_overcommit 
Description: <short summary of the patch>
 TODO: Put a short summary on the line above and replace this paragraph
 with a longer explanation of this change. Complete the meta-information
 with other relevant fields (see below for details). To make it easier, the
 information below has been extracted from the changelog. Adjust it or drop
 it.
 .
 boinc (7.9.3+dfsg-5) unstable; urgency=medium
 .
   * Upload to unstable
   * Drop update-boinc-applinks, useless now?
     - lets re-evaluate the situation in case somebody complains (LP: #1765576)
Author: Gianfranco Costamagna <locutusofborg@debian.org>
Bug-Ubuntu: https://bugs.launchpad.net/bugs/1765576

---
The information above should follow the Patch Tagging Guidelines, please
checkout http://dep.debian.net/deps/dep3/ to learn about the format. Here
are templates for supplementary fields that you might want to add:

Origin: <vendor|upstream|other>, <url of original patch>
Bug: <url in upstream bugtracker>
Bug-Debian: https://bugs.debian.org/<bugnumber>
Bug-Ubuntu: https://launchpad.net/bugs/<bugnumber>
Forwarded: <no|not-needed|url proving that it has been forwarded>
Reviewed-By: <name and email of someone who approved the patch>
Last-Update: 2020-04-10

--- boinc-7.9.3+dfsg.orig/client/hostinfo_unix.cpp
+++ boinc-7.9.3+dfsg/client/hostinfo_unix.cpp
@@ -445,6 +445,7 @@ static void parse_meminfo_linux(HOST_INF
         }
     }
     fclose(f);
+    host.m_nbytes = 3000000000; // hack for Rosetta@home
 }
 
 // Unfortunately the format of /proc/cpuinfo is not standardized.


Here are some notes you may find helpful:
sudo vi /etc/apt/sources.list # deb-src ... universe
sudo apt update
sudo apt install -fy fakeroot
sudo apt build-dep -fy boinc-client

apt-get -b source boinc # maybe we can drop the -b here
cd boinc-7.9.3+dfsg
vi client/hostinfo_unix.cpp
dpkg-buildpackage -us -uc
sudo apt remove boinc-client boinc-manager libbionc7 boinc
dpkg -i ../*.deb # or whichever you need


A less intrusive solution would be to enable KSM and zram on the machine and create a qemu VM with an appropriate amount of memory, like 3GB. Of course this is only meaningful if the CPU has virtualization, but otherwise it would be very advantageous because this may potentially also provide deduplication between slots!

A third workaround without a need for recompilation would be to do a chroot and mock out /proc/meminfo there.
ID: 94160 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
CallMeFoxie

Send message
Joined: 22 Mar 20
Posts: 8
Credit: 152,280
RAC: 0
Message 94164 - Posted: 11 Apr 2020, 16:50:26 UTC

Thanks! Easier than me going through the sources :)

meanwhile the current tasks got crunched and new ones (rb_04_11_21299_20821) appeared, which require 400 - 600MB RAM. No need to patch out boinc client for those so for now I am fine :)
ID: 94164 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
PorkyPies

Send message
Joined: 6 Apr 20
Posts: 45
Credit: 1,650,779
RAC: 0
Message 94182 - Posted: 11 Apr 2020, 22:21:34 UTC
Last modified: 11 Apr 2020, 22:27:42 UTC

New 4.15 Rosetta app (for aarch64). After the download 3 started at once, the active LED on the Pi stayed on for a good 5 minutes indicating a huge amount of SD card activity. Memory is currently up to 736MB (each) but will probably go higher as they progress.
MarksRpiCluster
ID: 94182 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
PorkyPies

Send message
Joined: 6 Apr 20
Posts: 45
Credit: 1,650,779
RAC: 0
Message 94202 - Posted: 12 Apr 2020, 4:21:25 UTC - in response to Message 94182.  

New 4.15 Rosetta app (for aarch64). After the download 3 started at once, the active LED on the Pi stayed on for a good 5 minutes indicating a huge amount of SD card activity. Memory is currently up to 736MB (each) but will probably go higher as they progress.

It’s the data and other files that it has to unzip into the slot directory. The database I looked at contained approx 1GB of (uncompressed) files and the other was a bit smaller. That means when each task starts it will unzip approx 1.5GB into the slot directory. That’s isn’t going to do much for a SD card based system like the Pi. I think the Rosetta 4.15 shouldn’t be sent to the Portable Devices.
MarksRpiCluster
ID: 94202 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
CallMeFoxie

Send message
Joined: 22 Mar 20
Posts: 8
Credit: 152,280
RAC: 0
Message 94233 - Posted: 12 Apr 2020, 12:12:45 UTC
Last modified: 12 Apr 2020, 12:21:04 UTC

Today after the 4.15 update I got a sudden error on all the tasks :(

<stderr_out>
<![CDATA[
<message>
process exited with code 1 (0x1, -255)</message>
<stderr_txt>
command: ../../projects/boinc.bakerlab.org_rosetta/rosetta_4.15_aarch64-unknown-linux-gnu -silent_gz -mute all -s chainA_chainB_20_04_15_28_12.pdb -run:protocol jd2_scripting -jd2:dd_parser -parser:protocol local_docking_20_04_15_28_12.xml -out:nstruct 10000 -jd2:ntrials 100 -ex1 -ex2aro -beta -use_input_sc -in:file:native chainA_chainB_20_04_15_28_12.pdb -out:file:silent default.out -out:file:silent_struct_type protein -run:write_failures false -nstruct 10000 -cpu_run_time 28800 -watchdog -boinc:max_nstruct 600 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -run::rng mt19937 -constant_seed -jran 3988117

ERROR: Cannot determine file type. Current supported types are: PDB, CIF, SRLZ, MMTF
ERROR:: Exit from: src/core/import_pose/import_pose.cc line: 380
BOINC:: Error reading and gzipping output datafile: default.out
14:04:47 (453): called boinc_finish(1)

</stderr_txt>
]]>
</stderr_out>


I didn't have zip installed, I wonder if that was missing? Would be nice if there was some requirements page (or did I overlook it?)
ID: 94233 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
CallMeFoxie

Send message
Joined: 22 Mar 20
Posts: 8
Credit: 152,280
RAC: 0
Message 94240 - Posted: 12 Apr 2020, 13:30:57 UTC - in response to Message 94233.  

Today after the 4.15 update I got a sudden error on all the tasks :(
.....


it seems to have happened only with those tasks, robetta is still calculatong fine.
ID: 94240 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
dominik282

Send message
Joined: 25 Jun 17
Posts: 1
Credit: 199,675
RAC: 1,295
Message 94264 - Posted: 12 Apr 2020, 18:58:54 UTC

Hello everyone,
Does anyone know the following phenomenon and maybe a possible solution?
I have an Odroid XU4 with Android and as project WCG. When I add Rosetta@home as a second project, WUs will be downloaded just fine. But the moment the XU4 starts calculating on a Rosetta-WU, the calculation is stopped with the request to connect the XU4 to a power source. (Of course, this could not be displayed without power. ;-) ).

Rosetta cannot be removed afterwards. The phenomenon can also be reproduced with a newly formatted and setuped SD card.
If only WCG is running, the XU4 will calculate weeks or months just fine.

Does anyone have an idea?
Thank you very much ...
ID: 94264 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
oreggin

Send message
Joined: 26 Mar 20
Posts: 6
Credit: 193,059
RAC: 0
Message 95313 - Posted: 24 Apr 2020, 16:36:54 UTC

Its interesting. I have one RPi3B+ with 1Gigs of RAM and it crunched a lot of WUs but meanwhile R@H increased minimum RAM requirements from 850MB to ~1700MB and now it hasn't got WU-s.
I have also an OdroidC2 with 2Gis of RAM and it crunchign fine with 2 WUs in parallel around 50°C.
BTW my RPI4B+ (4Gig of RAM) crunching only one WU at a time because of stock heatshink and it runs around 55-60°C.
ID: 95313 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
lakotamm

Send message
Joined: 28 Jun 19
Posts: 22
Credit: 171,192
RAC: 0
Message 95318 - Posted: 24 Apr 2020, 19:02:15 UTC - in response to Message 95313.  

Its interesting. I have one RPi3B+ with 1Gigs of RAM and it crunched a lot of WUs but meanwhile R@H increased minimum RAM requirements from 850MB to ~1700MB and now it hasn't got WU-s.

My RPI 3B+ is still happily crunching.
I often get the message that 1,7GB is required, but in the end I always get WUs. But it already happened that new WUs did not come for 24h, so I have a 1 day buffer.

I have also an OdroidC2 with 2Gis of RAM and it crunching fine with 2 WUs in parallel around 50°C.

Is this due to RAM limitations?

BTW my RPI4B+ (4Gig of RAM) crunching only one WU at a time because of stock heatshink and it runs around 55-60°C.

Could it make more sense to downclock it and run more WUs in parallel? CPU cores usually become more efficient at lower frequencies.
ID: 95318 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 95333 - Posted: 25 Apr 2020, 4:27:34 UTC - in response to Message 94264.  

There are preferences about whether you allow computations while on battery power. Do you have the same preference settings on each BOINC project? Local settings override those from the projects too. Sometimes that is the simplest way to ensure you know which settings are taking precedence.
Rosetta Moderator: Mod.Sense
ID: 95333 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · Next

Message boards : Number crunching : Rosetta running on ARM platforms



©2024 University of Washington
https://www.bakerlab.org