Message boards : Number crunching : Computers not found / hardly any tasks
Author | Message |
---|---|
fritz Send message Joined: 17 Apr 20 Posts: 3 Credit: 1,626,120 RAC: 0 |
Hi, I'm new here and my company wants to dedicate a few hundred cores to this projects. We have hundreds of RPI3B+ in stock and until we ship them to our customers, we want to let them run rosetta@home. We're using the https://foldforcovid.io/ image, provided by balena.io Until now, I've set up 13 RPIs. but on the "Your computers" page, only 3 of them are visible: https://boinc.bakerlab.org/rosetta/hosts_user.php?sort=rpc_time&rev=0&show_all=1&userid=2143308 I assume, the computer infomation can be found in the global_prefs_override.xml file. If I look into this, I see e.g. <host_cpid> and <externl_cpid> -> I assume, this would be the computer Id? Interestingly, some share the very same Ids, some don't, why is that? I just added the weak account project id to all, so they are all gathered together into one account / team. Also, just 2 or so ever received any computation tasks and just one was accounted. Would be very glad if someone finds time to help me out setting this up correctly, so we can increase the fleet next week! Best regards Fritz |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1677 Credit: 17,762,257 RAC: 22,840 |
This sounds very much like a problem someone had over at Seti with a whole bunch of Pis they wanted to use, and it appeared they were all processing work, but only a few of them were showing up on their Account. It doesn't look as though the issue was ever resolved, but if you checkout the thread, something there may be of use. When you attach a system to BOINC, it should get it's very own ID number. There are cases where if there are communication issues between a system and the servers a system can end up with a new ID number. But how you can have a bunch of systems, and only a few of them show up, i've no idea. And i've no idea how some system could get the same IDs. And it appears to be an issue with Pis only. Many others in the past have had setups of dozens (even hundreds) of systems, on their account without this problem occurring. Good luck. Multiple computers setup? Grant Darwin NT |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
Is the uniqueness of the host lost when you duplicate a working setup from one to another? Does the server actually think more than one of them are the same system? An audit of the active WUs ought to prove if something like that were the case. Rosetta Moderator: Mod.Sense |
Bryn Mawr Send message Joined: 26 Dec 18 Posts: 390 Credit: 12,073,857 RAC: 4,165 |
Is the uniqueness of the host lost when you duplicate a working setup from one to another? Does the server actually think more than one of them are the same system? An audit of the active WUs ought to prove if something like that were the case. In which case would changing the system name separate them in the eyes of the server? |
fritz Send message Joined: 17 Apr 20 Posts: 3 Credit: 1,626,120 RAC: 0 |
Hi, the balena concept allows the duplication of the base system. But every system (on first boot) generates its unique hostname and can be maintained from a web UI dashboard. (think of it as a raspian image -> everyone downloads the same, but once installed, each system is unique). In short, I did not duplicate a working system, I just used the same base image to deploy it to a number of RPIs. Thanks |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1677 Credit: 17,762,257 RAC: 22,840 |
Hi,How did you install BOINC on those systems and then attach them to projects? Grant Darwin NT |
fritz Send message Joined: 17 Apr 20 Posts: 3 Credit: 1,626,120 RAC: 0 |
Hi Grant, I've used the setup here: https://github.com/balenalabs/rosetta-at-home Simplified, it is a docker container with a custom install script for the boinc client. For the RPI (because of the 1GB RAM limit, it is started then as: boinc --allow_remote_gui_rpc --fetch_minimal_work Interestingly, yesterday evening more computers started to get work. Now I have 6 listed, whereas 2 of them still have 0 credit, one is still with total credit of 25 (which is now the 3rd day without change), and the other 3 have total credits of: 4, 135 and 242. Very strange distribution. Cheers Fritz |
Message boards :
Number crunching :
Computers not found / hardly any tasks
©2024 University of Washington
https://www.bakerlab.org