Posts by Paul

21) Message boards : Number crunching : Compute Error - Ubuntu 18.04 (Message 89428)
Posted 21 Aug 2018 by Paul
Post:
All:

What is the fix for computation errors and Ubuntu 18.04? I have 2 servers that have yet to complete a 4.07 task after I upgraded to Ubuntu 18.04. I understand there is a problem with the glibc 2.27 but there must be a fix other than creating virtual machines. My Ubuntu machines are all dedicated crunchers and I am getting hundreds of failed work units everyday. I am glad they typically fail in the first 6 seconds but it feels very inefficient to download and upload these work units.
22) Message boards : Number crunching : Error while computing - AMD Opteron (Message 88870)
Posted 12 May 2018 by Paul
Post:
I am running Ubuntu 16.04 LTS. I can try a project reset. How do I look at dmesg?

ldd version 2.23

64GB RAM
4 AMD Opteron 6176 Processors with 12 Cores each
250GB SSD

100% dedicated to Rosetta. Everything else runs fine including other Rosetta WUs

Problem started all at once. I did not reset the project as she is running 48 active WUs. Hate to waste all that progress.
23) Message boards : Number crunching : Error while computing - AMD Opteron (Message 88855)
Posted 11 May 2018 by Paul
Post:
All:

I have many failed work units fail with Error while computing after about 1 min of run time. All on my AMD Opteron cores are chewing through these WUs. All of them are rb_05_10_167_247__t000__0_C1_SAVE_ALL_OUT_IGNORE_THE_REST_600454_

I think I have 500 failed work units and growing.

examples:
https://boinc.bakerlab.org/workunit.php?wuid=898124245
https://boinc.bakerlab.org/workunit.php?wuid=898124145
https://boinc.bakerlab.org/workunit.php?wuid=898124156
24) Message boards : Number crunching : CPU App Performance (Message 88699)
Posted 14 Apr 2018 by Paul
Post:
I have Linux, Windows and Macs running the client. All of them work great. The Linux client might be a bit more efficient. I have never benchmarked two identical computers long term so I can't provide a true comparison. Hope you enjoy crunching for this project.
25) Message boards : Number crunching : Legacy CPU Performance (Message 87122)
Posted 24 Aug 2017 by Paul
Post:
I am really interested in the RAC per core for Rosetta on an AMD Opteron 6380. It has a large L2 cache and faster clock rate than the 6176 I already have. I can buy another server but I don't want to pay twice as much for the 6380 if it isn't twice as fast. The 6176 machines are very inexpensive.

Thx
26) Message boards : Number crunching : WUs estimated time way off to elapsed time (Message 87105)
Posted 19 Aug 2017 by Paul
Post:
When I look at the processes, they are getting 0% CPU so I think it is a bug in BOINC. Maybe it will be fixed soon.
27) Message boards : Number crunching : WUs estimated time way off to elapsed time (Message 87104)
Posted 19 Aug 2017 by Paul
Post:
It sounds like 7.2.42 is the fix I need. I am running on Linux so I know how to apt-get but I don't know how to ask for an older version. Is there a way to apt-get a specific version? How do I tell software updates that I don't want updates on that app?
28) Message boards : Number crunching : WUs estimated time way off to elapsed time (Message 87065)
Posted 13 Aug 2017 by Paul
Post:
I can't find any information on this bug and the problem is back. Can you tell me how to get this reported to the BOINC team?

In all the years I have run BOINC I have never seen this before. It is a dedicated cruncher so I have it set to 100% CPU utilization & 100% CPUs.
29) Message boards : Number crunching : Legacy CPU Performance (Message 87050)
Posted 12 Aug 2017 by Paul
Post:
It is interesting. I really like my multi-CPU systems and the fact that the older RAM is really inexpensive. A threadripper with 32GB RAM is likely an expensive machine.

I can add a few more rack mount servers and then I am going to need to buy a 19" equipment rack & more power! The best part of the newer processors is the performance and the power savings.
30) Message boards : Number crunching : Legacy CPU Performance (Message 87047)
Posted 12 Aug 2017 by Paul
Post:
I am assuming 100% dedicated to R@H & dedicated cruncher 24 hours a day. I also assume at least 1GB RAM per thread.

Larry - great information. My first server has 48 cores & 64GB of RAM. I want to go with the Opteron 6300 series to get 64 cores but it looks like 64GB of RAM is plenty. The Xeon E7-4870 has 10 cores with hyperthreading for 20 threads per processor. I am wondering if 80 threads with 128GB of RAM is worth the extra cost. I would love to use newer processors but the machines get really expensive.

Maybe the new chips will push down the cost of some of these legacy processors.

I would also love to see work units use more RAM if that would make them run faster.
31) Message boards : Number crunching : Legacy CPU Performance (Message 87042)
Posted 11 Aug 2017 by Paul
Post:
Electricity is about $0.10 / KwH here so it isn't a big deal. The Atom is 15W and the newer ones are even less.

I am looking at a server with 10 core Intel Xeon E7-4870 processors or AMD Opteron 6272 or Opteron 6378. It would be great if anyone has a good estimate as to the RAC per core with these processors. They are getting much more affordable on ebay so I want to know what to expect from each processor version. 500 per core on my Opteron 6176 really isn't enough.
32) Message boards : Number crunching : Legacy CPU Performance (Message 87038)
Posted 11 Aug 2017 by Paul
Post:
All:

I have a number of dedicated crunchers and I am always looking to expand my farm. I recently purchase a server with 4 AMD Opteron 6172 processors with 48 total cores. I also have an old Atom powered laptop and several intel i5 and i7 devices. In a dedicated environment, I have some estimated performance levels. Can anyone add to this list?

RAC per Core:
Intel ATOM - 190
Intel i5 - 950
Intel i7 - 1250
AMD Opteron 6100 - 500

Maybe someone can explain how I can get this info from BOINCStats or one of the other statistic engines.
33) Message boards : Number crunching : WUs estimated time way off to elapsed time (Message 87010)
Posted 8 Aug 2017 by Paul
Post:
I think I got it fixed. I aborted several tasks that ran over but I also killed a few instances of minirosetta running but using 0% CPU. When I counted the minirosetta tasks I found 50 of them but I only have 48 cores. I killed the 2 tasks at 0%. So far everything has been back to normal for about 48 hours.

Wish I knew exactly what fixed it
34) Message boards : Number crunching : WUs estimated time way off to elapsed time (Message 86997)
Posted 6 Aug 2017 by Paul
Post:
All

I really need some help here. I have at least 5 cores that are idle because of this bug. I have never experienced this in the past. Is there a version of BOINC that does not have this bug? I hate to wait for 8 hours & abort tasks but I can restart BOINC every 8 hours.

Please help
35) Message boards : Number crunching : WUs estimated time way off to elapsed time (Message 86980)
Posted 5 Aug 2017 by Paul
Post:
Is BOINC Manager required?? I have BOINC Client set to launch at startup. If I close BOINC Manager and let the tasks continue to run will I avoid the bug? Can BOINC Client send & receive tasks without BOINC Manager??

Just looking for a workaround

Thx
36) Message boards : Number crunching : WUs estimated time way off to elapsed time (Message 86936)
Posted 1 Aug 2017 by Paul
Post:
I had a number of very long running tasks I recently had to abort. Several jobs were over 24 hours and others we 3% at 12 hours.

These workunits are running on a server dedicated to Rosetta@Home running Linux.

Any ideas?

Task WorkUnit
931190869 840070369
930920244 839826801
930919923 839826521
930912635 839819841
930986536 839886162
37) Message boards : Number crunching : Larger Memory Models (Message 86823)
Posted 18 Jul 2017 by Paul
Post:
I looked at one of my machines this morning & it had at least 10 processes using more than 1GB of RAM each! I thought the project team just started throwing me larger processes. This is the fist time I have seen my machine using more than 32GB of RAM. I love to see better resource utilization.
38) Message boards : Number crunching : Larger Memory Models (Message 86813)
Posted 14 Jul 2017 by Paul
Post:
Is there a way to request work units that require large memory models? I now have a few systems with more than 1GB RAM per core and would like to see that RAM put to good use. Currently the largest work unit I have is using about 650MB and most are using less than 400MB.
39) Questions and Answers : Unix/Linux : Rosetta@home on Raspberry Pi and other ARM devices (Message 78048)
Posted 19 Mar 2015 by Paul
Post:
Just to bump this thread.

Raspberry Pi 2 has a quad core processor. While it is still limited to 1GB of RAM, that limitation is likely to improve.

Is there any effort underway to support this tiny powerhouse of a platform?
40) Message boards : Number crunching : xbox 720/loop (Message 72355)
Posted 18 Feb 2012 by Paul
Post:
I still run Folding at Home on 3 PS3s. It is a great way to use the PS3 when it isn't playing a BluRay or some game.

R@H on XBox 720! How can we make it happen??


Previous 20 · Next 20



©2024 University of Washington
https://www.bakerlab.org