Posts by mmstick

1) Message boards : Number crunching : Not getting any python work (Message 103350)
Posted 16 Nov 2021 by Profile mmstick
Post:
With as many issues as the Python tasks have; with half of them begin unmanageable, or causing the system's OOM killer to assassinate them for using too much memory; not getting any should be considered a blessing. I've just opted to uninstall virtualbox on my Linux systems. There's simply no valid reason that BOINC projects should be using it on Linux.

We all know that virtualization is largely an inefficient waste of resources. That's especially true for VirtualBox compared to the Linux kernel's KVM/QEMU support. There are better solutions that exist today that would provide the same benefits -- virtual environments, namespaces, and containers -- without having to emulate an entire virtual machine. I'd rather wait for BOINC projects to start using these solutions.

You could argue about Python dependencies, but we live in an era where Python programmers have pip, virtualenv, and anaconda at their disposal. You could bundle your entire development environment into an OSTree or docker image, and execute them natively on a system using a bubblewrap chroot, or podman. Such that the software is being run in an isolated sandbox with no interference from the host OS. Root's not even required to achieve this.

Of course, I'd also argue that Python itself is not the best tool for distributed computing. 100 computers running a Python application will get the same computational output as 1 computer running a Rust application. As far as super simple scripting languages go, I'd give more of a pass to Julia because it at least leverages the most performant mathematics libraries while also performing JIT compilation of its scripts to something that's close to optimized machine code. WASM would also be an excellent target with its ability to compile on any platform architecture and optimize for the system's native CPU.
2) Message boards : Number crunching : Problems and Technical Issues with Rosetta@home (Message 103243)
Posted 14 Nov 2021 by Profile mmstick
Post:
Using an app_config to set the max-concurrent value will cause your system to endlessly request work until you've fully depleted the server of work units. I don't recommend doing so until this issue is fixed: https://github.com/BOINC/boinc/issues/4322
3) Message boards : Number crunching : Problems and Technical Issues with Rosetta@home (Message 103231)
Posted 13 Nov 2021 by Profile mmstick
Post:
I do constantly get the issue of having to abort Python units at 99.996% completion, even on my Ryzen 5700g desktop with 64 GB RAM, which seems to be good enough for running 8 python units simultaneously on each physical core. Have tried to limit the number of Python work units to 4 just in case so I can run 12 normal tasks in addition to that, but apparently using an app_config.xml to define max-concurrent work units causes BOINC to repeatedly ask for 12 work units every 30 seconds, so had to abort that attempt.
4) Message boards : Number crunching : Problems and Technical Issues with Rosetta@home (Message 103229)
Posted 13 Nov 2021 by Profile mmstick
Post:
I wish they'd use KVM/QEMU instead of Virtualbox for Linux. It's the much more efficient method of virtualization on Linux that doesn't require installing external DKMS modules since it's supported directly by the Linux kernel. That said, I don't see why we're even using virtualization when a sandboxed namespace does the job just as well. Anyway, call me when there's interest in seeking open source contributors to transition from Python to Rust.
5) Message boards : Number crunching : Problems and Technical Issues with Rosetta@home (Message 103200)
Posted 12 Nov 2021 by Profile mmstick
Post:
Anyone else not getting assigned tasks suddenly?

Requesting new tasks for CPU
Scheduler request completed: got 0 new tasks
No tasks sent


8-core Linux PC w/ 32 GB RAM
6) Message boards : Number crunching : Current issues with 7+ boinc client (Message 74771)
Posted 22 Dec 2012 by Profile mmstick
Post:
Yes, I was running POEM and WCG GPU projects and Rosetta@home at the same time. Not a single error.
7) Message boards : Number crunching : Current issues with 7+ boinc client (Message 74763)
Posted 20 Dec 2012 by Profile mmstick
Post:
I had no idea this was a problem. I've been crunching with my Radeon HD 7950 in World Community Grid and POEM@Home while doing Rosetta@home tasks and never had a single problem with invalidated or errored work units; Using BOINC v7 as well.


Umm not exactly:
549205051 499214705 9 Dec 2012 22:07:32 UTC 11 Dec 2012 7:06:39 UTC Over Client error Compute error 11,468.76 79.64 ---
549203504 499213284 9 Dec 2012 21:58:59 UTC 9 Dec 2012 23:27:03 UTC Over Validate error Done 580.03 --- ---

549209311 499218156 9 Dec 2012 22:45:08 UTC 14 Dec 2012 5:32:01 UTC Over Validate error Done 215.59 --- ---

And then a ton of units 'aborted by user'. I sent as far back as the stats I can see and you only had one valid unit that you credits for. You may have had nothing but success prior to what I can see, I have no idea, but you did have some problems too.

I still think the problem is based around the gpu and it's drivers, Chilean has two things that are contradictory there...his list says:
Thu 13 Dec 2012 07:17:00 EST | | No usable GPUs found
but further down he says "Yet my NVIDIA card (which is running GPUGRID)", so either they are not from the same pc or there IS a problem someplace!


Wrong, don't try to look at my stuff; I don't run this project on anything but an old laptop. I aborted all tasks about a week ago on my desktop after I switched completely to World Community Grid because it demands my entire CPU to keep my GPU fed (Note my RAC, a high end desktop CPU would not have a low RAC). The compute error was caused from restarting the client abruptly. All work units passed successfully until I switched.

I merely ran this project for one day on my high end desktop (FX-8120@4Ghz+HD7950 in POEM/WCG). Not a single task failed.

This has nothing to do with AMD GPUs as far as I am concerned, nor do I see why it would be involved with NVIDIA GPUs.
8) Message boards : Number crunching : Current issues with 7+ boinc client (Message 74752)
Posted 19 Dec 2012 by Profile mmstick
Post:
I had no idea this was a problem. I've been crunching with my Radeon HD 7950 in World Community Grid and POEM@Home while doing Rosetta@home tasks and never had a single problem with invalidated or errored work units; Using BOINC v7 as well.






©2024 University of Washington
https://www.bakerlab.org