Posts by loris

1) Questions and Answers : Unix/Linux : Jobs seem to complete OK but have status 'abandoned' (Message 96656)
Posted 20 May 2020 by loris
Post:
Thanks for the info regarding my computers being hidden.

As far as installing on a cluster is concerned, I realize that is not what BOINC was designed for. However, since every node essentially behaves as an individual computer, I thought it wouldn't be too hard to get it to work. I'll try running a number of jobs serially and see how that goes.
2) Questions and Answers : Unix/Linux : Jobs seem to complete OK but have status 'abandoned' (Message 96646)
Posted 20 May 2020 by loris
Post:
In what way are my computers hidden?

Regarding the cluster, the software is installed (via NFS) on all nodes of the cluster. The problem, I think, is more to do with the way I start the jobs via the scheduling system. Possibly it is to do with the fact that the scheduler could try to start multiple jobs on one node. Perhaps
max_ncpus_pct
then applies to all the jobs, so all but one get terminated.
3) Questions and Answers : Unix/Linux : Jobs seem to complete OK but have status 'abandoned' (Message 96645)
Posted 20 May 2020 by loris
Post:
@loris, if you are submitting tasks to the Robetta server, these message boards are not the place to look for help.


Where is the correct place? I thought this forum was for questions relating to "Installing and running BOINC on Unix and Linux".
4) Questions and Answers : Unix/Linux : Jobs seem to complete OK but have status 'abandoned' (Message 96623)
Posted 19 May 2020 by loris
Post:
I am not sure I understand your question but I am trying to set things up so that each job I submit to the cluster just fetches a single r@h task.

Currently I am starting single jobs by hand with a separation of a couple of minutes, but each job seems to cause the previous job to be abandoned.
5) Questions and Answers : Unix/Linux : Jobs seem to complete OK but have status 'abandoned' (Message 96202)
Posted 7 May 2020 by loris
Post:
Yes, I mean staggering. This does seem to be necessary although I still got

07-May-2020 08:57:57 [Rosetta@home] Not sending work - last request too recent: 0 sec

for one of four jobs started one minute apart.

What version are you referring to? I have client version 7.16.5.
6) Questions and Answers : Unix/Linux : Jobs seem to complete OK but have status 'abandoned' (Message 96154)
Posted 6 May 2020 by loris
Post:
I changed the URL to
https
and a single job was subsequently completed and validated. However, of an array of 10 jobs started at the same time, 6 complete almost immediately with "exiting because no more results", but I think that is a different problem. I have already added some random delay to prevent too many requests for tasks happening at the same time, but perhaps this delay needs to be longer.
7) Questions and Answers : Unix/Linux : Jobs seem to complete OK but have status 'abandoned' (Message 96009)
Posted 4 May 2020 by loris
Post:
Hi,

I am running jobs on a cluster via a resource manager. The batch script I use starts BOINC in the following manner:

boinc --no_gui_rpc --fetch_minimal_work --exit_when_idle --attach_project ${URL} ${AUTH}

The jobs seem to complete OK and do consume CPU time on the cluster, and there are no errors in the client log. Howver the status show on the R@H website often seems to be 'abandoned'.

Is the way I am calling BOINC incorrect?






©2025 University of Washington
https://www.bakerlab.org