1)
Questions and Answers :
Unix/Linux :
Jobs seem to complete OK but have status 'abandoned'
(Message 96656)
Posted 20 May 2020 by loris Post: Thanks for the info regarding my computers being hidden. As far as installing on a cluster is concerned, I realize that is not what BOINC was designed for. However, since every node essentially behaves as an individual computer, I thought it wouldn't be too hard to get it to work. I'll try running a number of jobs serially and see how that goes. |
2)
Questions and Answers :
Unix/Linux :
Jobs seem to complete OK but have status 'abandoned'
(Message 96646)
Posted 20 May 2020 by loris Post: In what way are my computers hidden? Regarding the cluster, the software is installed (via NFS) on all nodes of the cluster. The problem, I think, is more to do with the way I start the jobs via the scheduling system. Possibly it is to do with the fact that the scheduler could try to start multiple jobs on one node. Perhaps max_ncpus_pctthen applies to all the jobs, so all but one get terminated. |
3)
Questions and Answers :
Unix/Linux :
Jobs seem to complete OK but have status 'abandoned'
(Message 96645)
Posted 20 May 2020 by loris Post: @loris, if you are submitting tasks to the Robetta server, these message boards are not the place to look for help. Where is the correct place? I thought this forum was for questions relating to "Installing and running BOINC on Unix and Linux". |
4)
Questions and Answers :
Unix/Linux :
Jobs seem to complete OK but have status 'abandoned'
(Message 96623)
Posted 19 May 2020 by loris Post: I am not sure I understand your question but I am trying to set things up so that each job I submit to the cluster just fetches a single r@h task. Currently I am starting single jobs by hand with a separation of a couple of minutes, but each job seems to cause the previous job to be abandoned. |
5)
Questions and Answers :
Unix/Linux :
Jobs seem to complete OK but have status 'abandoned'
(Message 96202)
Posted 7 May 2020 by loris Post: Yes, I mean staggering. This does seem to be necessary although I still got 07-May-2020 08:57:57 [Rosetta@home] Not sending work - last request too recent: 0 sec for one of four jobs started one minute apart. What version are you referring to? I have client version 7.16.5. |
6)
Questions and Answers :
Unix/Linux :
Jobs seem to complete OK but have status 'abandoned'
(Message 96154)
Posted 6 May 2020 by loris Post: I changed the URL to httpsand a single job was subsequently completed and validated. However, of an array of 10 jobs started at the same time, 6 complete almost immediately with "exiting because no more results", but I think that is a different problem. I have already added some random delay to prevent too many requests for tasks happening at the same time, but perhaps this delay needs to be longer. |
7)
Questions and Answers :
Unix/Linux :
Jobs seem to complete OK but have status 'abandoned'
(Message 96009)
Posted 4 May 2020 by loris Post: Hi, I am running jobs on a cluster via a resource manager. The batch script I use starts BOINC in the following manner: boinc --no_gui_rpc --fetch_minimal_work --exit_when_idle --attach_project ${URL} ${AUTH} The jobs seem to complete OK and do consume CPU time on the cluster, and there are no errors in the client log. Howver the status show on the R@H website often seems to be 'abandoned'. Is the way I am calling BOINC incorrect? |
©2025 University of Washington
https://www.bakerlab.org