Posts by Steve

1) Message boards : Number crunching : Problems and Technical Issues with Rosetta@home (Message 79410)
Posted 13 Jan 2016 by Steve
Post:
Hi All,

Been running Rosetta for a while and now encountering serious issues with near-endless or endless loops.
Normal running time is 6 hours on a task. And half of the WU's seem to adhere to that, however the other half is showing some weird behaviour :
1. Running forever without any estimated time left, going on for 20+ hours
as an example : nkid_1_3_2016_final3_0716_00058_0043.pdb343_TG_dez_fold_SAVE_ALL_OUT_322141_663_0
nkid_1_3_2016_final3_0692_00366_0042.pdb342_TG_dez_fold_SAVE_ALL_OUT_322134_678_0

2. Running forever, but with an estimated time left which keeps creeping up.
don't have examples here, I aborted them after 25+ hours of running.

This appears on a laptop. On my desktop, it seems to work well. Although I have other issues there with the scheduling of Rosetta.

Could you please investigate ?

Many thanks in advance !

Kind Regards,

B.E.

Hi B.E.

This looks very similar to my experience (see my recent posts and replies). May I ask: do you suspend/hibernate your laptop or do you shut down and then reboot?

My experience has been that some Rosetta tasks go 'waiting to run' or run with no remaining estimate on my system if I don't reboot for several days and they then don't complete for many hours; other Rosetta tasks complete with no issue.

If you are using suspend/hibernate (or just shutting the lid on the laptop) try rebooting it occasionally and see if that helps.

Best wishes
Steve
2) Message boards : Number crunching : Problems and Technical Issues with Rosetta@home (Message 79409)
Posted 13 Jan 2016 by Steve
Post:
A few things to consider:

Where did you set your preferences? Changes made in the BOINC Manager will override any web-based settings.

Double check the wording. In my version of BOINC Manager a box must be checked to keep tasks running while the computer is in use while you must select the “no” radio button to achieve the same thing using web-based prefs.

What I'm puzzled about is that BOINC is starting new tasks when older ones still are Waiting to Run...

This can happen if there isn’t enough memory to continue running a particular task. BOINC will set that one aside and try another. Rosetta tasks are among the most memory hungry tasks you will encounter in the BOINC world. So how much memory per core do you have and, more importantly, how much is BOINC allowed to use?

Could computer (not BOINC) sleep/hibernation settings be coming into play?

Thanks Snags - useful input. I have used local settings and the option window confirms that it's using those (it has a button to use prefs from the web but I haven't clicked that)

PC is a quad core with 12GB RAM, but it's running several large java-based services so memory typically runs around 80-90% used but with very little swapping. However as I'm not using the largest of those services most days I've now stopped that (releasing around 4GB) and will only run it when I need to access it. Rosetta tasks are usually under 200MB each in task manager so that should now mean there's plenty of memory available.

Making previously suggested changes seems to have improved things somewhat (only one overdue task waiting this morning) so I'll see if the latest change does any better.

I saw you had 12Gb RAM so didn't expect RAM to be an issue, but now I read this it is likely to have been a factor. My 8 concurrent tasks typically contribute 1.5GB out of 6.5Gb RAM in use, but I have 16Gb RAM total to utilise.


Well, a week or so on after those option changes and things are much better although not entirely clean - I have 3 tasks that were deadlined 10th Jan, and sat 'waiting to run' for several days while BOINC chose to start running other tasks rather than resume these waiting tasks [I still wonder if this is correct behaviour].

Task Manager typically shows 75% memory in use (9GB out of 12GB) so there's no longer any memory shortage and I have unchecked all the Suspend options in BOINC except CPU above 70% (which it pretty much never is).

I'm going to reboot now as this PC has been up since before Christmas, so I'll see if that causes any cleanup. *edit* After a reboot BOINC has started running my two oldest 'waiting' tasks (even though we are past the deadline of 10th Jan). It seems as though a reboot every couple of days is probably the answer.

cheers
Steve
3) Message boards : Number crunching : Problems and Technical Issues with Rosetta@home (Message 79361)
Posted 5 Jan 2016 by Steve
Post:
A few things to consider:

Where did you set your preferences? Changes made in the BOINC Manager will override any web-based settings.

Double check the wording. In my version of BOINC Manager a box must be checked to keep tasks running while the computer is in use while you must select the “no” radio button to achieve the same thing using web-based prefs.

What I'm puzzled about is that BOINC is starting new tasks when older ones still are Waiting to Run...

This can happen if there isn’t enough memory to continue running a particular task. BOINC will set that one aside and try another. Rosetta tasks are among the most memory hungry tasks you will encounter in the BOINC world. So how much memory per core do you have and, more importantly, how much is BOINC allowed to use?

Could computer (not BOINC) sleep/hibernation settings be coming into play?

Best,
Snags

Thanks Snags - useful input. I have used local settings and the option window confirms that it's using those (it has a button to use prefs from the web but I haven't clicked that)

PC is a quad core with 12GB RAM, but it's running several large java-based services so memory typically runs around 80-90% used but with very little swapping. However as I'm not using the largest of those services most days I've now stopped that (releasing around 4GB) and will only run it when I need to access it. Rosetta tasks are usually under 200MB each in task manager so that should now mean there's plenty of memory available.

Making previously suggested changes seems to have improved things somewhat (only one overdue task waiting this morning) so I'll see if the latest change does any better.

Best,
Steve
4) Message boards : Number crunching : Problems and Technical Issues with Rosetta@home (Message 79348)
Posted 2 Jan 2016 by Steve
Post:
I'm finding that although some tasks complete OK, many more go "waiting to run" and seem to stay that way. I've aborted those that are clearly long past their deadline date but the others just sit there with varying % done and elapsed times. Is this normal? I'd have expected long-past-deadline tasks to be dropped and cleaned up by BOINC (but maybe that takes longer than a week?)Or is there something weird about my PC?

Any advice would be welcome.

Thanks in advance.
Steve

Waiting to run only applies when other projects are prioritised ahead of Rosetta, but I notice your only other project is Malaria which has been out of tasks for some while, so I'm wondering if you have "suspend when computer is in use" checked in OptionsComputing Preferences. This should be unchecked.


Thanks for the suggestion, I've not got "suspend when comouter is in use" checked but I did have "suspend GPU ... when in use" checked so I've cleared that and also allowed tasks to stay in memory when suspended so I'll see if that helps.

I've also removed the dormant Malaria Control project (which I deactivated because it hogged the system) so BOINC only has one project to work on.

Will see how that goes.

Thanks for your response

Steve
5) Message boards : Number crunching : Problems and Technical Issues with Rosetta@home (Message 79347)
Posted 2 Jan 2016 by Steve
Post:
Hi,

I'm finding that although some tasks complete OK, many more go "waiting to run" ...


Waiting to run is not a flaw in a task. It simply means that the BOINC Manager has decided to run something else first. Sounds like perhaps you have several projects running and the BOINC Manager is still getting used to the mix and may have download too much work.

As to the deadlines you mentioned, yes the BOINC Manager attempts to run tasks that are in risk of missing their deadlines first. And once the deadline has passed, you may as well "abort" the task. But once things settle in, this should not happen.

Does your machine run BOINC on a fairly regular schedule? How many hours per day?


Understood. I should have said "seem to stay stuck on Waiting to Run with only a small percentage of work completed".

I have only Rosetta running (I did run MalariaControl for a while but found it swamped BOINC such that no other tasks would start, so I set it to run no new tasks and now only Rosetta is getting work to do)

This PC runs very little other work - it's my retired desktop machine, now acting as a baby fileserver and occasional test machine in my home office, hence I decided in November to run some BOINC work on it. It is set to run BOINC tasks 24 hours a day and I left it running over Xmas and New Year and today found about a dozen unfinished Waiting to Run tasks that were past their deadlines which I've now aborted.

What I'm puzzled about is that BOINC is starting new tasks when older ones still are Waiting to Run, but I'm going to try some compute preference changes as suggested in another reply and see if that works better.

Thanks for the response
Steve
6) Message boards : Number crunching : Problems and Technical Issues with Rosetta@home (Message 79314)
Posted 27 Dec 2015 by Steve
Post:
Hi,

I'm finding that although some tasks complete OK, many more go "waiting to run" and seem to stay that way. I've aborted those that are clearly long past their deadline date but the others just sit there with varying % done and elapsed times. Is this normal? I'd have expected long-past-deadline tasks to be dropped and cleaned up by BOINC (but maybe that takes longer than a week?)Or is there something weird about my PC?

Any advice would be welcome.

Thanks in advance.
Steve
7) Message boards : Number crunching : Problems and Technical Issues with Rosetta@home (Message 79244)
Posted 18 Dec 2015 by Steve
Post:
Thanks for all the responses, nice to know this thread is monitored :-)
OK, I'll be more patient and let them run - maybe reboot the PC every couple of days, which will restart BOINC manager and client.
Season's best wishes
Steve
8) Message boards : Number crunching : Problems and Technical Issues with Rosetta@home (Message 79236)
Posted 17 Dec 2015 by Steve
Post:
Hi, I'm experiencing the following:

Most Rosetta tasks complete in around 6 to 8 hours on my PC - but frequently I find one or two tasks stop showing an estimated remaining time [just --- in the column]and then they continue to run indefinitely. I let one run for over 36 hours, still not completed, and 'stuck' on about 4% done, but the % varies (never more than 20% though).

I accept that estimating is inexact - more of a guess sometimes? - but I've taken to aborting these apparent zombie tasks that are just eating CPU and seemingly not producing results after 12 hours or so.

Am I being too pessimistic? should I just let these run and run in the hope they will someday finish? or are these bad tasks?

If you need more details please let me know.

Thanks
Steve






©2025 University of Washington
https://www.bakerlab.org