No progress & 1.04%?

Message boards : Number crunching : No progress & 1.04%?

To post messages, you must log in.

AuthorMessage
Charlie

Send message
Joined: 25 Mar 06
Posts: 53
Credit: 424,472
RAC: 0
Message 13741 - Posted: 14 Apr 2006, 17:58:17 UTC

i had to WUs running for last 3 hours and 5 hours respectively whice only shoew 2.6 and 1.4 % completion. These 2 WUS also showed 9 Plus hours to supposedly complete. Is this a bug with the WUs or my machine? I have aborted them as they are eating up CPU time i can use on other WUs


Thanks Again Charlie
ID: 13741 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Feet1st
Avatar

Send message
Joined: 30 Dec 05
Posts: 1755
Credit: 4,690,520
RAC: 0
Message 13751 - Posted: 14 Apr 2006, 19:00:37 UTC
Last modified: 14 Apr 2006, 19:17:18 UTC

You haven't really described enough detail to be certain. But I see it looks like you are a new users, and your WUs are generally running in the 4hour timeframe. The WU crunch time is configurable in the Rosetta preferences, so it's hard to guess how people have them set, or if they've changed it after the work visible in your work history.

Some of those really long and complex protein work units can take longer than you are accustomed to. The percent complete is not an absolute number, especially with a short WU run time.

The topic is covered in the FAQs here. In particular the point that every WU must process at least 1 model, regardless of how long that takes. So, if this happens again, you should check the graphic and see if you're still on model 1. Model 1 is always going to show one point something percent, because it doesn't really know ahead of time what your total number of models is going to be. If it completes model 1 and it's exceeded your target runtime, then it will stop there and report the results. So, you can zoom from 1.xx to 100% at the end of processing model 1.

And so, some of the recent WUs have been for VERY long proteins, and it's taking longer than it used to just to compelet that first model.

There are some threads in the crunching forums where you can read more about stuck WUs, and report any you run across.

Hope this helps.

Keep crunchin' Rosetta!

[edit] I forgot to mention, there were four series of WUs that they've asked people to go ahead and abort. Perhaps yours were in one of these series?
Add this signature to your EMail:
Running Microsoft's "System Idle Process" will never help cure cancer, AIDS nor Alzheimer's. But running Rosetta@home just might!
https://boinc.bakerlab.org/rosetta/
ID: 13751 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Charlie

Send message
Joined: 25 Mar 06
Posts: 53
Credit: 424,472
RAC: 0
Message 13763 - Posted: 14 Apr 2006, 19:54:15 UTC

Thanks that helps alot. All my settings are still set at same times. Tho looking the 1s i thought might be stuck all had the word large in the title. But all the info you gave me helps alot to understnad whats going on thanks again


Charlie
ID: 13763 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Tom Leser

Send message
Joined: 17 Mar 06
Posts: 2
Credit: 80,126
RAC: 0
Message 14008 - Posted: 18 Apr 2006, 4:38:01 UTC

Is anyone else having this issue? It gets stuck on 1.04% and then Slooooooooowly progresses from here? In about an 8-hour time period it went up to 1.0422%.

This has been going on for me, on 3 computers, since the weekend, and it's happened to other users I know too.

I've tried everything from resetting project, to uninstalling, deleting the directory, and reinstalling. Nothing has worked yet.
ID: 14008 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Dimitris Hatzopoulos

Send message
Joined: 5 Jan 06
Posts: 336
Credit: 80,939
RAC: 0
Message 14011 - Posted: 18 Apr 2006, 5:02:40 UTC
Last modified: 18 Apr 2006, 5:03:01 UTC

Rosetta@home has been sending out some pretty big work-units, which take 2-4 hours (on P4-type CPUs) to complete even 1 Model (notice the Model/Step numbers in the graphics).

As long as it's running, let it complete.

Make sure either have leave-in-mem-when-preempted=yes, or increase time between switches (between BOINC projects) to e.g. 4hr.

Quoting a previous conversation:

"The "leave in memory" is no longer an issue, but Rosetta "check-points" at the end of each model (notice the Model/Step info in the screensaver). AMD_is_logical explained it very well.

Most WUs are small proteins, which take only ~10min per model.

Some recent ones are very big which take 2-4hr per model (on Pentium4!). So to finish such a WU, Rosetta needs to run on your PC for 2-4hrs, WITHOUT being unloaded from memory. If a PC unloads Rosetta every hour to run another project, it will never finish, as it'll start everytime from scratch.

The surest way to run Big WUs would be check the "leave in memory when preempted"=YES."

"I think the "odd" %-progress behaviour you're seeing might be because the WUs running on your PC can be very different. It can be apples and oranges. A model on a HBLR* WU might take 10min and a *_largescale_large_fullatom_relax_* might take 3hr.

Rosetta is very different than most other BOINC projects, which have more or less constant size WUs.

WU %-progress might not increase linearly with time, as AMD_is_logical / Snake_Doc said. Especially if you're using very short WU runtime.

The *_largescale_large_fullatom_relax_* WUs are very big and sometimes "Steps" remains at 0. Usually just one "Model" will fit in the 7200 seconds (2hr) timeframe, in which case the %-progress indicator may stay at e.g. 1.5% for 1-3 hours while computing the first model and then finish, realise that it can't run a second model per your WU-runtime settings (7200 sec might have been already exceeded), so it jumps to 100% and finishes.

I use 8-hr WU-runtimes (instead of 2hr default) and a big WU taking 2hr per Model might jump 0% -> 25% -> 50% -> 75% -> 100% in BOINC progress"

Best UFO Resources
Wikipedia R@h
How-To: Join Distributed Computing projects that benefit humanity
ID: 14011 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Moderator9
Volunteer moderator

Send message
Joined: 22 Jan 06
Posts: 1014
Credit: 0
RAC: 0
Message 14029 - Posted: 18 Apr 2006, 12:05:50 UTC
Last modified: 18 Apr 2006, 18:51:10 UTC

This thread was moved from the science forum and was merged with another on the same subject
Moderator9
ROSETTA@home FAQ
Moderator Contact
ID: 14029 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Tom Leser

Send message
Joined: 17 Mar 06
Posts: 2
Credit: 80,126
RAC: 0
Message 14030 - Posted: 18 Apr 2006, 12:18:35 UTC
Last modified: 18 Apr 2006, 12:23:13 UTC

I created a thread yesterday, but now it's HIDDEN for some reason "for administrative purposes".

Since the weekend my Rosetta units won't go past 1.04% I'll let it run all day long and it'll be 1.0422%. I know others that had this problem too and it's happened on my difference PC's.

Woops, just noticed my topic was moved to the number crunching forum.
ID: 14030 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Hoelder1in
Avatar

Send message
Joined: 30 Sep 05
Posts: 169
Credit: 3,915,947
RAC: 0
Message 14035 - Posted: 18 Apr 2006, 13:24:29 UTC - in response to Message 14008.  

Is anyone else having this issue? It gets stuck on 1.04% and then Slooooooooowly progresses from here? In about an 8-hour time period it went up to 1.0422%.

This has been going on for me, on 3 computers, since the weekend, and it's happened to other users I know too.

I've tried everything from resetting project, to uninstalling, deleting the directory, and reinstalling. Nothing has worked yet.

If your stuck WUs are some of the ones listed in Rhiju's post (here) then they should be aborted (I had one of these myself a couple of hours ago).
Team betterhumans.com - discuss and celebrate the future - hoelder1in.org
ID: 14035 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : No progress & 1.04%?



©2024 University of Washington
https://www.bakerlab.org