Message boards : Number crunching : Problems with Rosetta version 5.80
Author | Message |
---|---|
Ingemar Send message Joined: 28 Feb 06 Posts: 20 Credit: 1,680 RAC: 0 |
Please report problems with this version. Thanks! |
Jmarks Send message Joined: 16 Jul 07 Posts: 132 Credit: 98,025 RAC: 0 |
|
DJStarfox Send message Joined: 19 Jul 07 Posts: 145 Credit: 1,250,162 RAC: 0 |
Please report problems with this version. Thanks! 5.80 needs a lot more memory than previous Betas. BOINC says waiting for memory on a 512MB linux system with 2 CPUs. This did not happen on previous versions of Rosetta. Is this a permanent change? One task runs but the other (second set of threads) below is waiting for memory. %CPU %MEM VSZ RSS STAT START TIME COMMAND 100 43.5 356264 224188 RN 10:39 87:56 rosetta_beta_5.80_i686-pc-linux-gnu 0.0 43.5 356264 224188 SN 10:39 0:00 rosetta_beta_5.80_i686-pc-linux-gnu 0.0 43.5 356264 224188 SN 10:39 0:00 rosetta_beta_5.80_i686-pc-linux-gnu 0.0 43.5 356264 224188 SN 10:39 0:00 rosetta_beta_5.80_i686-pc-linux-gnu 0.1 37.7 320764 194128 SN 10:39 0:06 rosetta_beta_5.80_i686-pc-linux-gnu 0.0 37.7 320764 194128 SN 10:39 0:00 rosetta_beta_5.80_i686-pc-linux-gnu 0.0 37.7 320764 194128 SN 10:39 0:00 rosetta_beta_5.80_i686-pc-linux-gnu 0.0 37.7 320764 194128 SN 10:39 0:00 rosetta_beta_5.80_i686-pc-linux-gnu |
Wits End Send message Joined: 16 Apr 07 Posts: 4 Credit: 29,477 RAC: 0 |
|
David Emigh Send message Joined: 13 Mar 06 Posts: 158 Credit: 417,178 RAC: 0 |
|
Rayburner Send message Joined: 4 Oct 05 Posts: 32 Credit: 16,518,823 RAC: 0 |
Hi! two validate errors lately. Is there a specail reason for that? https://boinc.bakerlab.org/rosetta/result.php?resultid=105570644 https://boinc.bakerlab.org/rosetta/result.php?resultid=104716132 |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
I moved Rayburner's post here. One of thos was 5.78 the other was 5.80. Rosetta Moderator: Mod.Sense |
anders n Send message Joined: 19 Sep 05 Posts: 403 Credit: 537,991 RAC: 0 |
|
Mark Henderson Send message Joined: 24 May 06 Posts: 9 Credit: 643,001 RAC: 0 |
I had a compute error today on 5.80 and a watchdog termination on another yesterday using 5.78 on my AMD X2 4800. I have ran rosetta a long time and this is the first 2 errors I remember. |
The_Bad_Penguin Send message Joined: 5 Jun 06 Posts: 2751 Credit: 4,271,025 RAC: 0 |
Here we go again... (1he8__BOINC_CAPRI14_DOCK_FIXBACKBONE_POSE_LOOPS-1he8_-plexinmonomer__2083_1421_0) I did look at the screen at about 3 hours... i think it said model 1, step 513, the percentage indicator was 95.9x% - 96.xx% and increasing. Nothing was visibly moving in any of the graphic representations. Watchdog shut down... ~60+ credits requested for ~4 hours on a single core of a Core2Quad, 20 credits granted... |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
~60+ credits requested for ~4 hours on a single core of a Core2Quad, 20 credits granted... OK, great! I'm glad you were able to catch one. Assuming that others behave the same way (a bit of a stretch with only a single one observed, but it's all we have to go by)... the fact that it is still on model one is the reason why the task fails and only 20 credits are granted. If you had completed several models, then (at least the design to my knowledge is) these completed results would be reported back and credit issued for them. So that was one of my oustanding questions was "is the partial reporting of tasks that run for a while and then fail working properly?" And, based on your observation, it sounds like it is working as well as I would have expected. But the long running single models are basically exhibiting a worst-case scenario where extensive time is spent and only 20 credits are issued. Wow, your task shows the score was stuck for 1,800 seconds. I take it Rhiju has increased the timeout for the watchdog. Rosetta Moderator: Mod.Sense |
Ingemar Send message Joined: 28 Feb 06 Posts: 20 Credit: 1,680 RAC: 0 |
It appears that some of the Capri docking runs get stuck and gets terminated by the watchdog. The watchdog seems to do its job, the problem seem to be the simulations. This is the first time we do large scale tests on some new simulation modes and we will have to analyze why some runs get stuck/crashes. CAPRI ( Critical Assesment of Protein Interactions) is a competion where we try to predict the structure of protein-protein complexes. We have a deadline for submission of our models to this competion coming up soon and thats why you see so many Capri-something jobs. they will soon be out of the queue. And yes we did increase the watchdog timeout. |
The_Bad_Penguin Send message Joined: 5 Jun 06 Posts: 2751 Credit: 4,271,025 RAC: 0 |
Again, I'm in it for the science, not the credits. So, if the info I am able to provide is helpful, great. Hope it helps for this round (or the next) of the competition (good luck!)... |
Paul Send message Joined: 29 Oct 05 Posts: 193 Credit: 66,598,787 RAC: 9,712 |
I continue to get computation errors running Rosetta 5.80 I had very few of these errors over the last few months and recently I have received many of them. What can I do correct this condition? thx PRaney Thx! Paul |
The_Bad_Penguin Send message Joined: 5 Jun 06 Posts: 2751 Credit: 4,271,025 RAC: 0 |
Seems to be a bunch of: Unhandled Exception Detected... - Unhandled Exception Record - Reason: Out Of Memory (C++ Exception) (0xe06d7363) at address 0x7C812A5B Engaging BOINC Windows Runtime Debugger... I continue to get computation errors running Rosetta 5.80 |
The_Bad_Penguin Send message Joined: 5 Jun 06 Posts: 2751 Credit: 4,271,025 RAC: 0 |
Jim's post refers to this invalid result |
Michael B Send message Joined: 13 Feb 06 Posts: 19 Credit: 306,566 RAC: 0 |
One of my BOINC Managers won't let me attach to rosetta...keeps saying project is offline. |
Rayburner Send message Joined: 4 Oct 05 Posts: 32 Credit: 16,518,823 RAC: 0 |
I got 0 credits for this wu: too many results: https://boinc.bakerlab.org/rosetta/workunit.php?wuid=94605647 |
Paul Send message Joined: 29 Oct 05 Posts: 193 Credit: 66,598,787 RAC: 9,712 |
Just noticed each WU is consuming about 248MB of RAM. With 2 GB of RAM, this was not a problem until the Q6600 went into the system. 4 WUs are consuming 1/2 of the system memory. What changed in 5.8 to cause the massive memory consumption and all of the computation errors? Can you do anything to pull in the memory requirements? Did the previous versions hold memory requirements at about 128MB per WU? Thx! Paul |
Jmarks Send message Joined: 16 Jul 07 Posts: 132 Credit: 98,025 RAC: 0 |
Just noticed each WU is consuming about 248MB of RAM. With 2 GB of RAM, this was not a problem until the Q6600 went into the system. 4 WUs are consuming 1/2 of the system memory. Go into Your Account and Edit General preferences Disk and memory usage Use at most - 50% of memory when computer is in use *** Lower this to what you want. Ps This post is not about 5.80 you should start a seperate thread in 'Number Crunching'. Jmarks |
Message boards :
Number crunching :
Problems with Rosetta version 5.80
©2024 University of Washington
https://www.bakerlab.org