Problems with Rosetta version 5.51

Message boards : Number crunching : Problems with Rosetta version 5.51

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Rhiju
Volunteer moderator

Send message
Joined: 8 Jan 06
Posts: 223
Credit: 3,546
RAC: 0
Message 37725 - Posted: 12 Mar 2007, 20:10:32 UTC

Thanks for keeping us posted -- let us know especially how the new graphics are going on your computers! As usual, you can expect occasional download errors on the first day (Monday, March 12) due to high network traffic, but hopefully everything will be running smoothly by Tuesday.


ID: 37725 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Feet1st
Avatar

Send message
Joined: 30 Dec 05
Posts: 1755
Credit: 4,690,520
RAC: 0
Message 37728 - Posted: 12 Mar 2007, 21:45:04 UTC

RNA task, Validate error:
https://boinc.bakerlab.org/rosetta/result.php?resultid=67116185

Viewing the graphic of that one, it seemed to show "initializing", with the strobing periods, long after it appeared to be done initializing.
Add this signature to your EMail:
Running Microsoft's "System Idle Process" will never help cure cancer, AIDS nor Alzheimer's. But running Rosetta@home just might!
https://boinc.bakerlab.org/rosetta/
ID: 37728 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Rhiju
Volunteer moderator

Send message
Joined: 8 Jan 06
Posts: 223
Credit: 3,546
RAC: 0
Message 37741 - Posted: 13 Mar 2007, 1:40:08 UTC - in response to Message 37728.  

Hmm, I'll look into it.

RNA task, Validate error:
https://boinc.bakerlab.org/rosetta/result.php?resultid=67116185

Viewing the graphic of that one, it seemed to show "initializing", with the strobing periods, long after it appeared to be done initializing.


ID: 37741 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Feet1st
Avatar

Send message
Joined: 30 Dec 05
Posts: 1755
Credit: 4,690,520
RAC: 0
Message 37742 - Posted: 13 Mar 2007, 3:14:17 UTC
Last modified: 13 Mar 2007, 3:18:01 UTC

The NEXT RNA task I started also showed "Stage Initializing..." throughout the run. And the protein was MUCH larger. That one that failed looked like it only had about 7 amino acids.

I noticed it reports 30 nstructs, but zero decoys, which is the same as that one that someone reported on Ralph with a runtime of < 2seconds. Mine only ran for about 10 minutes.

I see now my second RNA task completed:
https://boinc.bakerlab.org/rosetta/result.php?resultid=67118019
also 30 nstructs and zero decoys. This one took an hour (24hr preference) and was very complex.

Were those WUs preprogrammed to end after 30 nstructs rather then the runtime preference?

While we're here... what's the difference between "nstructs" and "decoys" and "attempts" and "models"?? Why are all four terms used?


Add this signature to your EMail:
Running Microsoft's "System Idle Process" will never help cure cancer, AIDS nor Alzheimer's. But running Rosetta@home just might!
https://boinc.bakerlab.org/rosetta/
ID: 37742 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Feet1st
Avatar

Send message
Joined: 30 Dec 05
Posts: 1755
Credit: 4,690,520
RAC: 0
Message 37743 - Posted: 13 Mar 2007, 3:27:10 UTC

Following my first RNA task, another host has now crunched it ok... with exactly 30 nstructs.

...but that host is throwing numerous errors on other tasks, and appears it was stable previously.
https://boinc.bakerlab.org/rosetta/results.php?hostid=433817

Often with exit -107, and others with these messages every 10 minutes:
[03/12/07 13:27:15] TRACE [2964]: Retrieved the required window station

[03/12/07 13:27:15] TRACE [2964]: Retrieved the required desktop

for example: https://boinc.bakerlab.org/rosetta/result.php?resultid=67068374
Add this signature to your EMail:
Running Microsoft's "System Idle Process" will never help cure cancer, AIDS nor Alzheimer's. But running Rosetta@home just might!
https://boinc.bakerlab.org/rosetta/
ID: 37743 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 37758 - Posted: 13 Mar 2007, 13:43:32 UTC
Last modified: 13 Mar 2007, 15:17:48 UTC

Another example of a host with a history of running well, which is now receiving many -107 errors on a variety of task types:
https://boinc.bakerlab.org/rosetta/results.php?hostid=269763

[edit] sorry, that host's errors are with v5.48 [/edit]
Rosetta Moderator: Mod.Sense
ID: 37758 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 37761 - Posted: 13 Mar 2007, 14:29:12 UTC
Last modified: 13 Mar 2007, 14:29:38 UTC

Here were some 5.51 errors that I had been looking for when I posted before.

Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x009AE6DE read attempt to address 0x00000002

https://boinc.bakerlab.org/rosetta/result.php?resultid=67170038

and an access violation:
https://boinc.bakerlab.org/rosetta/result.php?resultid=67171894

from this host
Rosetta Moderator: Mod.Sense
ID: 37761 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Iconner

Send message
Joined: 30 Apr 06
Posts: 3
Credit: 148,980
RAC: 0
Message 37764 - Posted: 13 Mar 2007, 16:31:38 UTC
Last modified: 13 Mar 2007, 16:34:15 UTC

Two workunits didn't really work for me. they had 30 nstructs, but zero decoys:

https://boinc.bakerlab.org/rosetta/result.php?resultid=67231441
https://boinc.bakerlab.org/rosetta/result.php?resultid=67240566

One took about 30 minutes, the other on an hour.

Third one running now. Seems to be running well. Graphics are working.
ID: 37764 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
k6

Send message
Joined: 18 Oct 06
Posts: 5
Credit: 1,545,536
RAC: 0
Message 37765 - Posted: 13 Mar 2007, 16:45:29 UTC

The graphics is working well, it´s nice step to make Rosetta more attractive. Another thing is, that the percent bar in the BOINC manager make steps only when one of the models is done. During computing one model, the percent bar doesn´t make any change. Probably it depends on type of WU that i´ve currently computed. So, the rosetta is running well, but percent bar in BOINC keeps the same value for approx 2,5h. Using latest BOINC mngr.

(Sory for my bad english.)
ID: 37765 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 37768 - Posted: 13 Mar 2007, 21:33:08 UTC

I've just returned this work its one with RNA in the name

it finished early any ideas.

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=59971795

cpu_run_time_pref: 36000
======================================================
DONE :: 1 starting structures built 30 (nstruct) times
This process generated 0 decoys from 0 attempts

Over Success Done 4,439.00 8.85 13.20

ID: 37768 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Thomas Leibold

Send message
Joined: 30 Jul 06
Posts: 55
Credit: 19,627,164
RAC: 0
Message 37784 - Posted: 14 Mar 2007, 6:19:46 UTC

All of the RNA workunits processed with Rosetta 5.51 show the same message:

DONE :: 1 starting structures built 30 (nstruct) times
This process generated 0 decoys from 0 attempts

Also all of them are running for much less time then the 8 hours I have in my preferences (range is about 10 minutes to 2 hours). Most of them show a status (outcome/client state) of "Success Done", but there are also some with "Validate Error Done". Those with validation errors also fail the second time they are being assigned to someone else.

All of those results are from Rosetta. I never got any 5.51 workunits from Ralph (RNA or otherwise).

Team Helix
ID: 37784 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Gen_X_Accord
Avatar

Send message
Joined: 5 Jun 06
Posts: 154
Credit: 279,018
RAC: 0
Message 37785 - Posted: 14 Mar 2007, 7:32:54 UTC

I have had some work units end early and produce no decoys. I don't know what the problem is, I figure it is with the work unit itself.

And I like the new graphics, even if I don't use the screen saver.
ID: 37785 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Viromancy

Send message
Joined: 23 Sep 06
Posts: 8
Credit: 125,713
RAC: 0
Message 37786 - Posted: 14 Mar 2007, 7:37:20 UTC
Last modified: 14 Mar 2007, 7:40:45 UTC

I've also had multiple validation errors on RNA WUs in 5.51:

resultid=67219319
resultid=67284982
resultid=67410504
resultid=67412952

A couple of other RNA WUs have run without errors though:

resultid=67253746
resultid=67397280


ID: 37786 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Michael.L

Send message
Joined: 12 Nov 06
Posts: 67
Credit: 31,295
RAC: 0
Message 37789 - Posted: 14 Mar 2007, 10:24:26 UTC

Result ID 67245211 Result ID 67357866 Both RNA WUs. Both ran for short time.
Both no Decoys.
ID: 37789 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
k6

Send message
Joined: 18 Oct 06
Posts: 5
Credit: 1,545,536
RAC: 0
Message 37792 - Posted: 14 Mar 2007, 11:10:21 UTC

https://boinc.bakerlab.org/rosetta/result.php?resultid=67170941
https://boinc.bakerlab.org/rosetta/result.php?resultid=67150947

Both RNAs, rans for short time, no decoys.

ID: 37792 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
B-Roy

Send message
Joined: 26 Sep 05
Posts: 26
Credit: 46,121
RAC: 38
Message 37793 - Posted: 14 Mar 2007, 11:38:08 UTC

after trying to open the graphics window, my first wu (Workunit 60192061) finished with:

<core_client_version>5.8.15</core_client_version>
<![CDATA[
<message>
- exit code 1073807364 (0x40010004)
</message>
<stderr_txt>
# random seed: 1432142
No heartbeat from core client for 31 sec - exiting

</stderr_txt>
]]>

2nd is running right now and the graph worked.
ID: 37793 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
David Ball

Send message
Joined: 25 Nov 05
Posts: 25
Credit: 1,439,333
RAC: 0
Message 37802 - Posted: 14 Mar 2007, 16:37:12 UTC

https://boinc.bakerlab.org/rosetta/result.php?resultid=67366016

1esy__BOINC_RNA_ABINITIO-1esy_-_1609_5735_0

CPU time 2009.7416

# random seed: 1591680
# cpu_run_time_pref: 86400
======================================================
DONE :: 1 starting structures built 30 (nstruct) times
This process generated 0 decoys from 0 attempts
======================================================

This has been a very reliable cruncher that's set for 24 hour execution preference.

-- David
ID: 37802 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
AMD_is_logical

Send message
Joined: 20 Dec 05
Posts: 299
Credit: 31,460,681
RAC: 0
Message 37803 - Posted: 14 Mar 2007, 16:44:33 UTC

I've had about 50 of those WUs that end after a short time with 30 nstruct and zero decoys. Two of these had a "Validate error":

https://boinc.bakerlab.org/rosetta/result.php?resultid=67444108
https://boinc.bakerlab.org/rosetta/result.php?resultid=67174628

These were from different machines, and both gave a validate error when crunched by someone else.
ID: 37803 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
ThorLite

Send message
Joined: 27 Apr 06
Posts: 1
Credit: 7,110,850
RAC: 0
Message 37807 - Posted: 14 Mar 2007, 17:25:14 UTC

This is the most unstable version on mine quad core.System was rock solid before after 5.51 crashes and blue screens......
ID: 37807 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Rhiju
Volunteer moderator

Send message
Joined: 8 Jan 06
Posts: 223
Credit: 3,546
RAC: 0
Message 37811 - Posted: 14 Mar 2007, 19:11:36 UTC - in response to Message 37802.  

Hi all:

ThorLite, AMD_is_logical, and others: the "0 decoys" is misleading, many of the workunits that you report are producing tons of decoys and getting credit. A small fraction aren't getting credit, though, and I'm tracking those down. I think I know the overall fix, and am working on it over on ralph.
I do want to say that the results are streaming in beautifully, and the data is pretty awesome.

I'm not entirely sure about the quad core problem, thor -- if you attach your project to ralph part-time, I'll be doing an update soon. I think I know one possible issue with the graphics, and I'll have the fix on the next update. We obviously don't want to lose your machine!

Thanks to everybody for posting and crunching!


https://boinc.bakerlab.org/rosetta/result.php?resultid=67366016

1esy__BOINC_RNA_ABINITIO-1esy_-_1609_5735_0

CPU time 2009.7416

# random seed: 1591680
# cpu_run_time_pref: 86400
======================================================
DONE :: 1 starting structures built 30 (nstruct) times
This process generated 0 decoys from 0 attempts
======================================================

This has been a very reliable cruncher that's set for 24 hour execution preference.

-- David


ID: 37811 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
1 · 2 · Next

Message boards : Number crunching : Problems with Rosetta version 5.51



©2024 University of Washington
https://www.bakerlab.org