Rosetta Mini with new score terms bug thread

Message boards : Number crunching : Rosetta Mini with new score terms bug thread

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1224
Credit: 13,844,503
RAC: 1,768
Message 56567 - Posted: 31 Oct 2008, 20:44:59 UTC

Seems like there's a new version of minirosetta, not with a version number in the regular series:

Rosetta Mini with new score terms

Nobody else has created a bug thread for it, so I decided to start one.

I've had three workunits of this type so far; all three reached a compute error in less than 1 second, both for me and for the other person running them.

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=185620050

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=185900046

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=185953422
ID: 56567 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile (_KoDAk_)

Send message
Joined: 18 Jul 06
Posts: 109
Credit: 1,859,263
RAC: 0
Message 56570 - Posted: 31 Oct 2008, 21:22:36 UTC

1st WU https://boinc.bakerlab.org/rosetta/result.php?resultid=203330759
is OK but long only 167 min
ID: 56570 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Odd Braathun

Send message
Joined: 2 Sep 08
Posts: 9
Credit: 16,125
RAC: 0
Message 56573 - Posted: 31 Oct 2008, 21:32:59 UTC

Personally I have had 17 of these WU's that errored out just after a few secs.
I reported the first 3 under the thread v.102. Then I gave up. I am not getting
any new tasks.
ID: 56573 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Yin Gang

Send message
Joined: 17 Sep 05
Posts: 13
Credit: 63,992
RAC: 0
Message 56580 - Posted: 1 Nov 2008, 1:22:23 UTC

Just finished one WU of this kind:

https://boinc.bakerlab.org/rosetta/result.php?resultid=203821501


<stderr_txt>

ERROR: in::file::zip minirosetta_database.zip does not exist!
ERROR:: Exit from: ....srcappspublicboincminirosetta.cc line: 74
called boinc_finish

</stderr_txt>



Welcome To Team China!
ID: 56580 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Aegis Maelstrom

Send message
Joined: 29 Oct 08
Posts: 61
Credit: 2,137,555
RAC: 0
Message 56590 - Posted: 1 Nov 2008, 7:30:33 UTC
Last modified: 1 Nov 2008, 7:48:30 UTC

I've already highlighted the problem in the Minirosetta v1.39 bug thread - like several other people. See my previous post for details.

The problem seems to be with - at least - Win XP - and the latest software - both BOINC and Rosetta.

EDIT: as I thought it looks like both Intel and AMD problem - see NetwtonianRefractor's AMD machine stats.

This issue seems to be widespread, however some machines report they did the job. Thus, while writing "it just works" or "it just does not", could you provide further information about your OSes, machines etc.? =)

I guess it would be easier to debug then (although the crash description suggests it should be trivial - there is no minirosetta_database zip file, just minirosetta_database_rev25321 one.

I hope the maintainer fixes the software soon.

Best regards for all!
ID: 56590 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1224
Credit: 13,844,503
RAC: 1,768
Message 56593 - Posted: 1 Nov 2008, 10:04:32 UTC - in response to Message 56567.  
Last modified: 1 Nov 2008, 10:16:46 UTC

I've had 9 workunits of Rosetta Mini with new score terms so far. All gave a compute error in less than 1 second of CPU time. For the ones already run by someone else, they got the same result. Could I have the option of not getting any more workunits of this type, until I see enough reports that the problem has been fixed?

Also, these workunits do not put their version number on the Workunit details page, so it's hard to report the version number after the workunit has been reported and another workunit has been requested. Could it be added?

I run these workunits under 32-bit Vista SP1, with an AMD CPU and BOINC 5.10.45. Some of the other BOINC projects I participate in aren't ready to switch to a later version of BOINC.
ID: 56593 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Rabinovitch
Avatar

Send message
Joined: 28 Apr 07
Posts: 28
Credit: 5,439,728
RAC: 0
Message 56597 - Posted: 1 Nov 2008, 12:56:26 UTC

Computation error after more than 15 hours of WU processing.

11/01/08 14:53:04|rosetta@home|Restarting task 1tul__BOINC_CASP8_ABRELAX_SPLIT_SPLIT_IGNORE_THE_REST-S25-9-S3-3--1tul_-_4662_1454_1 using minirosetta_split_terms version 102
11/01/08 14:54:35|rosetta@home|Restarting task 1tul__BOINC_CASP8_ABRELAX_SPLIT_SPLIT_IGNORE_THE_REST-S25-9-S3-3--1tul_-_4662_1467_1 using minirosetta_split_terms version 102
11/01/08 15:11:42|rosetta@home|Computation for task 1tul__BOINC_CASP8_ABRELAX_SPLIT_SPLIT_IGNORE_THE_REST-S25-9-S3-3--1tul_-_4662_1454_1 finished
11/01/08 15:11:42|rosetta@home|Output file 1tul__BOINC_CASP8_ABRELAX_SPLIT_SPLIT_IGNORE_THE_REST-S25-9-S3-3--1tul_-_4662_1454_1_0 for task 1tul__BOINC_CASP8_ABRELAX_SPLIT_SPLIT_IGNORE_THE_REST-S25-9-S3-3--1tul_-_4662_1454_1 absent

Many-many symbols are here: https://boinc.bakerlab.org/rosetta/result.php?resultid=203594723
ID: 56597 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Purple Rabbit
Avatar

Send message
Joined: 24 Sep 05
Posts: 28
Credit: 3,895,043
RAC: 1,719
Message 56601 - Posted: 1 Nov 2008, 16:06:12 UTC
Last modified: 1 Nov 2008, 16:14:04 UTC

I've had 100% failure for this app (25 of them so far). I've started aborting them as they appear. This is with BOINC 5.10.45 on both Linux and Windows (XP and Vista), but mainly Windows. I've got one task under Linux that's 4 hours into its 6 hour run. I'll wait to see how that goes.

There's something wrong here (as if I'm telling you something new)!
ID: 56601 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1224
Credit: 13,844,503
RAC: 1,768
Message 56603 - Posted: 1 Nov 2008, 16:57:24 UTC - in response to Message 56597.  

Computation error after more than 15 hours of WU processing.

11/01/08 14:53:04|rosetta@home|Restarting task 1tul__BOINC_CASP8_ABRELAX_SPLIT_SPLIT_IGNORE_THE_REST-S25-9-S3-3--1tul_-_4662_1454_1 using minirosetta_split_terms version 102


Yours are doing better than mine in some ways - so far, none of the Rosetta Mini with new score terms workunits has used even a whole CPU second on my machine before encountering an error.

I got to look at results for some of the other users for the same workunits I got of this kind by following links under the Workunit Details windows.

All of then had the same missing minirosetta_database zip file error message practically as their first error message. Some used XP and some used Vista. Some used an Intel CPU and some used an AMD CPU; all of these CPUs had more than one core. The users preferred an assortment of languages. There were an assortment of BOINC versions, from 5.8.* to 6.2.*. I'm not sure how much of what I didn't see was due to the way my machine was selected as eligible for the same workunits as other machines.
ID: 56603 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile rochester new york
Avatar

Send message
Joined: 2 Jul 06
Posts: 2842
Credit: 2,020,043
RAC: 0
Message 56604 - Posted: 1 Nov 2008, 17:16:46 UTC

ID: 56604 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile (_KoDAk_)

Send message
Joined: 18 Jul 06
Posts: 109
Credit: 1,859,263
RAC: 0
Message 56608 - Posted: 1 Nov 2008, 21:20:25 UTC

OK
https://boinc.bakerlab.org/rosetta/workunit.php?wuid=186021731
https://boinc.bakerlab.org/rosetta/workunit.php?wuid=186021706
https://boinc.bakerlab.org/rosetta/workunit.php?wuid=185977687


ID: 56608 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile rochester new york
Avatar

Send message
Joined: 2 Jul 06
Posts: 2842
Credit: 2,020,043
RAC: 0
Message 56609 - Posted: 1 Nov 2008, 21:28:57 UTC - in response to Message 56604.  
Last modified: 1 Nov 2008, 21:29:30 UTC

ID: 56609 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Rabinovitch
Avatar

Send message
Joined: 28 Apr 07
Posts: 28
Credit: 5,439,728
RAC: 0
Message 56610 - Posted: 1 Nov 2008, 22:03:07 UTC

Once again:

11/01/08 23:06:48|rosetta@home|Computation for task 1tul__BOINC_CASP8_ABRELAX_SPLIT_SPLIT_IGNORE_THE_REST-S25-9-S3-3--1tul_-_4662_1467_1 finished
11/01/08 23:06:48|rosetta@home|Output file 1tul__BOINC_CASP8_ABRELAX_SPLIT_SPLIT_IGNORE_THE_REST-S25-9-S3-3--1tul_-_4662_1467_1_0 for task 1tul__BOINC_CASP8_ABRELAX_SPLIT_SPLIT_IGNORE_THE_REST-S25-9-S3-3--1tul_-_4662_1467_1 absent

Now after 19 hours of working!

https://boinc.bakerlab.org/rosetta/result.php?resultid=203601208
ID: 56610 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
R.L. Casey

Send message
Joined: 7 Jun 06
Posts: 91
Credit: 2,728,885
RAC: 0
Message 56613 - Posted: 1 Nov 2008, 23:18:17 UTC - in response to Message 56610.  

Once again:

11/01/08 23:06:48|rosetta@home|Computation for task 1tul__BOINC_CASP8_ABRELAX_SPLIT_SPLIT_IGNORE_THE_REST-S25-9-S3-3--1tul_-_4662_1467_1 finished
11/01/08 23:06:48|rosetta@home|Output file 1tul__BOINC_CASP8_ABRELAX_SPLIT_SPLIT_IGNORE_THE_REST-S25-9-S3-3--1tul_-_4662_1467_1_0 for task 1tul__BOINC_CASP8_ABRELAX_SPLIT_SPLIT_IGNORE_THE_REST-S25-9-S3-3--1tul_-_4662_1467_1 absent

Now after 19 hours of working!

https://boinc.bakerlab.org/rosetta/result.php?resultid=203601208

Rabinovitch,
Looking at the result you iedntified, I noticed that the BOINC version you are using is Version 6.3.19. According to the BOINC site, Version 6.3.19 is *not* a production version, and is to be used "only for testing." It would be a good idea to use a production version such as Version 6.2.19. Then, an error, if it appears, will be viewed as more legitimate.
Thanks for crunching Rosetta!
ID: 56613 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Purple Rabbit
Avatar

Send message
Joined: 24 Sep 05
Posts: 28
Credit: 3,895,043
RAC: 1,719
Message 56615 - Posted: 2 Nov 2008, 0:48:35 UTC
Last modified: 2 Nov 2008, 1:42:07 UTC

OK, I'm one for 26. The Linux one amazingly completed correctly on Tomato (host 282106).

This looks like a Windows problem. My other Linux failures may be due to a memory defect...sigh. Onion (host 333136) has been having memory problems so I can't blame Rosetta. The Windows machines are dropping like flies on this app.
ID: 56615 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Path7

Send message
Joined: 25 Aug 07
Posts: 128
Credit: 61,751
RAC: 0
Message 56617 - Posted: 2 Nov 2008, 1:48:02 UTC

Hello all,

Running Windows XP home SP3 & Windows Vista home SP1 both Boinc 5.10.45 together 7 times an error running the Mini w n s t 1.02 after less than a second runtime:

stderr out <core_client_version>5.10.45</core_client_version>
<![CDATA[
<message>
Onjuiste functie. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>

ERROR: in::file::zip minirosetta_database.zip does not exist!
ERROR:: Exit from: ....srcappspublicboincminirosetta.cc line: 74
called boinc_finish

</stderr_txt>
]]>

I hope the techs will find a solution to those errors.

Have a nice day,
Path7.

ID: 56617 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
googloo
Avatar

Send message
Joined: 15 Sep 06
Posts: 133
Credit: 21,684,939
RAC: 5,253
Message 56618 - Posted: 2 Nov 2008, 2:46:31 UTC

I have set Rosetta to "won't get new tasks" until this version is adjusted to work on XP. I am running BOINC 6.2.19. I haven't counted the number of tasks that failed within a minute, but it is more than twenty.
ID: 56618 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Rabinovitch
Avatar

Send message
Joined: 28 Apr 07
Posts: 28
Credit: 5,439,728
RAC: 0
Message 56619 - Posted: 2 Nov 2008, 4:26:07 UTC - in response to Message 56613.  


Rabinovitch,
Looking at the result you iedntified, I noticed that the BOINC version you are using is Version 6.3.19. According to the BOINC site, Version 6.3.19 is *not* a production version, and is to be used "only for testing." It would be a good idea to use a production version such as Version 6.2.19. Then, an error, if it appears, will be viewed as more legitimate.
Thanks for crunching Rosetta!


Well, I suppose that most of us are not using a develompenst versions of BOINS Manager. But they meet a bugs anyway...

By the way, I'm using it only for crunching PS3GRID WUs.
ID: 56619 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Rabinovitch
Avatar

Send message
Joined: 28 Apr 07
Posts: 28
Credit: 5,439,728
RAC: 0
Message 56624 - Posted: 2 Nov 2008, 10:23:24 UTC

Once again:

02.11.2008 15:42:03|rosetta@home|Starting 2chf__BOINC_CASP8_ABRELAX_SPLIT_SPLIT_IGNORE_THE_REST-S25-9-S3-3--2chf_-_4662_4628_1
02.11.2008 15:42:11|rosetta@home|Starting task 2chf__BOINC_CASP8_ABRELAX_SPLIT_SPLIT_IGNORE_THE_REST-S25-9-S3-3--2chf_-_4662_4628_1 using minirosetta_split_terms version 102
02.11.2008 15:46:21|rosetta@home|Computation for task 2chf__BOINC_CASP8_ABRELAX_SPLIT_SPLIT_IGNORE_THE_REST-S25-9-S3-3--2chf_-_4662_4628_1 finished
02.11.2008 15:46:21|rosetta@home|Output file 2chf__BOINC_CASP8_ABRELAX_SPLIT_SPLIT_IGNORE_THE_REST-S25-9-S3-3--2chf_-_4662_4628_1_0 for task 2chf__BOINC_CASP8_ABRELAX_SPLIT_SPLIT_IGNORE_THE_REST-S25-9-S3-3--2chf_-_4662_4628_1 absent

BOINC manager version:

02.11.2008 10:16:24||Starting BOINC client version 6.3.19 for windows_x86_64
ID: 56624 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile (_KoDAk_)

Send message
Joined: 18 Jul 06
Posts: 109
Credit: 1,859,263
RAC: 0
Message 56630 - Posted: 2 Nov 2008, 13:45:14 UTC

OK
https://boinc.bakerlab.org/rosetta/workunit.php?wuid=186104285

ID: 56630 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
1 · 2 · Next

Message boards : Number crunching : Rosetta Mini with new score terms bug thread



©2024 University of Washington
https://www.bakerlab.org