Problems with Rosetta version 5.81

Message boards : Number crunching : Problems with Rosetta version 5.81

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · Next

AuthorMessage
bentjeans

Send message
Joined: 30 Oct 07
Posts: 1
Credit: 5,313
RAC: 0
Message 48539 - Posted: 10 Nov 2007, 23:01:36 UTC
Last modified: 10 Nov 2007, 23:02:47 UTC

My host seems to have run the 2257's the past day or so without incident.
https://boinc.bakerlab.org/rosetta/show_host_detail.php?hostid=656908
ID: 48539 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 48544 - Posted: 11 Nov 2007, 0:46:07 UTC

This one ran well over my run time, even after i reduced it from

8hrs to 6hrs for faster turn round. It had run over the 8hr mark.

2fqm__BOINC_SYMM_FOLD_AND_DOCK_RELAX-2fqm_-crystal_foldanddock__2257_17546_0

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=107955094

pete.
ID: 48544 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Jmarks
Avatar

Send message
Joined: 16 Jul 07
Posts: 132
Credit: 98,025
RAC: 0
Message 48552 - Posted: 11 Nov 2007, 13:15:28 UTC
Last modified: 11 Nov 2007, 13:17:21 UTC

Client error 2reb__TREEJUMP_ABRELAX_TOR_EQ_-1_PROB_.1_SAVE_ALL_OUT-2reb_-_BARCODE__2244_6092_1
119063108
Jmarks
ID: 48552 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile rochester new york
Avatar

Send message
Joined: 2 Jul 06
Posts: 2839
Credit: 2,020,043
RAC: 0
Message 48562 - Posted: 11 Nov 2007, 16:36:52 UTC

could someone look at why i have all these errors ???????
ID: 48562 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Jmarks
Avatar

Send message
Joined: 16 Jul 07
Posts: 132
Credit: 98,025
RAC: 0
Message 48563 - Posted: 11 Nov 2007, 17:57:22 UTC - in response to Message 48562.  

could someone look at why i have all these errors ???????


Try restarting your PC.
Jmarks
ID: 48563 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile rochester new york
Avatar

Send message
Joined: 2 Jul 06
Posts: 2839
Credit: 2,020,043
RAC: 0
Message 48574 - Posted: 11 Nov 2007, 22:39:01 UTC

ok thanks ill try it.......
ID: 48574 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
MattDavis
Avatar

Send message
Joined: 22 Sep 05
Posts: 206
Credit: 1,377,748
RAC: 0
Message 48575 - Posted: 11 Nov 2007, 22:43:41 UTC
Last modified: 11 Nov 2007, 23:08:08 UTC

This is just a heads up that tons of MFR_SYMM_FOLD_AND_DOCK_RELAX units are erroring out AFTER crunching for a full runtime, which is annoying. It also errored out for people who got the unit after mine errored out.

Just a heads up!

edit: This isn't just one computer of mine but many different computers, not to mention that other people errored when they got the same unit. So it's not just a faulty computer on my end!
ID: 48575 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Matthias Lehmkuhl

Send message
Joined: 20 Nov 05
Posts: 10
Credit: 1,694,422
RAC: 20
Message 48576 - Posted: 11 Nov 2007, 22:58:47 UTC

I use Show Graphics in Boinc Manager on application version 5.81 and after an little time i closed the window.
This causes an error of the Result with the following error message:
<core_client_version>5.10.28</core_client_version>
<![CDATA[
<message>
- exit code 1073807364 (0x40010004)
</message>
<stderr_txt>
# cpu_run_time_pref: 10800
# random seed: 3802685

</stderr_txt>
]]>

resultid=119553269

Matthias
Matthias

ID: 48576 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mhhall

Send message
Joined: 28 Mar 06
Posts: 7
Credit: 10,181,677
RAC: 0
Message 48578 - Posted: 12 Nov 2007, 0:45:20 UTC

1) I note that the "explain" item does not document what a "Compute Error" is....

2) Work unit 106606679
1n0u__TREEJUMP_ABRELAX_NOTOR-1n0u_-_BARCODE__2241_1083
Appears to have failed on two different machines.

Seems like 2nd time that this has happened to me recently... other job was WU 106970621:
2reb__TREEJUMP_ABRELAX_TOR_EQ_-5_PROB_.5_SAVE_ALL_OUT-2reb_-_BARCODE__2243_7638_0

This looks like a programming issue on both counts
in same routine (ERROR:: Exit from: .pose.cc line: 769)
Or, there is a issue with software running
processes on my machine.
ID: 48578 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Trey

Send message
Joined: 3 Oct 06
Posts: 11
Credit: 110,142
RAC: 0
Message 48581 - Posted: 12 Nov 2007, 4:36:07 UTC

I got a Client/Compute error for result 119600589. The workunit it is for, MFR_SYMM_FOLD_AND_DOCK_RELAX_GB1_mutant_2286_166, apparently failed previously for someone else, but with a Validate error instead.
ID: 48581 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Marky-UK

Send message
Joined: 1 Nov 05
Posts: 73
Credit: 1,689,495
RAC: 0
Message 48583 - Posted: 12 Nov 2007, 7:14:02 UTC - in response to Message 48575.  
Last modified: 12 Nov 2007, 7:29:07 UTC

This is just a heads up that tons of MFR_SYMM_FOLD_AND_DOCK_RELAX units are erroring out AFTER crunching for a full runtime, which is annoying. It also errored out for people who got the unit after mine errored out.

Just a heads up!

edit: This isn't just one computer of mine but many different computers, not to mention that other people errored when they got the same unit. So it's not just a faulty computer on my end!

I'd agree with that - I haven't had a single MFR_SYMM_FOLD_AND_DOCK_RELAX WU that's worked; they all fail with a -161 error after completion.
ID: 48583 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Stevea

Send message
Joined: 19 Dec 05
Posts: 50
Credit: 738,655
RAC: 0
Message 48584 - Posted: 12 Nov 2007, 7:42:32 UTC
Last modified: 12 Nov 2007, 7:50:58 UTC

Here is one that errored out after 2 runs, neither received credit:

MFR_SYMM_FOLD_AND_DOCK_RELAX_GB1_mutant_2286_18566

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=108650563

And for what it's worth, this is still one of the lowest granting projects, for time computed....

Might be time for a credit adjustment, I only crunch this now when other projects are down. Crunching this project for 2-5 times less credit than others is just unacceptable for a credit counter like me, trying to keep up with the other members of my team. Sorry but I cannot afford to buy 3-5 quads to crunch whatever I want, I have to pick and choose now.

And to all you cross project parity people that IMO screwed this project up....

Have you figured out this is impossible........ just MO

Will not be responding to the naysayers..JMO
BETA = Bahhh

Way too many errors, killing both the credit & RAC.

And I still think the (New and Improved) credit system is not ready for prime time...
ID: 48584 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Geoff Roynon

Send message
Joined: 4 Nov 05
Posts: 5
Credit: 960,686
RAC: 0
Message 48586 - Posted: 12 Nov 2007, 9:27:20 UTC

I'm still having intermittent problems with some work-units never completing. The latest is WU "2p64_BOINC_SYMM_FOLD_AND_DOCK_RELAX_2p64_-crystal_foldanddock_2257_389...".
It has been sitting on CPU Time: 03:05:31 Progress: 94.885% for the last hour.
I will abort it now.

Geoff
PPC Mac G5 dual 1.8GHz, 3GB Ram, Mac OSX 10.5
BOINC Manager 5.10.28
Rosetta Beta 5.81
ID: 48586 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Trey

Send message
Joined: 3 Oct 06
Posts: 11
Credit: 110,142
RAC: 0
Message 48587 - Posted: 12 Nov 2007, 12:16:25 UTC
Last modified: 12 Nov 2007, 12:51:47 UTC

Yep, I've had several MFR_SYMM_FOLD_DOCK_AND_RELAX errors. For example:


While this may be helpful to other project participants, I'm honestly not sure why we need to list these here for the project... I would think the project could simply query the database for this sort of information. If not, let me know and I'll try to help with that.

cheers,
/trey

ID: 48587 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
transient
Avatar

Send message
Joined: 30 Sep 06
Posts: 376
Credit: 10,836,395
RAC: 0
Message 48591 - Posted: 12 Nov 2007, 17:22:42 UTC - in response to Message 48587.  

Yep, I've had several MFR_SYMM_FOLD_DOCK_AND_RELAX errors. For example:


While this may be helpful to other project participants, I'm honestly not sure why we need to list these here for the project... I would think the project could simply query the database for this sort of information. If not, let me know and I'll try to help with that.

cheers,
/trey



I've got one like that. Everything seems all right with it except it ended with a -161 error code, which is a file transfer error apparently.

https://boinc.bakerlab.org/rosetta/result.php?resultid=119510837

stderr out

<core_client_version>5.10.28</core_client_version>
<![CDATA[
<stderr_txt>
# cpu_run_time_pref: 21600
# random seed: 3520433
# cpu_run_time_pref: 21600
======================================================
DONE :: 1 starting structures 21396.9 cpu seconds
This process generated 44 decoys from 44 attempts
======================================================


BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down...

</stderr_txt>
<message>
<file_xfer_error>
<file_name>MFR_SYMM_FOLD_AND_DOCK_RELAX_GB1_mutant_2286_6858_0_0</file_name>
<error_code>-161</error_code>
</file_xfer_error>

</message>
]]>

Validate state Invalid
Claimed credit 116.539106403729
Granted credit 0


ID: 48591 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Stevea

Send message
Joined: 19 Dec 05
Posts: 50
Credit: 738,655
RAC: 0
Message 48599 - Posted: 13 Nov 2007, 1:01:58 UTC

Detaching now way tooooo many errors for 0 credit.....Get your act together.

Every rig now has errors...

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=108626751

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=108637515

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=108635707

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=108622177

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=108641769

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=108632329

C'mon....

BETA = Bahhh

Way too many errors, killing both the credit & RAC.

And I still think the (New and Improved) credit system is not ready for prime time...
ID: 48599 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David Emigh
Avatar

Send message
Joined: 13 Mar 06
Posts: 158
Credit: 417,178
RAC: 0
Message 48603 - Posted: 13 Nov 2007, 4:48:17 UTC - in response to Message 48575.  
Last modified: 13 Nov 2007, 4:49:52 UTC

This is just a heads up that tons of MFR_SYMM_FOLD_AND_DOCK_RELAX units are erroring out AFTER crunching for a full runtime {...}


Add me to the list of those bit by the MFR_SYMM_FOLD etc. bug.

WU 108617269

Rosie, Rosie, she's our gal,
If she can't do it, no one shall!
ID: 48603 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile rochester new york
Avatar

Send message
Joined: 2 Jul 06
Posts: 2839
Credit: 2,020,043
RAC: 0
Message 48614 - Posted: 13 Nov 2007, 16:14:57 UTC

im getting bummed out with all these errors :(
ID: 48614 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
BarryAZ

Send message
Joined: 27 Dec 05
Posts: 153
Credit: 30,837,810
RAC: 0
Message 48615 - Posted: 13 Nov 2007, 16:23:26 UTC - in response to Message 48603.  

I looked for this thread after seeing a batch of computation errors on different workstations -- useful information (albeit all user to user and no Rosetta admin input here). Just went around and aborted all my MFR_SYMM_FOLD workunits (I found about 10 to 15 of them locally). Of course the downside is until there is admin awareness of the problems with this set of work units, they will simply get recycled. If someone can get the attention of the admin folks at Rosetta, perhaps they can purge the outgoing database so that these bad boys don't simply recycle in the user universe.


This is just a heads up that tons of MFR_SYMM_FOLD_AND_DOCK_RELAX units are erroring out AFTER crunching for a full runtime {...}


Add me to the list of those bit by the MFR_SYMM_FOLD etc. bug.

WU 108617269


ID: 48615 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Jmarks
Avatar

Send message
Joined: 16 Jul 07
Posts: 132
Credit: 98,025
RAC: 0
Message 48616 - Posted: 13 Nov 2007, 18:04:59 UTC

Validation error.
MFR_SYMM_FOLD_AND_DOCK_RELAX_GB1_mutant_2286_7049_1119565586
Jmarks
ID: 48616 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · Next

Message boards : Number crunching : Problems with Rosetta version 5.81



©2022 University of Washington
https://www.bakerlab.org