Problems with Rosetta version 5.82

Message boards : Number crunching : Problems with Rosetta version 5.82

To post messages, you must log in.

1 · 2 · 3 · 4 · Next

AuthorMessage
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 26199 - Posted: 6 Sep 2006, 19:30:37 UTC
Last modified: 4 Dec 2007, 16:40:45 UTC

Please post any problem reports with Rosetta version 5.82 here. I'll be moving prior posts that clearly identify v5.82 to this thread later on today.
Rosetta Moderator: Mod.Sense
ID: 26199 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 4876
Credit: 4,562,816
RAC: 3,333
Message 49263 - Posted: 1 Dec 2007, 13:19:52 UTC

since there is no 5.82 thread i will post here:

from boinc mangager

12/1/2007 1:01:06 AM|rosetta@home|Task CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_2342426_0 exited with zero status but no 'finished' file
12/1/2007 1:01:06 AM|rosetta@home|If this happens repeatedly you may need to reset the project.
12/1/2007 1:01:11 AM|rosetta@home|Restarting task CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_2342426_0 using rosetta version 582
12/1/2007 4:41:44 AM|rosetta@home|Computation for task CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_2342426_0 finished


what does this mean?
ID: 49263 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Luuklag

Send message
Joined: 13 Sep 07
Posts: 262
Credit: 4,171
RAC: 0
Message 49289 - Posted: 1 Dec 2007, 20:09:21 UTC - in response to Message 49263.  

since there is no 5.82 thread i will post here:

from boinc mangager

12/1/2007 1:01:06 AM|rosetta@home|Task CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_2342426_0 exited with zero status but no 'finished' file
12/1/2007 1:01:06 AM|rosetta@home|If this happens repeatedly you may need to reset the project.
12/1/2007 1:01:11 AM|rosetta@home|Restarting task CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_2342426_0 using rosetta version 582
12/1/2007 4:41:44 AM|rosetta@home|Computation for task CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_2342426_0 finished


what does this mean?


well it kinda failed, but then the project decided to make it again, and then it succeeded
ID: 49289 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 4876
Credit: 4,562,816
RAC: 3,333
Message 49294 - Posted: 1 Dec 2007, 20:50:40 UTC - in response to Message 49289.  

since there is no 5.82 thread i will post here:

from boinc mangager

12/1/2007 1:01:06 AM|rosetta@home|Task CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_2342426_0 exited with zero status but no 'finished' file
12/1/2007 1:01:06 AM|rosetta@home|If this happens repeatedly you may need to reset the project.
12/1/2007 1:01:11 AM|rosetta@home|Restarting task CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_2342426_0 using rosetta version 582
12/1/2007 4:41:44 AM|rosetta@home|Computation for task CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_2342426_0 finished


what does this mean?


well it kinda failed, but then the project decided to make it again, and then it succeeded


oh my blind eyes sometimes, i should have saw the restart, but the exit 0 and no finished file? thats got me puzzled
ID: 49294 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Holmis

Send message
Joined: 15 Nov 07
Posts: 6
Credit: 975,490
RAC: 0
Message 49303 - Posted: 2 Dec 2007, 0:04:26 UTC - in response to Message 49294.  

oh my blind eyes sometimes, i should have saw the restart, but the exit 0 and no finished file? thats got me puzzled


It happens to me sometimes, it's usually when I "overload" my computer. To many things to do at once. It's quite harmless and you probably do not need to reset as suggested.

This is one of those things that has been occurring with Bonic for a long time. Don't think anyone really knows why...

/Holmis
ID: 49303 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
ramostol

Send message
Joined: 6 Feb 07
Posts: 64
Credit: 584,052
RAC: 0
Message 49317 - Posted: 2 Dec 2007, 11:49:36 UTC

This is still (I believe) a general issue, but since the wu in question is a 5.82 file:

From local message file:

02-Dec-2007 12:14:08 [rosetta@home] Sending scheduler request: To fetch work. Requesting 63171 seconds of work, reporting 5 completed tasks
02-Dec-2007 12:14:13 [rosetta@home] Scheduler request succeeded: got 3 new tasks
02-Dec-2007 12:14:15 [rosetta@home] Started download of vf_1rnbA.fasta.gz
02-Dec-2007 12:14:15 [rosetta@home] Started download of vf_1rnbA.psipred_ss2.gz
02-Dec-2007 12:15:37 [rosetta@home] Task CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_2739310_0 exited with zero status but no 'finished' file
02-Dec-2007 12:15:37 [rosetta@home] If this happens repeatedly you may need to reset the project.
02-Dec-2007 12:15:37 [---] Project communication failed: attempting access to reference site
02-Dec-2007 12:15:37 [rosetta@home] Temporarily failed download of vf_1rnbA.fasta.gz: can't resolve hostname
02-Dec-2007 12:15:37 [rosetta@home] Temporarily failed download of vf_1rnbA.psipred_ss2.gz: can't resolve hostname
02-Dec-2007 12:15:37 [rosetta@home] Started download of boinc_vf_aa1rnbA03_05.200_v1_3.gz
02-Dec-2007 12:15:38 [rosetta@home] Started download of boinc_vf_aa1rnbA09_05.200_v1_3.gz
02-Dec-2007 12:15:38 [rosetta@home] Restarting task CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_2739310_0 using rosetta version 582
02-Dec-2007 12:15:39 [---] Access to reference site succeeded - project servers may be temporarily down.
02-Dec-2007 12:15:45 [rosetta@home] Finished download of boinc_vf_aa1rnbA03_05.200_v1_3.gz
02-Dec-2007 12:15:45 [rosetta@home] Started download of boinc_vf_aa1rnbA15_05.200_v1_3.gz
[etc etc]

The computing of Rosetta tasks is now as before sensitive to unstable internet connections. Whether this is a Boinc of Rosetta problem I don't know, but I feel that such consequences of internet failures are unneccessary and should be addressed.
ID: 49317 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 4876
Credit: 4,562,816
RAC: 3,333
Message 49321 - Posted: 2 Dec 2007, 14:25:01 UTC - in response to Message 49317.  

This is still (I believe) a general issue, but since the wu in question is a 5.82 file:

From local message file:

02-Dec-2007 12:14:08 [rosetta@home] Sending scheduler request: To fetch work. Requesting 63171 seconds of work, reporting 5 completed tasks
02-Dec-2007 12:14:13 [rosetta@home] Scheduler request succeeded: got 3 new tasks
02-Dec-2007 12:14:15 [rosetta@home] Started download of vf_1rnbA.fasta.gz
02-Dec-2007 12:14:15 [rosetta@home] Started download of vf_1rnbA.psipred_ss2.gz
02-Dec-2007 12:15:37 [rosetta@home] Task CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_2739310_0 exited with zero status but no 'finished' file
02-Dec-2007 12:15:37 [rosetta@home] If this happens repeatedly you may need to reset the project.
02-Dec-2007 12:15:37 [---] Project communication failed: attempting access to reference site
02-Dec-2007 12:15:37 [rosetta@home] Temporarily failed download of vf_1rnbA.fasta.gz: can't resolve hostname
02-Dec-2007 12:15:37 [rosetta@home] Temporarily failed download of vf_1rnbA.psipred_ss2.gz: can't resolve hostname
02-Dec-2007 12:15:37 [rosetta@home] Started download of boinc_vf_aa1rnbA03_05.200_v1_3.gz
02-Dec-2007 12:15:38 [rosetta@home] Started download of boinc_vf_aa1rnbA09_05.200_v1_3.gz
02-Dec-2007 12:15:38 [rosetta@home] Restarting task CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_2739310_0 using rosetta version 582
02-Dec-2007 12:15:39 [---] Access to reference site succeeded - project servers may be temporarily down.
02-Dec-2007 12:15:45 [rosetta@home] Finished download of boinc_vf_aa1rnbA03_05.200_v1_3.gz
02-Dec-2007 12:15:45 [rosetta@home] Started download of boinc_vf_aa1rnbA15_05.200_v1_3.gz
[etc etc]

The computing of Rosetta tasks is now as before sensitive to unstable internet connections. Whether this is a Boinc of Rosetta problem I don't know, but I feel that such consequences of internet failures are unneccessary and should be addressed.


your having the same error as I had with the same group of work unit but a different fragment.


12/1/2007 1:01:06 AM|rosetta@home|Task CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_2342426_0 exited with zero status but no 'finished' file
12/1/2007 1:01:06 AM|rosetta@home|If this happens repeatedly you may need to reset the project.
12/1/2007 1:01:11 AM|rosetta@home|Restarting task CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_2342426_0 using rosetta version 582
12/1/2007 4:41:44 AM|rosetta@home|Computation for task CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_2342426_0 finished
ID: 49321 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 49326 - Posted: 2 Dec 2007, 16:08:16 UTC

ramostol is actually having two issues. One with network connectivity, and the other with tasks completing with no finish line. And both happened to occur at the same time in the messages shown.

I had download problems with the same two files last night.
boinc_vf_aa1rnbA03_05.200_v1_3.gz &
boinc_vf_aa1rnbA09_05.200_v1_3.gz

The BOINC system is very resiliant. When your client requests files, there is a list of 3 servers that it goes through to get it. If the first fails it tries the second. Then the third, and if the client still doesn't have the file, it runs the list again (because the list is actually of 6 URLs). If you make it to the bottom of the list without getting the file, I'm not clear on whether that's when the client gives up and chalks up a permanent error, or not.

Anyway, I wanted you to know that the internet problem that occured there was for work you were downloading at the time. And the task failure there, with no finish line, was an unrelated event that occured as a task was completing.
Rosetta Moderator: Mod.Sense
ID: 49326 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
M.L.

Send message
Joined: 21 Nov 06
Posts: 182
Credit: 180,462
RAC: 0
Message 49403 - Posted: 4 Dec 2007, 21:35:42 UTC
Last modified: 4 Dec 2007, 21:37:29 UTC

created 24 Nov 2007 12:43:55 UTC
name CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_2298621
canonical result 122751258
granted credit 44.73
minimum quorum 1
initial replication 1
max # of error/total/success tasks 1, 2, 1
Task ID
click for details Computer Sent Time reported
or deadline
explain Server state
explain Outcome
explain Client state
explain CPU time (sec) claimed credit granted credit
122751258 580603 24 Nov 2007 13:29:27 UTC 4 Dec 2007 21:12:32 UTC Over Success Done 10,567.80 44.73 45.23
124543662 510574 4 Dec 2007 13:32:59 UTC 14 Dec 2007 13:32:59 UTC In Progress Unknown New --- --- ---

Have aborted this task on 4 Dec {for pc 510574} on 5.82
ID: 49403 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 4876
Credit: 4,562,816
RAC: 3,333
Message 49564 - Posted: 10 Dec 2007, 11:22:36 UTC

i had a perfect run with this task CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_3193642_0, but I took a 8 point hit in granted credit vs claimed credit. That just sucks peanuts beyond belief! That is the worst credit grant differential I have had in a long time!
ID: 49564 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Astro
Avatar

Send message
Joined: 2 Oct 05
Posts: 987
Credit: 500,253
RAC: 0
Message 49567 - Posted: 10 Dec 2007, 13:08:24 UTC - in response to Message 49564.  
Last modified: 10 Dec 2007, 13:37:02 UTC

i had a perfect run with this task CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_3193642_0, but I took a 8 point hit in granted credit vs claimed credit. That just sucks peanuts beyond belief! That is the worst credit grant differential I have had in a long time!

Hi Greg, I added your host to my files and ran it. Below is what all your work that is still available looks like. Notice that your Granted Credit/hour for that task is similar to what you get for other jobs.



I then grabbed the same type jobs from work done by your machine, my AMD64 2800, and also that of the number one machine at rosetta belonging to MSO. I sorted them by TaskID, then deleted many of MSO's work so this would fit on one screen. With this you can compare how your machine does against a similar machine and also against the best machine. Notice everyone that I've looked at for that Job received 1.81 credit/decoy. I've been collecting samples on many hosts, and everyone I have shows that same amount/decoy. I.E everyone got paid the same for the same work done. MSO just does it much faster. Your data is Black, Mine is Red, and that of MSO is blue. Looks like ours are yeilding the same amount.



I do see where some decoys within the same job type seem to take longer/shorter than others, and that I can't explain. However, it happens to everyone I see, so in that, it's seems fair.
ID: 49567 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
transient
Avatar

Send message
Joined: 30 Sep 06
Posts: 376
Credit: 10,836,395
RAC: 2,883
Message 49573 - Posted: 10 Dec 2007, 17:26:15 UTC - in response to Message 49564.  

i had a perfect run with this task CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_3193642_0, but I took a 8 point hit in granted credit vs claimed credit. That just sucks peanuts beyond belief! That is the worst credit grant differential I have had in a long time!


Yup, sorta sucks. But to be completely fair, you'd do good to remember this unit:

CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_2640142_0

You claimed 40.66 credits there and got 48.87 credits
ID: 49573 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 4876
Credit: 4,562,816
RAC: 3,333
Message 49576 - Posted: 10 Dec 2007, 18:31:53 UTC - in response to Message 49573.  

ok so rosie got me back..lol
i had a perfect run with this task CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_3193642_0, but I took a 8 point hit in granted credit vs claimed credit. That just sucks peanuts beyond belief! That is the worst credit grant differential I have had in a long time!


Yup, sorta sucks. But to be completely fair, you'd do good to remember this unit:

CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_2640142_0

You claimed 40.66 credits there and got 48.87 credits


ID: 49576 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
transient
Avatar

Send message
Joined: 30 Sep 06
Posts: 376
Credit: 10,836,395
RAC: 2,883
Message 49584 - Posted: 10 Dec 2007, 21:59:51 UTC

Yeah, you win some, you lose some. :)

It's what I do when I see a task where I've lost big, I try to find some where I 'lucked out'. I'm usually successful in that.
ID: 49584 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
ramostol

Send message
Joined: 6 Feb 07
Posts: 64
Credit: 584,052
RAC: 0
Message 49635 - Posted: 12 Dec 2007, 9:22:29 UTC - in response to Message 49326.  

ramostol is actually having two issues. One with network connectivity, and the other with tasks completing with no finish line. And both happened to occur at the same time in the messages shown.

[...]
Anyway, I wanted you to know that the internet problem that occured there was for work you were downloading at the time. And the task failure there, with no finish line, was an unrelated event that occured as a task was completing.


Sorry for being late in responding. But I don't feel too certain that the case is this easy.

7-8 months ago I had not infrequently task failures with no finishing line, due to various unexplained causes. However, the last months this error has appeared very seldom, and practically all incidences (one unsolved exception) can be linked to the quality of the network connection. Mind you, this is not necessarily a fault caused by the Rosetta server, but seems most often to be connected to faulty wireless connection (Rosetta knowing the internet connection is activated but being unable to locate a server because the connection is fluctuating) influencing the task computing. I am no computer expert, but now I begin to feel that in my case the facts speak for themselves.
ID: 49635 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
M.L.

Send message
Joined: 21 Nov 06
Posts: 182
Credit: 180,462
RAC: 0
Message 51216 - Posted: 7 Feb 2008, 11:27:44 UTC
Last modified: 7 Feb 2008, 11:30:01 UTC

According to the BOINC MANAGER I have a 5.82 task waiting to run---
Task ID 138895008
Name CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_3253661_0
Workunit 126479703
Created 6 Feb 2008 16:40:54 UTC
Sent 6 Feb 2008 16:41:27 UTC
Received ---
Server state In Progress
Outcome Unknown
Client state New
Exit status 0 (0x0)
Computer ID 735230
Report deadline 16 Feb 2008 16:41:27 UTC
CPU time 0
stderr out

Validate state Initial
Claimed credit 0
Granted credit 0
application version ---

Is this correct? Not seen 5.82 for a long long time.
ID: 51216 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 51220 - Posted: 7 Feb 2008, 14:26:59 UTC

At any time, you can click "applications" on the Rosetta home page and see all of the application names and version that may be sending work.

Yes, I haven't seen any for a while either, but 5.82 is still active.
Rosetta Moderator: Mod.Sense
ID: 51220 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 4876
Credit: 4,562,816
RAC: 3,333
Message 51268 - Posted: 9 Feb 2008, 13:32:16 UTC - in response to Message 51216.  

I have one as well,

CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_3260163_0

thought 5.82 was done doing any work.
guess not.

According to the BOINC MANAGER I have a 5.82 task waiting to run---
Task ID 138895008
Name CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_3253661_0
Workunit 126479703
Created 6 Feb 2008 16:40:54 UTC
Sent 6 Feb 2008 16:41:27 UTC
Received ---
Server state In Progress
Outcome Unknown
Client state New
Exit status 0 (0x0)
Computer ID 735230
Report deadline 16 Feb 2008 16:41:27 UTC
CPU time 0
stderr out

Validate state Initial
Claimed credit 0
Granted credit 0
application version ---

Is this correct? Not seen 5.82 for a long long time.


ID: 51268 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Thomas Leibold

Send message
Joined: 30 Jul 06
Posts: 55
Credit: 19,627,164
RAC: 0
Message 51295 - Posted: 10 Feb 2008, 6:06:45 UTC

Yesterday I had one of my systems (quad-core opteron) run all three types of applications at the same time: Rosetta 5.82, Rosetta Beta 5.93 and Mini Rosetta 1.07. Looks like they get along alright, because they all completed successfully.
Team Helix
ID: 51295 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 51532 - Posted: 21 Feb 2008, 2:46:03 UTC

This task has errored twice now, someone might want to have a look at it.

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=129474110

2/21/2008 1:33:06 PM|rosetta@home|Starting task CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_3281563_1 using rosetta version 582

2/21/2008 1:33:34 PM|rosetta@home|Computation for task CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_3281563_1 finished

2/21/2008 1:33:34 PM|rosetta@home|Output file CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_3281563_1_0 for task CNTRL_01ABRELAX_SAVE_ALL_OUT_-1ubi_-_filters_1782_3281563_1 absent

pete.



ID: 51532 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
1 · 2 · 3 · 4 · Next

Message boards : Number crunching : Problems with Rosetta version 5.82



©2021 University of Washington
https://www.bakerlab.org