Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 12 · 13 · 14 · 15 · 16 · 17 · 18 . . . 55 · Next

AuthorMessage
Dave Mickey

Send message
Joined: 29 Dec 07
Posts: 33
Credit: 4,136,957
RAC: 0
Message 72005 - Posted: 8 Jan 2012, 13:26:49 UTC


I think that you are asking for validate errors with this method because it is quite possible that the server that a work unit is assigned to is the only one that can validate it.
.....
To do other things risks corrupting files which potentially effects your whole boat of tasks.

Aborting the transfer will be throwing away the work you've done, and the credit you've earned for that work.

...having said that, the suggestion below to hit an alternate upload server should be processed normally if you are comfortable achieving the redirection via the hosts file, etc.




I looked for, and found my 13 holdouts, all reported at just about 1:00 UTC, and they all are "Over, Success, Done, and granted credit". So maybe I dodged a bullet, but I would guess that a robust parallel system like boinc would not have fragile path such as work having to go back into one and only one IP address. But I can't claim any expertise, just luck, I guess. But, from ModSenses comment, there is no "etc", it was just mod the host file. Period.

typical result record:


473457597 431993511 28 Dec 2011 17:41:51 UTC 8 Jan 2012 0:57:37 UTC Over Success Done 27,876.58 198.47 154.33


YMMV, I guess, but no sign of trouble here.

Dave
ID: 72005 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Profile XJR-Maniac
Avatar

Send message
Joined: 24 Jan 06
Posts: 12
Credit: 271,462
RAC: 0
Message 72009 - Posted: 8 Jan 2012, 15:41:28 UTC

OK, after all this answers I decided to edit the "client_state.xml" to redirect the upload to "srv5" and it worked fine. WU uploaded, valdited and credit granted. No errors. It was only one WU so no big deal with the mod. After the upload the entry is deleted and so I don't have to do anything afterwards.

ID: 72009 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 72010 - Posted: 8 Jan 2012, 15:49:57 UTC
Last modified: 8 Jan 2012, 15:56:54 UTC

I think that you are asking for validate errors with this method because it is quite possible that the server that a work unit is assigned to is the only one that can validate it.

=====
Not true. I had no validation problems, and received credit for all of the uploaded work units.


Right, it would be possible for a project to set up their servers in such a way that you'd earn yourself a validation error. I do not believe R@h's server configuration will have any problem with the specific of which upload server is used, since they all feed the same underlaying database and data store.

All I meant about the "etc." is that there are numerous ways to redirect the host name and achieve the same result. The other thing is to put it all back when you are done, although the potentially riskier approach described with modifying the client state file avoids that. I say "riskier" because there are certainly very simple ways to corrupt that file and end up trashing all of the tasks you have. Not riskier for your computer, just for your earned credits and current task list.
Rosetta Moderator: Mod.Sense
ID: 72010 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Rick A. Sponholz
Avatar

Send message
Joined: 6 Sep 10
Posts: 14
Credit: 7,823,937
RAC: 0
Message 72012 - Posted: 8 Jan 2012, 17:37:26 UTC - in response to Message 72010.  

I also have completed work units failing to upload on two different computers(although some work units are uploading wihout any problems). Will the Rosetta Team be resolving the problem discribed above, or do I need to manually do something? Please help:-s
Rick

I think that you are asking for validate errors with this method because it is quite possible that the server that a work unit is assigned to is the only one that can validate it.

=====
Not true. I had no validation problems, and received credit for all of the uploaded work units.


Right, it would be possible for a project to set up their servers in such a way that you'd earn yourself a validation error. I do not believe R@h's server configuration will have any problem with the specific of which upload server is used, since they all feed the same underlaying database and data store.

All I meant about the "etc." is that there are numerous ways to redirect the host name and achieve the same result. The other thing is to put it all back when you are done, although the potentially riskier approach described with modifying the client state file avoids that. I say "riskier" because there are certainly very simple ways to corrupt that file and end up trashing all of the tasks you have. Not riskier for your computer, just for your earned credits and current task list.


ID: 72012 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Profile Terry D Stout

Send message
Joined: 8 Jun 06
Posts: 2
Credit: 2,669,965
RAC: 0
Message 72013 - Posted: 8 Jan 2012, 18:01:22 UTC

I too have three work units that won't upload. When rosetta came back on line all but these three uploaded and more work downloaded.
ID: 72013 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Rick A. Sponholz
Avatar

Send message
Joined: 6 Sep 10
Posts: 14
Credit: 7,823,937
RAC: 0
Message 72017 - Posted: 8 Jan 2012, 19:11:38 UTC

I also have completed work units failing to upload on two different computers(although some work units are uploading wihout any problems). Below are the Message Lines from the Event Log. Will the Rosetta Team be resolving the problem discribed above, or do I need to manually do something? Please help:-s
Rick

1/8/2012 12:27:09 PM | rosetta@home | Started upload of ab_11_29__optpps_T6211_optpps_03_09_35686_130941_1_0
1/8/2012 12:27:10 PM | rosetta@home | Temporarily failed upload of ab_11_29__optpps_T6211_optpps_03_09_35686_130941_1_0: can't resolve hostname
1/8/2012 12:27:10 PM | rosetta@home | Backing off 4 hr 41 min 22 sec on upload of ab_11_29__optpps_T6211_optpps_03_09_35686_130941_1_0
1/8/2012 12:27:14 PM | | Project communication failed: attempting access to reference site
1/8/2012 12:27:16 PM | | Internet access OK - project servers may be temporarily down.

ID: 72017 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 72018 - Posted: 8 Jan 2012, 19:39:42 UTC

I also have completed work units failing to upload on two different computers(although some work units are uploading wihout any problems). Below are the Message Lines from the Event Log. Will the Rosetta Team be resolving the problem discribed above, or do I need to manually do something? Please help:-s


Please review my original comment here
Rosetta Moderator: Mod.Sense
ID: 72018 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Viking69
Avatar

Send message
Joined: 3 Oct 05
Posts: 20
Credit: 6,872,023
RAC: 1,874
Message 72019 - Posted: 8 Jan 2012, 19:46:31 UTC - in response to Message 71991.  

This fixed my upload issue on the one file I had on a Win 7 PC. I had to modify file security permissions to SAVE the "hosts" file with this MOD.
C:WindowsSystem32driversetc

Apparently, yes, there is. put this line in your "hosts" file - somewhere under windows, just "hosts", with no extension:

128.95.160.145 srv6.bakerlab.org

just by itself. Then, requests to srv6 will go to srv4. All mine are gone now......


Dave


Hi all you enthusiastic crunchers.....
ID: 72019 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Rick A. Sponholz
Avatar

Send message
Joined: 6 Sep 10
Posts: 14
Credit: 7,823,937
RAC: 0
Message 72021 - Posted: 8 Jan 2012, 20:28:21 UTC - in response to Message 72018.  

I also have completed work units failing to upload on two different computers(although some work units are uploading wihout any problems). Below are the Message Lines from the Event Log. Will the Rosetta Team be resolving the problem discribed above, or do I need to manually do something? Please help:-s


Please review my original comment here

I had read your post, but didn't understand it to mean the problem WOULD self correct. Rick
ID: 72021 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Profile skgiven
Avatar

Send message
Joined: 7 Jun 06
Posts: 9
Credit: 642,742
RAC: 0
Message 72022 - Posted: 8 Jan 2012, 23:55:55 UTC

The host entry worked well for me too.
If Washington added a temporary routing entry it would prevent the need for people to do this!
ID: 72022 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Chris Skull

Send message
Joined: 21 Dec 10
Posts: 8
Credit: 105,938
RAC: 0
Message 72043 - Posted: 9 Jan 2012, 15:42:17 UTC - in response to Message 71810.  

Chris, GPU is NOT supported by Rosetta.
This is strictly a CPU program.
If you want GPU work then have a look at world community grid or a program similar to Rosetta is Poem@home but I am not sure what is going on with their GPU tasks.

As far as CPU work goes, there is over 4,000 jobs waiting in queue to be processed by crunchers.


Sorry Greg,
you don't read my problem.... i reported 3 cpl. tasks but they marked as deleted by user...
ID: 72043 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Profile The_Saint_(LDS)

Send message
Joined: 12 Aug 10
Posts: 6
Credit: 10,076,132
RAC: 0
Message 72045 - Posted: 9 Jan 2012, 16:55:41 UTC

Dave (or anyone else),

Any idea how to do this on a non-windows system...my ubuntu systems are totally full and unable to send anything back or get anything new because of this issue.
ID: 72045 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1832
Credit: 119,860,059
RAC: 7,494
Message 72046 - Posted: 9 Jan 2012, 18:06:36 UTC - in response to Message 72045.  

Dave (or anyone else),

Any idea how to do this on a non-windows system...my ubuntu systems are totally full and unable to send anything back or get anything new because of this issue.

Not sure if this is the equivalent file or is only for LAN hosts: http://linuxservertutorials.blogspot.com/2008/11/ubuntu-hosts-file.html

Alternatively, you could try as BarryP has recently posted in the other thread and change the upload_url from srv6 to srv4 in client_state.xml: https://boinc.bakerlab.org/rosetta/forum_thread.php?id=5880&nowrap=true#72044. Not sure if you should exit BOINC before making the change - I would.
ID: 72046 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Ian

Send message
Joined: 20 Jun 06
Posts: 1
Credit: 2,433
RAC: 0
Message 72047 - Posted: 9 Jan 2012, 18:13:50 UTC

Having an issue with a WU not wanting to upload after completion.

Ive tried update in the project tab and Retry Now in the transfers.

Any ideas on how I can force it through?
ID: 72047 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Profile [PUGLIA] kidkidkid3
Avatar

Send message
Joined: 14 Sep 10
Posts: 11
Credit: 2,348,063
RAC: 0
Message 72048 - Posted: 9 Jan 2012, 18:20:23 UTC

Sorry for this thread ... i'll leave Rosetta if nobody tell us when this problem will be correct ... it's incredible ... all the people have thousand of WU pending ... and all is quite.
ID: 72048 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Starz

Send message
Joined: 7 Nov 11
Posts: 1
Credit: 159,476
RAC: 0
Message 72052 - Posted: 9 Jan 2012, 19:48:34 UTC - in response to Message 72048.  

Sorry for this thread ... i'll leave Rosetta if nobody tell us when this problem will be correct ... it's incredible ... all the people have thousand of WU pending ... and all is quite.


I left SETI@home for the same thing... I was # 14 on team USA but the servers kept screwing up and no one was doing anything. At least tell the user base what is going on so we dont assume. When we assume the worst and get tired of waiting, we leave.
ID: 72052 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Tyrant

Send message
Joined: 12 Dec 10
Posts: 1
Credit: 90,544
RAC: 0
Message 72053 - Posted: 9 Jan 2012, 20:18:07 UTC - in response to Message 72019.  

This fixed my upload issue on the one file I had on a Win 7 PC. I had to modify file security permissions to SAVE the "hosts" file with this MOD.
C:WindowsSystem32driversetc

Apparently, yes, there is. put this line in your "hosts" file - somewhere under windows, just "hosts", with no extension:

128.95.160.145 srv6.bakerlab.org

just by itself. Then, requests to srv6 will go to srv4. All mine are gone now......


Dave




Thanks, works perfectly in Windows 7

Greetings from Spain

ID: 72053 · Rating: 0 · rate: Rate + / Rate - Report as offensive
CBSX01

Send message
Joined: 17 Dec 07
Posts: 11
Credit: 5,387,356
RAC: 0
Message 72055 - Posted: 9 Jan 2012, 23:29:13 UTC

I read in a prev post not to abort tasks. Been trying to upload all day even after updating host file. I just aborted 7 stalled uploads that were all at < 5% requested an update and just got a boatload of new tasks immediately.

If yer stuck, you might have to bite the bullet and abort the tasks.

Good luck...
ID: 72055 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Dave Mickey

Send message
Joined: 29 Dec 07
Posts: 33
Credit: 4,136,957
RAC: 0
Message 72056 - Posted: 10 Jan 2012, 1:14:57 UTC

Update from here:

It's still doing it - I just had to fake out hosts once more for a unit that couldn't resolve host name, and that unit was created less than 24 hours ago. So you might want to keep that mod in hosts, or maybe leave it the but comment it out (by putting the # character as the first character on the line:

# i am a comment
# and so is the next line
# 128.95.160.145 srv6.bakerlab.org

Oh, and it so happens I have some ubuntu. In my ubu systems hosts is found as

/etc/hosts

and I make the presumption that this would be standard on most any unix flavored host, but am not sure. Look there first.

Dave

Greetings, Tyrant - my wife loves Fernando Alonso - I would love all the $$ he makes......

@CBSX01 - was your error message "can't resolve hostname"? Did you try tweaking "hosts"?
just curious if yours was this, or something else.....

ID: 72056 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Richard de Lhorbe

Send message
Joined: 17 Aug 09
Posts: 5
Credit: 3,013,955
RAC: 0
Message 72057 - Posted: 10 Jan 2012, 1:21:14 UTC
Last modified: 10 Jan 2012, 1:29:17 UTC

Anybody know what the equivalent file to "hosts" is on a Mac ? I have searched for a file called "hosts" but can't find anything .... I also suspect it may be hidden from my view somehow. I have lots of uploads that won't go through either, but strangely, some on the computer work (maybe a third) and the other two-thirds do not (same computer).

Thanks, Richard

Never mind, the backlog suddenly uploaded just now. The solution nearly always happens just after you ask the question .....
ID: 72057 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Previous · 1 . . . 12 · 13 · 14 · 15 · 16 · 17 · 18 . . . 55 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org