Anyone else having trouble uploading results?

Message boards : Number crunching : Anyone else having trouble uploading results?

To post messages, you must log in.

Previous · 1 · 2 · 3 · Next

AuthorMessage
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2145
Credit: 41,560,787
RAC: 9,320
Message 69106 - Posted: 10 Jan 2011, 0:19:27 UTC - in response to Message 69101.  

It is only older WUs that cannot be uploaded, new ones -the one I received today - give no problems.

Returning to an unattended machine after a few days I found I lost my internet connection as well a couple of days ago and ran out of my stock of work for 5-7 hours. There's a lot of it about...

Afer restocking from my back-up project I found I could upload 17 completed 8-hour Rosetta tasks in 2 batches, but 14 others struggled and are still waiting (I won't rush them to keep the heat off the servers in their current state). Point is, some are struggling through.

Nothing requested and nothing came down though (probably because I'm full with back-up tasks tbh).
ID: 69106 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
shimp
Avatar

Send message
Joined: 4 May 06
Posts: 7
Credit: 329,810
RAC: 0
Message 69107 - Posted: 10 Jan 2011, 1:11:14 UTC

1/9/2011 6:40:17 PM rosetta@home Project file upload handler is missing

1/9/2011 6:40:17 PM rosetta@home [error] Error reported by file upload server: [mem_prub_run05_centroid_round03_A_subrun_007594_SAVE_ALL_OUT_IGNORE_THE_REST_22824_29_0_0] locked by file_upload_handler PID=-1

1/9/2011 6:40:17 PM rosetta@home Backing off 2 hr 9 min 18 sec on upload of mem_prub_run05_centroid_round03_A_subrun_007598_SAVE_ALL_OUT_IGNORE_THE_REST_22824_29_0_0

shimp
ID: 69107 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Chilean
Avatar

Send message
Joined: 16 Oct 05
Posts: 711
Credit: 26,694,507
RAC: 0
Message 69110 - Posted: 10 Jan 2011, 6:34:02 UTC

Joined POEM@Home in the meanwhile ('tis my backup project). I was able to upload a few WUs, but I still have 24 WUs waiting to be uploaded (2hrs of CPU each). Haven't been able to download a single WU since the crash.
ID: 69110 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Shadowfax

Send message
Joined: 24 May 07
Posts: 1
Credit: 4,045,649
RAC: 0
Message 69150 - Posted: 10 Jan 2011, 22:57:57 UTC

I don't seem to be able to upload *any* of my Rosetta units, even the ones that downloaded and were processed OK.

For the time being, I've suspended Rosetta and joined malariacontrol.net. Wonder, though, if I should just tell BOINC no new tasks for it...
ID: 69150 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bernd Schnitker

Send message
Joined: 2 Jan 09
Posts: 10
Credit: 62,009
RAC: 0
Message 69155 - Posted: 11 Jan 2011, 0:50:02 UTC

Is their an ETA on when the upload handler missing error will be fixed? I know they had a really bad crash last week.
ID: 69155 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
JohnMacPherson

Send message
Joined: 18 Sep 09
Posts: 2
Credit: 88,521
RAC: 0
Message 69201 - Posted: 11 Jan 2011, 23:47:02 UTC

I have 9 WU waiting to upload for several days now. I've read most of this thread but fail to see when or if the system will be fixed.


11/01/2011 6:37:36 PM | rosetta@home | Project file upload handler is missing

This is message I'm getting, any ideas if I can fix it on my end??
ID: 69201 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Carla59

Send message
Joined: 10 Aug 08
Posts: 1
Credit: 4,466,456
RAC: 0
Message 69202 - Posted: 12 Jan 2011, 0:27:30 UTC

Me too, Me too.

I have a couple of dozen with the same error message.
ID: 69202 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
CTAPbIi

Send message
Joined: 25 Aug 09
Posts: 2
Credit: 201,366
RAC: 0
Message 69205 - Posted: 12 Jan 2011, 2:42:00 UTC

count me pls. BOINC says about smth about upload handler (never seeing that stuff before)
ID: 69205 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 69213 - Posted: 12 Jan 2011, 10:08:06 UTC

Try reading the front page when there are problems:
Jan 7, 2010
Well, our luck ran out. The SAN controller that has been causing so much trouble in the last few months finally tipped over in a rather distructive fashion, corrupting the binary tree on which the filesystem is based. We're trying to rebuild the thing but the sheer number of files in the filesystem (> 10M files) makes this process very, very slow. We're bringing the project up from a recent backup (12/09/10) but the backup wasn't a perfect replica of the environment, so we're having to scramble to get all the parts working together again. We only need a few more weeks and then our new, next generation SAN will be ready to be put into place... I just thought the old one would last a few more week. I apologize for the hassle and appreciate your patience as we get things online again... KEL 01/07/11
ID: 69213 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
CTAPbIi

Send message
Joined: 25 Aug 09
Posts: 2
Credit: 201,366
RAC: 0
Message 69217 - Posted: 12 Jan 2011, 11:05:49 UTC - in response to Message 69213.  
Last modified: 12 Jan 2011, 11:06:08 UTC

Try reading the front page when there are problems:

I read that. but why server status page (https://boinc.bakerlab.org/rosetta/rah_status.php) shows everything up and running? that's confusing me
ID: 69217 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1895
Credit: 9,217,610
RAC: 1,154
Message 69218 - Posted: 12 Jan 2011, 11:57:59 UTC - in response to Message 69217.  

Try reading the front page when there are problems:


I read that. but why server status page (https://boinc.bakerlab.org/rosetta/rah_status.php) shows everything up and running? that's confusing me


The Server Status page is updated by a script that may not be running right now, so it could be showing several days ago stats.
ID: 69218 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
TJ

Send message
Joined: 29 Mar 09
Posts: 127
Credit: 4,799,890
RAC: 0
Message 69225 - Posted: 12 Jan 2011, 13:51:52 UTC

Yes I have the same, 4 still unaible to upload, retriing from time to time.

It's always when I would like to run a day or two for Rosetta there is an issue. All wu's result in errors, or no new work or not uploading.
If the wu's I have are finished I switch that pc to Einstein@home again.

Try Rosetta next mounth again.
Greetings,
TJ.
ID: 69225 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 69226 - Posted: 12 Jan 2011, 13:52:43 UTC - in response to Message 69218.  

I saw something by MOD in another thread somewhere that made it sound like this section of the server is not monitored by the status script. Almost sounded like a sub system. **MOD - if your reading this can you repost what you wrote over in that other thread that I do not remember???**


Try reading the front page when there are problems:


I read that. but why server status page (https://boinc.bakerlab.org/rosetta/rah_status.php) shows everything up and running? that's confusing me


The Server Status page is updated by a script that may not be running right now, so it could be showing several days ago stats.

ID: 69226 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
JohnMacPherson

Send message
Joined: 18 Sep 09
Posts: 2
Credit: 88,521
RAC: 0
Message 69227 - Posted: 12 Jan 2011, 13:59:07 UTC

i Try reading the front page when there are problems:
Jan 7, 2010
Well, our luck ran out. The SAN controller that has been causing so much trouble in the last few months finally tipped over in a rather distructive fashion, corrupting the binary tree on which the filesystem is based. We're trying to rebuild the thing but the sheer number of files in the filesystem (> 10M files) makes this process very, very slow. We're bringing the project up from a recent backup (12/09/10) but the backup wasn't a perfect replica of the environment, so we're having to scramble to get all the parts working together again. We only need a few more weeks and then our new, next generation SAN will be ready to be put into place... I just thought the old one would last a few more week. I apologize for the hassle and appreciate your patience as we get things online again... KEL 01/07/11 /i

Guy, this might do for a database engineer, but, some of us have a life and are just attempting to help the projects out. Perhaps, if they had said something like "things will be screwed up for the next weeks due to computer problems. It will be fixed in February". Then perhaps give a detailed explanation.

Remember the old KISS principal, it does apply here.
ID: 69227 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Saenger
Avatar

Send message
Joined: 19 Sep 05
Posts: 271
Credit: 824,883
RAC: 0
Message 69241 - Posted: 12 Jan 2011, 16:36:43 UTC

I still can't upload an old result, although a newer one has successfully uploaded.
The old one still gets the "upload handler not present" message, the newer on didn't.

Is it possible that you somehow changed something in the upload handler URL, so that it would not be found? In my client_state.xml the right path is given as this:
<url>http://srv5.bakerlab.org/rosetta_cgi/file_upload_handler</url>
I can't find the srv5 on the server status page, should I change the number to "4"?
Grüße vom Sänger
ID: 69241 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
[AF>france>pas-de-calais]symaski62

Send message
Joined: 19 Sep 05
Posts: 47
Credit: 33,871
RAC: 0
Message 69242 - Posted: 12 Jan 2011, 16:45:25 UTC - in response to Message 69241.  

I still can't upload an old result, although a newer one has successfully uploaded.
The old one still gets the "upload handler not present" message, the newer on didn't.

Is it possible that you somehow changed something in the upload handler URL, so that it would not be found? In my client_state.xml the right path is given as this:
<url>http://srv5.bakerlab.org/rosetta_cgi/file_upload_handler</url>
I can't find the srv5 on the server status page, should I change the number to "4"?


ok, problem servers :)

12/01/2011 16:24:45 rosetta@home Project file upload handler is missing
12/01/2011 16:24:45 rosetta@home Backing off 1 hr 36 min 48 sec on upload of abrelax_helixfrag_1vkk_SAVE_ALL_OUT_22843_2845_0_0

ID: 69242 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 69243 - Posted: 12 Jan 2011, 16:46:52 UTC - in response to Message 69226.  

I saw something by MOD in another thread somewhere that made it sound like this section of the server is not monitored by the status script. Almost sounded like a sub system. **MOD - if your reading this can you repost what you wrote over in that other thread that I do not remember???**


I may have just pointed out that "upload server" or anything similar, does NOT appear in the list of server statuses. I'm not sure why, and not in a position to address the problem personally.
Rosetta Moderator: Mod.Sense
ID: 69243 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Saenger
Avatar

Send message
Joined: 19 Sep 05
Posts: 271
Credit: 824,883
RAC: 0
Message 69244 - Posted: 12 Jan 2011, 16:54:05 UTC - in response to Message 69243.  
Last modified: 12 Jan 2011, 16:54:35 UTC

I saw something by MOD in another thread somewhere that made it sound like this section of the server is not monitored by the status script. Almost sounded like a sub system. **MOD - if your reading this can you repost what you wrote over in that other thread that I do not remember???**


I may have just pointed out that "upload server" or anything similar, does NOT appear in the list of server statuses. I'm not sure why, and not in a position to address the problem personally.

Yes, an entity called "Upload Server" doesn't appear on that page, but it could be just something not shown on srv4, which I guess is a name of a physical machine or partition.

As you can see, on srv3 are more than one "servers" located.
If they have lost one machine entirely, and put the stuff on another one, they may have forgotten, that some old results still have the referrer srv5 in it. Shit happens, especially in busy times.
ID: 69244 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
shimp
Avatar

Send message
Joined: 4 May 06
Posts: 7
Credit: 329,810
RAC: 0
Message 69263 - Posted: 12 Jan 2011, 23:07:21 UTC

i have about 50 that cant upload.
not receiving any new.
shimp
ID: 69263 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Darmok

Send message
Joined: 4 Sep 09
Posts: 6
Credit: 231,572
RAC: 0
Message 69267 - Posted: 12 Jan 2011, 23:55:09 UTC - in response to Message 69263.  

i have about 50 that cant upload.
not receiving any new.


I beat you, not that I want to. I have about 60 completed models waiting for upload in both the transfers and the tasks tabs.
ID: 69267 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · Next

Message boards : Number crunching : Anyone else having trouble uploading results?



©2024 University of Washington
https://www.bakerlab.org