Posts by TPCBF

21) Message boards : Number crunching : Problems with uploading the results (Message 75128)
Posted 18 Feb 2013 by TPCBF
Post:
It might help if you actually read other threads in the forum as well. NOBODY can upload/download or communicate in any way with the project servers for more than a day now...



.. And I suggest to inform users at least on main page about such issues ..
it would definitely reduce number of such posts
As mentioned by others as well,(lack of) communication is the biggest problem with the Rosetta project...

Mr.Baker seems to like basking in the limelight, but doesn't give a **** when it comes how the project is maintained... :(

Ralf
22) Message boards : Number crunching : Problems with uploading the results (Message 75113)
Posted 18 Feb 2013 by TPCBF
Post:
I have finished several tasks and there are no errors, but it couldn't upload, and the time left shown in 'status' just delayed for many times, besides, I found that the 'TeraFLOPS estimate' in the frontpage is 7.068, and it's far below several days before, I remember three days before, that was over 100, so I guess there maybe some errors with the server and I'm calling for help.
Thanks!
It might help if you actually read other threads in the forum as well. NOBODY can upload/download or communicate in any way with the project servers for more than a day now...


23) Message boards : Number crunching : Problems and Technical Issues with Rosetta@home (Message 75101)
Posted 17 Feb 2013 by TPCBF
Post:
Oh well. It's Sunday so I expect we will have to wait another day for the comms to be sorted out.
Don't hold your breath, Monday is a holiday here in the US of A, we likely have to wait longer for anyone to fix whatever is broken.
Not that the project admins are more responsive during a work week though... :-(

Ralf
24) Message boards : Number crunching : 3.43 is causing pop-ups (Message 74470)
Posted 19 Nov 2012 by TPCBF
Post:
Why not abort these tasks as has been reported as the fix on the front page and all over these forums?
Because that isn't really a "solution". You simply forget that not all machines crunching for Rosetta@Home are easily accessible. While I could do just that, aborting the batch of faulty WUs on one of my own hosts, it created a mess on two remote systems for which I had in the past permission to run it on. Not any longer, as those users now felt interrupted and I had to remove BOINC/R@H from those systems once I got on-site...

Sorry, if for whatever reason such faulty WUs make it out "into the wild", there needs to be a way to have a server side abort on those, not requiring user interaction. WCG can do this just fine...

Ralf
25) Message boards : Number crunching : Does using the Screensaver affect preformace of Rosetta@home? (Message 73577)
Posted 6 Aug 2012 by TPCBF
Post:
Yes but Remote Desktop is $80
Just use VNC... ;-)

Ralf
26) Message boards : Number crunching : I installed new version now I have lost all my statistics and not getting any new work (Message 73514)
Posted 22 Jul 2012 by TPCBF
Post:


Wed 18 Jul 19:56:15 2012 | | max memory usage when idle: 3686.40MB
Wed 18 Jul 19:56:15 2012 | | max disk usage: 0.00GB
That doesn't look quite kosher to me... :?

Ralf
27) Message boards : Number crunching : low credit (Message 73190)
Posted 2 Jun 2012 by TPCBF
Post:
...I will be ABORTING...


Thus assuring that you receive zero credit for the effort.
Well, I was tempted to do this on a couple of those WUs as well, as the difference between zero and something like 2.4 or 3.5 (for a WU that ran a day or two) doesn't make much of a difference...

Ralf
28) Message boards : Number crunching : Almost out of work again? Jeesh... (Message 71785)
Posted 11 Dec 2011 by TPCBF
Post:
As of now there are only 18,000 WU's left. My Boinc manager is already giving the message: "Message
From Server: No Work Sent."

We will probably run out of WU's in a few hours and shut-down for the rest of the weekend.
What do you expect? After all, it's weekend again. And almost time for the yearly week long blowup...

Ralf
29) Message boards : Number crunching : Problems and Technical Issues with Rosetta@home (Message 71663)
Posted 28 Nov 2011 by TPCBF
Post:
pfft, just more lack of communication and action by R@H team.
total work units queued goose egg 0.
in the news section any info? no - why should there be this is not news.
news here on the boards? no - same reason, this is not news.
You're right, no news, just the same old same old...

I have given up on taking Rosetta@Home serious earlier this year, I honestly doubt that this project's attitude will ever change...

Ralf
30) Message boards : Number crunching : MiniRosetta 3.17 Problems. (Message 71559)
Posted 31 Oct 2011 by TPCBF
Post:
RALPH has separate executables for minirosetta (current version of Rosetta@Home) and minirosetta_beta (next version of Rosetta@Home). At the moment, the two applications are identical, despite their different version numbers.

minirosetta => 3.18
minirosetta_beta => 3.17

During the update process, the two versions will diverge. The idea behind this is to always have a running version of the software currently deployed on Rosetta@Home available for test.
And are you sure that everyone's on the same page here? :?

Ralf
31) Message boards : Number crunching : MiniRosetta 3.17 Problems. (Message 71556)
Posted 31 Oct 2011 by TPCBF
Post:
Why isn't ralph being used to catch these errors? All workunits I've received from ralph recently have been using app version 3.14.
Yeah, what RALPH@Home is doing is a bit odd recently. Several times, I got swamped with sets of 20 WUs at a time, and a mix of applications labeled both as "Rosetta Mini Beta 3.17" (currently 2 awaiting their turn) and as "Rosetta Mini 3.14" (another 20 WUs piled up to be eventually being processed).

Ralf
32) Message boards : Number crunching : MiniRosetta 3.17 Problems. (Message 71554)
Posted 31 Oct 2011 by TPCBF
Post:
The offending jobs have been removed.

Rosetta is a large and diverse project. Unlike more focused efforts such as SETI@Home, the breadth of compute tasks being performed on Rosetta@Home is incredible. While offering enormous flexibility, this greatly complicates testing and validation. Unfortunately, some bad jobs slipped in this time. In many cases, Rosetta@Home users such as myself find out about failing jobs when you do, and we're just as frustrated when such jobs are distributed.

Thank you for your continued support.
But why the *snap* can no sysadmin post some proper info about this in a timely fashion?

It's just a matter of simple communication, doesn't even cost much time. :-(

Ralf

33) Message boards : Number crunching : compute errors (Message 71523)
Posted 28 Oct 2011 by TPCBF
Post:
Got the same kind of compute errors now too, claiming problems with the .out file, which I think is a red herring. This and a number of validate errors started to show up since the update to 3.17...

Ralf
34) Message boards : Number crunching : Validator down... :-( (Message 71511)
Posted 27 Oct 2011 by TPCBF
Post:
Well, never a dull moment...

Does anyone know what the issue is here or is this (just) another "it's weekend and no sysadmin is around" kind of typical R@H thing again? :-(

Ralf


It's slow but it isn't a problem.
Now, it is only slow, but that was not the case when I wrote my original message days ago, that you are (mis)quoting here.

Ralf
35) Message boards : Number crunching : Validator down... :-( (Message 71505)
Posted 27 Oct 2011 by TPCBF
Post:
Well, WU's are being validated, but at a snail's pace right now. Usually, they would not sit more than 5-10 minutes as pending, now I always have about a dozen that will sit for about a day before being validated...

Ralf
36) Message boards : Number crunching : Validator down... :-( (Message 71497)
Posted 26 Oct 2011 by TPCBF
Post:
The validator is probably working through a backlog of results after their downtime, hence the pending status of tasks.
That backlog would have existed earlier today as well and all the previously pending WUs had cleared out but now they just keep piling up again. If it is "just" a backlog, at least one or two of those should be validated once in a while. But none of them has in more than 12h...

The silence of the sysadmins is really deafening... :-(

Ralf
37) Message boards : Number crunching : Validator down... :-( (Message 71494)
Posted 25 Oct 2011 by TPCBF
Post:
Everything normal here as of now,
Uploads, downloads, validation, no pending`s,
Though from what you are saying here something else is not.
There's certainly something not right, not only with Rosetta@Home but with RALPH@Home as well.
On R@H, WU's are uploaded and reported but then just sit as "pending". This was working at some point yesterday.
And on RALPH@Home, you can not upload any finished WUs sue t a "can not attach to shared memory" error on the server(s).

Don't know how much resources Rosetta@Home and RALPH@Home are sharing, but it looks to me as if whatever they fixed yesterday isn't in fact working properly...

Ralf
38) Message boards : Number crunching : Validator down... :-( (Message 71492)
Posted 25 Oct 2011 by TPCBF
Post:
Pending WUs are definitely piling up again, RALPH@Home WU's can not be reported due to a server error as well and on the server status page, everything shows running, which it certainly is not... :-(

Ralf
39) Message boards : Number crunching : Validator down... :-( (Message 71490)
Posted 25 Oct 2011 by TPCBF
Post:
Either the validator is malfunctioning and this is not registering on the Server Status page, or else it is badly behind. I have over 120 work units waiting to validate -- some are several days old.
Yeah, something's still up, all WU's that were pending over the weekend went through but now since this morning/last night, WU's keep getting stuck as pending again here as well...

Ralf
40) Message boards : Number crunching : Validator down... :-( (Message 71485)
Posted 25 Oct 2011 by TPCBF
Post:
Well, after two days being down, it took someone apparently less than two hours to fix the problem.
All servers show status running and all but 3 WUs that where stuck as pending have been validated.

Still wonder why the response from the R@H team has to be so abysmal compared to other scientific projects... :-(

Ralf


Previous 20 · Next 20



©2024 University of Washington
https://www.bakerlab.org