SERVER PROBLEMS.

Message boards : Number crunching : SERVER PROBLEMS.

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 . . . 12 · Next

AuthorMessage
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5662
Credit: 5,701,869
RAC: 2,154
Message 58390 - Posted: 2 Jan 2009, 22:13:21 UTC

as of last update 22.01 utc servers are still down.
ID: 58390 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5662
Credit: 5,701,869
RAC: 2,154
Message 58396 - Posted: 3 Jan 2009, 1:14:46 UTC

As of 3 Jan 2009 1:09:06 UTC (updated every 10 minutes)

servers down again, 354 tasks ready to go out.
main page says there are just 7 tasks ready to go.

i will be out of work pretty soon. boosted my run time to 8 hrs.

any news?
ID: 58396 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 58399 - Posted: 3 Jan 2009, 1:41:56 UTC

There was some work over 1.000 tasks, one work spliter was running.

Now both are offline again, such is life. I'm out of work for Rosetta.

pete.

ID: 58399 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
BarryAZ

Send message
Joined: 27 Dec 05
Posts: 153
Credit: 30,843,285
RAC: 39
Message 58419 - Posted: 3 Jan 2009, 15:45:12 UTC - in response to Message 58399.  

Looks like the admins are off celebrating the long weekend -- we might well be out of work until sometime on Monday.



There was some work over 1.000 tasks, one work spliter was running.

Now both are offline again, such is life. I'm out of work for Rosetta.

pete.


ID: 58419 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Evan

Send message
Joined: 23 Dec 05
Posts: 268
Credit: 402,585
RAC: 0
Message 58421 - Posted: 3 Jan 2009, 15:48:35 UTC

There are 24,000 in the queue now. I filled up my quota.
ID: 58421 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 58433 - Posted: 3 Jan 2009, 20:28:30 UTC

Morning all.

It is in Oz anyway!

Looks like both spliters are down again, no work until someone gives them a kick again.

pete.

ID: 58433 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 58435 - Posted: 3 Jan 2009, 21:58:38 UTC

It is not just a matter of starting a job. The reason they are inactive is because the queue of tasks is apparently empty. "so fill the queue"... yes, I'm sure they will. Until then, there isn't any work unless tasks pass deadlines, or report failures.
Rosetta Moderator: Mod.Sense
ID: 58435 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5662
Credit: 5,701,869
RAC: 2,154
Message 58443 - Posted: 4 Jan 2009, 1:13:12 UTC - in response to Message 58435.  

It is not just a matter of starting a job. The reason they are inactive is because the queue of tasks is apparently empty. "so fill the queue"... yes, I'm sure they will. Until then, there isn't any work unless tasks pass deadlines, or report failures.



When will there be new work?
ID: 58443 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile DeMus
Avatar

Send message
Joined: 2 Jan 09
Posts: 10
Credit: 55,278
RAC: 0
Message 58446 - Posted: 4 Jan 2009, 5:07:05 UTC - in response to Message 58443.  

It is not just a matter of starting a job. The reason they are inactive is because the queue of tasks is apparently empty. "so fill the queue"... yes, I'm sure they will. Until then, there isn't any work unless tasks pass deadlines, or report failures.



When will there be new work?



2 days ago I started Rosetta@Home, merely to have my computer doing something. Seti@Home ran dry because of a server problem which took like forever to be fixed (I'm sure the guys at Berkely did all they could do but sometimes it just takes a while). Now I am almost out of work at Rosetta as well.
Does this happen often? I mean do I have to set my workbuffer at a large amount of days to be able to keep doing work, without running dry? What are your experiences?

ID: 58446 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
AMD_is_logical

Send message
Joined: 20 Dec 05
Posts: 299
Credit: 31,460,681
RAC: 0
Message 58448 - Posted: 4 Jan 2009, 6:51:08 UTC - in response to Message 58446.  

Does this happen often? I mean do I have to set my workbuffer at a large amount of days to be able to keep doing work, without running dry? What are your experiences?


My experience has been that brief outages are common, but extended outages are rare. In the past, a two day queue would ride out nearly all of them.

I'm currently using a shorter queue than that because I believe that earlier results are more useful. The Rosetta team can use these preliminary results to tweak the next batch of WUs.

ID: 58448 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Aegis Maelstrom

Send message
Joined: 29 Oct 08
Posts: 61
Credit: 2,137,555
RAC: 0
Message 58450 - Posted: 4 Jan 2009, 8:55:11 UTC

I shall not state anything spectacular saying that these outages are annoying - especially after problems with Work Units themselves.

It is a double pity as my BOINC@Poland team chose Rosetta@Home as their project of the month for January - just to see problems with fetching data and some WUs.

I'm afraid that what supposed to be an advertisement and encouragement for some crunchers to help and stay with Rosetta could bring a contrary effect.
ID: 58450 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1894
Credit: 8,767,285
RAC: 12,464
Message 58459 - Posted: 4 Jan 2009, 11:35:27 UTC - in response to Message 58450.  
Last modified: 4 Jan 2009, 11:36:28 UTC

I shall not state anything spectacular saying that these outages are annoying - especially after problems with Work Units themselves.

It is a double pity as my BOINC@Poland team chose Rosetta@Home as their project of the month for January - just to see problems with fetching data and some WUs.

I'm afraid that what supposed to be an advertisement and encouragement for some crunchers to help and stay with Rosetta could bring a contrary effect.


With today only being the 4th and with the Holidays almost over I would guess that your Team will have plenty of time to more than make up for a few bad days. I would expect everything to be back in tip top shape sometime tomorrow, the 5th. Congratulations on selecting Rosetta for your Teams Project of the Month!! I wish you all well.
ID: 58459 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5662
Credit: 5,701,869
RAC: 2,154
Message 58471 - Posted: 4 Jan 2009, 14:44:59 UTC

out of work on RAH for the first time since I joined this project.
now running einstein 100% until something else comes in.
this is the lowest level RAH has ever gone to.
hope that the rest of 2009 will be better.
ID: 58471 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 1982
Credit: 38,451,347
RAC: 14,322
Message 58481 - Posted: 4 Jan 2009, 18:09:44 UTC - in response to Message 58435.  

It is not just a matter of starting a job. The reason they are inactive is because the queue of tasks is apparently empty. "so fill the queue"... yes, I'm sure they will. Until then, there isn't any work unless tasks pass deadlines, or report failures.

Not sure if I'm misunderstanding this. Do you mean that there's no input to the rah_make_work* servers or just that they aren't running to output anything?
ID: 58481 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 1982
Credit: 38,451,347
RAC: 14,322
Message 58482 - Posted: 4 Jan 2009, 18:14:32 UTC

Also, the server status page is showing:

Program___ Host Status
transitioner boinc Not running

What does this do? Anything we should concern ourselves with?
ID: 58482 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
idonthaveanick

Send message
Joined: 11 May 06
Posts: 1
Credit: 9,297
RAC: 0
Message 58483 - Posted: 4 Jan 2009, 18:17:15 UTC

Im not getting work... No changes have been made to preferences.

Boinc requests XXXXX seconds of work but just gets 0 tasks...

???
ID: 58483 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5662
Credit: 5,701,869
RAC: 2,154
Message 58485 - Posted: 4 Jan 2009, 18:20:11 UTC - in response to Message 58482.  

guys, the simplest version of this whole thing is that the system is 'broken'.
there is NO work
there is nothing WE can do, but WAIT.
when the team comes in on monday morning pacific time,they will get on this.

i am out of work and so are allot of other people, so we just sit and wait.

Also, the server status page is showing:

Program___ Host Status
transitioner boinc Not running

What does this do? Anything we should concern ourselves with?

ID: 58485 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 58492 - Posted: 4 Jan 2009, 18:39:42 UTC

If you review all of the posts on the message boards for the past 3 days, you will find two themes. One is that people are upset because they got work, and it encountered errors or ran much too long. The other is they are unable to get work.

When pockets of tasks are hit that error out quickly, the queue of tasks can become depleted rather quickly.

Do you mean that there's no input to the rah_make_work* servers


That puts it well Sid, yes. I just didn't feel most people would understand a statement like that. Also, depending on what the Project Team has in progress, they may need to test new tasks on Ralph first as well, before releasing new work here.

All the tasks you are seeing come out are just the WUs that are being reissued due to errors by the first party, or expiring deadlines. This is why there are so few.
Rosetta Moderator: Mod.Sense
ID: 58492 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 1982
Credit: 38,451,347
RAC: 14,322
Message 58497 - Posted: 4 Jan 2009, 19:00:44 UTC - in response to Message 58492.  

Do you mean that there's no input to the rah_make_work* servers

That puts it well Sid, yes. I just didn't feel most people would understand a statement like that. Also, depending on what the Project Team has in progress, they may need to test new tasks on Ralph first as well, before releasing new work here.

All the tasks you are seeing come out are just the WUs that are being reissued due to errors by the first party, or expiring deadlines. This is why there are so few.

Amazed that an untechnical person like me got that right. But I'm a process engineer so I guess I appreciate the process even if I'm unqualified in the detail.

So this takes the problem to a whole other level. Best to just wait.

I'm just reminded of a remark made by a manager of mine a few years ago:
"There's 24 hours in a day and it's not essential to sleep or take holidays in the short-term..."

He wasn't popular either...
ID: 58497 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5662
Credit: 5,701,869
RAC: 2,154
Message 58501 - Posted: 4 Jan 2009, 21:03:13 UTC - in response to Message 58497.  

Do you mean that there's no input to the rah_make_work* servers

That puts it well Sid, yes. I just didn't feel most people would understand a statement like that. Also, depending on what the Project Team has in progress, they may need to test new tasks on Ralph first as well, before releasing new work here.

All the tasks you are seeing come out are just the WUs that are being reissued due to errors by the first party, or expiring deadlines. This is why there are so few.

Amazed that an untechnical person like me got that right. But I'm a process engineer so I guess I appreciate the process even if I'm unqualified in the detail.

So this takes the problem to a whole other level. Best to just wait.

I'm just reminded of a remark made by a manager of mine a few years ago:
"There's 24 hours in a day and it's not essential to sleep or take holidays in the short-term..."

He wasn't popular either...



interesting quote for sure....
ID: 58501 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 . . . 12 · Next

Message boards : Number crunching : SERVER PROBLEMS.



©2024 University of Washington
https://www.bakerlab.org