SERVER PROBLEMS - 2.

Message boards : Number crunching : SERVER PROBLEMS - 2.

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · Next

AuthorMessage
Jochen

Send message
Joined: 6 Jun 06
Posts: 133
Credit: 3,847,433
RAC: 0
Message 67343 - Posted: 25 Aug 2010, 18:53:09 UTC - in response to Message 67336.  

I have 30GB of my HD reserved for R@H but it never use more than 350MB aprox. So, I think there\'s no way to increase the tasks pool.

With your computers hidden, one needs to guess...
What is your cache size? What is your preferred running time?

@ModSense: It\'s too late, trying to educate me. ;)
I guess I will just increase my cache again. It was just comfortable with a small cache, since I quite frequently reinstall the OS, due to hardware changes. Just let BOINC run out of work over night...

cu Joe



ID: 67343 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator
Project administrator

Send message
Joined: 22 Aug 06
Posts: 3550
Credit: 0
RAC: 0
Message 67344 - Posted: 25 Aug 2010, 18:57:13 UTC - in response to Message 67336.  

I have 30GB of my HD reserved for R@H but it never use more than 350MB aprox. So, I think there\'s no way to increase the tasks pool.


The amount of work you keep on-hand is configured in the network preferences of the BOINC Manager (or via the project website, and then you can update to the project and use the same settings on several machines).

Also, the amount of free space on your hard drive is not really relevant. What is important is the amount that BOINC is allowed to use. This is configured in the disk and memory tab of the preferences.
Rosetta Moderator: Mod.Sense
ID: 67344 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
goraxan

Send message
Joined: 18 Jul 10
Posts: 6
Credit: 1,143,926
RAC: 0
Message 67347 - Posted: 25 Aug 2010, 21:21:39 UTC

Ok, thx. Now I\'ve found where are the cache properties :)
ID: 67347 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 67404 - Posted: 27 Aug 2010, 21:52:50 UTC

It seem most servers are down, hopefully they can fix the problems finally.

Data-driven web pages boinc.bakerlab.org Running
Scheduler srv4.bakerlab.org Running
rah_make_work1 srv1 Not running
rah_make_work2 srv3 Not running
feeder srv4 Not running
file_deleter srv1 Not running
rah_validator_beta bk2 Not running
rah_validator_mini bk1 Not running
rah_assimilatorbeta1 bk1 Not running
rah_assimilatorbeta2 bk1 Not running
rah_assimilatorbeta3 bk2 Not running
rah_assimilatorbeta4 bk2 Not running
rah_assimilator_mini1 bk1 Not running
rah_assimilator_mini2 bk1 Not running
rah_assimilator_mini3 bk2 Not running
rah_assimilator_mini4 bk2 Not running
rah_assimilator_mini5 bk1 Not running
rah_assimilator_mini6 bk1 Not running
rah_assimilator_mini7 bk2 Not running
rah_assimilator_mini8 bk2 Not running
transitioner boinc Not running
db_purge srv1 Not runnin

ID: 67404 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Speedy
Avatar

Send message
Joined: 25 Sep 05
Posts: 159
Credit: 598,637
RAC: 0
Message 67405 - Posted: 27 Aug 2010, 22:08:37 UTC

As of 27 Aug 2010 22:04:51 UTC

All servers are running.

Ready to send 3,280
In progress 258,996
Have a crunching good day!!
ID: 67405 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
goraxan

Send message
Joined: 18 Jul 10
Posts: 6
Credit: 1,143,926
RAC: 0
Message 67413 - Posted: 28 Aug 2010, 8:37:31 UTC

Everything running but still problems :(

Ready to send 10
In progress 358,716
ID: 67413 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Evan

Send message
Joined: 10 Aug 08
Posts: 5
Credit: 39,050
RAC: 0
Message 67444 - Posted: 29 Aug 2010, 21:20:29 UTC



Still having some troubles.
ID: 67444 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
CrazySpy

Send message
Joined: 29 Aug 10
Posts: 1
Credit: 11,961
RAC: 0
Message 67477 - Posted: 30 Aug 2010, 23:50:58 UTC - in response to Message 67444.  



Still having some troubles.


Hi, unfortunately I´m having the same problem.

I have two project in my BOINC Manager.

Seti and Rosetta.

I´m able to process Seti tasks without problem but I can´t get any work from Rosetta... only getting the Communication deferred in the Status messagem on the Projects tab.

Could anyone help me? I´m really interest in help the Rosetta project.
ID: 67477 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Polian
Avatar

Send message
Joined: 21 Sep 05
Posts: 152
Credit: 10,141,266
RAC: 0
Message 67481 - Posted: 31 Aug 2010, 3:00:07 UTC - in response to Message 67477.  

Could anyone help me? I´m really interest in help the Rosetta project.


You\'re not doing anything wrong, it appears the servers are working, although they are churning out workunits slowly, and work is not always available at the moment.

However, without official word from the project, this is just speculation. When your client gets \"lucky\" you\'ll receive some work. My PCs have been crunching *most* of the time.
ID: 67481 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bill Hepburn

Send message
Joined: 18 Sep 05
Posts: 13
Credit: 12,478,085
RAC: 477
Message 67482 - Posted: 31 Aug 2010, 4:07:27 UTC - in response to Message 67477.  


Hi, unfortunately I´m having the same problem.

I have two project in my BOINC Manager.

Seti and Rosetta.

I´m able to process Seti tasks without problem but I can´t get any work from Rosetta... only getting the Communication deferred in the Status messagem on the Projects tab.

Could anyone help me? I´m really interest in help the Rosetta project.


Remember that Seti goes into it\'s weekly three day outage starting Tuesday morning California time (UTC-7). So, if they don\'t get Rosetta fixed, you may run out of work. Over the years, Rosetta has been one of the most reliable BOINC projects, so I remain optimistic.


ID: 67482 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 67483 - Posted: 31 Aug 2010, 4:07:29 UTC
Last modified: 31 Aug 2010, 4:07:48 UTC

Down again.

Data-driven web pages boinc.bakerlab.org Running
Scheduler srv4.bakerlab.org Running
rah_make_work1 srv1 Not running
rah_make_work2 srv3 Not running
feeder srv4 Not running
file_deleter srv1 Not running
rah_validator_beta bk2 Not running
rah_validator_mini bk1 Not running
rah_assimilatorbeta1 bk1 Not running
rah_assimilatorbeta2 bk1 Not running
rah_assimilatorbeta3 bk2 Not running
rah_assimilatorbeta4 bk2 Not running
rah_assimilator_mini1 bk1 Not running
rah_assimilator_mini2 bk1 Not running
rah_assimilator_mini3 bk2 Not running
rah_assimilator_mini4 bk2 Not running
rah_assimilator_mini5 bk1 Not running
rah_assimilator_mini6 bk1 Not running
rah_assimilator_mini7 bk2 Not running
rah_assimilator_mini8 bk2 Not running
transitioner boinc Not running
db_purge srv1 Not running

Running: Program is operating normally

Not Running: Program failed or ran out of work
ID: 67483 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1452
Credit: 5,791,113
RAC: 1,166
Message 67485 - Posted: 31 Aug 2010, 9:34:34 UTC - in response to Message 67481.  

Could anyone help me? I´m really interest in help the Rosetta project.


You\'re not doing anything wrong, it appears the servers are working, although they are churning out workunits slowly, and work is not always available at the moment.

However, without official word from the project, this is just speculation. When your client gets \"lucky\" you\'ll receive some work. My PCs have been crunching *most* of the time.


Over in another thread this came out by a person that knows what is going on:
\"Even after the cause of the go-slow is identified and fixed it may be a week before we are back up to normal operations. The servers normally make work available in the tens of thousands of tasks per hour; right now idle crunchers are probably requesting at least 100,000 tasks. It will take time to clear that backlog even when running at full capacity.\"

The thread is here Message boards : Number crunching : no work units
ID: 67485 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Murasaki
Avatar

Send message
Joined: 20 Apr 06
Posts: 303
Credit: 499,539
RAC: 123
Message 67494 - Posted: 31 Aug 2010, 18:17:58 UTC - in response to Message 67485.  
Last modified: 31 Aug 2010, 18:23:10 UTC

Over in another thread this came out by a person that knows what is going on:


I think a bit of clarification is needed here. The comment was by a person who made some educated guesses about what is going on based on previous experiences.

I have no more knowledge than any other volunteer who has trawled these forums over the past few years. I can speculate based on similar situations in the past, but we are still awaiting official comments from the project team. They may actually surprise me and upgrade the capacity so the backlog is cleared much sooner.
ID: 67494 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Murasaki
Avatar

Send message
Joined: 20 Apr 06
Posts: 303
Credit: 499,539
RAC: 123
Message 67570 - Posted: 3 Sep 2010, 18:23:31 UTC - in response to Message 67494.  

They may actually surprise me and upgrade the capacity so the backlog is cleared much sooner.


As the project has jumped from 32 Teraflops to 108 Teraflops in less than 48 hours my prediction of a week to recover looks abysmal in hindsight. I doubt I will be able to get employment as either a fortune teller or a weatherman.
ID: 67570 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 67882 - Posted: 30 Sep 2010, 22:46:06 UTC
Last modified: 30 Sep 2010, 22:47:25 UTC

Hi.

Well that was a big one, glad to see the servers coming back up.
ID: 67882 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 4871
Credit: 3,741,243
RAC: 2,255
Message 67884 - Posted: 30 Sep 2010, 23:17:28 UTC

server status says everything is up and running, but I get

10/1/2010 1:16:20 AM rosetta@home Sending scheduler request: To report completed tasks.
10/1/2010 1:16:20 AM rosetta@home Reporting 14 completed tasks, not requesting new tasks
10/1/2010 1:16:22 AM Project communication failed: attempting access to reference site
10/1/2010 1:16:22 AM rosetta@home Scheduler request failed: Couldn\'t connect to server
10/1/2010 1:16:23 AM Internet access OK - project servers may be temporarily down.


so I guess they are in overload mode again
ID: 67884 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
jesse1919

Send message
Joined: 1 Jul 10
Posts: 8
Credit: 2,680,869
RAC: 0
Message 67893 - Posted: 1 Oct 2010, 5:45:40 UTC

I haven\'t been able to upload or download all day. Servers are all green but now
\"TeraFLOPS estimate: 0.642\"
That\'s not good. Hope they figure it out.


ID: 67893 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 4871
Credit: 3,741,243
RAC: 2,255
Message 67902 - Posted: 1 Oct 2010, 7:41:05 UTC - in response to Message 67893.  

when they go down for awhile, everyone\'s program is looking to send information to the server. when the servers come back online then they get pounded with everyone\'s program trying to access the server. So it says it can\'t reach the server, which is true, since it is overloaded. Just wait for the server to get unburied and your program will update as soon as it can get through.


I haven\'t been able to upload or download all day. Servers are all green but now
\"TeraFLOPS estimate: 0.642\"
That\'s not good. Hope they figure it out.



ID: 67902 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Chris Holvenstot
Avatar

Send message
Joined: 2 May 10
Posts: 220
Credit: 9,106,918
RAC: 0
Message 67907 - Posted: 1 Oct 2010, 10:52:51 UTC

Greg -

I hate to be contrary (you believe that?) but I don\'t think that this is a simple case of servers being overloaded at this point - if it were you would see a few jobs squeaking through now and then.

Which does not seem to be the case - things appear to be locked up tighter than Fort Knox. I have not been able to report a completed task, finish an upload, or get a new unit of work since the servers started coming back up yesterday.

And judging from a few of the \"stats\" pages my closest competitors are in the same boat.

Further, the project\'s TeraFLOPS estimate has remained static at 0.642 during this time frame, another indication that nothing is getting through.

So me thinks the good folks out in Washington are still working on getting all of their systems and the network which connects them back up and functional - yes, I know that the \"Server Status Board\" is all \"green\" but what the heck, seeing is believing, right?

Have patience my friend, I\'m sure that getting the project up and functional again is their top priority.

CH

ID: 67907 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1452
Credit: 5,791,113
RAC: 1,166
Message 67908 - Posted: 1 Oct 2010, 12:45:04 UTC - in response to Message 67907.  

Greg -

I hate to be contrary (you believe that?) but I don\'t think that this is a simple case of servers being overloaded at this point - if it were you would see a few jobs squeaking through now and then.

Which does not seem to be the case - things appear to be locked up tighter than Fort Knox. I have not been able to report a completed task, finish an upload, or get a new unit of work since the servers started coming back up yesterday.

And judging from a few of the \"stats\" pages my closest competitors are in the same boat.

Further, the project\'s TeraFLOPS estimate has remained static at 0.642 during this time frame, another indication that nothing is getting through.

So me thinks the good folks out in Washington are still working on getting all of their systems and the network which connects them back up and functional - yes, I know that the \"Server Status Board\" is all \"green\" but what the heck, seeing is believing, right?

Have patience my friend, I\'m sure that getting the project up and functional again is their top priority.

CH


We talked about the \'server status\' page on another project, it is only as good as the data given to it, so if the data is bad or non existent the \'server status\' page will be inaccurate.
ID: 67908 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · Next

Message boards : Number crunching : SERVER PROBLEMS - 2.



©2020 University of Washington
http://www.bakerlab.org