Problems with web site

Message boards : Number crunching : Problems with web site

To post messages, you must log in.

Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · 11 · 12 . . . 19 · Next

AuthorMessage
Profile MM Sihombing
Avatar

Send message
Joined: 22 May 06
Posts: 15
Credit: 1,424,082
RAC: 0
Message 58820 - Posted: 14 Jan 2009, 23:23:15 UTC

User of the day has been stuck for several days I think.
ID: 58820 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Paul D. Buck

Send message
Joined: 17 Sep 05
Posts: 815
Credit: 1,812,737
RAC: 0
Message 58821 - Posted: 14 Jan 2009, 23:37:44 UTC - in response to Message 58818.  

And the error i had today is probably a harder one to isolate because it might not be easily reprodusable. Your server or my connection has to be in a special state so the boinc client thinks it\'s ok until it\'s to late (files already deleted from my client). It\'s not easy to get a two face commit to work properly or what it\'s called today.

Some wasted cputime but you can\'t get them all.


It is called Two Phase COmmit and actually it is not hard to make work at all ... it is just that the BOINC Developers probably did not think that it was really important to ensure that a proper two phase commit is needed.

By their lights, it isn\'t ... the actual science is, and has been loaded, so, the data that they care about has been moved to the server. The trivia of proper accounting of credit and things like that are not that important to them ...

As to the last statement ... well ... in relational databases, the two phase commit protocol is core to all activities ... including MySQL and SQL Server which are used by projects for BOINC ...
ID: 58821 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Kurre

Send message
Joined: 12 Apr 06
Posts: 9
Credit: 69,240
RAC: 0
Message 58837 - Posted: 15 Jan 2009, 14:50:07 UTC - in response to Message 58821.  

It is called Two Phase COmmit and actually it is not hard to make work at all ... it is just that the BOINC Developers probably did not think that it was really important to ensure that a proper two phase commit is needed.

By their lights, it isn\'t ... the actual science is, and has been loaded, so, the data that they care about has been moved to the server. The trivia of proper accounting of credit and things like that are not that important to them ...

As to the last statement ... well ... in relational databases, the two phase commit protocol is core to all activities ... including MySQL and SQL Server which are used by projects for BOINC ...[/quote]

My spelling is what it is :-) and yes you have a point there it\'s science.... And im ok with that as long as they get the results but i don\'t think they got them eather.

ID: 58837 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Paul D. Buck

Send message
Joined: 17 Sep 05
Posts: 815
Credit: 1,812,737
RAC: 0
Message 58864 - Posted: 17 Jan 2009, 7:00:53 UTC - in response to Message 58837.  

My spelling is what it is :-) and yes you have a point there it\'s science.... And im ok with that as long as they get the results but i don\'t think they got them eather.


Um, I did not criticize your spelling or language ... or, I don\'t think I did ... if you got that impression I am sorry that I gave you that impression.

I was just trying to, with my teacher\'s hat on, explain the proper term and give the reference for those that might not understand what you were saying.

Anyway, I did not mean to cause offense ...
ID: 58864 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Venturini Dario[VENETO]

Send message
Joined: 25 May 07
Posts: 22
Credit: 245,028
RAC: 0
Message 58866 - Posted: 17 Jan 2009, 9:00:58 UTC

It\'s not actually a problem with the site, but I just wanted to point out that the date of the news is wrong... it keeps saying \"2008\" :asd:
ID: 58866 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Kurre

Send message
Joined: 12 Apr 06
Posts: 9
Credit: 69,240
RAC: 0
Message 58871 - Posted: 17 Jan 2009, 13:58:14 UTC - in response to Message 58864.  

Anyway, I did not mean to cause offense ...[/quote]

It\'s ok Paul no hard fealings at all. Just tried to make a joke there but this isn\'t my native language so it\'s not always easy to get it right.
And i agree that it\'s easy to use the built in 2 phase comm. in the databases but it\'s a bit harder to get that done in a stateless app that i think boinc is especially if you have to build that support yourself. Havent built any apps that use http witch i suppose this app is using so i might be wrong.
But this discuss is probably a bit out of thopic here so i think i\'ll keep a low profile here after this


ID: 58871 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Aegis Maelstrom

Send message
Joined: 29 Oct 08
Posts: 61
Credit: 2,076,012
RAC: 4,309
Message 58874 - Posted: 17 Jan 2009, 16:18:25 UTC
Last modified: 17 Jan 2009, 16:31:45 UTC

I can\'t connect to the website.

The server status seems to be O.K., however none of my 2 computers - in different locations, with different IPs - can connect to the server and get a new work unit.
According to the Rosetta server my 2nd laptop managed to upload data and report the success, however it can\'t get any new WUs.

On the first laptop I get as following:

2009-01-17 17:13:37|rosetta@home|Sending scheduler request: Requested by user. Requesting 0 seconds of work, reporting 1 completed tasks
2009-01-17 17:13:59||Project communication failed: attempting access to reference site
2009-01-17 17:14:01||Internet access OK - project servers may be temporarily down.
2009-01-17 17:14:02|rosetta@home|Scheduler request failed: Couldn\'t connect to server


I have no problem with the Internet access - I can even write this bug report down. :D

The same issue was reported by my teammate a couple hours ago.

Does anyone have the same problem?


EDIT: an interesting issue - my BOINC Client repeated its attempt, however the second time it requested... 0 seconds of work! See:

2009-01-17 17:13:37|rosetta@home|Sending scheduler request: Requested by user. Requesting 0 seconds of work, reporting 1 completed tasks
2009-01-17 17:13:59||Project communication failed: attempting access to reference site
2009-01-17 17:14:01||Internet access OK - project servers may be temporarily down.
2009-01-17 17:14:02|rosetta@home|Scheduler request failed: Couldn\'t connect to server


Finally, after a couple of attempts, my client managed to report a completed WU, however it still requested 0 seconds of work so I got no new WUs.
2009-01-17 17:19:12|rosetta@home|Sending scheduler request: Requested by user. Requesting 0 seconds of work, reporting 1 completed tasks
2009-01-17 17:20:48|rosetta@home|Scheduler request succeeded: got 0 new tasks


In this way, I still get no new WUs.
I don\'t know where the problem is, but looking at my 2nd laptop I guess it won\'t get any new Rosetta WU unless the BOINC is at least restarted.
The strange thing is that the 2nd laptop still communicates with the server (the \"last seen\" date in my computers window is getting updated) but it doesn\'t take new WUs although it should.
ID: 58874 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Path7

Send message
Joined: 25 Aug 07
Posts: 128
Credit: 61,751
RAC: 0
Message 58875 - Posted: 17 Jan 2009, 16:46:57 UTC - in response to Message 58874.  
Last modified: 17 Jan 2009, 16:59:50 UTC

.........I can\'t connect to the website.......
.....................................................
Does anyone have the same problem?...................

Hello Aegis Maelstrom,

However the Server Status Page is looking OK.,
from (a quick link) to BOINC Online Schedulers I\'m getting unreachable.

Something odd going on.

Edit:
Some minutes later I\'m getting online from (the quick link) to BOINC Online Schedulers.
And than again unreachable.
Perhaps the servers are on a heavy load?

Have a nice day,
Path7.
ID: 58875 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator
Project administrator

Send message
Joined: 22 Aug 06
Posts: 3550
Credit: 0
RAC: 0
Message 58880 - Posted: 17 Jan 2009, 18:47:47 UTC

Aegis, you don\'t need to do anything. BOINC will request work when it is ready for more. Your machine thinks it already has enough work on-board for other projects, and is trying to assure you meet the deadlines for that other work.

You can review this thread for more info. on how BOINC client decides to request work.
Rosetta Moderator: Mod.Sense
ID: 58880 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
ComfortablyNumb

Send message
Joined: 6 Jul 07
Posts: 8
Credit: 658,196
RAC: 0
Message 58883 - Posted: 17 Jan 2009, 19:18:15 UTC - in response to Message 58875.  
Last modified: 17 Jan 2009, 19:18:54 UTC

.........I can\'t connect to the website.......
.....................................................
Does anyone have the same problem?...................

Hello Aegis Maelstrom,

However the Server Status Page is looking OK.,
from (a quick link) to BOINC Online Schedulers I\'m getting unreachable.

Something odd going on.

Edit:
Some minutes later I\'m getting online from (the quick link) to BOINC Online Schedulers.
And than again unreachable.
Perhaps the servers are on a heavy load?

Have a nice day,
Path7.
I\'m getting the same thing.

1/17/2009 1:57:52 PM|rosetta@home|Sending scheduler request: To fetch work. Requesting 43202 seconds of work, reporting 0 completed tasks
1/17/2009 1:58:57 PM|rosetta@home|Scheduler request succeeded: got 0 new tasks
1/17/2009 2:03:02 PM|rosetta@home|Sending scheduler request: To fetch work. Requesting 43202 seconds of work, reporting 0 completed tasks
1/17/2009 2:03:24 PM||Project communication failed: attempting access to reference site
1/17/2009 2:03:25 PM||Access to reference site succeeded - project servers may be temporarily down.


I was able to report all completed wu\'s. I tried to reset the project, didn\'t work.
:shrug:
ID: 58883 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator
Project administrator

Send message
Joined: 22 Aug 06
Posts: 3550
Credit: 0
RAC: 0
Message 58884 - Posted: 17 Jan 2009, 19:22:28 UTC

No, resetting the project would not resolve a problem that says the server is temporarily unavailable.

And if the servers are under heavy load, it will in fact worsen the problem, because you will have to download all of the files that make Rosetta work again.
Rosetta Moderator: Mod.Sense
ID: 58884 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 4871
Credit: 3,740,253
RAC: 2,321
Message 58885 - Posted: 17 Jan 2009, 19:35:01 UTC

who updates the RSS home page feeds?
it would be nice that when servers go down (comm problems or whatever) that someone would put a note out on the homepage saying that comm is down or whatever the problem is. it is annoying to have to guess what is happening or wonder if anyone is aware of the problem.
ID: 58885 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator
Project administrator

Send message
Joined: 22 Aug 06
Posts: 3550
Credit: 0
RAC: 0
Message 58886 - Posted: 17 Jan 2009, 20:07:53 UTC

I do not have the ability to create RSS feeds :( Nor access to server info. to define, correct or detect a problem.
Rosetta Moderator: Mod.Sense
ID: 58886 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 4871
Credit: 3,740,253
RAC: 2,321
Message 58887 - Posted: 17 Jan 2009, 20:19:03 UTC - in response to Message 58886.  

I do not have the ability to create RSS feeds :( Nor access to server info. to define, correct or detect a problem.


now thats a shame and a black spot on the project.
you know as well as i do, that allot of us have been requesting more updated information/news to be posted by someone about outages, problems and if a solution is going to be worked on.

it was a damn shame that it took so long for any news from the last outage to be posted and then no word on what happened on the other one. it seems like you are the only one from the \"team\" or project that tries to tell us whats going on. to bad you couldn\'t get some sort of way to access the statuses or alert someone that there is a problem and then tell us what is going on if they won\'t.

so \'well done\' to you for at least wanting to get us some info. even though that is probably a bit outside your normal role as moderator.

now back to the Obama whistle stop tour video streams....
ID: 58887 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator
Project administrator

Send message
Joined: 22 Aug 06
Posts: 3550
Credit: 0
RAC: 0
Message 58889 - Posted: 17 Jan 2009, 20:38:47 UTC
Last modified: 17 Jan 2009, 20:50:57 UTC

Thanks, I try to post what I can. But at this point, I can\'t presume the extent of the problem you have reported so far. So, can\'t really offer any advice other then to wait for BOINC\'s normal retries, and to be assured that handling of such problems is designed in to BOINC. It will recover on it\'s own with no intervention on your part.

BTW, it\'s not really a black spot on the project. It is just how BOINC is set up, the RSS feeds are administered by the same control panel that contols all of the other project aspects. And moderators do not need that level of access.
Rosetta Moderator: Mod.Sense
ID: 58889 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 4871
Credit: 3,740,253
RAC: 2,321
Message 58891 - Posted: 17 Jan 2009, 21:19:46 UTC - in response to Message 58889.  

Thanks, I try to post what I can. But at this point, I can\'t presume the extent of the problem you have reported so far. So, can\'t really offer any advice other then to wait for BOINC\'s normal retries, and to be assured that handling of such problems is designed in to BOINC. It will recover on it\'s own with no intervention on your part.

BTW, it\'s not really a black spot on the project. It is just how BOINC is set up, the RSS feeds are administered by the same control panel that contols all of the other project aspects. And moderators do not need that level of access.



ok, but what i am trying to point out is that in the aprox. 12 hrs that this problem has been going on, no one has reported anything on the homepage.
this is kind of disappointing.

on the other hand it seems that it has resolved itself somewhat. i was able to upload 2 tasks but there was a bit of a bubble in reporting them which resolved itself on the next comm. cycle.
ID: 58891 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Michael J. R. O'Brien

Send message
Joined: 20 Dec 08
Posts: 2
Credit: 94,415
RAC: 0
Message 58893 - Posted: 17 Jan 2009, 22:03:38 UTC

I\'m having a similar problem. It has been about 18 hours since I have not been able to get any new WU\'s. I was getting temporarily down messages from the server, but those stopped. My computer is requesting 4 days worth of work and I\'m running only 3 of 4 WU\'s because the servers aren\'t giving me any more work. I updated preferences to have more work in my queue and despite boinc seeing that update and requesting more work, I\'m still getting none.
ID: 58893 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 4871
Credit: 3,740,253
RAC: 2,321
Message 58894 - Posted: 17 Jan 2009, 22:45:13 UTC

it is now 12 hours since the start of this problem and it looks like it is sporadic in nature. at around 10.50pm (utc +1) boinc tried to call home and got a troubled signal. then it retried 6 minutes later and got through but was not able to get any work despite requesting 56,000+ seconds of work.

and again no communications from the team.
ID: 58894 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Michael J. R. O'Brien

Send message
Joined: 20 Dec 08
Posts: 2
Credit: 94,415
RAC: 0
Message 58896 - Posted: 17 Jan 2009, 23:55:07 UTC

Problem resolved. I just pulled down 4 more WU\'s by manually updating.
ID: 58896 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 4871
Credit: 3,740,253
RAC: 2,321
Message 58898 - Posted: 18 Jan 2009, 0:24:54 UTC

i don\'t get it. it is requesting 73 thousand plus seconds of work but downloading none. i guess i have enough work in my queue? if so why is it requesting so much work?
ID: 58898 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · 11 · 12 . . . 19 · Next

Message boards : Number crunching : Problems with web site



©2020 University of Washington
http://www.bakerlab.org