prob connecting?

Message boards : Number crunching : prob connecting?

To post messages, you must log in.

AuthorMessage
[B^S] Leprichon

Send message
Joined: 3 Jun 07
Posts: 3
Credit: 25,216
RAC: 0
Message 45839 - Posted: 9 Sep 2007, 16:14:57 UTC

Hi, Im having a problem connecting with rosetta with BOINC (it only shows the www.address)...I checked and all the info is the same everywhere - and all the servers are up for rosetta and it appears to be sending out work...I just wondered if anyone else was having this problem??...
ID: 45839 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Ian_D

Send message
Joined: 21 Sep 05
Posts: 55
Credit: 4,216,173
RAC: 0
Message 45843 - Posted: 9 Sep 2007, 16:29:07 UTC

Don't think anyone is getting work, uploading is OK now but reporting and downloading still don't work after the SAN hardware/firmware debarcle.

HTH


ID: 45843 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
BarryAZ

Send message
Joined: 27 Dec 05
Posts: 153
Credit: 30,843,285
RAC: 0
Message 45846 - Posted: 9 Sep 2007, 16:34:36 UTC - in response to Message 45843.  

Right -- the sort of thing one encounters when a full rebuild is performed -- discovery of 'We need to do this,and oh, yes, we need to do that' Given the severe nature of the failure, I suspect there is a bit of this 'discovery' still going on for the admins here.


Don't think anyone is getting work, uploading is OK now but reporting and downloading still don't work after the SAN hardware/firmware debarcle.

HTH


ID: 45846 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 45849 - Posted: 9 Sep 2007, 16:52:43 UTC - in response to Message 45846.  

uploading is still offline, boinc manager is trying to connect every hour, but still comes back with errors.
Right -- the sort of thing one encounters when a full rebuild is performed -- discovery of 'We need to do this,and oh, yes, we need to do that' Given the severe nature of the failure, I suspect there is a bit of this 'discovery' still going on for the admins here.


Don't think anyone is getting work, uploading is OK now but reporting and downloading still don't work after the SAN hardware/firmware debarcle.

HTH



ID: 45849 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Ian_D

Send message
Joined: 21 Sep 05
Posts: 55
Credit: 4,216,173
RAC: 0
Message 45852 - Posted: 9 Sep 2007, 16:56:56 UTC
Last modified: 9 Sep 2007, 16:59:25 UTC

Well mines uploaded everything it had to the server and now is waiting for the "internal error: shared memory" problem to be resolved so it can report and no doubt download.


ID: 45852 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 45853 - Posted: 9 Sep 2007, 17:01:21 UTC - in response to Message 45852.  
Last modified: 9 Sep 2007, 17:01:30 UTC

thats the error that is holding back my uploading and reporting.
Well mines uploaded everything it had to the server and now is waiting for the "internal error: shared memory" problem to be resolved so it can report and no doubt download.


ID: 45853 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bill Hepburn

Send message
Joined: 18 Sep 05
Posts: 14
Credit: 14,873,472
RAC: 2,392
Message 45857 - Posted: 9 Sep 2007, 17:22:23 UTC

Remember that after an extended outage, almost every active host (almost 400,000 of them according to BOINCStats) will be asking for work, and reporting work, all at once ("exponential backup" notwithstanding). You are competing for program resources with those other computers. Rosetta seems to have a relatively robust server farm, and a very responsive admin team, but don't be surprised to be seeing all sorts of strange error messages and delays for a day or so.


ID: 45857 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 45862 - Posted: 9 Sep 2007, 18:27:43 UTC

9/9/2007 8:26:05 PM|rosetta@home|Sending scheduler request: Requested by user
9/9/2007 8:26:05 PM|rosetta@home|Requesting 439761 seconds of new work, and reporting 32 completed tasks
9/9/2007 8:26:10 PM|rosetta@home|Scheduler RPC succeeded
9/9/2007 8:26:10 PM|rosetta@home|Message from server: Project encountered internal error: shared memory
9/9/2007 8:26:10 PM|rosetta@home|Deferring communication for 1 hr 0 min 0 sec
9/9/2007 8:26:10 PM|rosetta@home|Reason: project is down

looks to be more than just alot of computers if it is reporting back project is down.
ID: 45862 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David Emigh
Avatar

Send message
Joined: 13 Mar 06
Posts: 158
Credit: 417,178
RAC: 0
Message 45865 - Posted: 9 Sep 2007, 18:59:37 UTC

This is the error message I'm getting now...

9/9/2007 1:53:07 PM|rosetta@home|Sending scheduler request: Project initialization
9/9/2007 1:53:07 PM|rosetta@home|Requesting 1 seconds of new work
9/9/2007 1:53:12 PM|rosetta@home|Scheduler RPC succeeded
9/9/2007 1:53:12 PM|rosetta@home|Message from server: Project encountered internal error: shared memory
9/9/2007 1:53:12 PM|rosetta@home|Deferring communication for 1 hr 0 min 0 sec
9/9/2007 1:53:12 PM|rosetta@home|Reason: project is down

Rosie, Rosie, she's our gal,
If she can't do it, no one shall!
ID: 45865 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 45874 - Posted: 9 Sep 2007, 21:03:25 UTC

check this out, you can upload the most recent task that is finished but if you try to update your whole project the server wont let you.

single task:
9/9/2007 10:05:16 PM|rosetta@home|Computation for task 1gidA_BOINC_MG_CHAINBREAK5_OLDLRSCORE_RNA_ABINITIO_RNA_CONTACT_RNA_LONG_RANGE_CONTACT_RNA_SASA-1gidA-_2064_27878_0 finished
9/9/2007 10:05:16 PM|rosetta@home|Starting 1gidA_BOINC_MG_CHAINBREAK5_OLDLRSCORE_RNA_ABINITIO_RNA_CONTACT_RNA_LONG_RANGE_CONTACT_RNA_SASA-1gidA-_2064_42845_0
9/9/2007 10:05:16 PM|rosetta@home|Starting task 1gidA_BOINC_MG_CHAINBREAK5_OLDLRSCORE_RNA_ABINITIO_RNA_CONTACT_RNA_LONG_RANGE_CONTACT_RNA_SASA-1gidA-_2064_42845_0 using rosetta_beta version 578
9/9/2007 10:05:19 PM|rosetta@home|[file_xfer] Started upload of file 1gidA_BOINC_MG_CHAINBREAK5_OLDLRSCORE_RNA_ABINITIO_RNA_CONTACT_RNA_LONG_RANGE_CONTACT_RNA_SASA-1gidA-_2064_27878_0_0
9/9/2007 10:05:21 PM|rosetta@home|[file_xfer] Finished upload of file 1gidA_BOINC_MG_CHAINBREAK5_OLDLRSCORE_RNA_ABINITIO_RNA_CONTACT_RNA_LONG_RANGE_CONTACT_RNA_SASA-1gidA-_2064_27878_0_0
9/9/2007 10:05:21 PM|rosetta@home|[file_xfer] Throughput 13777 bytes/sec
attempted project update:
9/9/2007 10:59:30 PM|rosetta@home|Sending scheduler request: Requested by user
9/9/2007 10:59:30 PM|rosetta@home|Requesting 457313 seconds of new work, and reporting 33 completed tasks
9/9/2007 10:59:35 PM|rosetta@home|Scheduler RPC succeeded
9/9/2007 10:59:35 PM|rosetta@home|Message from server: Project encountered internal error: shared memory
9/9/2007 10:59:35 PM|rosetta@home|Deferring communication for 1 hr 0 min 0 sec
9/9/2007 10:59:35 PM|rosetta@home|Reason: project is down
9/9/2007 10:59:35 PM|rosetta@home|Deferring communication for 3 hr 41 min 39 sec
9/9/2007 10:59:35 PM|rosetta@home|Reason: project is down

ID: 45874 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Ian_D

Send message
Joined: 21 Sep 05
Posts: 55
Credit: 4,216,173
RAC: 0
Message 45875 - Posted: 9 Sep 2007, 21:05:04 UTC
Last modified: 9 Sep 2007, 21:05:50 UTC

Really ! I thought I'd said that earlier in this thread ???

Here


ID: 45875 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Rob Jacob

Send message
Joined: 11 Aug 07
Posts: 9
Credit: 702,294
RAC: 0
Message 45880 - Posted: 9 Sep 2007, 23:23:00 UTC

Well me work units uploaded, but I haven't gotten any new work yet.
ID: 45880 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Henry Huff

Send message
Joined: 31 May 06
Posts: 6
Credit: 2,298,502
RAC: 0
Message 45884 - Posted: 10 Sep 2007, 0:55:18 UTC
Last modified: 10 Sep 2007, 1:05:41 UTC

AS of about 10 min ago everything reported and downloaded on 2 of my dial up machines and my cable machine. Things working fine here
ID: 45884 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
[B^S] Leprichon

Send message
Joined: 3 Jun 07
Posts: 3
Credit: 25,216
RAC: 0
Message 45915 - Posted: 10 Sep 2007, 12:13:25 UTC

Yes, I just checked (8:10am EST) and Rosetta seems to be back up and running fine!!!!!...Thanks, rosetta admin team!!
ID: 45915 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : prob connecting?



©2024 University of Washington
https://www.bakerlab.org