Message boards : Number crunching : prob connecting?
Author | Message |
---|---|
[B^S] Leprichon Send message Joined: 3 Jun 07 Posts: 3 Credit: 25,216 RAC: 0 |
Hi, Im having a problem connecting with rosetta with BOINC (it only shows the www.address)...I checked and all the info is the same everywhere - and all the servers are up for rosetta and it appears to be sending out work...I just wondered if anyone else was having this problem??... |
Ian_D Send message Joined: 21 Sep 05 Posts: 55 Credit: 4,216,173 RAC: 0 |
Don't think anyone is getting work, uploading is OK now but reporting and downloading still don't work after the SAN hardware/firmware debarcle. HTH |
BarryAZ Send message Joined: 27 Dec 05 Posts: 153 Credit: 30,843,285 RAC: 0 |
Right -- the sort of thing one encounters when a full rebuild is performed -- discovery of 'We need to do this,and oh, yes, we need to do that' Given the severe nature of the failure, I suspect there is a bit of this 'discovery' still going on for the admins here. Don't think anyone is getting work, uploading is OK now but reporting and downloading still don't work after the SAN hardware/firmware debarcle. |
Greg_BE Send message Joined: 30 May 06 Posts: 5691 Credit: 5,859,226 RAC: 0 |
uploading is still offline, boinc manager is trying to connect every hour, but still comes back with errors. Right -- the sort of thing one encounters when a full rebuild is performed -- discovery of 'We need to do this,and oh, yes, we need to do that' Given the severe nature of the failure, I suspect there is a bit of this 'discovery' still going on for the admins here. |
Ian_D Send message Joined: 21 Sep 05 Posts: 55 Credit: 4,216,173 RAC: 0 |
Well mines uploaded everything it had to the server and now is waiting for the "internal error: shared memory" problem to be resolved so it can report and no doubt download. |
Greg_BE Send message Joined: 30 May 06 Posts: 5691 Credit: 5,859,226 RAC: 0 |
thats the error that is holding back my uploading and reporting. Well mines uploaded everything it had to the server and now is waiting for the "internal error: shared memory" problem to be resolved so it can report and no doubt download. |
Bill Hepburn Send message Joined: 18 Sep 05 Posts: 14 Credit: 14,873,472 RAC: 2,392 |
Remember that after an extended outage, almost every active host (almost 400,000 of them according to BOINCStats) will be asking for work, and reporting work, all at once ("exponential backup" notwithstanding). You are competing for program resources with those other computers. Rosetta seems to have a relatively robust server farm, and a very responsive admin team, but don't be surprised to be seeing all sorts of strange error messages and delays for a day or so. |
Greg_BE Send message Joined: 30 May 06 Posts: 5691 Credit: 5,859,226 RAC: 0 |
9/9/2007 8:26:05 PM|rosetta@home|Sending scheduler request: Requested by user 9/9/2007 8:26:05 PM|rosetta@home|Requesting 439761 seconds of new work, and reporting 32 completed tasks 9/9/2007 8:26:10 PM|rosetta@home|Scheduler RPC succeeded 9/9/2007 8:26:10 PM|rosetta@home|Message from server: Project encountered internal error: shared memory 9/9/2007 8:26:10 PM|rosetta@home|Deferring communication for 1 hr 0 min 0 sec 9/9/2007 8:26:10 PM|rosetta@home|Reason: project is down looks to be more than just alot of computers if it is reporting back project is down. |
David Emigh Send message Joined: 13 Mar 06 Posts: 158 Credit: 417,178 RAC: 0 |
This is the error message I'm getting now... 9/9/2007 1:53:07 PM|rosetta@home|Sending scheduler request: Project initialization 9/9/2007 1:53:07 PM|rosetta@home|Requesting 1 seconds of new work 9/9/2007 1:53:12 PM|rosetta@home|Scheduler RPC succeeded 9/9/2007 1:53:12 PM|rosetta@home|Message from server: Project encountered internal error: shared memory 9/9/2007 1:53:12 PM|rosetta@home|Deferring communication for 1 hr 0 min 0 sec 9/9/2007 1:53:12 PM|rosetta@home|Reason: project is down Rosie, Rosie, she's our gal, If she can't do it, no one shall! |
Greg_BE Send message Joined: 30 May 06 Posts: 5691 Credit: 5,859,226 RAC: 0 |
check this out, you can upload the most recent task that is finished but if you try to update your whole project the server wont let you. single task: 9/9/2007 10:05:16 PM|rosetta@home|Computation for task 1gidA_BOINC_MG_CHAINBREAK5_OLDLRSCORE_RNA_ABINITIO_RNA_CONTACT_RNA_LONG_RANGE_CONTACT_RNA_SASA-1gidA-_2064_27878_0 finished 9/9/2007 10:05:16 PM|rosetta@home|Starting 1gidA_BOINC_MG_CHAINBREAK5_OLDLRSCORE_RNA_ABINITIO_RNA_CONTACT_RNA_LONG_RANGE_CONTACT_RNA_SASA-1gidA-_2064_42845_0 9/9/2007 10:05:16 PM|rosetta@home|Starting task 1gidA_BOINC_MG_CHAINBREAK5_OLDLRSCORE_RNA_ABINITIO_RNA_CONTACT_RNA_LONG_RANGE_CONTACT_RNA_SASA-1gidA-_2064_42845_0 using rosetta_beta version 578 9/9/2007 10:05:19 PM|rosetta@home|[file_xfer] Started upload of file 1gidA_BOINC_MG_CHAINBREAK5_OLDLRSCORE_RNA_ABINITIO_RNA_CONTACT_RNA_LONG_RANGE_CONTACT_RNA_SASA-1gidA-_2064_27878_0_0 9/9/2007 10:05:21 PM|rosetta@home|[file_xfer] Finished upload of file 1gidA_BOINC_MG_CHAINBREAK5_OLDLRSCORE_RNA_ABINITIO_RNA_CONTACT_RNA_LONG_RANGE_CONTACT_RNA_SASA-1gidA-_2064_27878_0_0 9/9/2007 10:05:21 PM|rosetta@home|[file_xfer] Throughput 13777 bytes/sec attempted project update: 9/9/2007 10:59:30 PM|rosetta@home|Sending scheduler request: Requested by user 9/9/2007 10:59:30 PM|rosetta@home|Requesting 457313 seconds of new work, and reporting 33 completed tasks 9/9/2007 10:59:35 PM|rosetta@home|Scheduler RPC succeeded 9/9/2007 10:59:35 PM|rosetta@home|Message from server: Project encountered internal error: shared memory 9/9/2007 10:59:35 PM|rosetta@home|Deferring communication for 1 hr 0 min 0 sec 9/9/2007 10:59:35 PM|rosetta@home|Reason: project is down 9/9/2007 10:59:35 PM|rosetta@home|Deferring communication for 3 hr 41 min 39 sec 9/9/2007 10:59:35 PM|rosetta@home|Reason: project is down |
Ian_D Send message Joined: 21 Sep 05 Posts: 55 Credit: 4,216,173 RAC: 0 |
|
Rob Jacob Send message Joined: 11 Aug 07 Posts: 9 Credit: 702,294 RAC: 0 |
Well me work units uploaded, but I haven't gotten any new work yet. |
Henry Huff Send message Joined: 31 May 06 Posts: 6 Credit: 2,298,502 RAC: 0 |
AS of about 10 min ago everything reported and downloaded on 2 of my dial up machines and my cable machine. Things working fine here |
[B^S] Leprichon Send message Joined: 3 Jun 07 Posts: 3 Credit: 25,216 RAC: 0 |
Yes, I just checked (8:10am EST) and Rosetta seems to be back up and running fine!!!!!...Thanks, rosetta admin team!! |
Message boards :
Number crunching :
prob connecting?
©2024 University of Washington
https://www.bakerlab.org