New "shared memory" problem

Message boards : Number crunching : New "shared memory" problem

To post messages, you must log in.

AuthorMessage
googloo
Avatar

Send message
Joined: 15 Sep 06
Posts: 133
Credit: 21,662,970
RAC: 4,382
Message 46139 - Posted: 13 Sep 2007, 19:33:32 UTC

9/13/2007 3:32:26 PM|rosetta@home|Sending scheduler request: Requested by user
9/13/2007 3:32:26 PM|rosetta@home|Reporting 2 tasks
9/13/2007 3:32:31 PM|rosetta@home|Scheduler RPC succeeded
9/13/2007 3:32:31 PM|rosetta@home|Message from server: Project encountered internal error: shared memory
9/13/2007 3:32:31 PM|rosetta@home|Deferring communication for 1 hr 0 min 0 sec
9/13/2007 3:32:31 PM|rosetta@home|Reason: project is down

ID: 46139 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Brad Meide

Send message
Joined: 26 Nov 05
Posts: 3
Credit: 147,909
RAC: 0
Message 46140 - Posted: 13 Sep 2007, 19:40:31 UTC - in response to Message 46139.  

I'm seeing the same message when sending completed work:

9/13/2007 2:32:32 PM|rosetta@home|Sending scheduler request: To report completed tasks
9/13/2007 2:32:32 PM|rosetta@home|Reporting 1 tasks
9/13/2007 2:32:38 PM|rosetta@home|Scheduler RPC succeeded
9/13/2007 2:32:38 PM|rosetta@home|Message from server: Project encountered internal error: shared memory
9/13/2007 2:32:38 PM|rosetta@home|Deferring communication for 1 hr 0 min 0 sec
9/13/2007 2:32:38 PM|rosetta@home|Reason: project is down

ID: 46140 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
The Grinch

Send message
Joined: 29 Mar 07
Posts: 3
Credit: 3,622,517
RAC: 0
Message 46528 - Posted: 18 Sep 2007, 15:55:14 UTC

Same Problem know

18.09.2007 17:51:54|rosetta@home|Sending scheduler request: Requested by user
18.09.2007 17:51:54|rosetta@home|Reporting 1 tasks
18.09.2007 17:51:59|rosetta@home|Scheduler RPC succeeded
18.09.2007 17:51:59|rosetta@home|Message from server: Project encountered internal error: shared memory
18.09.2007 17:51:59|rosetta@home|Deferring communication for 1 hr 0 min 0 sec
18.09.2007 17:51:59|rosetta@home|Reason: project is down

Whats up?
ID: 46528 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
BarryAZ

Send message
Joined: 27 Dec 05
Posts: 153
Credit: 30,843,285
RAC: 52
Message 46530 - Posted: 18 Sep 2007, 16:05:01 UTC - in response to Message 46528.  

Possibly two different issues -- one would be a server configuration issue (similar to a problem that showed up toward the end of the last outage). The second might be with the new 5.80 application -- which apparently included a bunch of bad boy work units and may simply (the 5.80 beta) be something which needs to be pulled back.




Same Problem know

18.09.2007 17:51:54|rosetta@home|Sending scheduler request: Requested by user
18.09.2007 17:51:54|rosetta@home|Reporting 1 tasks
18.09.2007 17:51:59|rosetta@home|Scheduler RPC succeeded
18.09.2007 17:51:59|rosetta@home|Message from server: Project encountered internal error: shared memory
18.09.2007 17:51:59|rosetta@home|Deferring communication for 1 hr 0 min 0 sec
18.09.2007 17:51:59|rosetta@home|Reason: project is down

Whats up?


ID: 46530 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1018
Credit: 4,334,829
RAC: 0
Message 46546 - Posted: 18 Sep 2007, 19:23:24 UTC

Are people seeing the same "shared memory" errors now?
ID: 46546 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Alex Huxley

Send message
Joined: 15 Aug 06
Posts: 8
Credit: 6,034
RAC: 0
Message 46547 - Posted: 18 Sep 2007, 19:31:04 UTC - in response to Message 46546.  

Are people seeing the same "shared memory" errors now?


Luckily I haven't had that problem as far as I am aware!

ID: 46547 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5659
Credit: 5,691,837
RAC: 1,806
Message 46553 - Posted: 18 Sep 2007, 20:33:15 UTC - in response to Message 46547.  

Are people seeing the same "shared memory" errors now?


Luckily I haven't had that problem as far as I am aware!


i just got started on 5.80 wu's and had a capri that died. but not due to shared memory. maybe this next batch of stuff does it.
ID: 46553 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
BarryAZ

Send message
Joined: 27 Dec 05
Posts: 153
Credit: 30,843,285
RAC: 52
Message 46554 - Posted: 18 Sep 2007, 20:46:47 UTC - in response to Message 46546.  

Are people seeing the same "shared memory" errors now?


Those seem to have been cleared -- though I see the server is about to go offline for 3 hours or so shortly.


ID: 46554 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1018
Credit: 4,334,829
RAC: 0
Message 46557 - Posted: 18 Sep 2007, 20:51:00 UTC

Sounds like the shared memory issue has been fixed. The project is going to go offline soon so we can make a backup of the database and optimize the tables, and backup some project files. Hopefully the throughput will increase back to the normal level soon after.
ID: 46557 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : New "shared memory" problem



©2024 University of Washington
https://www.bakerlab.org