Welcome Back!

Message boards : Number crunching : Welcome Back!

To post messages, you must log in.

Previous · 1 · 2 · 3

AuthorMessage
r2d2

Send message
Joined: 6 Jul 07
Posts: 1
Credit: 302,516
RAC: 0
Message 45840 - Posted: 9 Sep 2007, 16:20:45 UTC - in response to Message 45838.  

9/9/2007 11:17:56 AM|rosetta@home|Sending scheduler request: Requested by user
9/9/2007 11:17:56 AM|rosetta@home|Requesting 347328 seconds of new work, and reporting 26 completed tasks
9/9/2007 11:18:01 AM|rosetta@home|Scheduler RPC succeeded
9/9/2007 11:18:01 AM|rosetta@home|Message from server: Project encountered internal error: shared memory
9/9/2007 11:18:01 AM|rosetta@home|Deferring communication for 1 hr 0 min 0 sec
9/9/2007 11:18:01 AM|rosetta@home|Reason: project is down
ID: 45840 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Ian_D

Send message
Joined: 21 Sep 05
Posts: 55
Credit: 4,216,173
RAC: 0
Message 45841 - Posted: 9 Sep 2007, 16:23:22 UTC
Last modified: 9 Sep 2007, 16:26:06 UTC

Internal Memory problem from 2006

Anything to do with this ?


ID: 45841 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
BarryAZ

Send message
Joined: 27 Dec 05
Posts: 153
Credit: 30,156,731
RAC: 0
Message 45847 - Posted: 9 Sep 2007, 16:35:55 UTC - in response to Message 45841.  

Nice catch, a rebuild might well have left some things in a rather \'untidy\' state.



Internal Memory problem from 2006

Anything to do with this ?


ID: 45847 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Ian_D

Send message
Joined: 21 Sep 05
Posts: 55
Credit: 4,216,173
RAC: 0
Message 45848 - Posted: 9 Sep 2007, 16:43:18 UTC

ID: 45848 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
larry1186

Send message
Joined: 18 Apr 06
Posts: 7
Credit: 329,257
RAC: 0
Message 45896 - Posted: 10 Sep 2007, 4:41:38 UTC - in response to Message 45782.  

I\'m getting \"Can\'t open log file\" and a \"Project is down\" so I assume we aren\'t out of the woods just yet...


Well, I am happy to say that I have WUs that uploaded (even ones that were stuck halfway through an upload for days), reported, and new ones downloaded. I was expecting some sort of delay with the usual feeding frenzy after an outage, but none was to be found whatsoever.

Thanks to the Rosetta crew for giving up your weekend to get things back on track.

Here\'s to you!
Don't get distracted by shiny objects.
ID: 45896 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile (_KoDAk_)

Send message
Joined: 18 Jul 06
Posts: 109
Credit: 1,859,263
RAC: 0
Message 45897 - Posted: 10 Sep 2007, 5:04:47 UTC
Last modified: 10 Sep 2007, 5:06:02 UTC

hurrah!, hurray!
To day is real
Welcome Back!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
all work\'s fine.

ID: 45897 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Sir Cracked of the Mind

Send message
Joined: 5 Apr 07
Posts: 2
Credit: 248,559
RAC: 552
Message 45904 - Posted: 10 Sep 2007, 7:34:26 UTC

Well done for completing the job over the weekend, we all apreciate your efforts greatly.


IJ.
Thanet
ID: 45904 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile MM Sihombing
Avatar

Send message
Joined: 22 May 06
Posts: 15
Credit: 1,424,082
RAC: 0
Message 45905 - Posted: 10 Sep 2007, 8:11:50 UTC

9/10/2007 2:53:48 PM|rosetta@home|[file_xfer] Started download of file 1ig5A.fasta
9/10/2007 2:53:49 PM|rosetta@home|[file_xfer] Temporarily failed download of 1ig5A.fasta: file not found
9/10/2007 2:53:50 PM|rosetta@home|[file_xfer] Started download of file 1ig5A.fasta
9/10/2007 2:53:55 PM|rosetta@home|[file_xfer] Temporarily failed download of 1ig5A.fasta: file not found
9/10/2007 2:53:56 PM|rosetta@home|[file_xfer] Started download of file 1ig5A.fasta
9/10/2007 2:53:58 PM|rosetta@home|[file_xfer] Temporarily failed download of 1ig5A.fasta: file not found
9/10/2007 2:53:59 PM|rosetta@home|[file_xfer] Started download of file 1ig5A.fasta
9/10/2007 2:54:00 PM|rosetta@home|[file_xfer] Temporarily failed download of 1ig5A.fasta: file not found
9/10/2007 2:54:01 PM|rosetta@home|[file_xfer] Started download of file 1ig5A.fasta
9/10/2007 2:54:02 PM|rosetta@home|[file_xfer] Temporarily failed download of 1ig5A.fasta: file not found
9/10/2007 2:54:03 PM|rosetta@home|[file_xfer] Started download of file 1ig5A.fasta
9/10/2007 2:54:04 PM|rosetta@home|[file_xfer] Temporarily failed download of 1ig5A.fasta: file not found
9/10/2007 2:54:04 PM|rosetta@home|Backing off 52 min 37 sec on download of file 1ig5A.fasta

ID: 45905 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Jmarks
Avatar

Send message
Joined: 16 Jul 07
Posts: 132
Credit: 98,025
RAC: 0
Message 45914 - Posted: 10 Sep 2007, 11:16:35 UTC - in response to Message 45905.  

9/10/2007 2:53:48 PM|rosetta@home|[file_xfer] Started download of file 1ig5A.fasta
9/10/2007 2:53:49 PM|rosetta@home|[file_xfer] Temporarily failed download of 1ig5A.fasta: file not found
9/10/2007 2:53:50 PM|rosetta@home|[file_xfer] Started download of file 1ig5A.fasta
9/10/2007 2:53:55 PM|rosetta@home|[file_xfer] Temporarily failed download of 1ig5A.fasta: file not found
9/10/2007 2:53:56 PM|rosetta@home|[file_xfer] Started download of file 1ig5A.fasta
9/10/2007 2:53:58 PM|rosetta@home|[file_xfer] Temporarily failed download of 1ig5A.fasta: file not found
9/10/2007 2:53:59 PM|rosetta@home|[file_xfer] Started download of file 1ig5A.fasta
9/10/2007 2:54:00 PM|rosetta@home|[file_xfer] Temporarily failed download of 1ig5A.fasta: file not found
9/10/2007 2:54:01 PM|rosetta@home|[file_xfer] Started download of file 1ig5A.fasta
9/10/2007 2:54:02 PM|rosetta@home|[file_xfer] Temporarily failed download of 1ig5A.fasta: file not found
9/10/2007 2:54:03 PM|rosetta@home|[file_xfer] Started download of file 1ig5A.fasta
9/10/2007 2:54:04 PM|rosetta@home|[file_xfer] Temporarily failed download of 1ig5A.fasta: file not found
9/10/2007 2:54:04 PM|rosetta@home|Backing off 52 min 37 sec on download of file 1ig5A.fasta


I saw this same error on another post and after they aborted the wu they started getting new wus again.

Message 45895 - Posted 10 Sep 2007 4:13:55 UTC
Last modified: 10 Sep 2007 4:14:28 UTC
I\'ve been getting this off a WU for the last few hours. I aborted it, and more work from R@H came in fine. Judging by the message, I assume it was deleted/removed on the R@H server.

Hope this helps.
Jmarks
ID: 45914 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1016
Credit: 3,979,141
RAC: 51
Message 45966 - Posted: 10 Sep 2007, 23:59:27 UTC

There were a couple remaining permissions issues and a bunch of missing input files that should be available for download. I copied over the missing files from the ralph project that thankfully is not on the SAN. However, the ralph database server is on the SAN and was effected momentarily. There are still a lot of missing R@h files that may or may not be restored. New work units should be safe though.
ID: 45966 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
BarryAZ

Send message
Joined: 27 Dec 05
Posts: 153
Credit: 30,156,731
RAC: 0
Message 45971 - Posted: 11 Sep 2007, 1:15:31 UTC - in response to Message 45966.  

Excellent -- and thanks for your efforts over the weekend!!


There were a couple remaining permissions issues and a bunch of missing input files that should be available for download. I copied over the missing files from the ralph project that thankfully is not on the SAN. However, the ralph database server is on the SAN and was effected momentarily. There are still a lot of missing R@h files that may or may not be restored. New work units should be safe though.


ID: 45971 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Cureseekers~Kristof

Send message
Joined: 5 Nov 05
Posts: 80
Credit: 689,603
RAC: 0
Message 45984 - Posted: 11 Sep 2007, 7:07:50 UTC

In our team I have had the question of some users, if the credits for the jobs that were returned too late due to the downtime will be rewarded?

I know that the science is the most important, that (when seen over a very long period) is will have no effect, but still credits can be important for some users and for the organisation credits doesn\'t cost anything. So ....
Member of Dutch Power Cows
ID: 45984 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1016
Credit: 3,979,141
RAC: 51
Message 46077 - Posted: 12 Sep 2007, 17:31:33 UTC

ID: 46077 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator
Project administrator

Send message
Joined: 22 Aug 06
Posts: 3550
Credit: 0
RAC: 0
Message 46104 - Posted: 13 Sep 2007, 0:23:02 UTC - in response to Message 45984.  

In our team I have had the question of some users, if the credits for the jobs that were returned too late due to the downtime will be rewarded?

I know that the science is the most important, that (when seen over a very long period) is will have no effect, but still credits can be important for some users and for the organisation credits doesn\'t cost anything. So ....


Quoting David E K (just what BakerLab needed, another David!)

Valid results returned past the deadline have been granted the claimed credit. The maximum value possible is 300 so if you claimed over 300 you get 300.

Rosetta Moderator: Mod.Sense
ID: 46104 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Iconner

Send message
Joined: 30 Apr 06
Posts: 3
Credit: 148,980
RAC: 0
Message 46105 - Posted: 13 Sep 2007, 0:41:48 UTC

The Predictor of the day has not been updated since the server crash.
ID: 46105 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3

Message boards : Number crunching : Welcome Back!



©2020 University of Washington
http://www.bakerlab.org