Message boards : Number crunching : Welcome Back!
Previous · 1 · 2 · 3
Author | Message |
---|---|
r2d2 Send message Joined: 6 Jul 07 Posts: 1 Credit: 302,516 RAC: 0 |
9/9/2007 11:17:56 AM|rosetta@home|Sending scheduler request: Requested by user 9/9/2007 11:17:56 AM|rosetta@home|Requesting 347328 seconds of new work, and reporting 26 completed tasks 9/9/2007 11:18:01 AM|rosetta@home|Scheduler RPC succeeded 9/9/2007 11:18:01 AM|rosetta@home|Message from server: Project encountered internal error: shared memory 9/9/2007 11:18:01 AM|rosetta@home|Deferring communication for 1 hr 0 min 0 sec 9/9/2007 11:18:01 AM|rosetta@home|Reason: project is down |
Ian_D Send message Joined: 21 Sep 05 Posts: 55 Credit: 4,216,173 RAC: 0 |
|
BarryAZ Send message Joined: 27 Dec 05 Posts: 153 Credit: 30,843,285 RAC: 0 |
Nice catch, a rebuild might well have left some things in a rather 'untidy' state. Internal Memory problem from 2006 |
Ian_D Send message Joined: 21 Sep 05 Posts: 55 Credit: 4,216,173 RAC: 0 |
Cheers chap It could also be something like Project encountered internal error: shared memory - Maleria Control forum |
larry1186 Send message Joined: 18 Apr 06 Posts: 7 Credit: 329,257 RAC: 0 |
I'm getting "Can't open log file" and a "Project is down" so I assume we aren't out of the woods just yet... Well, I am happy to say that I have WUs that uploaded (even ones that were stuck halfway through an upload for days), reported, and new ones downloaded. I was expecting some sort of delay with the usual feeding frenzy after an outage, but none was to be found whatsoever. Thanks to the Rosetta crew for giving up your weekend to get things back on track. Here's to you! Don't get distracted by shiny objects. |
(_KoDAk_) Send message Joined: 18 Jul 06 Posts: 109 Credit: 1,859,263 RAC: 0 |
hurrah!, hurray! To day is real Welcome Back!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! all work's fine. |
Sir Cracked of the Mind Send message Joined: 5 Apr 07 Posts: 2 Credit: 682,661 RAC: 0 |
Well done for completing the job over the weekend, we all apreciate your efforts greatly. IJ. Thanet |
MM Sihombing Send message Joined: 22 May 06 Posts: 15 Credit: 1,424,082 RAC: 0 |
9/10/2007 2:53:48 PM|rosetta@home|[file_xfer] Started download of file 1ig5A.fasta 9/10/2007 2:53:49 PM|rosetta@home|[file_xfer] Temporarily failed download of 1ig5A.fasta: file not found 9/10/2007 2:53:50 PM|rosetta@home|[file_xfer] Started download of file 1ig5A.fasta 9/10/2007 2:53:55 PM|rosetta@home|[file_xfer] Temporarily failed download of 1ig5A.fasta: file not found 9/10/2007 2:53:56 PM|rosetta@home|[file_xfer] Started download of file 1ig5A.fasta 9/10/2007 2:53:58 PM|rosetta@home|[file_xfer] Temporarily failed download of 1ig5A.fasta: file not found 9/10/2007 2:53:59 PM|rosetta@home|[file_xfer] Started download of file 1ig5A.fasta 9/10/2007 2:54:00 PM|rosetta@home|[file_xfer] Temporarily failed download of 1ig5A.fasta: file not found 9/10/2007 2:54:01 PM|rosetta@home|[file_xfer] Started download of file 1ig5A.fasta 9/10/2007 2:54:02 PM|rosetta@home|[file_xfer] Temporarily failed download of 1ig5A.fasta: file not found 9/10/2007 2:54:03 PM|rosetta@home|[file_xfer] Started download of file 1ig5A.fasta 9/10/2007 2:54:04 PM|rosetta@home|[file_xfer] Temporarily failed download of 1ig5A.fasta: file not found 9/10/2007 2:54:04 PM|rosetta@home|Backing off 52 min 37 sec on download of file 1ig5A.fasta |
Jmarks Send message Joined: 16 Jul 07 Posts: 132 Credit: 98,025 RAC: 0 |
9/10/2007 2:53:48 PM|rosetta@home|[file_xfer] Started download of file 1ig5A.fasta I saw this same error on another post and after they aborted the wu they started getting new wus again. Message 45895 - Posted 10 Sep 2007 4:13:55 UTC Last modified: 10 Sep 2007 4:14:28 UTC I've been getting this off a WU for the last few hours. I aborted it, and more work from R@H came in fine. Judging by the message, I assume it was deleted/removed on the R@H server. Hope this helps. Jmarks |
David E K Volunteer moderator Project administrator Project developer Project scientist Send message Joined: 1 Jul 05 Posts: 1480 Credit: 4,334,829 RAC: 0 |
There were a couple remaining permissions issues and a bunch of missing input files that should be available for download. I copied over the missing files from the ralph project that thankfully is not on the SAN. However, the ralph database server is on the SAN and was effected momentarily. There are still a lot of missing R@h files that may or may not be restored. New work units should be safe though. |
BarryAZ Send message Joined: 27 Dec 05 Posts: 153 Credit: 30,843,285 RAC: 0 |
Excellent -- and thanks for your efforts over the weekend!! There were a couple remaining permissions issues and a bunch of missing input files that should be available for download. I copied over the missing files from the ralph project that thankfully is not on the SAN. However, the ralph database server is on the SAN and was effected momentarily. There are still a lot of missing R@h files that may or may not be restored. New work units should be safe though. |
Cureseekers~Kristof Send message Joined: 5 Nov 05 Posts: 80 Credit: 689,603 RAC: 0 |
In our team I have had the question of some users, if the credits for the jobs that were returned too late due to the downtime will be rewarded? I know that the science is the most important, that (when seen over a very long period) is will have no effect, but still credits can be important for some users and for the organisation credits doesn't cost anything. So .... Member of Dutch Power Cows |
David E K Volunteer moderator Project administrator Project developer Project scientist Send message Joined: 1 Jul 05 Posts: 1480 Credit: 4,334,829 RAC: 0 |
see this post. |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
In our team I have had the question of some users, if the credits for the jobs that were returned too late due to the downtime will be rewarded? Quoting David E K (just what BakerLab needed, another David!) Valid results returned past the deadline have been granted the claimed credit. The maximum value possible is 300 so if you claimed over 300 you get 300. Rosetta Moderator: Mod.Sense |
Iconner Send message Joined: 30 Apr 06 Posts: 3 Credit: 148,980 RAC: 0 |
The Predictor of the day has not been updated since the server crash. |
Message boards :
Number crunching :
Welcome Back!
©2025 University of Washington
https://www.bakerlab.org