Mod.Sense

Message boards : Number crunching : Mod.Sense

To post messages, you must log in.

AuthorMessage
Profile Chris Holvenstot
Avatar

Send message
Joined: 2 May 10
Posts: 220
Credit: 9,106,918
RAC: 0
Message 70085 - Posted: 21 Apr 2011, 22:45:16 UTC

Could you please PM me when the system admins / devs decide to address all of the bad tasks which have been flowing into the system of late - such as validate errors after 10 minutes, compute errors behind the cs_frags.9mers.gz file (all with matching wingman results)

It has been over two weeks since the user community started reporting these problems and there has been no response from the project yet.

If they don't care, why should I?

Let me know when they have addressed the problem, until then I guess I'll just shut down the systems and save on my electric bill.

Thanks

ID: 70085 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1894
Credit: 8,777,997
RAC: 2,671
Message 70086 - Posted: 22 Apr 2011, 13:32:14 UTC - in response to Message 70085.  

Could you please PM me when the system admins / devs decide to address all of the bad tasks which have been flowing into the system of late - such as validate errors after 10 minutes, compute errors behind the cs_frags.9mers.gz file (all with matching wingman results)

It has been over two weeks since the user community started reporting these problems and there has been no response from the project yet.

If they don't care, why should I?

Let me know when they have addressed the problem, until then I guess I'll just shut down the systems and save on my electric bill.

Thanks


Hey Chris don't just stop crunching! There are PLENTY of other projects that can use your help!! Here is a link to alot of the different Distributed Computing projects, the Boinc ones are noted:
http://www.distributedcomputing.info/projects.html
ID: 70086 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5664
Credit: 5,711,666
RAC: 764
Message 70089 - Posted: 22 Apr 2011, 21:28:04 UTC - in response to Message 70086.  

Could you please PM me when the system admins / devs decide to address all of the bad tasks which have been flowing into the system of late - such as validate errors after 10 minutes, compute errors behind the cs_frags.9mers.gz file (all with matching wingman results)

It has been over two weeks since the user community started reporting these problems and there has been no response from the project yet.

If they don't care, why should I?

Let me know when they have addressed the problem, until then I guess I'll just shut down the systems and save on my electric bill.

Thanks


Hey Chris don't just stop crunching! There are PLENTY of other projects that can use your help!! Here is a link to alot of the different Distributed Computing projects, the Boinc ones are noted:
http://www.distributedcomputing.info/projects.html


Mikey,Chris,

I am debating about pulling out of this project and going to look for something else. Already got Einstein (rarely has problems) and Milkyway (also problem free), so debating another space project or go looking for something else health related. Don't want to leave just yet after all these years, but 26 bad tasks in a week?!?!?! And no one pays attention?!?!? Kind of got the feeling they are to busy and understaffed to pay attention to posts here.

Btw, where is mod? Haven't seen anything by him in awhile.
ID: 70089 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Chris Holvenstot
Avatar

Send message
Joined: 2 May 10
Posts: 220
Credit: 9,106,918
RAC: 0
Message 70094 - Posted: 23 Apr 2011, 5:09:58 UTC

Greg - Mod.Sense is still hanging around - and from my perspective he is a man of honor and dedication (gender assumed) who puts out a real effort.

I would not want to be in his shoes - been there, done that. it is difficult to try and tap dance and communicate status when you are getting no support from the back-end organization.

At the other end of the totem pole there is Dr Dave - and while his posts have been limited over the past few months I have the impression that he still makes an attempt to keep us up to date on the scientific direction of the project.

Dr Dave is a scientist and not an IT specialist - and as such can not really be expected to be down in the nuts and bolts of the system. I see the problem as being rooted squarely in the ranks of the

I see the problem as being at the feet of the developers and the sysadmins.

I am especially frustrated by the error opening cs_frags.9mers.gz - that problem is so consistent and frequent that it sold have been easy to nail - heck, anyone who has played the software game for any length of time knows that often the biggest challenge is being able to recreate the problem.

Before I withdrew form the project I spent a couple of hours trying to locate the local job queue where jobs are stored while they were in the ready to start status - the plan was to write a script to scan and purge the problem tasks before they started execution.

Unfortunately, nothing jumped out at me.
ID: 70094 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Chris Holvenstot
Avatar

Send message
Joined: 2 May 10
Posts: 220
Credit: 9,106,918
RAC: 0
Message 70095 - Posted: 23 Apr 2011, 5:26:47 UTC

Greg / Mike -

After withdrawing from the project I looked at other BOINC projects but nothing caught my eye.

1. I would prefer medical research. I am not sure I see the value to society in finding new prime numbers or cracking an encryption system - but that's a personal preference.

2. I wanted a CPU project - my systems run "headless" over a VNC connection and their graphics adapters are small integrated ATI chips. I already have problems with the heat generated by 12 systems.

3, I wanted a project with an active, dynamic community. Due to a degenerative neuromuscular disease I am becoming more and more of a shut in - and I enjoy the technical banter and team spirit so much more than something like slashdot.

4. I wanted a project whose "news" section was kept up to date - unfortunately many of the projects are even worse than Rosetta in this area.

5. I fairly steady supply or work units.

6. I would have preferred that the project be US based - my only reason for this is that it is so much easier to be confident that the results are "in the public" domain - I may be fooling myself but I don't know the rules and regulations dealing with this when you select a project based overseas.

When I find something I will likely jump in - if not I will go out and purchase some monitors and donate the systems to some charity.

However, I will say that you guys have been a pleasure to crunch with over the past year and will be missed.

ID: 70095 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 70097 - Posted: 23 Apr 2011, 7:44:46 UTC
Last modified: 23 Apr 2011, 7:48:04 UTC

Hi Chris.

Since your looking for medical projects have a look here, many good

sub-projects run from the one place, and they rarely have problems.

W.C.G.

ps/ I've been there a while now very happy. ;)
ID: 70097 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile rochester new york
Avatar

Send message
Joined: 2 Jul 06
Posts: 2842
Credit: 2,020,043
RAC: 0
Message 70098 - Posted: 23 Apr 2011, 7:48:08 UTC - in response to Message 70095.  

Greg / Mike -

After withdrawing from the project I looked at other BOINC projects but nothing caught my eye.

1. I would prefer medical research. I am not sure I see the value to society in finding new prime numbers or cracking an encryption system - but that's a personal preference.

2. I wanted a CPU project - my systems run "headless" over a VNC connection and their graphics adapters are small integrated ATI chips. I already have problems with the heat generated by 12 systems.

3, I wanted a project with an active, dynamic community. Due to a degenerative neuromuscular disease I am becoming more and more of a shut in - and I enjoy the technical banter and team spirit so much more than something like slashdot.

4. I wanted a project whose "news" section was kept up to date - unfortunately many of the projects are even worse than Rosetta in this area.

5. I fairly steady supply or work units.

6. I would have preferred that the project be US based - my only reason for this is that it is so much easier to be confident that the results are "in the public" domain - I may be fooling myself but I don't know the rules and regulations dealing with this when you select a project based overseas.

When I find something I will likely jump in - if not I will go out and purchase some monitors and donate the systems to some charity.

However, I will say that you guys have been a pleasure to crunch with over the past year and will be missed.



no matter how it works out thanks for your contributions
ID: 70098 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Hank Barta

Send message
Joined: 6 Feb 11
Posts: 14
Credit: 3,943,460
RAC: 0
Message 70100 - Posted: 23 Apr 2011, 14:03:37 UTC

Hi Chris,
The lack of attention to this issue is highly irritating and I don't blame you at all for your decision to stop processing Rosetta. Here's what keeps me in:

1) I'm on the Overclockers team. We're presently #1 on RAC and trying to achieve #15 in total credits. And I'm helping.

2) The bad units are just a fraction of the total. As near as I can tell most of the WUs produce good results. The bad ones just add a bit to my bandwidth usage and a few seconds of compute time. Their actual impact is negligible for me. (I can see where it could be different for others.)

I hope the lack of attention to these details does not reflect the overall tenor of the project. It would be disappointing to know that all of the CPU cycles we've been donating are being squandered because the science end is handled similarly to these IT issues.

As others have mentioned, there are other DC projects that you could work on. I've tasked a video card with looking for messages for aliens and my PS3 is Folding@Home when I'm not watching movies or playing games.

At times looking for alien communications seems silly compared to something like F@H or Rosetta which are both more likely to produce tangible results. However if we ever do hear from ET, the results could be (hopefully not literally) earth shattering.
ID: 70100 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1894
Credit: 8,777,997
RAC: 2,671
Message 70101 - Posted: 23 Apr 2011, 14:36:56 UTC - in response to Message 70095.  

Greg / Mike -

After withdrawing from the project I looked at other BOINC projects but nothing caught my eye.

1. I would prefer medical research. I am not sure I see the value to society in finding new prime numbers or cracking an encryption system - but that's a personal preference.

2. I wanted a CPU project - my systems run "headless" over a VNC connection and their graphics adapters are small integrated ATI chips. I already have problems with the heat generated by 12 systems.

3, I wanted a project with an active, dynamic community. Due to a degenerative neuromuscular disease I am becoming more and more of a shut in - and I enjoy the technical banter and team spirit so much more than something like slashdot.

4. I wanted a project whose "news" section was kept up to date - unfortunately many of the projects are even worse than Rosetta in this area.

5. I fairly steady supply or work units.

6. I would have preferred that the project be US based - my only reason for this is that it is so much easier to be confident that the results are "in the public" domain - I may be fooling myself but I don't know the rules and regulations dealing with this when you select a project based overseas.

When I find something I will likely jump in - if not I will go out and purchase some monitors and donate the systems to some charity.

However, I will say that you guys have been a pleasure to crunch with over the past year and will be missed.



Malariacontrol.net is NOT US based but IS making a difference in the area of medicine. As stated so does WCG, there are over half a dozen projects to chose from and the project is supported by IBM. Do you remember the Jeopardy show where the two biggest winning guys came into compete against the computer? They donated A TON of money to WCG as the computer won! WCG can be found here:
http://www.worldcommunitygrid.org/ While Malariacontrol can be found here:
[url[http://malariacontrol.net/[/url]

I can see why you, as a former programmer for Nasa, a simple problem that is not being addressed is a problem for you, and I TOTALLY agree! A long time ago I tried running Rosetta on some Windows pc's and just could not, it just would not download for me. I ended up getting all of my Rosie points from using Linux machines instead. It can be VERY frustrating when a problem just will not be fixed by a project that has such potential!
ID: 70101 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 70102 - Posted: 23 Apr 2011, 15:34:43 UTC

I've been away since you posted this, thanks for some of the compliments I caught on the quick scroll of the thread, but let me get caught up here and then see if we can address the issues you've brought up.
Rosetta Moderator: Mod.Sense
ID: 70102 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 70103 - Posted: 23 Apr 2011, 16:24:25 UTC
Last modified: 23 Apr 2011, 16:25:25 UTC

OK, that didn't take as long to catch up as I'd thought it might.

Yes, since the failing tasks fail almost immediately, unless your network time or bandwidth is limited, you should still be able to spend 99.5% of your electricity doing productive crunching on the other tasks. I don't mean to minimize the problem, it certainly makes it difficult to collect a week's work at a time, etc. But unless I'm mistaken, you should still be enjoying #5 fairly steady supply or work units, in spite of the problem.

If you do some day donate those headless machines... try looking for monitors that people are hanging on to trying to defer disposal costs, rather then getting new ones.

Alternate projects, I like WCG. Subprojects include AIDS, MS and cancer. I like to hope for clean energy too. IBM has been backing WCG (and aiding the researchers whose projects you see there) from the start, but also donated the Jeopardy win proceeds (or was it half of them?) to WCG (and another charity?)

I sent an EMail and James assures me he will address this missing file issue and apologizes for the trouble it's been causing everyone.
Rosetta Moderator: Mod.Sense
ID: 70103 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5664
Credit: 5,711,666
RAC: 764
Message 70104 - Posted: 24 Apr 2011, 12:33:14 UTC

Thanks for emailing the responsible person.

Again the lack of communication by members of the team makes us crazy at times.
The lost work dropped my stats pretty bad. I don't call exact numbers, but it sent the line on a pretty severe downward plunge. Now that I got rid of those faulty tasks, it is starting to climb again.

But as was mentioned some time ago, the lack of communication is going to drive people away. This project had a lot more information flow back when I started. It has died off to almost nothing now.

Anyway..I will stick around for a bit and see if anything improves, or maybe reduce resource share and pick up some other project.
ID: 70104 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile rochester new york
Avatar

Send message
Joined: 2 Jul 06
Posts: 2842
Credit: 2,020,043
RAC: 0
Message 70107 - Posted: 24 Apr 2011, 17:41:44 UTC
Last modified: 24 Apr 2011, 18:03:04 UTC

A live chat board somewhere here on the Rosetta site would be nice especially if someone on the project team could be on it once every 2 or 3 weeks at certain times
ID: 70107 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : Mod.Sense



©2024 University of Washington
https://www.bakerlab.org