Posts by Marky-UK

21) Message boards : Number crunching : RSS feeds broken again (Message 37904)
Posted 17 Mar 2007 by Marky-UK
Post:
The 'latest' three news items in the RSS feed are all dated 31 Dec 1969.


PS. Can you fix RALPH's RSS feed too. I posted on their forum but no one has done anything. RALPH's RSS feed has invalid dates and missing guid elements.
22) Message boards : Number crunching : Predictor of the day (Message 37673)
Posted 11 Mar 2007 by Marky-UK
Post:
Think there may be a slight glitch in the scripts - the user Charles has predicted the lowest energy structure for the same workunit 3 days in a row now.
23) Message boards : Number crunching : Predictor of the day (Message 37500)
Posted 5 Mar 2007 by Marky-UK
Post:
Thanks for getting POTD going again.

There's an error in the POTD RSS feed, caused by the '&' in the current POTD's team name.

http://validator.w3.org/feed/check.cgi?url=http%3A%2F%2Fboinc.bakerlab.org%2Frosetta%2Frah_rss_potd.php


Thanks for the heads-up. I just fixed the offending invalid XML, and the W3 validator appears to be happy.

Cool, thanks. I only spotted it because RALPH's RSS feed is badly broken (I posted about it in RALPH's Feedback forum) and I was checking to see if Rosetta's RSS feeds had the same problem.
24) Message boards : Number crunching : Predictor of the day (Message 37463)
Posted 5 Mar 2007 by Marky-UK
Post:
Thanks for getting POTD going again.

There's an error in the POTD RSS feed, caused by the '&' in the current POTD's team name.

http://validator.w3.org/feed/check.cgi?url=http%3A%2F%2Fboinc.bakerlab.org%2Frosetta%2Frah_rss_potd.php
25) Message boards : Number crunching : controlling a linux client (Message 36998)
Posted 20 Feb 2007 by Marky-UK
Post:
VNC has been implemnented by several different groups, some commercial, some free, and at least one is available in a paid for ('enterprise') version as well as a free version

Which version of VNC were you using at the windows end?

I was using the free UltraVNC client, but I'd expect the free RealVNC client to work too (I can try that later if someone wants me to check).
26) Message boards : Number crunching : controlling a linux client (Message 36987)
Posted 20 Feb 2007 by Marky-UK
Post:
I have a Ubuntu 6.10 system, and as soon as I enabled Remote Desktop (or whatever it was called) on it and set a password, I could VNC to it fine. I didn't have to install any other software on the Ubuntu system to achieve this.
27) Message boards : Number crunching : Problems with Rosetta version 5.46 (Message 36948)
Posted 18 Feb 2007 by Marky-UK
Post:
Nope, it was running at the standard speed. Just for the heck of it though, I've now underclocked it 6% to see how it goes.


Has underclocking helped?

Looks like it - I don't think the host has failed on a WU since. I'll keep an eye on it though.
28) Message boards : Number crunching : Problems with Rosetta version 5.46 (Message 36898)
Posted 16 Feb 2007 by Marky-UK
Post:
Is your machine overclocked?

Nope, it was running at the standard speed. Just for the heck of it though, I've now underclocked it 6% to see how it goes.
29) Message boards : Number crunching : Rosetta accessing the Internet (Message 36876)
Posted 16 Feb 2007 by Marky-UK
Post:
I'd go with the debug information request too. One of my hosts is failing when running Rosetta WUs lately (see the 5.46 problem thread), and the Rosetta application accesses the internet right before a WU fails. I seem to remember something said months ago about applications only downloading symbol information when they need them to save bandwidth (possibly back when Rom was helping fix the Rosetta application).
30) Message boards : Number crunching : Problems with Rosetta version 5.46 (Message 36866)
Posted 16 Feb 2007 by Marky-UK
Post:
Have you tried to reset the project to see if it helps? Those workunits themself seem to be fine and if this happens all the time on a single host, my guess is that some files become corrupted. Another possibility is hardware problem though this can be ruled out if it does not have problem of running other programs.

Have tried that now, and that host is still failing - and on almost every WU now.

The same host now also fails to run the new Human Proteome Folding WUs from WGC, and that's Rosetta too. But every other project, including the others from WGC, run fine, so does any other bit of software I run on it.
31) Message boards : Number crunching : Problems with Rosetta version 5.46 (Message 36783)
Posted 14 Feb 2007 by Marky-UK
Post:
And MOST of the WUs on this Win/XP machine are now failing.

Grrr, so they are. I've set that host to "no new work" on Rosetta for now until the cause is found.
32) Message boards : Number crunching : Problems with Rosetta version 5.46 (Message 36758)
Posted 13 Feb 2007 by Marky-UK
Post:
What could be causing these compute errors? It's only happening on one of my hosts in the last few weeks.

http://boinc.bakerlab.org/rosetta/result.php?resultid=62506015
http://boinc.bakerlab.org/rosetta/result.php?resultid=62470017
http://boinc.bakerlab.org/rosetta/result.php?resultid=62378522
http://boinc.bakerlab.org/rosetta/result.php?resultid=62351637
http://boinc.bakerlab.org/rosetta/result.php?resultid=61390501

That host has been fine running Rosetta for ages.
33) Message boards : Number crunching : Workunit error - check skipped? (Message 36122)
Posted 4 Feb 2007 by Marky-UK
Post:
We set the max #s low because we like to keep the lifespan of work units to a minimum without having to decrease the delay bound (since user's have requested a longer delay bound). It does seem odd to us that the scheduler may send more results than the max # of total results though. It may help to start using the reliable_time scheduler option which attempts to send old results to reliable hosts after we update the server this week. Maybe with this option, we could increase the max #s.

I can understand wanting to avoid WUs erroring out many times, but what about results that get returned just 1 hour after the deadline? The current settings will mean a second result will already have been sent out, but it won't get any credit because the late result gets it.
34) Message boards : Number crunching : Workunit error - check skipped? (Message 36086)
Posted 4 Feb 2007 by Marky-UK
Post:
The first result errored, which should have killed the entire WU, but two more results were sent out after that. When you returned your result (assuming it was valid), you didn't get any credit because the max # of error results had been hit.

The time period is close to when the validator server failed so maybe that's why the extra two results were sent out.


I still think the settings for max # of error/total/success results is set too low on all WUs. Is a project admin going to respond to my points in this post?
35) Message boards : Number crunching : Workunit error - check skipped? (Message 35916)
Posted 1 Feb 2007 by Marky-UK
Post:
not the same problem, but look at this:
http://boinc.bakerlab.org/rosetta/workunit.php?wuid=51477081

again the same computer, I hope it isn't on my side though :(

I've got loads of WUs I've returned today that are all stuck in Pending state. The credit for my team hasn't increased at all since before 0800 UTC.
36) Message boards : Number crunching : Workunit error - check skipped? (Message 35913)
Posted 1 Feb 2007 by Marky-UK
Post:
I seem to have the same problem. It seems that I was the only one that did the workunit.

Workunit:
http://boinc.bakerlab.org/rosetta/workunit.php?wuid=50393160

The "max # of total results" (clicky) is set to 2 which was probably exceeded by the first two results that didn't get returned. I guess "No reply" results don't count as an error, which would have stopped the 3rd result being sent out - if that's the case, I'd think that the "max # of total results" is too low at 2.

The WU in Tom Philippart's post (WU 50835993) went funny because the second result was not returned by its deadline, so a 3rd copy was sent out. Before the 3rd was returned, the second result came back late and passed validation, the "max # of success results" was hit so the 3rd was rejected.

IMHO, if results returned after their deadline are accepted, the "max # of success results" must be higher than 1. And "max # of total results" probably needs to be higher than 2.
37) Message boards : Number crunching : Ralph is now giving out 5.44 application wus (Message 35675)
Posted 28 Jan 2007 by Marky-UK
Post:
Marky, I believe you are talking about the problem where the BOINC manager seems to lose contact with localhost? ...and all the tabs go blank? It seems this is a BOINC issue, and some of the later betas supposedly have a fix for that. So... no a new Rosetta version won't be expected to fix it, but "coming soon" from BOINC changes.

No it's not that, it's the problem where rosetta 5.43 is still running, BOINC thinks it's running, but rosetta isn't using any CPU time at all. There's a thread on here about it here. The solution mentioned in that thread is that the application needs to be compiled using the latest API.

The BOINC crashing problem is something else.
38) Message boards : Number crunching : Ralph is now giving out 5.44 application wus (Message 35480)
Posted 25 Jan 2007 by Marky-UK
Post:
Is 5.44 going to fix the problem of 5.43 constantly stalling on some hosts?
39) Message boards : Number crunching : RAC dropping, BOINC dropping comms (Message 35062)
Posted 19 Jan 2007 by Marky-UK
Post:
The finger points at the libCURL library at the moment. This was updated to the latest version in 5.8.3, but it's too early to say if this is the fix yet - I've had it running on some hosts for a couple of days so far without problems though.
40) Message boards : Number crunching : Negative Credits ? (Message 32582)
Posted 13 Dec 2006 by Marky-UK
Post:
It's not a bug in BOINCstats, it happens when the user's CPID keeps changing because they haven't got it to sync up between all their projects.

The user "Xanthochroid" has two CPIDs, the other one is here. The negative scores on one ID match the positive scores on the other.


Previous 20 · Next 20



©2024 University of Washington
https://www.bakerlab.org