Posts by Ananas

21) Message boards : Number crunching : Couldn't resolve host name (Message 76592)
Posted 6 Apr 2014 by Ananas
Post:
Scheduler request failed: Couldn't resolve host name

I have seen this message several times lately, the second attempt usually works. Other projects are not affected.
22) Message boards : Number crunching : System Idle Process. (Message 76587)
Posted 5 Apr 2014 by Ananas
Post:
Here is one that did this again. After a core client restart it continued where it had stopped.
23) Message boards : Number crunching : System Idle Process. (Message 76578)
Posted 2 Apr 2014 by Ananas
Post:
It was getting a small CPU time figure though, 2% or 3% rather than the 25% you would normally see. I'll enable Rosetta again and keep a watch on it.

I had the same lately, BoincView showed a Rosetta result running, but with CPU efficiency 0% and when I checked the machine, I saw 25% idle in taskmanager (1 core out of 4) and no Rosetta activity.

Restarting the core client brought the result back to life and it finished without further problems.

p.s.: Pausing the result will help if "keep in memory" is disabled - but it's usually better to keep results in memory.
24) Message boards : Number crunching : Untypical ghost WU, probably name too long (Message 76549)
Posted 24 Mar 2014 by Ananas
Post:
It is not (or not only?) the name length, in Link's report can be seen that it complains about the two dots. Two consecutive dots might be seen as a security risk (but from my understanding that would only be the case next to a slash).

p.s.: Here we have it (client_types.cpp) :

        if (match_tag(buf, "</file_info>")) {
            if (!strlen(name)) return ERR_BAD_FILENAME;
            if (strstr(name, "..")) return ERR_BAD_FILENAME; <======
            if (strstr(name, "%")) return ERR_BAD_FILENAME;
            return 0;
        }

The core client doesn't like that.

A too long filename is a risk too though, especially since BOINC switched the default data directory on Windows to reside in the profile area. Drive letter + path + filename are limited to 256.
25) Message boards : Number crunching : Untypical ghost WU, probably name too long (Message 76540)
Posted 22 Mar 2014 by Ananas
Post:
A_31_AGBBA_B_18_AGBBA_6h_3l_i4_blue_1.82_10.22_93.74_66.67_0.00.Loop.Res.SurMod.inQQ..REsiSemi.ciL__fold_SAVE_ALL_OUT_149002_483

128 characters for the WU name, 2 x "no reply" already and I didn't receive it either - I guess you violated a max. filename limit there.

p.s.: my BOINC path is rather short ( D:BOINCprojectsboinc.bakerlab.org_rosetta for Rosetta), so it should not be a problem typical for my installation. The BOINC database allows 254 characters so that's not the limit either. Maybe some string in the BOINC sources that transports the result name or the base path on server side is too long for such a filename - or Apache doesn't handle that.


p.p.s.: I just saw a similar error report here, the problems are probably caused by the same thing
26) Message boards : Number crunching : Problems and Technical Issues with Rosetta@home (Message 76053)
Posted 19 Sep 2013 by Ananas
Post:
endo_ae__ results cause (and suffer from) BOINC heartbeat problems and they do not checkpoint properly on one of my boxes, my guess is that they have very high RAM requirements (my internet PC with only 2GB RAM, having Firefox nearly always running, one Rosetta task plus 3 projects with very low RAM requirements). They should probably be limited to boxes with more than 3GB physical RAM.

Unfortunately I could not catch/spy on one just before it crashed, so the RAM thing is only a guess. After the crash the RAM history is lost with the PID so I cannot check the maximum usage. Other result types seem not to be affected.
27) Message boards : Number crunching : Windows 2000 (Message 75345)
Posted 10 Apr 2013 by Ananas
Post:
Depending on the version of VC++ there might still be a setting to create binaries compatible with W2k (but it is not the default setting!) - but this option has been removed from the latest compiler versions.

Afaik. Yoyo lost the Win2k compatibility too not long ago for the wrapper of Yoyo@home and/or RNAWorld, when he switched to a newer VC++ version.

Not an issue for projects using Gnu compilers, just MS compilers are affected.

I wonder if there is a hack for making VC++ 10.x programs compatible - just by adding dummy functions for the missing ones. As long as they are unused, this would not matter at all - they would just be needed to satisfy the reference.
28) Message boards : Number crunching : Crappy result configuration (Message 75321)
Posted 5 Apr 2013 by Ananas
Post:
As this is just a configuration issue, not a single line of code to be written, it wouldn't take much time to fix it.

Oh, and I doubt that it is so very rare, I rather think that many people don't notice it as the client doesn't report the error.


The problems with the extra long running results giving extra low credits is much harder to fix but as it makes many people unhappy, they should work on it.
29) Message boards : Number crunching : Crappy result configuration (Message 75317)
Posted 4 Apr 2013 by Ananas
Post:
max # of error/total/success tasks 1, 2, 1

One wingman couldn't start, the other one aborted his result - gives me :

Validate error

Of course, counting aborted results as error results is most likely a bug implemented by a well known BOINC despot - and "couldn't start" is not really a workunit error either - so the best would be to set those limits much higher - but wait ... why did it even deliver the result a second and even a third time if the first one already had an error? One more ancient BOINC bug (oh well, some will call it "feature")

Do you even care about your volunteer crunchers? Then fix this bug and those in the old threads that get a refresh from new bug reports nearly everyday :-(
30) Message boards : Number crunching : TLUM15 task still running after 40hrs (Message 75295)
Posted 29 Mar 2013 by Ananas
Post:
I wonder if it's coincidence ... this result needed 2 restarts :

rb_03_18_37122_70435_t000__r5n_IGNORE_THE_REST_14_11_76809_16_1

it got stuck at decoy #25 and again at decoy #50 - desired runtime was reached before it started to work on decoy #75 but I bet it would have gotten stuck there once more.
31) Message boards : Number crunching : Client errors (Message 75037)
Posted 4 Feb 2013 by Ananas
Post:

I JUST dumped Rosetta off of one of my pc's as it REFUSED to send new work to my pc, Boinc kept saying "Not requesting tasks" even though I had NO cpu tasks on this 6 core pc! I am now crunching for Poem on that pc and it got 50 or more cpu units with NO problem!! There are WAAAY too many fish in the sea to waste time on one project that is being a PITA!!!!


Sounds more like a core client problem (bugs? in BOINC? say it's not so) than a scheduler issue.


I don't know I even reset the project and every other pc is working just fine on Rosetta, that one just didn't! It's okay Poem loves my time!

This definitely is a core client problem. It usually occurs when the cache is either really stuffed (high values for the first two options in "Network usage") or the other project has collected a really high "long term debit".
Unfortunately the BOINC GUI has no feature to reset those debits, the command line thingie can do it though, e.g. :

boinccmd.exe --host <YourComputerName> --set_debts http://boinc.fzk.de/poem/ 0 0

or

boinccmd.exe --host <YourComputerName> --set_debts http://boinc.bakerlab.org/rosetta/ 0 100000


The second value is the one to modify the long term debits but the command needs the one for short term as well, that's why you have to put both 0's
32) Message boards : Number crunching : Client errors (Message 75036)
Posted 4 Feb 2013 by Ananas
Post:
There might be something with the project options (not properly initialized?).

If the results fail again, you could try to set the value "Target CPU run time" on the project options

p.s.: got this idea comparing the output of valid vs. invalid result, the startup output looks different in some "options" output lines.
33) Message boards : Number crunching : Only 20 credits for 25,000 seconds (Message 75035)
Posted 4 Feb 2013 by Ananas
Post:
same here :-(

Not only bench*ignore... are affected, other series have the problem too :

rb_02_02_36194_68641__t000__0_C1_SAVE_ALL_OUT_IGNORE_THE_REST_73531_121_0

Common part is always Stream information inconsistent. in stderr and only one decoy in the result, plus this warning :

WARNING! cannot get file size for default.out.gz: could not open file. so it probably has actually generated nothing at all.

Unfortunately all those facts occur after the time has already been wasted so they cannot be used to abort the task before the calculation starts.

There is one thing that might help though :

OK:
Watchdog active.
Starting work on structure: _00001 <= *** difference ***
# cpu_run_time_pref: 28800


damaged:

Watchdog active.
# cpu_run_time_pref: 28800


so Starting work is missing completely quite close to the start already
34) Message boards : Number crunching : Only 20 credits for 25,000 seconds (Message 75034)
Posted 4 Feb 2013 by Ananas
Post:
...
35) Message boards : Number crunching : Only 20 credits for 25,000 seconds (Message 75030)
Posted 2 Feb 2013 by Ananas
Post:
Problem not solved, they are still on delivery. I'll abort all "bench*IGNORE_THE_REST" tasks before they start I guess.
36) Message boards : Number crunching : Team Points STOLEN!!!!! Over 6 Million Points! (Message 54540)
Posted 16 Jul 2008 by Ananas
Post:
The original founder of team 1601 is userid=45029

Here's the XML from 09_2007 :

<team>
 <id>1601</id>
 <type>2</type>
 <name>XM Satellite Radio</name>
 <userid>45029</userid>
 <total_credit>1453419.973322</total_credit>
 <expavg_credit>28212.372774</expavg_credit>
 <expavg_time>1172926323.971930</expavg_time>
  <founder_name>Test</founder_name>
 <create_time>1135978302</create_time>
 <url>www.xmradio.com</url>
<name_html>&lt;br>&lt;br>&lt;a href="http://www.xmradio.com" target="_blank">&lt;img src="http://www.multi-search-engine.com/xm_logo.gif" width="200" height="90" border="1" alt="XM Satellite Radio">&lt;/a>&lt;br>&lt;br></name_html>
<description>For clients and friends of America's #1 Satellite Radio Service - Over 150 Digital Channels, Featuring 100% Commercial-Free Music and the Best News, Sports, and Talk!</description>
 <country>United States</country>
</team>


The <create_time> equals : 12 / 30 / 05 @ 4:31pm and matches the founder's join date.

The same founder has already been in the XML from 06/2006

The wayback machine stores some binaries too - they are usually a bit damaged (probably a CR/LF issue) but most of the .gz files has been expandable.

Unfortunately the Wayback history ends there, that's the latest XML excerpt I could find.
37) Message boards : Number crunching : Team Points STOLEN!!!!! Over 6 Million Points! (Message 54483)
Posted 13 Jul 2008 by Ananas
Post:
Real name of Rosetta team 1601 is XM Satellite Radio, archive.org has a BOINCstats team list snapshot timestamped 20070203184935
38) Message boards : Number crunching : Team Points STOLEN!!!!! Over 6 Million Points! (Message 54463)
Posted 12 Jul 2008 by Ananas
Post:
It's team "Bevo" at SIMAP (team ID = 466) - and it has been returned to the righteous owner, the original members have been added back to the team, the original name has been restored.

MIZ did join team 466 of course, his account has not been manipulated in any way.
39) Message boards : Number crunching : Team Points STOLEN!!!!! Over 6 Million Points! (Message 54447)
Posted 12 Jul 2008 by Ananas
Post:
Found one more that had been missing in my list above :

At Leiden Classical he stole Czech Boinc Team
40) Message boards : Number crunching : Team Points STOLEN!!!!! Over 6 Million Points! (Message 54423)
Posted 10 Jul 2008 by Ananas
Post:
...

At Einstein : Western Michigan University Physics Club


Sorry, I couldn't find out all original team names, the web looses some things over the time. Maybe BOINCstats Willy could help.


Previous 20 · Next 20



©2022 University of Washington
https://www.bakerlab.org