Posts by Grutte Pier [Wa Oars]~MAB The Frisian

21) Message boards : Number crunching : Result was reported too late to validate ???????????? (Message 11847)
Posted 10 Mar 2006 by Grutte Pier [Wa Oars]~MAB The Frisian
Post:
http://boinc.bakerlab.org/rosetta/result.php?resultid=11593259

Would like to know how something like this is possible, while de computer is running 24/7 and connected all the time.
Do I have to check the queues myself everytime or what ?

The rest of the jobs in the queue are either already sent every day or can be send in on 16 march and so on. http://boinc.bakerlab.org/rosetta/show_host_detail.php?hostid=88145

Furthermore I'm using a program such as boincmanager but with the possibility to auto-abort "jobs approaching/passed deadline".

So I'm rather curious to know the reason for this ?


I'm getting a bit tired of all these ????? errors.
Hope our Stampede goes to another project otherwise one can expect a lot more problems
22) Message boards : Number crunching : Report stuck & aborted WU here please (Message 11812)
Posted 9 Mar 2006 by Grutte Pier [Wa Oars]~MAB The Frisian
Post:
http://boinc.bakerlab.org/rosetta/result.php?resultid=12412131

Couldn't find an explanation in Wiki but I may have overlooked it.
23) Message boards : Number crunching : Running Boinc as a Service (Message 11775)
Posted 8 Mar 2006 by Grutte Pier [Wa Oars]~MAB The Frisian
Post:
http://www.tomaxwell.com/boincsvcwinxp.htm
24) Message boards : Number crunching : Report Maximum CPU Time Exceeded WU HERE (Message 11527)
Posted 1 Mar 2006 by Grutte Pier [Wa Oars]~MAB The Frisian
Post:
Already any progress in granting credits for MCTE WU's ??
Or did I miss it somewhere ?



As was reported before, it will be AT LEAST mid-March before the project team can deal with the credit granting process for this class of WU failures, and maybe longer. They did say they would grant the credit in due course, but they are focused on fixing run time errors at this time. The cause of the Max time errors has been isolated and fixed so people should not see any more of them. But the credit granting process takes time.


Thanks. I'm not visiting these forums on a regularly base, so I must've missed.
25) Message boards : Number crunching : Report Maximum CPU Time Exceeded WU HERE (Message 11518)
Posted 1 Mar 2006 by Grutte Pier [Wa Oars]~MAB The Frisian
Post:
Already any progress in granting credits for MCTE WU's ??
Or did I miss it somewhere ?
26) Message boards : Number crunching : 27.70 credit granted from 10 (ten) hours work (Message 11517)
Posted 1 Mar 2006 by Grutte Pier [Wa Oars]~MAB The Frisian
Post:
http://boinc.bakerlab.org/rosetta/workunit.php?wuid=9429375

using AMD Athlon(tm) 64 Processor 3700+

What gives ?

You got the same amount of credits as a PIV 3GHz and it seems yours gave some kind of error ???
http://boinc.bakerlab.org/rosetta/result.php?resultid=12062151

<core_client_version>5.3.12.tx37</core_client_version>
<stderr_txt>
# random seed: 1632301
# cpu_run_time_pref: 36000
# DONE :: 1 starting structures built 101 (nstruct) times
# This process generated 101 decoys from 102 attempts

</stderr_txt>

Perhaps you can find some info here http://boinc-doc.net/boinc-wiki/index.php?title=Main_Page
27) Message boards : Number crunching : 600,000 second/165 Hour/7 day WU!!! (Message 10968)
Posted 19 Feb 2006 by Grutte Pier [Wa Oars]~MAB The Frisian
Post:
28) Message boards : Number crunching : 600,000 second/165 Hour/7 day WU!!! (Message 10963)
Posted 19 Feb 2006 by Grutte Pier [Wa Oars]~MAB The Frisian
Post:
It looks like a MCTE problem so I assume you could report it here http://boinc.bakerlab.org/rosetta/forum_thread.php?id=1008

Perhaps time for D.B. to report something about this problem ?
Credits or not ????
29) Message boards : Number crunching : Report stuck & aborted WU here please (Message 10914)
Posted 18 Feb 2006 by Grutte Pier [Wa Oars]~MAB The Frisian
Post:
Another possible cause is when the CPDN controlling process hadsm3_* is killed, leaving the worker process hadsm3um_* running. The Science Application (a.k.a. "worker") can only be killed using task manager or by a reboot.

I'm not using graphics at all running R@H.
And the other thing doesn't ring a bell to me.

Not running the Boinc screensaver? Hmm then it seems likely that some part of Rosetta isn't being killed when switching and causing the error. I wonder if this is part of the problems Ralph is looking to find? I don't know much about Rosettas' processes/app.

sorry

tony

No switching, only running R@H 24/7.
30) Message boards : Number crunching : Report stuck & aborted WU here please (Message 10903)
Posted 18 Feb 2006 by Grutte Pier [Wa Oars]~MAB The Frisian
Post:
31) Message boards : Number crunching : Report stuck & aborted WU here please (Message 10826)
Posted 16 Feb 2006 by Grutte Pier [Wa Oars]~MAB The Frisian
Post:
http://boinc.bakerlab.org/rosetta/result.php?resultid=10041959

??????????????????????
32) Message boards : Number crunching : Serverproblems ? (Message 10748)
Posted 14 Feb 2006 by Grutte Pier [Wa Oars]~MAB The Frisian
Post:
If so, it's 'getting annoying and therefor time to fix it.
During the time your computer is trying to connect, you can put on the kettle for a cup of tea/coffee, drink it, take nap and you'll still be in time.

Or are we having this problem in the Netherlands only ?
33) Message boards : Number crunching : Report stuck & aborted WU here please (Message 10493)
Posted 6 Feb 2006 by Grutte Pier [Wa Oars]~MAB The Frisian
Post:
These show "Unhandled Exception" :
http://boinc.bakerlab.org/rosetta/result.php?resultid=9272934
http://boinc.bakerlab.org/rosetta/result.php?resultid=8910039
http://boinc.bakerlab.org/rosetta/result.php?resultid=7076489

and these show "pending" which is something new ?
http://boinc.bakerlab.org/rosetta/result.php?resultid=7739133
http://boinc.bakerlab.org/rosetta/result.php?resultid=7738709
34) Message boards : Number crunching : Help us solve the 1% bug! (Message 9158)
Posted 16 Jan 2006 by Grutte Pier [Wa Oars]~MAB The Frisian
Post:
For these, unless you've uncovered a different problem, you can stop BOINC and restart it and the workunit will start over and should complete normally.

In this case (4 hour wasted) six of these a day and you'de better chose another project to prefent wasting idle time.
It's easier to use a program that will give an alarm or abort right away if the WU is still at 1% after 15 minutes and than abort instead of taking the trouble to follow the instructions.
35) Message boards : Number crunching : Help us solve the 1% bug! (Message 9152)
Posted 16 Jan 2006 by Grutte Pier [Wa Oars]~MAB The Frisian
Post:
http://boinc.bakerlab.org/rosetta/result.php?resultid=6477569

Aborted it after running for another hour.
36) Message boards : Number crunching : Help us solve the 1% bug! (Message 9146)
Posted 16 Jan 2006 by Grutte Pier [Wa Oars]~MAB The Frisian
Post:
A NO_SIM-ANNEAL_NO_BARCODE_2reb_243_286_0 is running for more than 2:25:xx now and still at 1 %.
What is the wright thing to do ?

EDIT : Just found the instructions below, so wil check that.

EDIT @ : Followed the instructions but no graphics (W2K), time is running and still at 1% at the moment.
When can I expect the 1% change ?

37) Message boards : Number crunching : Report stuck & aborted WU here please (Message 9043)
Posted 14 Jan 2006 by Grutte Pier [Wa Oars]~MAB The Frisian
Post:
http://boinc.bakerlab.org/rosetta/result.php?resultid=6869420

Never had it before.
Just a computer which couldn't flush (installed R@H last week) because of ZA and I've changed that so it could flush.
Now all of the WU's are like the above.
And couldn't upload any WU's this afternoon.
Reason can be ????


Update :
Just checked and the downloading seems to have been OK now.
Don't know about the other problem though.
38) Message boards : Number crunching : Report stuck & aborted WU here please (Message 9038)
Posted 14 Jan 2006 by Grutte Pier [Wa Oars]~MAB The Frisian
Post:
http://boinc.bakerlab.org/rosetta/result.php?resultid=6869420

Never had it before.
Just a computer which couldn't flush (installed R@H last week) because of ZA and I've changed that so it could flush.
Now all of the WU's are like the above.
And couldn't upload any WU's this afternoon.
Reason can be ????
39) Message boards : Number crunching : Credits Granted (Message 9033)
Posted 14 Jan 2006 by Grutte Pier [Wa Oars]~MAB The Frisian
Post:


Well about those 20 second WU's just cuirous how much credit do you think should be granted... most my jobs that run 5000 seconds get about 14 credits. this may be low or high for the avarage .. but anyway asuming 5000 seconds =14 credits then 1 credit is about 357 seconds (~6 min) so um.. i hate saying it but it probaly isnt worth the effort to worry about loosing a 20 second job (unless you pay for your UL/DL bandwith...

yes i love to see credits for work done and credits if an error occours beyond your control... but personaly every time i reboot one of my computers it looses back to the last benchmark (probaly miniuts or more of work + 3-6 min to reboot) so i expect to loose credits now and then

sorry for the soapbox lecture


I do not care about these 20 seconds WU's I didnt get credits granted for.
It's just that were a lot of these WU's that got credits and I found it a bit strang I didn't get anything so I thought what was wrong with the ones I had uploaded. Just curiousity.
Too much time gets wasted on these 0.00xxxx credits but if you've had a lot of these it might count.
40) Message boards : Number crunching : Credits Granted (Message 8942)
Posted 13 Jan 2006 by Grutte Pier [Wa Oars]~MAB The Frisian
Post:
I know/understand ......................the reasons to chose this project.
I'm glad this is cleared up.


While I _totally_ agree that clear and frequent communication is a major "good thing" in general, ................... even _if_ it can be solved, and THEN tell us either "okay, we'll do that" - or "we'd like to, but we can't, and here's why".


I can only agree with that.
But to me, and I assume to others as well, it (the "Technical News" statement) looked like that was all they were going to do (rather incomplete) and that made me rather grumpy.
As I stated before, it's not about the credits I've lost, they can devide them between the last on the list as far as I'm concerned, but the lack of information etc. on that moment.


Previous 20 · Next 20



©2024 University of Washington
https://www.bakerlab.org