Posts by mhhall

1) Message boards : Number crunching : minirosetta 2.14 (Message 66709)
Posted 29 Jun 2010 by mhhall
Post:
Hi folks,
My system is currently executing WU 317305089.

BOINC is showing following properties that would seem to
indicate process is stuck and not checkpointing properly.

CPU Time at last checkpoing: 13:18:13
CPU Time : 15:20:20

Fraction done: 98.925%

Would hate to kill a job so close to comletion,
but I've got to wonder if this is really going to
complete.
2) Message boards : Number crunching : Problems with Rosetta version 5.93 (Message 51007)
Posted 26 Jan 2008 by mhhall
Post:
Please post problems and/or bugs with rosetta 5.93. Thanks for your
support!

My slower computer (ID #187636 -- older Linspire Linux box) is set to accept
jobs of approx 14 hours. I have a job on machine at this time which say it
is 99.67% completed with 50:16:19 of CPU time. For time being, I've suspended
the job. Name starts "2h4o_BOINC_TWIST_RINGS_TWIST_ANGLE_SYMM_FOLD_AND_DOCK
(Work unit 123162090).

Don't know if this is a Rosetta issue or a problem w/ this specific job.
I know that I have another of same name in my queue (135883853).

Just wondering if someone else has seen similar issue/problem.

Hope this helps!!
3) Message boards : Number crunching : Problems with Rosetta version 5.81 (Message 48681)
Posted 15 Nov 2007 by mhhall
Post:
1) I note that the "explain" item does not document what a "Compute Error" is....

2) Work unit 106606679
1n0u__TREEJUMP_ABRELAX_NOTOR-1n0u_-_BARCODE__2241_1083
Appears to have failed on two different machines.

Seems like 2nd time that this has happened to me recently... other job was WU 106970621:
2reb__TREEJUMP_ABRELAX_TOR_EQ_-5_PROB_.5_SAVE_ALL_OUT-2reb_-_BARCODE__2243_7638_0

This looks like a programming issue on both counts
in same routine (ERROR:: Exit from: .pose.cc line: 769)
Or, there is a issue with software running
processes on my machine.


Seems that my machine has generated another "Compute Error" on work unit id
108644649 --- This time without the previously noted error. Since my previous
post, I have upgraded the BOINC Manager on my machine to 5.10.28. Was worried that my problem above might be due to out of date BOINC Manager.

4) Message boards : Number crunching : Problems with Rosetta version 5.81 (Message 48578)
Posted 12 Nov 2007 by mhhall
Post:
1) I note that the "explain" item does not document what a "Compute Error" is....

2) Work unit 106606679
1n0u__TREEJUMP_ABRELAX_NOTOR-1n0u_-_BARCODE__2241_1083
Appears to have failed on two different machines.

Seems like 2nd time that this has happened to me recently... other job was WU 106970621:
2reb__TREEJUMP_ABRELAX_TOR_EQ_-5_PROB_.5_SAVE_ALL_OUT-2reb_-_BARCODE__2243_7638_0

This looks like a programming issue on both counts
in same routine (ERROR:: Exit from: .pose.cc line: 769)
Or, there is a issue with software running
processes on my machine.
5) Message boards : Number crunching : Failed download (Message 39435)
Posted 15 Apr 2007 by mhhall
Post:
I presume that the download server is currently hung.
My dual processor system finished two jobs, had one
remaining to be reported (which appears to have been
accepted), but all downloads to my system are showing
as downloading or "download pending".
6) Message boards : Number crunching : Comparing Claimed vs. Granted Credit. (Message 38192)
Posted 23 Mar 2007 by mhhall
Post:
Sorry briefer version. Lost 1st posting due to
user error. :-(

I have two computers
1 - 187636 running Linux
2 - 430400 running MS XP Pro (two processors)

Pretty routinely, computer one shows a granted credit
that is 180-190% of "claimed credit". Computer 2
just as routinely shows a granted credit that is only
80-90% of "claimed credit".

I am a little confused by this and was wondering if
anyone could explain.

Mike
7) Message boards : Number crunching : Report stuck & aborted 5.01 WU here please - III (Message 14817)
Posted 28 Apr 2006 by mhhall
Post:
[snip]

Please do not abort you current 5.01 Work Units if they are runing well. The science from them is still important to the project. Many of you have asked if credit will be awarded of these failed Work units, and the answer is yes.
[snip]

My current work unit has been running since Monday eve. and shows progress
at 5.90% and CPU time of 148:47:21. Does this meet the criteria of
"running well".

I don't mind letting this process continue to run.....
I'm just worried that its not getting finish anyway....

Mike Hall / Engineering Solutions, Inc.






©2024 University of Washington
https://www.bakerlab.org