1)
Message boards :
Number crunching :
minirosetta 2.14
(Message 66709)
Posted 29 Jun 2010 by mhhall Post: Hi folks, My system is currently executing WU 317305089. BOINC is showing following properties that would seem to indicate process is stuck and not checkpointing properly. CPU Time at last checkpoing: 13:18:13 CPU Time : 15:20:20 Fraction done: 98.925% Would hate to kill a job so close to comletion, but I've got to wonder if this is really going to complete. |
2)
Message boards :
Number crunching :
Problems with Rosetta version 5.93
(Message 51007)
Posted 26 Jan 2008 by mhhall Post: Please post problems and/or bugs with rosetta 5.93. Thanks for your My slower computer (ID #187636 -- older Linspire Linux box) is set to accept jobs of approx 14 hours. I have a job on machine at this time which say it is 99.67% completed with 50:16:19 of CPU time. For time being, I've suspended the job. Name starts "2h4o_BOINC_TWIST_RINGS_TWIST_ANGLE_SYMM_FOLD_AND_DOCK (Work unit 123162090). Don't know if this is a Rosetta issue or a problem w/ this specific job. I know that I have another of same name in my queue (135883853). Just wondering if someone else has seen similar issue/problem. Hope this helps!! |
3)
Message boards :
Number crunching :
Problems with Rosetta version 5.81
(Message 48681)
Posted 15 Nov 2007 by mhhall Post: 1) I note that the "explain" item does not document what a "Compute Error" is.... Seems that my machine has generated another "Compute Error" on work unit id 108644649 --- This time without the previously noted error. Since my previous post, I have upgraded the BOINC Manager on my machine to 5.10.28. Was worried that my problem above might be due to out of date BOINC Manager. |
4)
Message boards :
Number crunching :
Problems with Rosetta version 5.81
(Message 48578)
Posted 12 Nov 2007 by mhhall Post: 1) I note that the "explain" item does not document what a "Compute Error" is.... 2) Work unit 106606679 1n0u__TREEJUMP_ABRELAX_NOTOR-1n0u_-_BARCODE__2241_1083 Appears to have failed on two different machines. Seems like 2nd time that this has happened to me recently... other job was WU 106970621: 2reb__TREEJUMP_ABRELAX_TOR_EQ_-5_PROB_.5_SAVE_ALL_OUT-2reb_-_BARCODE__2243_7638_0 This looks like a programming issue on both counts in same routine (ERROR:: Exit from: .pose.cc line: 769) Or, there is a issue with software running processes on my machine. |
5)
Message boards :
Number crunching :
Failed download
(Message 39435)
Posted 15 Apr 2007 by mhhall Post: I presume that the download server is currently hung. My dual processor system finished two jobs, had one remaining to be reported (which appears to have been accepted), but all downloads to my system are showing as downloading or "download pending". |
6)
Message boards :
Number crunching :
Comparing Claimed vs. Granted Credit.
(Message 38192)
Posted 23 Mar 2007 by mhhall Post: Sorry briefer version. Lost 1st posting due to user error. :-( I have two computers 1 - 187636 running Linux 2 - 430400 running MS XP Pro (two processors) Pretty routinely, computer one shows a granted credit that is 180-190% of "claimed credit". Computer 2 just as routinely shows a granted credit that is only 80-90% of "claimed credit". I am a little confused by this and was wondering if anyone could explain. Mike |
7)
Message boards :
Number crunching :
Report stuck & aborted 5.01 WU here please - III
(Message 14817)
Posted 28 Apr 2006 by mhhall Post: [snip] Please do not abort you current 5.01 Work Units if they are runing well. The science from them is still important to the project. Many of you have asked if credit will be awarded of these failed Work units, and the answer is yes. [snip] My current work unit has been running since Monday eve. and shows progress at 5.90% and CPU time of 148:47:21. Does this meet the criteria of "running well". I don't mind letting this process continue to run..... I'm just worried that its not getting finish anyway.... Mike Hall / Engineering Solutions, Inc. |
©2024 University of Washington
https://www.bakerlab.org