Posts by BrnmccO1

1) Message boards : Number crunching : Problems with Minirosetta v1.54 (Message 59532)
Posted 12 Feb 2009 by BrnmccO1
Post:
Very good so far, zero error results on all machines for a long time. This 1.54 is much better than the prev versions, much more stable etc. Keep up the good work stamping out the bugs.

Its been a long time since I've reviewed the results on all my crunchers and found no compute errors. If things keep going the way they are, we might break 100 Tflops yet!
2) Message boards : Number crunching : BOINC on the XBOX360 (Message 55794)
Posted 15 Sep 2008 by BrnmccO1
Post:
I believe a Petaflop is 1000TFlops...
ExaFlop would be 1000 PetaFlops :S ??



Yes, I believe thats correct :)
3) Message boards : Number crunching : BOINC on the XBOX360 (Message 55755)
Posted 14 Sep 2008 by BrnmccO1
Post:
i attended the east coast boinc meeting at u delaware, and david anderson would seem to agree with the idea that gpu's will drive a majority of the future performance increase. he's already talking about exaflops possibly within the next 3-5 years.



I'd be happy with just a few PetaFlops, ExaFlops... wow, isn't that like a 1000 TeraFlops? At 70-75 T-Flops, we've got a couple orders of magnitude yet to go here :D
4) Message boards : Number crunching : Minirosetta v1.34 bug thread (Message 55753)
Posted 14 Sep 2008 by BrnmccO1
Post:
First unhandled exception error, access violation again:

192043183

Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x007D3863 read attempt to address 0x00000008

Got one of these too, a day ago or so: WU

ERROR: unrecognized aa HOH
ERROR:: Exit from: ....srccoreiopdbfile_data.cc line: 468
called boinc_finish
5) Message boards : Number crunching : Minirosetta v1.34 bug thread (Message 55725)
Posted 12 Sep 2008 by BrnmccO1
Post:
Still getting the "needs psipred_ss2 to run filters" messages just like in 1.32...

191557749

What gives with the filters?
6) Message boards : Number crunching : Minirosetta v1.32 bug thread (Message 55724)
Posted 12 Sep 2008 by BrnmccO1
Post:
Well, on this comp I've had quite a few 1.32 errors and some 1.28 errors as well. Like other people it run's 5.98's 100%.

191481586 is a typical example of the usual "Unhandled Exception Error" that bombs out the WU.

Hopefully 1.34 will be better! In any case, I for one won't be missing 1.32 RIP.
7) Message boards : Number crunching : Servers running, but no work available?? (Message 55096)
Posted 16 Aug 2008 by BrnmccO1
Post:
All the servers appear to be running normally, but my boxes are rapidly running out of WU's...

Server Status as of 16 Aug 2008 16:47:42 UTC
[ Scheduler running ] Queued: 0
In progress: 334,148

Whats up?

8/16/2008 10:40:39 AM|rosetta@home|Sending scheduler request: To fetch work. Requesting 1 seconds of work, reporting 1 completed tasks
8/16/2008 10:40:44 AM|rosetta@home|Scheduler request succeeded: got 0 new tasks
8/16/2008 10:44:49 AM|rosetta@home|Sending scheduler request: To fetch work. Requesting 348 seconds of work, reporting 0 completed tasks
8/16/2008 10:44:54 AM|rosetta@home|Scheduler request succeeded: got 0 new tasks
8/16/2008 10:48:59 AM|rosetta@home|Sending scheduler request: To fetch work. Requesting 688 seconds of work, reporting 0 completed tasks
8/16/2008 10:49:04 AM|rosetta@home|Scheduler request succeeded: got 0 new tasks
8/16/2008 10:53:09 AM|rosetta@home|Sending scheduler request: To fetch work. Requesting 1028 seconds of work, reporting 0 completed tasks
8/16/2008 10:53:14 AM|rosetta@home|Scheduler request succeeded: got 0 new tasks
8/16/2008 10:57:19 AM|rosetta@home|Sending scheduler request: To fetch work. Requesting 1368 seconds of work, reporting 0 completed tasks
8/16/2008 10:57:24 AM|rosetta@home|Scheduler request succeeded: got 0 new tasks
8/16/2008 11:01:29 AM|rosetta@home|Sending scheduler request: To fetch work. Requesting 1707 seconds of work, reporting 0 completed tasks
8/16/2008 11:01:34 AM|rosetta@home|Scheduler request succeeded: got 0 new tasks
8/16/2008 11:05:39 AM|rosetta@home|Sending scheduler request: To fetch work. Requesting 2046 seconds of work, reporting 0 completed tasks
8/16/2008 11:05:44 AM|rosetta@home|Scheduler request succeeded: got 0 new tasks
8/16/2008 11:13:09 AM|rosetta@home|Sending scheduler request: To fetch work. Requesting 2647 seconds of work, reporting 0 completed tasks
8/16/2008 11:13:14 AM|rosetta@home|Scheduler request succeeded: got 0 new tasks
8) Message boards : Number crunching : Minirosetta v1.32 bug thread (Message 55090)
Posted 15 Aug 2008 by BrnmccO1
Post:
Another one bites the dust: 184906376

What causes this access violation?? And again, it only happens with the Mini's, not Beta.
9) Message boards : Number crunching : Minirosetta v1.32 bug thread (Message 55085)
Posted 14 Aug 2008 by BrnmccO1
Post:
Here's another Rosetta Mini unhandled exception error. I got quite a lot of these from 1.28, and this is the only one I have got from ver 1.32 so far. It's always on just the one computer, and always from the Rosetta Mini's. I've never gotten one of these errors ever from 5.82 or 5.98 on any system. Here's a link to the failed WU:

184522226
10) Message boards : Number crunching : teraFLOPS estimate? (Message 55014)
Posted 10 Aug 2008 by BrnmccO1
Post:
74 now, gogogo! Lol, ;p
11) Message boards : Number crunching : Minirosetta v1.32 bug thread (Message 55013)
Posted 10 Aug 2008 by BrnmccO1
Post:
Hey David, thanks for clearing that up for us, I was about to say "Is there some files missing from the Database?"

Anyhow, just out of curiosity, I checked the Projects folder on all my machines, and it appears that the Mini-Rosetta rev 1.28 executable is gone but the old Database, "minirosetta_database_rev23035" is still there. The new 23513 is there as well. Will there be any more use for the 23035 that went with 1.28? Or is it safe to delete it?

Thanks, keep up the good work guys!

P.S. When is/does CASP8 wrap up?
12) Message boards : Number crunching : Problems with Rosetta version 5.98 (Message 54190)
Posted 5 Jul 2008 by BrnmccO1
Post:
this FRA_t453_CASP8_HYBRID_MANUAL_1_IGNORE_THE_RESTt451_1_axmin1_0001_4165_78 WU seemed to crunch correctly, then it bombed out with:

<message><file_xfer_error>
<file_name>FRA_t453_CASP8_HYBRID_MANUAL_1_IGNORE_THE_RESTt451_1_axmin1_0001_4165_78_1_0</file_name>
<error_code>-161</error_code>
<error_message></error_message>
</file_xfer_error>

This WU did the same for the other cruncher as well.


I got the same -161 Output file missing error from one of my t453's as well.
13) Message boards : Number crunching : Problems with Rosetta version 5.98 (Message 54188)
Posted 5 Jul 2008 by BrnmccO1
Post:
159639723

Compute error after full run, also failed on someone else's host as well. Output file missing.

<core_client_version>5.10.45</core_client_version>
<![CDATA[
<stderr_txt>
# cpu_run_time_pref: 10800
# random seed: 2136320
======================================================
DONE :: 1 starting structures 10289.6 cpu seconds
This process generated 1 decoys from 1 attempts
0 starting pdbs were skipped
======================================================


BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down...

</stderr_txt>
<message>
<file_xfer_error>
<file_name>FRA_t453_CASP8_HYBRID_MANUAL_1_IGNORE_THE_RESTt451_1_axmin1_0001_4165_2909_0_0</file_name>
<error_code>-161</error_code>
</file_xfer_error>

</message>
]]>

14) Message boards : Number crunching : Problems with Rosetta version 5.98 (Message 54089)
Posted 30 Jun 2008 by BrnmccO1
Post:
Bizzare problem with this WU; had an 'unhandled exception error' after about approx 50 mins CPU run time, with a lenthy Std_Out: 157316144

<core_client_version>5.10.45</core_client_version>
<![CDATA[
<message>
- exit code -1073741819 (0xc0000005)
</message>
<stderr_txt>
# cpu_run_time_pref: 10800
# random seed: 2747207


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x00B3C947 read attempt to address 0x000000A4

Engaging BOINC Windows Runtime Debugger...


Otherwise no other errors so far with 5.98 on both of my hosts (knocks on wood ;p)
15) Message boards : Number crunching : Minirosetta v1.28 bug thread (Message 54024)
Posted 27 Jun 2008 by BrnmccO1
Post:
2 more Mini128 bugs/crashes.

158211853
157910486

One failed on another computer, one completed sucessfully, both are CASP8's
16) Message boards : Number crunching : Problems with version 5.96 (Message 53978)
Posted 25 Jun 2008 by BrnmccO1
Post:
Have also had a rash of compute errors last two weeks. Mostly the aforementioned t405's and a few t434's as well.

Here's a list of the failed WU's:
http://boinc.bakerlab.org/rosetta/workunit.php?wuid=158023723
http://boinc.bakerlab.org/rosetta/workunit.php?wuid=155316236
http://boinc.bakerlab.org/rosetta/workunit.php?wuid=155266537
http://boinc.bakerlab.org/rosetta/workunit.php?wuid=156920807 <-- had to manually abort, was 'stuck'
http://boinc.bakerlab.org/rosetta/workunit.php?wuid=156498712

http://boinc.bakerlab.org/rosetta/workunit.php?wuid=158046502
http://boinc.bakerlab.org/rosetta/workunit.php?wuid=157548537 <-- Mini Rosetta, was sucessful on someone elses computer tho.
http://boinc.bakerlab.org/rosetta/workunit.php?wuid=155266298 <-- T409
http://boinc.bakerlab.org/rosetta/workunit.php?wuid=156219608 <-- t405 had to manually abort

Other than the recent troubles, things have been pretty good the past year for me, so I'll keep plugging away!


Cheers,
17) Message boards : Number crunching : teraFLOPS estimate? (Message 53977)
Posted 25 Jun 2008 by BrnmccO1
Post:
Back up to about 57 t-flops now! :))

Drat those t405's.... keep up the good work, in a couple days it will be one year to the day I put my boxes on Rosetta. (Not going to mention the prev project I was on, that was an "Epic Fail", lol)

- Happy Camper (well, cruncher)






©2024 University of Washington
https://www.bakerlab.org