Posts by Astro

21) Message boards : Number crunching : Problems with Rosetta version 5.93 (Message 50984)
Posted 25 Jan 2008 by Astro
Post:
yes, the "not selected" is the default of 3 hours.

also, that's set on the "web" so your client must "call home" in order to see and apply the change. This happens when it gets/reports work. But, if you want it to change in the middle of a run, you must do a "project update". You can manually update the projects from the "projects" tab on the manager. Highlight the project name in the right hand box by clicking on it. Then click the "update" button to the left.
22) Message boards : Number crunching : Problems with Rosetta version 5.93 (Message 50981)
Posted 25 Jan 2008 by Astro
Post:
THat's kind of my point. It was ended by the watchdog at 4X his/her 3 hour run pref at 12 hour + a bit. The Task ID shows NO decoy info at all. Was any scientifically worthwhile work performed? Or is it just credit for time served?? This is going to be very typical of all participants except those with a "cpu run time pref" exceeding 8-12 hours(depending on processor, etc).

and now that I think about my one wu and my 6 hour run time. It looks like I should bump that up to the next step above 6hrs or suffer the same fate as everyone else. It's at 99.005 percent after 16:38:00, so is holding to the .1%/hour.

[edit] moved up to 8 hour pref
23) Message boards : Number crunching : Problems with Rosetta version 5.93 (Message 50976)
Posted 25 Jan 2008 by Astro
Post:
These 6+hr per model tasks are basicaly the worst case for the time estimate calculations.

I do think somethings wrong.

For the last 3 hours, it's progressed .1%/hour. If that were true for the full length of the wu, and given I'm 15 hours into it, then I should only be at 1.5% complete. At some point the "% comp" had to have progressed faster, and then at some point went into slow motion mode. I'm aware of how the "to comp" works and have NO issue with that. Also, If I'd had a 1hr, 2 hr, 3 hr, and shortly a 4 hour run time preference, then this one would have been ended by the watchdog. Ofcourse, I'm assuming it'll finish at all. If the .1%/hour holds, then a 6 hour pref would have been ended by the watchdog (have to wait and see total run time before I can say that definitively).

I guess, If these are really that long, then admin should change the % comp mechanism, and say something about having some "unusually LARGE" wus in the system ATM. Otherwise, you're going to get alot of questions and who knows how many users will "abort" just because they don't know it might be "normal".

Heck, I feel that I'm doing them a favor even running it as my gut feeling (without admin acknowledgement that this is normal) is I'm going to get nada for a days work.
24) Message boards : Number crunching : Problems with Rosetta version 5.93 (Message 50974)
Posted 25 Jan 2008 by Astro
Post:
I can't edit after 60 min.

After 3 hours the cpu time seems right, % comp has progressed up to 98.846, and "to comp" has gone up one second to 00:09:54. Hmmm, at .1%/hour there's just 11 more hours to go making it 25 hours/decoy...gotta be a record. I'll not post again until it's nearly over. (yes...I know 98.848% seems like it'd be nearly over...LOL)
25) Message boards : Number crunching : Problems with Rosetta version 5.93 (Message 50965)
Posted 25 Jan 2008 by Astro
Post:
Here's a snapshot of what others might be describing about the 2h4o wus. This is on my wifes laptop which was set to 1 hour run time pref, but I changed it at somepoint last nite to 6 hours(note: I changed it before I knew about this one, her laptop is WAAAAY out in the dining room, which never sees meals on the table, so I'm seldom there). Either way, we're way past that. It's longest recorded decoy (out of 912 recorded) was a 1gida which lasted 16627 seconds (4.61 hrs). I suspended the other projects to see what happens with it.



[edit] after 1 hour run time the cpu time has progressed one hour, and the "% complete has progressed from 98.558 to 98.664, but the "to comp" has remained unchanged.
26) Message boards : Number crunching : Estimated Crunching Time for a Twist Ring Wu ? (Message 50944)
Posted 24 Jan 2008 by Astro
Post:
Hi, I record all my work. To date I have 7343 recorded wus across my hosts. To give you some idea, I put together my "longest three" decoys per host/OS. The time shown is the time it took to do ONE decoy. I removed many which were of the same job type, so it would show 3 different job types. Ofcourse, the number of decoys done depends on "run time", but here's the list.



Hope this also helps.

tony
27) Message boards : Number crunching : Quick ? re: Participant Ranking. (Message 50905)
Posted 22 Jan 2008 by Astro
Post:
Is there a quick way to find out where we fall in the Rosetta @ Home project as far as credit goes? I see all the links for BOINC stats but they all appear to be for BOINC combined projects... I could sort the R @ H users and start paging through them 20 per page at a time but being a newbie, that might take a while...

Boincstats has your individual project stats that look like this. If you click on "participants" (above) it'll take you to "your account", look for a bunch of links somewhere near the bottom, and click on "boincstats". That'll take you to the same page.

There's also the rosetta specific "user stats" at boincstats.

Is this what you were looking for? If not, try some of the other links under "your account".

hope this helps
tony

hope that was quick enough....7 min, 24 sec. including links...LOL
28) Message boards : Number crunching : Switch from W.XP to Linux. (Message 50878)
Posted 21 Jan 2008 by Astro
Post:
dual boot is acheived by a "boot loader". I use GRUB (grand unified bootloader). If you install linux to the hard drive(the one with windows already on it) it'll install the bootloader for you, so when you power on your puter. It goes through bios, then flash screen (if enabled), then to the bootloader where you get to select which OS to use. You can configure it to load either one by default (autoloads the default after timeout)
29) Message boards : Number crunching : Switch from W.XP to Linux. (Message 50877)
Posted 21 Jan 2008 by Astro
Post:
Thanks to you both, seems Ubuntu is good for me so ordering a CD. {takes 2 or 3 months to arrive!}. Would be good to read a *how to*.
Must say I did not realise that a dual boot is possible.

2-3months??? do you have a dvd burner and the internet? Or just a CD and the internet, you could have linux installed by tomorrow.

I went to mandriva and they want you to get the 2008 version, but here's a picture drive "how to" on replacing windows the_perfect_desktop_mandriva_2007_spring_free. and you can download 2008 from Mandriva either as ftp or using a torrent. My version of Spring 2007 Free is 64 bit, haven't seen it listed on the mandriva page if 2008 is also 64 bit, so you might wanna dig around.

The "how to" I spoke of is only for mandriva installation of boinc either with the "free" (free of software which costs money), or ONE which runs from disc. If you don't have an ISO burner program, I can link you to a free one.
30) Message boards : Number crunching : Switch from W.XP to Linux. (Message 50873)
Posted 21 Jan 2008 by Astro
Post:
I don't think I've ever see it proven that one version of linux is "faster", atleast not until you get into special cut down versions(that I've never played with). Being a Win only person(after DOS that is) I tried and found Mandriva (KDE) reasonably simple to use. I have it (2007 Mandriva Spring Free X86-64) and windows running dual boot on all my systems, but wouldn't hesitate to build a system with just linux. Using dual boot gets you the ability to gently switch over. What I mean here is that when the learning linux gets you ready to pull out your last hair, then switch back to ole comfortable windows. I've found that the learning curve is definitely in existence, and best done a chunk at a time.

That said, I've heard good things about other linux Distros, but am still in that learning curve myself, so I don't wanna jump ship just yet. If you just wanna play with linux, but not install. Look to "mandriva ONE". It'll burn to a DVD/CD and you just reboot your puter with the CD inside. Install Boinc, Run some work, and when you wanna go back to windows, simply reboot (removing the cd/dvd) and Linux is GONE. It doesn't install anything, But also if you lose power or reboot, you lose any unfinished works as well.

If you're interested, I'll find you some links where I have posted a sort of "how to" (especially for the cd/dvd ONE version use).

If you've read my linux vs windows thread here, then you know Linux is 20% faster than windows(atleast on my AMD64, and X2's)
31) Message boards : Number crunching : Problems with web site (Message 50832)
Posted 20 Jan 2008 by Astro
Post:
me neither,.....soooooo.... good news....you're not crazy.
32) Message boards : Number crunching : Problems with Rosetta version 5.93 (Message 50817)
Posted 19 Jan 2008 by Astro
Post:
Here's a "scoreboard" update. It shows all the errors for all my systems as it pertains to 5.93, and thier percentages. Any error is annoying, but from my perspective, there's not a large percentage of them.

33) Message boards : Number crunching : Problems with Rosetta version 5.93 (Message 50777)
Posted 17 Jan 2008 by Astro
Post:
That stuck WU which restarted is resultid=133551161 which ended itself on this go around. Was Valid and creditted (but not for the first wasted 2 hours spent on it, plus however long it was stuck for).

The says:

Graphics are disabled due to configuration...
# cpu_run_time_pref: 10800
# random seed: 3171268
**********************************************************************
Rosetta score is stuck or going too long. Watchdog is ending the run!
Stuck at score -113.019 for 900 seconds
**********************************************************************
GZIP SILENT FILE: ./xx1zpy.out
*** glibc detected *** corrupted double-linked list: 0x092683c0 ***
SIGABRT: abort called
Stack trace (18 frames):
[0x8da3037]
[0x8d9de2c]
[0xffffe500]
[0x8e0e444]
[0x8e2330f]
[0x8e27d01]
[0x8e28176]
[0x8e28653]
[0x8df90a1]
[0x8dfaac9]
[0x83c4cc5]
[0x8e0e98f]
[0x8d9fab7]
[0x8d9ff27]
[0x8d2023d]
[0x8d20f35]
[0x8d9a0c5]
[0x8e3aa1a]

Exiting...
No heartbeat from core client for 31 sec - exiting
FILE_LOCK::unlock(): close failed.: Bad file descriptor
Graphics are disabled due to configuration...
# cpu_run_time_pref: 10800
# random seed: 3171268
**********************************************************************
Rosetta score is stuck or going too long. Watchdog is ending the run!
Stuck at score -89.0742 for 900 seconds
**********************************************************************
GZIP SILENT FILE: ./xx1zpy.out
SIGSEGV: segmentation violation
Stack trace (22 frames):
[0x8da3037]
[0x8d9de2c]
[0xffffe500]
[0x89a1824]
[0x804c828]
[0x8a8ae99]
[0x8a8babf]
[0x8d0c170]
[0x8c12abe]
[0x8c14e33]
[0x804c7c2]
[0x8a835ed]
[0x8a8586f]
[0x89363de]
[0x89380e3]
[0x893ba27]
[0x898ad7a]
[0x85e96d6]
[0x87289d2]
[0x8728af2]
[0x8e07384]
[0x8048111]

Exiting...

</stderr_txt>
]]>

Hope something in all this ends with a fix at some point.
34) Message boards : Number crunching : Problems with Rosetta version 5.93 (Message 50774)
Posted 17 Jan 2008 by Astro
Post:
got another stuck one. See details in this post, except this time it restarted at 10 minutes instead of uploading immediately. Looks like I'm in "Babysitter mode" until this one finishes.
35) Message boards : Number crunching : freezed processing time and no CPU usage. (Message 50772)
Posted 17 Jan 2008 by Astro
Post:
Hmmm, I too currently am experiencing this. On my AMD64 X2 5200 (using Linux), I have WU 1zpy_BOINC_TWIST_RINGS_TWIST_ANGLE_SYMM_FOLD_AND_DOCK_RELAX-1zpy_-native__2477_314663_0 that is currently showing CPU Time01:58:28, 100.000% complete, --- to completion, BUT also the status says "Running". Gkrellm shows one Boincsimap unit using 100% of my CPU and the other thread shows 0%. This is identical in behaviour to the one I reported in the 5.93 problems thread. Guess I have to exit the manager, Killall -9 boinc, and restart to free my machine back up and get it producing.
<<<<<time passes between postingssss>>>>>>>
Hmm, I did just that and the when I restarted that wu reverted to show 00:10:25 CPU Time, 6.465% complete, and 02:56:23 remaining. Oh great, guess I have to babysit this one.

Good luck Pedro.

tony

here's a link to the other time this happened to me
36) Message boards : Number crunching : Problems with Rosetta version 5.93 (Message 50687)
Posted 14 Jan 2008 by Astro
Post:
oops. linked to the wrong work unit for the stuck one. It was really, resultid=133258619 which showed this.

<core_client_version>5.10.21</core_client_version>
<![CDATA[
<stderr_txt>
Graphics are disabled due to configuration...
# cpu_run_time_pref: 10800
# random seed: 3630287
**********************************************************************
Rosetta score is stuck or going too long. Watchdog is ending the run!
Stuck at score -84.1725 for 900 seconds
**********************************************************************
GZIP SILENT FILE: ./xx1zpy.out
SIGSEGV: segmentation violation
Stack trace (21 frames):
[0x8da3037]
[0x8d9de2c]
[0xffffe500]
[0x8e2a1b9]
[0x8df8727]
[0x8dfaba1]
[0x8cb4a2c]
[0x8c1179b]
[0x8c14e33]
[0x804c7c2]
[0x8a835ed]
[0x8a8586f]
[0x89363de]
[0x893822e]
[0x893ba27]
[0x898ad7a]
[0x85e96d6]
[0x87289d2]
[0x8728af2]
[0x8e07384]
[0x8048111]

Exiting...
No heartbeat from core client for 31 sec - exiting
FILE_LOCK::unlock(): close failed.: Bad file descriptor
Graphics are disabled due to configuration...
# cpu_run_time_pref: 10800
**********************************************************************
Rosetta score is stuck or going too long. Watchdog is ending the run!
Stuck at score -82.6613 for 900 seconds
**********************************************************************
GZIP SILENT FILE: ./xx1zpy.out
SIGSEGV: segmentation violation
Stack trace (22 frames):
[0x8da3037]
[0x8d9de2c]
[0xffffe500]
[0x89a1824]
[0x804c828]
[0x8a8ae99]
[0x8a8babf]
[0x8d0c170]
[0x8c12abe]
[0x8c14e33]
[0x804c7c2]
[0x8a835ed]
[0x8a8586f]
[0x89363de]
[0x893822e]
[0x893ba27]
[0x898ad7a]
[0x85e96d6]
[0x87289d2]
[0x8728af2]
[0x8e07384]
[0x8048111]

Exiting...
SIGSEGV: segmentation violation
SIGABRT: abort called
[insert] about 200 more of the "abort called", but I snipped it for brevity
SIGABRT: abort called

</stderr_txt>
]]>
37) Message boards : Number crunching : Problems with Rosetta version 5.93 (Message 50684)
Posted 14 Jan 2008 by Astro
Post:
too late to edit.

The second one which was stuck, remained stuck after the work scheduler got back around to it. I ended up exiting the mangager, opening Konsole, and killing Boinc. I then restarted and opened the manager. The result showed "ready to report", so it must have uploaded before the manager displayed it.

Anyway, It was considered "Valid" and was granted credit like this never even happened. It's resultid=133326615
which shows:

<core_client_version>5.10.21</core_client_version>
<![CDATA[
<stderr_txt>
Graphics are disabled due to configuration...
# cpu_run_time_pref: 10800
# random seed: 3623102
======================================================
DONE :: 1 starting structures 9911.7 cpu seconds
This process generated 6 decoys from 6 attempts
======================================================


BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down...

</stderr_txt>
]]>

Which seems completely uneventful to me, but I know it stuck. Leaving my host only using one core for who knows how long.
38) Message boards : Number crunching : Problems with Rosetta version 5.93 (Message 50681)
Posted 14 Jan 2008 by Astro
Post:
I finally got a computation error, and strangely enough, I woke to find one wus stuck at 100% and gkrellm showed 0% cpu use for that core. I have suspended and resumed that wu and now wait for it to run again. The "stuck one" is 1zpy__BOINC_DEFAULT_SYMM_FOLD_AND_DOCK-1zpy_native_2_2519_22709_0. The one which has already reported as a computation error is resultid=133308819 1zpy__BOINC_TWIST_RINGS_TWIST_ANGLE_SYMM_FOLD_AND_DOCK_RELAX-1zpy_-native__2477_294683_0 and shows:

<core_client_version>5.10.21</core_client_version>
<![CDATA[
<message>
process exited with code 193 (0xc1, -63)
</message>
<stderr_txt>
Graphics are disabled due to configuration...
# cpu_run_time_pref: 10800
# random seed: 3191248
**********************************************************************
Rosetta score is stuck or going too long. Watchdog is ending the run!
Stuck at score -66.1132 for 900 seconds
**********************************************************************
GZIP SILENT FILE: ./xx1zpy.out
SIGSEGV: segmentation violation
Stack trace (22 frames):
[0x8da3037]
[0x8d9de2c]
[0xffffe500]
[0x89a1824]
[0x804c828]
[0x8a8ae99]
[0x8a8babf]
[0x8d0c170]
[0x8c12abe]
[0x8c14e33]
[0x804c7c2]
[0x8a835ed]
[0x8a8586f]
[0x89363de]
[0x89380e3]
[0x893ba27]
[0x898ad7a]
[0x85e96d6]
[0x87289d2]
[0x8728af2]
[0x8e07384]
[0x8048111]

Exiting...

so, it looks like I'm going to have two computation errors for my AMD64 X2 5200 under Linux
39) Message boards : Number crunching : new to rosetta using q6600 (Message 50631)
Posted 13 Jan 2008 by Astro
Post:
by default boinc will only use 50% of available ram when actively using the puter and 90% when you're inactive. There are two "independent" place you can choose to set your preferences. You can either do so on the Boinc manager itself setting your "local" prefs,,,OR you can set them up under "Participants" your account here at the website. I have them set to 100% for both setting (active 100/inactive 100) on all my hosts. If you choose "local prefs" then it will "over ride" the web based settings.

welcome aboard

tony
40) Message boards : Number crunching : Problems with Rosetta version 5.93 (Message 50621)
Posted 12 Jan 2008 by Astro
Post:
well i guess its the type of WU i ran, 1 type but only finished 1 sucessfully out of 7 of them or so. so i guess its in the type of WU.


I took the liberty of running your host with my "Rosetta-Pal". Then I copied and color coded all the work from yours combined with all the work from my "windows" hosts. Then I sorted by WU name and weeded out work not of the same "Job type", so we'd be comparing apples with apples. You had windows xp, I had winxp. You had AMD64, I had AMD64. Etc, Etc.

Anyway, I found 4 instances were we did the same "job type" and you can see them below. I see that of the first job type, you had many computation errors, but your host also did one of them successfully.

Your hosts are "Blue" when you had a error, and "Green" when you successfully completed one. Mine are a various colors so I added descriptions to the first column. My host can be discerned from the previous chart with the exception of my wife's laptop "M3700" which is a "Mobile AMD64 3700" using win xp(can't put linux on that one....lol).

So, from what I see, it's probably NOT the job type/wus, or at least my hosts aren't having trouble with them.

I wonder what else it could be??



[edit] on the second set of WUs I noticed a very early return date on the your wu I saw, so I rechecked, and that computation error was with 5.90, whereas my hosts were using 5.93. Also, that one was not a computation error, but Invalid.

Also, Look at the 'good' wu you returned (green text), It's the very next consecutive "task ID" and "Work unit ID" number from the previous one, which failed, so your own host managed to do one type that it had previous failed to do.[/edit]


Previous 20 · Next 20



©2024 University of Washington
https://www.bakerlab.org