Message boards : Number crunching : Problems with Rosetta version 5.78
Previous · 1 · 2 · 3 · Next
Author | Message |
---|---|
Beezlebub Send message Joined: 18 Oct 05 Posts: 40 Credit: 260,375 RAC: 0 |
I also have 8 WU's showing: Result ID 104616934 Name 1he8__BOINC_CAPRI14_DOCK_FIXBACKBONE-1he8_-nosillyloop_plexinmonomer__2067_8410_0 Workunit 94936831 Created 10 Sep 2007 10:42:43 UTC Sent 10 Sep 2007 10:43:28 UTC Received 10 Sep 2007 22:03:20 UTC Server state Over Outcome Success Client state Done Exit status 0 (0x0) Computer ID 341092 Report deadline 20 Sep 2007 10:43:28 UTC CPU time 16852.023625 stderr out <core_client_version>5.10.13</core_client_version> <![CDATA[ <stderr_txt> # cpu_run_time_pref: 28800 # random seed: 1272171 ********************************************************************** Rosetta score is stuck or going too long. Watchdog is ending the run! Stuck at score -173.421 for 900 seconds ********************************************************************** GZIP SILENT FILE: .xx1he8.out </stderr_txt> ]]> Validate state Valid Claimed credit 48.3512209341757 Granted credit 20 application version 5.78 e6600 quad @ 2.5ghz 2418 floating point 5227 integer e6750 dual @ 3.71ghz 3598 floating point 7918 integer |
P . P . L . Send message Joined: 20 Aug 06 Posts: 581 Credit: 4,865,274 RAC: 0 |
I've had the same problem. It's was an 1he8_**** W.U. https://boinc.bakerlab.org/rosetta/workunit.php?wuid=94800922 <core_client_version>5.10.13</core_client_version> <![CDATA[ <stderr_txt> # cpu_run_time_pref: 28800 # random seed: 1278145 ********************************************************************** Rosetta score is stuck or going too long. Watchdog is ending the run! Stuck at score -202.375 for 900 seconds ********************************************************************** GZIP SILENT FILE: .xx1he8.out </stderr_txt> |
BitSpit Send message Joined: 5 Nov 05 Posts: 33 Credit: 4,147,344 RAC: 0 |
More that got stuck and the watchdog killed. Still Windows only that's hanging. https://boinc.bakerlab.org/rosetta/result.php?resultid=104511812 https://boinc.bakerlab.org/rosetta/result.php?resultid=104511810 https://boinc.bakerlab.org/rosetta/result.php?resultid=104493978 https://boinc.bakerlab.org/rosetta/result.php?resultid=104493977 |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
The 20 credits sounds like the nightly credit granting script for failed WUs. I realize they probably show as "success", but they didn't end normally. Some details here. Rosetta Moderator: Mod.Sense |
M.L. Send message Joined: 21 Nov 06 Posts: 182 Credit: 180,462 RAC: 0 |
Result ID 104435631 Name 1g4u__BOINC_CAPRI14_DOCK_FIXBACKBONE-1g4u_-nosillyloop_plexinmonomer__2067_760_0 Workunit 94767401 Created 10 Sep 2007 0:01:42 UTC Sent 10 Sep 2007 0:01:53 UTC Received 11 Sep 2007 16:12:47 UTC Server state Over Outcome Success Client state Done Exit status 0 (0x0) Computer ID 510574 Report deadline 20 Sep 2007 0:01:53 UTC CPU time 13657.84375 stderr out <core_client_version>5.10.20</core_client_version> <![CDATA[ <stderr_txt> # cpu_run_time_pref: 21600 # random seed: 1279871 ********************************************************************** Rosetta score is stuck or going too long. Watchdog is ending the run! Stuck at score -223.806 for 900 seconds ********************************************************************** GZIP SILENT FILE: .xx1g4u.out </stderr_txt> ]]> Validate state Valid Claimed credit 55.7602549562382 Granted credit 20 application version 5.78 |
googloo Send message Joined: 15 Sep 06 Posts: 133 Credit: 22,854,895 RAC: 2,476 |
The 20 credits sounds like the nightly credit granting script for failed WUs. I realize they probably show as "success", but they didn't end normally. Some details here. Had that problem with these results: here and here. |
uNiUs Send message Joined: 12 Apr 06 Posts: 3 Credit: 29,739,803 RAC: 24 |
Same problem: 104582448 94905137 10 Sep 2007 7:53:36 UTC 11 Sep 2007 20:16:51 UTC Over Success Done 86,268.45 530.82 514.59 104582610 94905257 10 Sep 2007 7:57:48 UTC 11 Sep 2007 20:16:51 UTC Over Success Done 86,175.33 530.25 530.43 104589787 94911730 10 Sep 2007 8:28:09 UTC 11 Sep 2007 20:16:51 UTC Over Success Done 21,587.09 132.83 20.00 |
Greg_BE Send message Joined: 30 May 06 Posts: 5691 Credit: 5,859,226 RAC: 0 |
similar problem but with a new twist 104452274 94782474 10 Sep 2007 0:41:23 UTC 10 Sep 2007 17:38:48 UTC Over Success Done 21,550.19 61.05 20.00 104452273 94782473 10 Sep 2007 0:41:23 UTC 10 Sep 2007 19:59:44 UTC Over Success Done 7,042.41 19.95 20.00 <-- weird |
hugothehermit Send message Joined: 26 Sep 05 Posts: 238 Credit: 314,893 RAC: 0 |
I aborted this WU, nothing was wrong with it as far as I know, I just couldn't finish it in time so I didn't start it. |
Jim Send message Joined: 15 Oct 06 Posts: 22 Credit: 5,410,546 RAC: 0 |
I'm the second person to get this WU: 94462214 It seems to be missing a file: PROF2.pdb ; will not finish the download just a error message, "file not found". |
Ricky@SETI.USA Send message Joined: 13 Dec 05 Posts: 20 Credit: 97,355 RAC: 0 |
9/12/2007 05:19:59||Suspending network activity - user request 9/12/2007 07:04:30|rosetta@home|[error] rosetta_beta not responding to screensaver, requesting exit 9/12/2007 07:25:19|rosetta@home|[error] rosetta_beta not responding to screensaver, killing it 9/12/2007 07:25:24|rosetta@home|Restarting task 1g4u__BOINC_MINIMIZE2_SCORE12_CAPRI14_DOCK_FIXBACKBONE-1g4u_-rxplxn_0472plexinmonomer__2074_62_0 using rosetta_beta version 578 9/12/2007 10:26:29|rosetta@home|Computation for task 1g4u__BOINC_MINIMIZE2_SCORE12_CAPRI14_DOCK_FIXBACKBONE-1g4u_-rxplxn_0472plexinmonomer__2074_62_0 finished 9/12/2007 11:28:53||Resuming network activity Never seen this error before! "Life is like an Ice Cream cone, just when you think you got it licked, it drips all over you!" |
Greg_BE Send message Joined: 30 May 06 Posts: 5691 Credit: 5,859,226 RAC: 0 |
https://boinc.bakerlab.org/rosetta/result.php?resultid=104452274 ********************************************************************** Rosetta score is stuck or going too long. Watchdog is ending the run! Stuck at score -164.509 for 900 seconds |
The_Bad_Penguin Send message Joined: 5 Jun 06 Posts: 2751 Credit: 4,271,025 RAC: 0 |
1he8__BOINC_MINIMIZE2_SCORE12_CAPRI14_DOCK_FIXBACKBONE-1he8_-rxplxn_1030plexinmonomer__2074_1759_0 CPU time 14594.392753 stderr out <core_client_version>5.10.13</core_client_version> <![CDATA[ <stderr_txt> # cpu_run_time_pref: 10800 # random seed: 3919342 ********************************************************************** Rosetta score is stuck or going too long. Watchdog is ending the run! Stuck at score -467.27 for 900 seconds ********************************************************************** GZIP SILENT FILE: .xx1he8.out </stderr_txt> ]]> Validate state Valid Claimed credit 62.7858592995201 Granted credit 20 application version 5.78 |
Rhiju Volunteer moderator Send message Joined: 8 Jan 06 Posts: 223 Credit: 3,546 RAC: 0 |
Thanks to everyone for posting. I think I know how to fix this (the watchdog problem)! I have removed these jobs from the queue for now, and when they are sent out again, we should see fewer premature exits... |
The_Bad_Penguin Send message Joined: 5 Jun 06 Posts: 2751 Credit: 4,271,025 RAC: 0 |
|
Rhiju Volunteer moderator Send message Joined: 8 Jan 06 Posts: 223 Credit: 3,546 RAC: 0 |
One more question -- did you happen to notice if the screen looked totally stuck before the crash? (Probably too much to ask.) Ok, don't want to beat a dead horse, but just noticed |
The_Bad_Penguin Send message Joined: 5 Jun 06 Posts: 2751 Credit: 4,271,025 RAC: 0 |
sorry, didn't notice. i have the quad-core running on its own as a (more or less) dedicated cruncher, and am using the A64 3800+ for i-net / e-mail / ms office / etc. so, really don't look at Rosie running, just check my results page every so often to make sure i'm seeing about what i expect to see... greg_be ??? One more question -- did you happen to notice if the screen looked totally stuck before the crash? |
Jmarks Send message Joined: 16 Jul 07 Posts: 132 Credit: 98,025 RAC: 0 |
I am having the same problems. I usually crunch 830 credits but now 50% of my 5.78 wu's are bad. I do not use the pc for anything else or project so there is no moniter to see if the pc acting strange while this happening. 104430347 94762391 9 Sep 104430349 94762393 9 Sep 104430354 94762398 9 Sep 104430355 94762399 9 Sep 104430357 94762401 9 Sep 104430364 94762408 9 Sep 104430359 94762403 9 Sep 104430358 94762402 9 Sep 104430366 94762410 9 Sep 104430373 94762416 9 Sep 104430372 94762415 9 Sep 104430376 94762419 9 Sep Jmarks |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
Jmarks, sorry for all the failed WUs. Rhiju has pulled those WUs and is working on a fix that will improve things there. Otherwise, about all you can do is cut your runtime preference. Theory being that if your normal credit per task if close to 20, then a failure granted 20 will not be such an impact. Rosetta Moderator: Mod.Sense |
Greg_BE Send message Joined: 30 May 06 Posts: 5691 Credit: 5,859,226 RAC: 0 |
sorry, didn't notice. |
Message boards :
Number crunching :
Problems with Rosetta version 5.78
©2025 University of Washington
https://www.bakerlab.org