Problems with Rosetta version 5.78

Message boards : Number crunching : Problems with Rosetta version 5.78

To post messages, you must log in.

Previous · 1 · 2 · 3 · Next

AuthorMessage
Beezlebub
Avatar

Send message
Joined: 18 Oct 05
Posts: 40
Credit: 260,375
RAC: 0
Message 45970 - Posted: 11 Sep 2007, 1:07:10 UTC

I also have 8 WU's showing:

Result ID 104616934
Name 1he8__BOINC_CAPRI14_DOCK_FIXBACKBONE-1he8_-nosillyloop_plexinmonomer__2067_8410_0
Workunit 94936831
Created 10 Sep 2007 10:42:43 UTC
Sent 10 Sep 2007 10:43:28 UTC
Received 10 Sep 2007 22:03:20 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x0)
Computer ID 341092
Report deadline 20 Sep 2007 10:43:28 UTC
CPU time 16852.023625
stderr out

<core_client_version>5.10.13</core_client_version>
<![CDATA[
<stderr_txt>
# cpu_run_time_pref: 28800
# random seed: 1272171
**********************************************************************
Rosetta score is stuck or going too long. Watchdog is ending the run!
Stuck at score -173.421 for 900 seconds
**********************************************************************
GZIP SILENT FILE: .xx1he8.out

</stderr_txt>
]]>

Validate state Valid
Claimed credit 48.3512209341757
Granted credit 20
application version 5.78
e6600 quad @ 2.5ghz
2418 floating point
5227 integer

e6750 dual @ 3.71ghz
3598 floating point
7918 integer


ID: 45970 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 45976 - Posted: 11 Sep 2007, 3:39:12 UTC

I've had the same problem.

It's was an 1he8_**** W.U.

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=94800922

<core_client_version>5.10.13</core_client_version>
<![CDATA[
<stderr_txt>
# cpu_run_time_pref: 28800
# random seed: 1278145
**********************************************************************
Rosetta score is stuck or going too long. Watchdog is ending the run!
Stuck at score -202.375 for 900 seconds
**********************************************************************
GZIP SILENT FILE: .xx1he8.out

</stderr_txt>

ID: 45976 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
BitSpit
Avatar

Send message
Joined: 5 Nov 05
Posts: 33
Credit: 4,147,344
RAC: 0
Message 45992 - Posted: 11 Sep 2007, 11:09:42 UTC

ID: 45992 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 45996 - Posted: 11 Sep 2007, 12:22:58 UTC

The 20 credits sounds like the nightly credit granting script for failed WUs. I realize they probably show as "success", but they didn't end normally. Some details here.
Rosetta Moderator: Mod.Sense
ID: 45996 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
M.L.

Send message
Joined: 21 Nov 06
Posts: 182
Credit: 180,462
RAC: 0
Message 46001 - Posted: 11 Sep 2007, 16:19:12 UTC

Result ID 104435631
Name 1g4u__BOINC_CAPRI14_DOCK_FIXBACKBONE-1g4u_-nosillyloop_plexinmonomer__2067_760_0
Workunit 94767401
Created 10 Sep 2007 0:01:42 UTC
Sent 10 Sep 2007 0:01:53 UTC
Received 11 Sep 2007 16:12:47 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x0)
Computer ID 510574
Report deadline 20 Sep 2007 0:01:53 UTC
CPU time 13657.84375
stderr out <core_client_version>5.10.20</core_client_version>
<![CDATA[
<stderr_txt>
# cpu_run_time_pref: 21600
# random seed: 1279871
**********************************************************************
Rosetta score is stuck or going too long. Watchdog is ending the run!
Stuck at score -223.806 for 900 seconds
**********************************************************************
GZIP SILENT FILE: .xx1g4u.out

</stderr_txt>
]]>


Validate state Valid
Claimed credit 55.7602549562382
Granted credit 20
application version 5.78





ID: 46001 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
googloo
Avatar

Send message
Joined: 15 Sep 06
Posts: 133
Credit: 21,718,842
RAC: 5,931
Message 46005 - Posted: 11 Sep 2007, 17:21:05 UTC - in response to Message 45996.  

The 20 credits sounds like the nightly credit granting script for failed WUs. I realize they probably show as "success", but they didn't end normally. Some details here.


Had that problem with these results: here and here.
ID: 46005 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile uNiUs

Send message
Joined: 12 Apr 06
Posts: 3
Credit: 21,115,928
RAC: 855
Message 46028 - Posted: 11 Sep 2007, 20:49:13 UTC
Last modified: 11 Sep 2007, 20:54:51 UTC

Same problem:

104582448 94905137 10 Sep 2007 7:53:36 UTC 11 Sep 2007 20:16:51 UTC Over Success Done 86,268.45 530.82 514.59

104582610 94905257 10 Sep 2007 7:57:48 UTC 11 Sep 2007 20:16:51 UTC Over Success Done 86,175.33 530.25 530.43

104589787 94911730 10 Sep 2007 8:28:09 UTC 11 Sep 2007 20:16:51 UTC Over Success Done 21,587.09 132.83 20.00
ID: 46028 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5664
Credit: 5,710,284
RAC: 2,004
Message 46031 - Posted: 11 Sep 2007, 21:27:21 UTC

similar problem but with a new twist

104452274 94782474 10 Sep 2007 0:41:23 UTC 10 Sep 2007 17:38:48 UTC Over Success Done 21,550.19 61.05 20.00
104452273 94782473 10 Sep 2007 0:41:23 UTC 10 Sep 2007 19:59:44 UTC Over Success Done 7,042.41 19.95 20.00 <-- weird
ID: 46031 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
hugothehermit

Send message
Joined: 26 Sep 05
Posts: 238
Credit: 314,893
RAC: 0
Message 46043 - Posted: 12 Sep 2007, 4:54:03 UTC

I aborted this WU, nothing was wrong with it as far as I know, I just couldn't finish it in time so I didn't start it.
ID: 46043 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Jim

Send message
Joined: 15 Oct 06
Posts: 22
Credit: 5,410,546
RAC: 0
Message 46044 - Posted: 12 Sep 2007, 4:55:19 UTC - in response to Message 45712.  
Last modified: 12 Sep 2007, 4:56:05 UTC

I'm the second person to get this WU: 94462214
It seems to be missing a file: PROF2.pdb ; will not finish the download
just a error message, "file not found".

ID: 46044 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Ricky@SETI.USA
Avatar

Send message
Joined: 13 Dec 05
Posts: 20
Credit: 97,355
RAC: 0
Message 46069 - Posted: 12 Sep 2007, 15:50:03 UTC

9/12/2007 05:19:59||Suspending network activity - user request
9/12/2007 07:04:30|rosetta@home|[error] rosetta_beta not responding to screensaver, requesting exit
9/12/2007 07:25:19|rosetta@home|[error] rosetta_beta not responding to screensaver, killing it
9/12/2007 07:25:24|rosetta@home|Restarting task 1g4u__BOINC_MINIMIZE2_SCORE12_CAPRI14_DOCK_FIXBACKBONE-1g4u_-rxplxn_0472plexinmonomer__2074_62_0 using rosetta_beta version 578
9/12/2007 10:26:29|rosetta@home|Computation for task 1g4u__BOINC_MINIMIZE2_SCORE12_CAPRI14_DOCK_FIXBACKBONE-1g4u_-rxplxn_0472plexinmonomer__2074_62_0 finished
9/12/2007 11:28:53||Resuming network activity

Never seen this error before!

"Life is like an Ice Cream cone, just when you think you got it licked, it drips all over you!"

ID: 46069 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5664
Credit: 5,710,284
RAC: 2,004
Message 46089 - Posted: 12 Sep 2007, 21:06:26 UTC

https://boinc.bakerlab.org/rosetta/result.php?resultid=104452274
**********************************************************************
Rosetta score is stuck or going too long. Watchdog is ending the run!
Stuck at score -164.509 for 900 seconds
ID: 46089 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
The_Bad_Penguin
Avatar

Send message
Joined: 5 Jun 06
Posts: 2751
Credit: 4,271,025
RAC: 0
Message 46095 - Posted: 12 Sep 2007, 22:18:34 UTC
Last modified: 12 Sep 2007, 22:21:17 UTC

1he8__BOINC_MINIMIZE2_SCORE12_CAPRI14_DOCK_FIXBACKBONE-1he8_-rxplxn_1030plexinmonomer__2074_1759_0

CPU time 14594.392753

stderr out

<core_client_version>5.10.13</core_client_version>
<![CDATA[
<stderr_txt>
# cpu_run_time_pref: 10800
# random seed: 3919342
**********************************************************************
Rosetta score is stuck or going too long. Watchdog is ending the run!
Stuck at score -467.27 for 900 seconds
**********************************************************************
GZIP SILENT FILE: .xx1he8.out

</stderr_txt>
]]>

Validate state Valid
Claimed credit 62.7858592995201
Granted credit 20
application version 5.78
ID: 46095 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Rhiju
Volunteer moderator

Send message
Joined: 8 Jan 06
Posts: 223
Credit: 3,546
RAC: 0
Message 46098 - Posted: 12 Sep 2007, 22:30:31 UTC
Last modified: 12 Sep 2007, 22:31:57 UTC

Thanks to everyone for posting. I think I know how to fix this (the watchdog problem)! I have removed these jobs from the queue for now, and when they are sent out again, we should see fewer premature exits...

ID: 46098 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
The_Bad_Penguin
Avatar

Send message
Joined: 5 Jun 06
Posts: 2751
Credit: 4,271,025
RAC: 0
Message 46099 - Posted: 12 Sep 2007, 22:31:31 UTC - in response to Message 46098.  
Last modified: 12 Sep 2007, 22:34:03 UTC

Ok, don't want to beat a dead horse, but just noticed

this also...
ID: 46099 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Rhiju
Volunteer moderator

Send message
Joined: 8 Jan 06
Posts: 223
Credit: 3,546
RAC: 0
Message 46100 - Posted: 12 Sep 2007, 22:43:47 UTC - in response to Message 46099.  

One more question -- did you happen to notice if the screen looked totally stuck before the crash?
(Probably too much to ask.)

Ok, don't want to beat a dead horse, but just noticed

this also...


ID: 46100 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
The_Bad_Penguin
Avatar

Send message
Joined: 5 Jun 06
Posts: 2751
Credit: 4,271,025
RAC: 0
Message 46101 - Posted: 12 Sep 2007, 22:48:56 UTC - in response to Message 46100.  
Last modified: 12 Sep 2007, 22:50:26 UTC

sorry, didn't notice.

i have the quad-core running on its own as a (more or less) dedicated cruncher, and am using the A64 3800+ for i-net / e-mail / ms office / etc.

so, really don't look at Rosie running, just check my results page every so often to make sure i'm seeing about what i expect to see...

greg_be ???

One more question -- did you happen to notice if the screen looked totally stuck before the crash?
(Probably too much to ask.)

ID: 46101 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Jmarks
Avatar

Send message
Joined: 16 Jul 07
Posts: 132
Credit: 98,025
RAC: 0
Message 46117 - Posted: 13 Sep 2007, 11:53:39 UTC
Last modified: 13 Sep 2007, 11:54:30 UTC

I am having the same problems. I usually crunch 830 credits but now 50% of my 5.78 wu's are bad. I do not use the pc for anything else or project so there is no moniter to see if the pc acting strange while this happening.
104430347 94762391 9 Sep
104430349 94762393 9 Sep
104430354 94762398 9 Sep
104430355 94762399 9 Sep
104430357 94762401 9 Sep
104430364 94762408 9 Sep
104430359 94762403 9 Sep
104430358 94762402 9 Sep
104430366 94762410 9 Sep
104430373 94762416 9 Sep
104430372 94762415 9 Sep
104430376 94762419 9 Sep
Jmarks
ID: 46117 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 46120 - Posted: 13 Sep 2007, 13:26:24 UTC
Last modified: 13 Sep 2007, 13:27:03 UTC

Jmarks, sorry for all the failed WUs. Rhiju has pulled those WUs and is working on a fix that will improve things there. Otherwise, about all you can do is cut your runtime preference. Theory being that if your normal credit per task if close to 20, then a failure granted 20 will not be such an impact.
Rosetta Moderator: Mod.Sense
ID: 46120 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5664
Credit: 5,710,284
RAC: 2,004
Message 46143 - Posted: 13 Sep 2007, 19:55:47 UTC - in response to Message 46101.  

sorry, didn't notice.

i have the quad-core running on its own as a (more or less) dedicated cruncher, and am using the A64 3800+ for i-net / e-mail / ms office / etc.

so, really don't look at Rosie running, just check my results page every so often to make sure i'm seeing about what i expect to see...

greg_be ???
** so i see your moving up..congrat penguin**
One more question -- did you happen to notice if the screen looked totally stuck before the crash?
(Probably too much to ask.)



ID: 46143 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · Next

Message boards : Number crunching : Problems with Rosetta version 5.78



©2024 University of Washington
https://www.bakerlab.org