Problems with minirosetta version 1.+

Message boards : Number crunching : Problems with minirosetta version 1.+

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · Next

AuthorMessage
kkupsch

Send message
Joined: 25 Nov 05
Posts: 9
Credit: 5,167,204
RAC: 0
Message 51424 - Posted: 16 Feb 2008, 17:59:17 UTC

I don't like the Mini WU's because suddenly the credit I recieve for work is at least 5 to 10 times smaller than before. I've cancelled all mini jobs starting with "score13_hb_envtest62". My guess is many PCs "testing" the Mini WU's are probably alot faster than old 2.4G Dual CPU Xeon Machine... so comparing similar tasks my PC won't get a very good score... but still the sudden decrease in perceived productivity on my PC caused me to think there is something wrong with the WU's or there is something wrong with the way credit is granted with respect to those WU's.

Example:
Computer ID: 494454
Task ID: 139520471
Name: score13_hb_envtest62_A_1scjB_2833_1905_0
Workunit: 127066346
Claimed credit 98.9446294668645
Granted credit 5.54573244250427
application version 1.07

ID: 51424 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Paul

Send message
Joined: 29 Oct 05
Posts: 193
Credit: 65,736,681
RAC: 460
Message 51426 - Posted: 16 Feb 2008, 19:27:44 UTC - in response to Message 51424.  

I don't like the Mini WU's because suddenly the credit I recieve for work is at least 5 to 10 times smaller than before. I've cancelled all mini jobs starting with "score13_hb_envtest62". My guess is many PCs "testing" the Mini WU's are probably alot faster than old 2.4G Dual CPU Xeon Machine... so comparing similar tasks my PC won't get a very good score... but still the sudden decrease in perceived productivity on my PC caused me to think there is something wrong with the WU's or there is something wrong with the way credit is granted with respect to those WU's.

Example:
Computer ID: 494454
Task ID: 139520471
Name: score13_hb_envtest62_A_1scjB_2833_1905_0
Workunit: 127066346
Claimed credit 98.9446294668645
Granted credit 5.54573244250427
application version 1.07


I noticed a similar decrease in credit and it killed my RAC. Usually, credit granted is very close to credit claimed. With the Mini apps, the credit granted is ALWAYS lower than the credit claimed.

It is likely due to the fact that the mini application is not yet fully optimized and will improve over time. It would be great to see mini applications optimized for processor type as well.



Thx!

Paul

ID: 51426 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Chris

Send message
Joined: 11 Jul 07
Posts: 1
Credit: 20,312
RAC: 0
Message 51434 - Posted: 16 Feb 2008, 23:25:25 UTC

My machine is running a WU on Mini 1.07.

On normal Rosetta, I think WUs were usually taking 6 to 10 hours to complete. (Pentium III 550MHz)

Mini 1.07 has taken over 50 hours to go from 99.5% complete to 99.7% complete. During this whole time, it has declared that it has 10 minutes to go.

Should I abort this WU? Is there any useful information that can be gleaned before I do?
ID: 51434 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1018
Credit: 4,334,829
RAC: 0
Message 51436 - Posted: 17 Feb 2008, 5:08:26 UTC - in response to Message 51434.  

My machine is running a WU on Mini 1.07.

On normal Rosetta, I think WUs were usually taking 6 to 10 hours to complete. (Pentium III 550MHz)

Mini 1.07 has taken over 50 hours to go from 99.5% complete to 99.7% complete. During this whole time, it has declared that it has 10 minutes to go.

Should I abort this WU? Is there any useful information that can be gleaned before I do?


That's too long. Can you archive and compress the slot directory that it is running in and send it to dekim at u.washington.edu? then abort.
ID: 51436 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Ganesh

Send message
Joined: 24 Jan 08
Posts: 2
Credit: 42
RAC: 0
Message 51447 - Posted: 17 Feb 2008, 15:58:35 UTC

Hi

My system found a virus when downloading data from your site. Here is a
copy of the report issued by the anti-virus NOD-32:

NQENR  €T ô  âYÛ Ð ÞµÁuqÈÕN  http://s r v 3 . b a k e r l a b . o r g / r o s e t t a / d o w n l o a d / m i n i r o s e t t a _ 1 . 0 7 _ w i n d o w s _ i n t e l x 8 6 . e x e p r o b a b l y a v a r i a n t o f W i n 3 2 / S t a t i k a p p l i c a t i o n

Kindly look into this and let me have a feedback..

Thanks

Ganesh
ID: 51447 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Ganesh

Send message
Joined: 24 Jan 08
Posts: 2
Credit: 42
RAC: 0
Message 51448 - Posted: 17 Feb 2008, 15:58:40 UTC

Hi

My system found a virus when downloading data from your site. Here is a
copy of the report issued by the anti-virus NOD-32:

NQENR  €T ô  âYÛ Ð ÞµÁuqÈÕN  http://s r v 3 . b a k e r l a b . o r g / r o s e t t a / d o w n l o a d / m i n i r o s e t t a _ 1 . 0 7 _ w i n d o w s _ i n t e l x 8 6 . e x e p r o b a b l y a v a r i a n t o f W i n 3 2 / S t a t i k a p p l i c a t i o n

Kindly look into this and let me have a feedback..

Thanks

Ganesh
ID: 51448 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1829
Credit: 115,241,118
RAC: 46,153
Message 51449 - Posted: 17 Feb 2008, 16:25:56 UTC - in response to Message 51447.  

Hi

My system found a virus when downloading data from your site. Here is a
copy of the report issued by the anti-virus NOD-32:

NQENR  €T ô  âYÛ Ð ÞµÁuqÈÕN  http://s r v 3 . b a k e r l a b . o r g / r o s e t t a / d o w n l o a d / m i n i r o s e t t a _ 1 . 0 7 _ w i n d o w s _ i n t e l x 8 6 . e x e p r o b a b l y a v a r i a n t o f W i n 3 2 / S t a t i k a p p l i c a t i o n

Kindly look into this and let me have a feedback..

Thanks

Ganesh

Hi Ganesh

No need to worry - it's not a virus - it's a false-positive (NOD mis-identifies it):
NOD32 3 says Virus in file!
ID: 51449 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mike Gelvin
Avatar

Send message
Joined: 7 Oct 05
Posts: 65
Credit: 10,612,039
RAC: 0
Message 51452 - Posted: 17 Feb 2008, 18:24:36 UTC

Sadly I have to stop running Rosetta. I am getting way too much grief from MiniRosetta vs NOD32. Ill keep running Ralph on one system and wait until this gets sorted out, and then I shall return. I'm disappointed in the response that has allowed this application to migrate and/or continue on Rosetta when this issue was identified on Ralph and not addressed. Reminds me of Predictor@Home.
ID: 51452 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1829
Credit: 115,241,118
RAC: 46,153
Message 51454 - Posted: 17 Feb 2008, 19:09:09 UTC - in response to Message 51452.  
Last modified: 17 Feb 2008, 19:10:53 UTC

Sadly I have to stop running Rosetta. I am getting way too much grief from MiniRosetta vs NOD32. Ill keep running Ralph on one system and wait until this gets sorted out, and then I shall return. I'm disappointed in the response that has allowed this application to migrate and/or continue on Rosetta when this issue was identified on Ralph and not addressed. Reminds me of Predictor@Home.

I agree - there are too many computers on Rosetta that shouldn't be having errors. Maybe Ralph should be a lot bigger using the more closely monitored PCs that have more knowledgeable users and Rosetta should be for mass production jobs where the results are being tested rather than the methodology... A 1% problem on a project of this size with this much throughput per computer is still a pretty big problem!
ID: 51454 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Pepo
Avatar

Send message
Joined: 28 Sep 05
Posts: 115
Credit: 101,358
RAC: 0
Message 51457 - Posted: 17 Feb 2008, 19:59:07 UTC - in response to Message 51415.  

I have a Rosetta mini on my computer at the moment and unless I abort it then it will probably stay there.
It ran for 4:15:09 (h:m:s) with my preference set to 6 hours, and says it has completed 100% but is still sitting there "Waiting to Run" in Boinc Manager.

It hapens occasionally, that application reaches the end of its timeslot. If the app now checkpoints shortly before (or just at) reaching 100%, it is immediately preempted by Boinc client and will silently wait for its next turn.

I am unable to upload it as B/M thinks it has not finished yet.
Should I just abort it?

No, no need to. Suspend client's network communication, then suspend all other projects and the application should be immediately started and finished soon.

I've just got one too (on Ralph) ;-)

After suspending other tasks, it finished immediately :-)

Peter
ID: 51457 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Todd

Send message
Joined: 1 Jul 07
Posts: 2
Credit: 65,209
RAC: 0
Message 51461 - Posted: 17 Feb 2008, 23:00:35 UTC - in response to Message 51289.  

hope this helps

log from NOD32:
2/9/2008 5:58:05 PM
HTTP filter
file
http://srv4.bakerlab.org/rosetta/download/minirosetta_1.07_windows_intelx86.exe
probably a variant of Win32/Statik application
connection terminated - quarantined
<pc name><user>
Threat was detected upon access to web by the application: C:Program FilesBOINCboinc.exe.

Boinc ver. 5.10.30
NOD32 ver. 3.0.551.0
Virus sig. 2861 (20080209)
WinXP SP2

ID: 51461 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Todd

Send message
Joined: 1 Jul 07
Posts: 2
Credit: 65,209
RAC: 0
Message 51463 - Posted: 17 Feb 2008, 23:04:48 UTC - in response to Message 51289.  

Hi,
I just wanted to add that nod 32 is giving the same error here. I am running Windows vista 64 bit edition. Here is a copy of the log from NOD 32.

2/14/2008 8:03:38 PM HTTP filter file http://srv1.bakerlab.org/rosetta/download/minirosetta_1.07_windows_x86_64.exe probably a variant of Win32/Statik application connection terminated - quarantined Bird-LandTodd admin Threat was detected upon access to web by the application: C:Program FilesBOINCboinc.exe.


Thanks,
Todd







hope this helps

log from NOD32:
2/9/2008 5:58:05 PM
HTTP filter
file
http://srv4.bakerlab.org/rosetta/download/minirosetta_1.07_windows_intelx86.exe
probably a variant of Win32/Statik application
connection terminated - quarantined
<pc name><user>
Threat was detected upon access to web by the application: C:Program FilesBOINCboinc.exe.

Boinc ver. 5.10.30
NOD32 ver. 3.0.551.0
Virus sig. 2861 (20080209)
WinXP SP2

ID: 51463 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Stan Wells

Send message
Joined: 24 Jan 06
Posts: 11
Credit: 1,567,537
RAC: 1,232
Message 51464 - Posted: 17 Feb 2008, 23:52:00 UTC

I have three dead work units from minirosetta on my Linux box (Ubuntu 7.10) running BOINC v 5.10.8. all three stopped (waiting to run) at approximately 59 min 44 seconds into the run (13.61 to 14.78% complete). since they download worked with every other run being a minirosetta it is completing one, running for about an hour on the minirosetta - does not say in the messages that it is stopping for a reason, just shows that it is starting the next work unit. stan

ID: 51464 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Stan Wells

Send message
Joined: 24 Jan 06
Posts: 11
Credit: 1,567,537
RAC: 1,232
Message 51465 - Posted: 18 Feb 2008, 0:05:50 UTC

Forgot to mention that I run an AMD 4000 dual core, 64 bit, 64 bit Ubuntu, with 1 gB of ram. I tried the start / stop / suspend project / suspend task / stop network activity, etc. When it started back up it went to the regular Rosetta work unit that it was working on in the first place - the other three are still "waiting to run" with just under an hour on the clock.
ID: 51465 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Morgan the Gold
Avatar

Send message
Joined: 17 Sep 05
Posts: 3
Credit: 776,707
RAC: 0
Message 51466 - Posted: 18 Feb 2008, 0:07:18 UTC

:-( I run RAlph at a high rescorce share on a number of machines where its not critical if they hang or crash, Rosetta I liked to put on Peoples machines because its stable and has a nice screensaver, I wish I was visiting RAlph forum to because of a hung testbed, instead of seeing I'm part of a 3% failure rate :-(

ID: 51466 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Stan Wells

Send message
Joined: 24 Jan 06
Posts: 11
Credit: 1,567,537
RAC: 1,232
Message 51467 - Posted: 18 Feb 2008, 0:30:03 UTC

Sorry, shuda checked all this before I originally posted. I check previous good results - so far only one on the linux box has completed out of 4. these just started showing up. I have one on my Windows XP box that just finished and uploaded without problem after 2 hrs 35 minutes. this is a 64 bit system running a Windows XP home 32 bit edition - BOINC is 5.4.11. stan
ID: 51467 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Conan
Avatar

Send message
Joined: 11 Oct 05
Posts: 150
Credit: 3,818,279
RAC: 980
Message 51470 - Posted: 18 Feb 2008, 5:49:54 UTC

I have noticed on both Ralph and Rosetta that ALL "mini" type Work Units only get about one third (1/3) of the credit I usually get on either project.

The other types of work units are ok just the "mini/score" ones and only with Windows.
All is OK on Linux for the points granted on the same WU type.
ID: 51470 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile marina

Send message
Joined: 8 Nov 07
Posts: 3
Credit: 6,376
RAC: 0
Message 51478 - Posted: 18 Feb 2008, 13:19:05 UTC

Hi, during the work Antivirus Nod32 intercected mini_rosetta like a virus.
In the message i find:
18/02/2008 14.06.58|rosetta@home|Started download of minirosetta_1.07_windows_intelx86.exe
18/02/2008 14.06.58|rosetta@home|Started download of minirosetta_database_rev20412.zip
18/02/2008 14.08.21|rosetta@home|Task CFR_034_V9_2842_8604_0 exited with a DLL initialization error.
18/02/2008 14.08.21|rosetta@home|If this happens repeatedly you may need to reboot your computer.
18/02/2008 14.08.21|rosetta@home|Restarting task CFR_034_V9_2842_8604_0 using rosetta_beta version 593
18/02/2008 14.08.31|rosetta@home|Finished download of minirosetta_1.07_windows_intelx86.exe
ID: 51478 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5659
Credit: 5,691,837
RAC: 1,806
Message 51481 - Posted: 18 Feb 2008, 13:49:42 UTC
Last modified: 18 Feb 2008, 13:51:25 UTC

read this message from DEKwhere he says norton is making it a false positive.
ID: 51481 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5659
Credit: 5,691,837
RAC: 1,806
Message 51482 - Posted: 18 Feb 2008, 13:50:05 UTC
Last modified: 18 Feb 2008, 13:50:39 UTC

ooops sorry
ID: 51482 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · Next

Message boards : Number crunching : Problems with minirosetta version 1.+



©2024 University of Washington
https://www.bakerlab.org