Problems with Rosetta version 5.43

Message boards : Number crunching : Problems with Rosetta version 5.43

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 . . . 8 · Next

AuthorMessage
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 32759 - Posted: 16 Dec 2006, 17:05:32 UTC

I moved daniels' post here, as he's got Linux problems with 5.43 as well as having seen them on 5.41.

process exited with code 131 (0x83)
process got signal 11

Rosetta Moderator: Mod.Sense
ID: 32759 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
googloo
Avatar

Send message
Joined: 15 Sep 06
Posts: 133
Credit: 22,813,645
RAC: 3,531
Message 32763 - Posted: 16 Dec 2006, 19:50:05 UTC

12/16/2006 2:39:16 PM|rosetta@home|Unrecoverable error for result 1opd__BOINC_POSE_ABRELAX_NEWRELAXFLAGS_frags83__1449_313_0 ( - exit code -1073741783 (0xc0000029))

ID: 32763 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 32765 - Posted: 16 Dec 2006, 20:44:59 UTC

Is there something about FRA_ W.U.'s or is it they don't like me, the last

three i've had have all errored.

https://boinc.bakerlab.org/rosetta/result.php?resultid=52186178

12/16/2006 23:26:21|rosetta@home|Unrecoverable error for result FRA_t103_test_LARS_constraints_hom002_9_IGNORE_THE_RESTS_00001_0015478_0.pdb_1427_729_1 ( - exit code -1073741819 (0xc0000005))

ID: 32765 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Philip

Send message
Joined: 23 Oct 06
Posts: 6
Credit: 89,430
RAC: 0
Message 32776 - Posted: 17 Dec 2006, 1:14:29 UTC - in response to Message 32733.  

Philip,

The integrated Intel (945) has been listed before when graphics problems happen.

Is that using the trackpad, nipple or a mouse ?

USB mouse of Logitech pedigree.


First try the newer graphics drivers, I think there are a lot of improvments in them Intel 945GM series drivers for WinXP

Thanks for the link - I installed the newer drivers. But I'm far too scared to try to test this again. At least on a job that's been running for several hours already. Maybe if I see a new job firing up I'll do some target practice on it.


P.S. Have you check to see if your battery is one of the recall (exploding Sony) ones. That laptop is on the risk list.

Oh dear. I might have a potential IED sitting on my lap?! *shudder*

Nah, it's got a Toshiba label. Hopefully that's authentic. But it does have a warning label that includes the word "explosion"... maybe time to add a few items to the christmas shopping list: Plasma screen, a very long VGA cable, remote keyboard and one of them nice blast proof bins.
ID: 32776 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Chu

Send message
Joined: 23 Feb 06
Posts: 120
Credit: 112,439
RAC: 0
Message 32788 - Posted: 17 Dec 2006, 5:27:10 UTC - in response to Message 32765.  

Googloo and Peter, the error code indicates most likely a graphic-related problem. 5.43 has some improvement on this issue, but does not solve the problem totally. We are still working on searching for the real problem right now, meanwhile, please try to leave the screensaver or graphic off. Sorry for the inconvenience.
Is there something about FRA_ W.U.'s or is it they don't like me, the last

three i've had have all errored.

https://boinc.bakerlab.org/rosetta/result.php?resultid=52186178

12/16/2006 23:26:21|rosetta@home|Unrecoverable error for result FRA_t103_test_LARS_constraints_hom002_9_IGNORE_THE_RESTS_00001_0015478_0.pdb_1427_729_1 ( - exit code -1073741819 (0xc0000005))


ID: 32788 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
charlls

Send message
Joined: 15 Dec 06
Posts: 2
Credit: 112,435
RAC: 0
Message 32791 - Posted: 17 Dec 2006, 6:07:34 UTC


I have this problem also with a intel mac running 10.4.8. The problem is surely not limited to the 945 intel video chipset since i have ati video (mbp). I just don't open the graphics and the simulation goes fine
ID: 32791 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile hedera
Avatar

Send message
Joined: 15 Jul 06
Posts: 76
Credit: 5,263,150
RAC: 144
Message 32793 - Posted: 17 Dec 2006, 6:57:40 UTC
Last modified: 17 Dec 2006, 7:04:57 UTC

Help! I seem to have been dropped from rosetta@home! I have no tasks, and I can't connect to the project. Is it down? Did I do something? I can ping Rosetta@home but I can't connect.
--hedera

Never be afraid to try something new. Remember that amateurs built the ark. Professionals built the Titanic.

ID: 32793 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile hedera
Avatar

Send message
Joined: 15 Jul 06
Posts: 76
Credit: 5,263,150
RAC: 144
Message 32795 - Posted: 17 Dec 2006, 7:11:08 UTC

Now this is odd. I couldn't connect to Rosetta, as I just said. So I brought up the Windows task manager, and saw that boincmanager.exe was still running (boinc.exe was not). I killed the process, then went to the start menu and started boincmanager.exe again. Lo and behold, there was rosetta@home again, running just fine. I'm running 5.43 now. Since there was zero activity in BOINC manager, and also zero logs, before I restarted, I have no error messages, and the current logs look perfectly normal.
--hedera

Never be afraid to try something new. Remember that amateurs built the ark. Professionals built the Titanic.

ID: 32795 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
googloo
Avatar

Send message
Joined: 15 Sep 06
Posts: 133
Credit: 22,813,645
RAC: 3,531
Message 32796 - Posted: 17 Dec 2006, 7:43:51 UTC

Thanks, Chu. I figured that out eventually (smacking my palm against my forehead). I have quit using BOINC as my screensaver until you get this fixed. That is, I have changed to the blank screensaver in Windows XP. The unrecoverable errors seem to have ceased for now.
ID: 32796 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 32797 - Posted: 17 Dec 2006, 8:06:23 UTC
Last modified: 17 Dec 2006, 8:07:03 UTC

Thanks Chu.

But i don't use the screensaver never have and didn't touch graphics button, I was not

there when it happened. As it only ran for around 6 min don't no but as i said

that makes last 3 of the FRA W.U.s that have crashed.

ID: 32797 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
FluffyChicken
Avatar

Send message
Joined: 1 Nov 05
Posts: 1260
Credit: 369,635
RAC: 0
Message 32800 - Posted: 17 Dec 2006, 8:59:03 UTC - in response to Message 32776.  


P.S. Have you check to see if your battery is one of the recall (exploding Sony) ones. That laptop is on the risk list.

Oh dear. I might have a potential IED sitting on my lap?! *shudder*

Nah, it's got a Toshiba label. Hopefully that's authentic. But it does have a warning label that includes the word "explosion"... maybe time to add a few items to the christmas shopping list: Plasma screen, a very long VGA cable, remote keyboard and one of them nice blast proof bins.[/quote]

It say Toshiba on it since it is in a Toshiba laptop, the actual manufacturer of the cells could still be Sony.

If you havn't checked (with the checker at Toshiba's site) do so, you may get a brand new fresh battery :-)




To others, this does not just happen with Intel Integrated graphics
ATI was the first manufacturer brought up, Intel was also another common one. Though Nvidia has seen this happen.

It may not be totally dependent on the graphics type.

It could also be BOINC (there are fixes to screensaver/graphics start/stopping/stalling in the 5.8.0 client) OR just something simple in the code that more often than not appears when graphics are initiated.

DirectX December refresh has just been released by the way.

P.S. I only asked about the mouse as i remember a boinc news post where some problems with older Intellipoint drivers...
Team mauisun.org
ID: 32800 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Philip

Send message
Joined: 23 Oct 06
Posts: 6
Credit: 89,430
RAC: 0
Message 32810 - Posted: 17 Dec 2006, 16:05:47 UTC - in response to Message 32738.  

My granted credit is really low with this new version and its associated work units. It's killing my RAC.

Hmm. I haven't noticed the average granted credit going down significantly, but it has become MUCH more variable post 5.43.

My claimed credit has always averaged around 73.2 +/- 1. Granted credit for pre-5.43 WU's averaged 87.4, with std dev 4.77. My granted credit for 5.43 WU's is admittedly somewhat lower at 84.3, but the std dev is a whopping 14.4! Whereas >95% of pre-5.43 jobs had GC's between 77 and 95, I'm now seeing figures all over the map from 63 to 118.

If I understand the new credit scheme correctly, you might expect variability to be high when you're doing new types of WU's that don't have a long history of granted credits. So I might just be getting a lot of new WU's. But it does seem a bit coincidental. Can anyone explain?
ID: 32810 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Feet1st
Avatar

Send message
Joined: 30 Dec 05
Posts: 1755
Credit: 4,690,520
RAC: 0
Message 32815 - Posted: 17 Dec 2006, 18:30:54 UTC

hedera It sounds like you had the same problem that I've been seeing and reported here. BOINC loses contact and doesn't display anything. You must have at least seen a project list though to try and update, so your symptoms are a little different then mine.
Add this signature to your EMail:
Running Microsoft's "System Idle Process" will never help cure cancer, AIDS nor Alzheimer's. But running Rosetta@home just might!
https://boinc.bakerlab.org/rosetta/
ID: 32815 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Feet1st
Avatar

Send message
Joined: 30 Dec 05
Posts: 1755
Credit: 4,690,520
RAC: 0
Message 32816 - Posted: 17 Dec 2006, 18:39:28 UTC

Phillip, it really depends upon the timeframe you are looking at. There were some really LONG running models out there with some of the new work on docking and fibrils. Those tasks had huge varience in credit per model. And part of the reason was that the models could take soooo looong. And part of the reason was that the new science code they built in to Rosetta had a sort of quick kill switch. It would work a given model about half way and decide if it was reasonable to work it to completion or if we should cut out losses here and start on a new one. The later release, I don't see it mentioned in the release notes, improved the runtime on these tasks. I recall seeing about post saying 3x improvement. This minimized the varience somewhat.

Bottom line, I believe you are correct, we're now in to types of WUs with CAPRI and docking and fibrils which have a higher varience in credit per model. Or, should I say, a higher varience in how long it takes to crunch a given model to earn the credit?
Add this signature to your EMail:
Running Microsoft's "System Idle Process" will never help cure cancer, AIDS nor Alzheimer's. But running Rosetta@home just might!
https://boinc.bakerlab.org/rosetta/
ID: 32816 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile hedera
Avatar

Send message
Joined: 15 Jul 06
Posts: 76
Credit: 5,263,150
RAC: 144
Message 32819 - Posted: 17 Dec 2006, 19:27:50 UTC

Still having graphics problems with 5.43, I'm afraid. I started everything up this morning and put task graphics on, not in the screen saver, just a window. Came back a couple of hours later, the graphics window is gone, and I had this error:

12/17/2006 9:45:17 AM|rosetta@home|Unrecoverable error for result 5croA_BOINC_POSE_ABRELAX_NEWRELAXFLAGS_frags83__1449_434_0 ( - exit code -1073741819 (0xc0000005))

This was about 35 minutes after the task had picked up again on startup. My Windows event log had no errors at all.

Feet1st, my earlier error didn't quite match yours as I showed nothing whatever in the BOINC Manager, not even a project to update. Every tab in the Manager was blank, and when I tried to connect to Rosetta with my password, using the Advanced dropdown, a just got "unable to connect".
--hedera

Never be afraid to try something new. Remember that amateurs built the ark. Professionals built the Titanic.

ID: 32819 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Helix

Send message
Joined: 7 Apr 06
Posts: 1
Credit: 502,727
RAC: 0
Message 32823 - Posted: 17 Dec 2006, 20:02:36 UTC

I was watching the graphics screen, but walked away and when I came back I noticed this error in my messsages log. This is the first error I have ever received running Rosette.

12/17/2006 2:48:49 PM|rosetta@home|Unrecoverable error for result 1tig__BOINC_POSE_ABRELAX_VARY_SC_BOND_ANGLES_NEWRELAXFLAGS_frags83__1451_898_0 ( - exit code 1073807364 (0x40010004))

ID: 32823 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Feet1st
Avatar

Send message
Joined: 30 Dec 05
Posts: 1755
Credit: 4,690,520
RAC: 0
Message 32824 - Posted: 17 Dec 2006, 20:20:59 UTC - in response to Message 32819.  

Feet1st, my earlier error didn't quite match yours as I showed nothing whatever in the BOINC Manager, not even a project to update. Every tab in the Manager was blank...


Actually that is EXACTLY what happens to me (and others). I see, since everything was blank you tried to attach to the project again. Seems to be a BOINC problem. We're discussing it more in the thread I linked, as it does not seem to relate to a specific Rosetta version.

Add this signature to your EMail:
Running Microsoft's "System Idle Process" will never help cure cancer, AIDS nor Alzheimer's. But running Rosetta@home just might!
https://boinc.bakerlab.org/rosetta/
ID: 32824 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Jim

Send message
Joined: 15 Oct 06
Posts: 22
Credit: 5,410,546
RAC: 0
Message 32826 - Posted: 17 Dec 2006, 21:30:56 UTC - in response to Message 32759.  

I moved daniels' post here, as he's got Linux problems with 5.43 as well as having seen them on 5.41.

process exited with code 131 (0x83)
process got signal 11


ID: 32826 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Jim

Send message
Joined: 15 Oct 06
Posts: 22
Credit: 5,410,546
RAC: 0
Message 32827 - Posted: 17 Dec 2006, 21:42:59 UTC - in response to Message 32759.  

I moved daniels' post here, as he's got Linux problems with 5.43 as well as having seen them on 5.41.

process exited with code 131 (0x83)
process got signal 11


I am seeing the same thing on both of my Linux machines.
I can make the WU finish by shutting down the computer and restarting it.
I tried to suspend the project and let WCG run for a while then resume the
Rosetta WU but that didn't work.
The Rosetta projects get to about 58.71% done then just set there for hours doing nothing.
I did see this before "once" that I remember with 5.41 but 5.43 has become a brick
wall. Both machines have 256 meg of RAM. One has Linspire Linux and the other has Mandriva 2007.

https://boinc.bakerlab.org/rosetta/result.php?resultid=51996472
https://boinc.bakerlab.org/rosetta/result.php?resultid=51738512

Jim
ID: 32827 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Angel

Send message
Joined: 14 Dec 06
Posts: 1
Credit: 3,865
RAC: 0
Message 32862 - Posted: 18 Dec 2006, 18:50:04 UTC

Hi,
I can't get new projects since yesterday.

All I get in the reports is :

18.12.2006 19:37:53|rosetta@home|Sending scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi
18.12.2006 19:37:53|rosetta@home|Reason: Requested by user
18.12.2006 19:37:53|rosetta@home|(not requesting new work or reporting completed tasks)
18.12.2006 19:37:58|rosetta@home|Scheduler request succeeded

Can anyone tell me what's wrong? I didn't change anything any my other projects run just fine.

Thanx.

ID: 32862 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 . . . 8 · Next

Message boards : Number crunching : Problems with Rosetta version 5.43



©2024 University of Washington
https://www.bakerlab.org