Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 205 · 206 · 207 · 208 · 209 · 210 · 211 . . . 309 · Next

AuthorMessage
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 272
Credit: 507,897
RAC: 334
Message 106057 - Posted: 26 Apr 2022, 11:38:41 UTC - in response to Message 106056.  

They run up to 2% and freeze. With wall of errors in stderr.txt
ID: 106057 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 106058 - Posted: 26 Apr 2022, 16:24:59 UTC - in response to Message 106041.  

That's strange. How vbox processes tasks is above my head.
Computermeze or whatever his name is knows more about that kind of stuff.
Not on speaking terms ROFL! And I thought it was a woman.

Have you asked in Cosmo forum at all if anyone knows why 6 does not work?
Maybe post in Github and see what the experts say.
I think the program needs to be written to work in 6. From what the admin at Kryptos said, it's easier to program in 5. Kinda like a driver for Windows 7 might not work in 10.
ID: 106058 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 106059 - Posted: 26 Apr 2022, 16:26:25 UTC - in response to Message 106043.  

ID: 106059 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 272
Credit: 507,897
RAC: 334
Message 106060 - Posted: 26 Apr 2022, 16:27:39 UTC - in response to Message 106059.  

tacc is empty too.
ID: 106060 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 106061 - Posted: 26 Apr 2022, 16:28:08 UTC - in response to Message 106045.  
Last modified: 26 Apr 2022, 16:29:23 UTC

I can't run Quchem.
It goes to 60% and becomes unstable.
VmJob unmanagable message.
2022-04-25 20:38:52 (2024): VM state change detected. (old = 'running', new = 'paused')
2022-04-25 21:05:27 (2024): VM state change detected. (old = 'paused', new = 'running')
2022-04-25 21:06:03 (2024): Creating new snapshot for VM.
2022-04-25 21:06:11 (2024): Deleting stale snapshot.
2022-04-25 21:06:12 (2024): Checkpoint completed.
2022-04-25 21:10:16 (2024): VM state change detected. (old = 'running', new = 'paused')
2022-04-25 21:14:52 (2024): VM state change detected. (old = 'paused', new = 'running')
2022-04-25 21:16:38 (2024): VM state change detected. (old = 'running', new = 'paused')
2022-04-25 21:32:41 (2024): ERROR: Vboxwrapper lost communication with VirtualBox, rescheduling task for a later time.

And GPU gives me the exit child error on their new pythons.
Working here on a variety of Windows 11 machines, using VB 5. Oldest has 8GB of DDR2! I can run one QuChem per 2GB of RAM (almost). Something up with the QuChem tasks, I'm getting tasks ending in _9, so I checked and there are loads of (Linux) hosts churning through several thousand and failing them in 1 second. Missing libraries? I asked over there, but the forum is quiet.
ID: 106061 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 106062 - Posted: 26 Apr 2022, 16:36:14 UTC - in response to Message 106046.  

Maybe there isn't enough ram for python gpu?
WHAT? GPU? On QuChem? Where? I want.
ID: 106062 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 272
Credit: 507,897
RAC: 334
Message 106063 - Posted: 26 Apr 2022, 16:37:06 UTC - in response to Message 106062.  

gpugrid
ID: 106063 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 106064 - Posted: 26 Apr 2022, 16:37:52 UTC - in response to Message 106047.  

Oops, yes Treesize. I can see how useful it might be, but I keep a pretty tight ship at the best of times, so no need for it here.
I'd used Boinctasks before, but prior to installing VirtualBox, and I didn't have the kind of problems that BoincTasks would solve back then, so it just seemed an unnecessary duplication.
Nothing against it - just not enough going for it with my limited uses. Until now.
I have 7 computers, controlling all those individually would be ridiculous. AFAIK you have 4 active machines. That would be enough for me to use Boinctasks.
ID: 106064 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 106065 - Posted: 26 Apr 2022, 16:38:51 UTC - in response to Message 106048.  

Maybe there isn't enough ram for python gpu?

48 gigs? not enough?
I got a ACEMD 3 and running ATLAS and Prime Grid and only using 32% of my total RAM
I don't think its RAM.

The STDERR goes on about memory leaks in its setup, but this is fresh RAM (well half new and half less new but not ancient)
You could run memtest (I always do for any new/used RAM I obtain), but I think a memory leak is a programming error, not a hardware fault.
ID: 106065 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 106066 - Posted: 26 Apr 2022, 16:40:07 UTC - in response to Message 106056.  

I have only 12 GB RAM on thjis Windows 11 PC and can run both rosetta python and QuChem, but not at the same time. QuChem runs also on my Linux Virtual Machine with 8 GB RAM. I could not run rosetta python on it. Now I am running Rosetta 4.20 on the Windows 11 PC.
Tullio
I find VB on the main machine I'm trying to use gives a very sluggish Windows 11 interface. I run it on the 6 Boinc only machines and do native tasks on this one.
ID: 106066 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 272
Credit: 507,897
RAC: 334
Message 106067 - Posted: 26 Apr 2022, 16:41:02 UTC - in response to Message 106066.  

It happens on windows 10 too.
ID: 106067 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 106068 - Posted: 26 Apr 2022, 16:41:05 UTC - in response to Message 106063.  
Last modified: 26 Apr 2022, 16:41:23 UTC

gpugrid
GPUGrid needs Nvidias. I don't own Nvidias, I find AMD gives more bang for the buck. I've got my SP GPUs on Folding@Home and my DP GPUs on Milkyway.
ID: 106068 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 106069 - Posted: 26 Apr 2022, 16:42:18 UTC - in response to Message 106067.  
Last modified: 26 Apr 2022, 16:43:05 UTC

It happens on windows 10 too.
I was blaming my AMD CPU, but my Intels do the same, it's just I don't usually interact with them directly. Somebody somewhere said the CPU has to do quite a lot of work to switch a core between VB and native Windows. But even if I just run some cores on VB and leave loads free for Windows, it's still sluggish.
ID: 106069 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 272
Credit: 507,897
RAC: 334
Message 106070 - Posted: 26 Apr 2022, 16:44:29 UTC - in response to Message 106069.  

Are they real cores or hyperthreaded ones?
ID: 106070 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 106071 - Posted: 26 Apr 2022, 17:01:25 UTC - in response to Message 106070.  
Last modified: 26 Apr 2022, 17:02:15 UTC

Are they real cores or hyperthreaded ones?
I have 12 cores, 24 if with HT. Even if I run one 8 core Atlas from LHC, the interface sticks sometimes. It's not evenly slow, it's a 3 second pause every so often. Maybe windows doesn't share the cores around evenly so they get swapped? Someone did tell me that once, not sure if Windows 11 improved.
ID: 106071 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 272
Credit: 507,897
RAC: 334
Message 106072 - Posted: 26 Apr 2022, 17:02:13 UTC - in response to Message 106071.  

And beeping in the speakers?
ID: 106072 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 106073 - Posted: 26 Apr 2022, 17:03:03 UTC - in response to Message 106072.  

And beeping in the speakers?
I got that a few times.....
ID: 106073 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 106074 - Posted: 26 Apr 2022, 17:04:09 UTC - in response to Message 106070.  

Are they real cores or hyperthreaded ones?
OT friendly message:

I see you are in Russia. How are things doing over there? In my opinion the West is punishing the civilians and not the leader. We're also punishing ourselves with these stupid sanctions, everything costs more everywhere.
ID: 106074 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 106075 - Posted: 26 Apr 2022, 17:44:32 UTC - in response to Message 106060.  

tacc is empty too.



tacc is leftovers and very rarely has anything.
I tried them and left them after getting about 20 tasks and then a long dry spell.
They are a waste of time for the serious cruncher.
ID: 106075 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 106076 - Posted: 26 Apr 2022, 17:46:22 UTC - in response to Message 106065.  
Last modified: 26 Apr 2022, 17:50:38 UTC

Maybe there isn't enough ram for python gpu?

48 gigs? not enough?
I got a ACEMD 3 and running ATLAS and Prime Grid and only using 32% of my total RAM
I don't think its RAM.

The STDERR goes on about memory leaks in its setup, but this is fresh RAM (well half new and half less new but not ancient)
You could run memtest (I always do for any new/used RAM I obtain), but I think a memory leak is a programming error, not a hardware fault.


I figured as much as well since it is in the starting sequence.
I've never had a memory leak error for real since I have been on BOINC.

But it still doesn't tell me what the 195 (0xc3) EXIT_CHILD_FAILED error is caused by.
Run through this log and see if you can identify what the problem is: https://www.gpugrid.net/result.php?resultid=32886434
ID: 106076 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 205 · 206 · 207 · 208 · 209 · 210 · 211 . . . 309 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org