WU Problem Linux

Message boards : Number crunching : WU Problem Linux

To post messages, you must log in.

AuthorMessage
Profile Chilean
Avatar

Send message
Joined: 16 Oct 05
Posts: 711
Credit: 26,694,507
RAC: 0
Message 44671 - Posted: 3 Aug 2007, 23:48:26 UTC

On my LInux box... once ina while i get workunits that are supposedly "running" but really arent.
The CPU meters go down to 0%... although BOINC says its "running". And the WU makes no progress. SO I have to suspend it and let the next WU to work.
Anybody know the reason??

Thanks!! ^_^
ID: 44671 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile KSMarksPsych
Avatar

Send message
Joined: 15 Oct 05
Posts: 199
Credit: 22,337
RAC: 0
Message 44675 - Posted: 4 Aug 2007, 10:06:22 UTC

What version of BOINC?

Even the latest alpha is woefully behind the times as compared to Windows and Mac.

:(

I think I just ran into the exact same issue last night with CPDN beta.
Kathryn :o)
The BOINC FAQ Service
The Unofficial BOINC Wiki
The Trac System
More BOINC information than you can shake a stick of RAM at.
ID: 44675 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
zombie67 [MM]
Avatar

Send message
Joined: 11 Feb 06
Posts: 316
Credit: 6,589,590
RAC: 317
Message 44679 - Posted: 4 Aug 2007, 14:31:15 UTC - in response to Message 44671.  

On my LInux box... once ina while i get workunits that are supposedly "running" but really arent.
The CPU meters go down to 0%... although BOINC says its "running". And the WU makes no progress. SO I have to suspend it and let the next WU to work.
Anybody know the reason??

Thanks!! ^_^

This happened to me all the time back when my machines ran linux. It required constant monitoring to make sure they weren't sitting idle. That, combined with lower credit claims, pushed me back to windows. This was about a year ago.
Reno, NV
Team: SETI.USA
ID: 44679 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
FluffyChicken
Avatar

Send message
Joined: 1 Nov 05
Posts: 1260
Credit: 369,635
RAC: 0
Message 44680 - Posted: 4 Aug 2007, 15:26:42 UTC
Last modified: 4 Aug 2007, 15:27:28 UTC

While the credit claims is not a problem in Rosetta and many other projects now.

I've had my fair share of problems under linux.

I've use 5.4.x and 5.8.x both in Ubuntu 7.04 (edubuntu actually) I have played around before that but cannot remember exact problems.

Now things to look out for are.
- SpeedStep/Cool&Quiet techniques kicking in, Often mines sat at 600MHz (Pentium-M) when the computer is idle and boinc is running. I have to override it and force it to full speed.

- Memory, which maybe what's happening to you. Check your setting for memory percentage to use. If Rosetta starts to go over that amount it will just stop, drop to zero processor use and well do as you say. Happens in Windows too btw. Check in BoincManager and the message to see what s happening.

Other things like it just quiting for no reason, but that's not happened with 5.8.x version.


P.S. If you unhide your computers, we can look at the unit's themselves and you computer stats.


Team mauisun.org
ID: 44680 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Chilean
Avatar

Send message
Joined: 16 Oct 05
Posts: 711
Credit: 26,694,507
RAC: 0
Message 44688 - Posted: 4 Aug 2007, 21:04:42 UTC - in response to Message 44680.  

While the credit claims is not a problem in Rosetta and many other projects now.

I've had my fair share of problems under linux.

I've use 5.4.x and 5.8.x both in Ubuntu 7.04 (edubuntu actually) I have played around before that but cannot remember exact problems.

Now things to look out for are.
- SpeedStep/Cool&Quiet techniques kicking in, Often mines sat at 600MHz (Pentium-M) when the computer is idle and boinc is running. I have to override it and force it to full speed.

- Memory, which maybe what's happening to you. Check your setting for memory percentage to use. If Rosetta starts to go over that amount it will just stop, drop to zero processor use and well do as you say. Happens in Windows too btw. Check in BoincManager and the message to see what s happening.

Other things like it just quiting for no reason, but that's not happened with 5.8.x version.


P.S. If you unhide your computers, we can look at the unit's themselves and you computer stats.


But of course. Now that I remmeber... this happened on a windows machine. Only that i saw a message saying "waiting for memory".
SO I bumped the memory form 50% to 80%.
Thanks for the solutions!!
ID: 44688 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
DJStarfox

Send message
Joined: 19 Jul 07
Posts: 145
Credit: 1,239,073
RAC: 373
Message 44702 - Posted: 5 Aug 2007, 13:53:53 UTC - in response to Message 44671.  

On my LInux box... once ina while i get workunits that are supposedly "running" but really arent.
The CPU meters go down to 0%... although BOINC says its "running". And the WU makes no progress. SO I have to suspend it and let the next WU to work.
Anybody know the reason??

Thanks!! ^_^


I have the same problem, but it's not due to memory. The program just sits in a QUIT_PENDING state and will not exit. Then when BOINC goes to resume the task, it just hangs. I have to kill the pid, and then BOINC will restart it properly. This has been a problem since day 1 and it does not matter whether leave-apps-in-memory setting is yes or no.
ID: 44702 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mray

Send message
Joined: 3 Dec 06
Posts: 4
Credit: 31,477,070
RAC: 15,056
Message 44704 - Posted: 5 Aug 2007, 15:11:30 UTC

I was having this problem on a Linux Mepis box. I updated to the latest version of BOINC and the problem seems to be reduced or perhaps even fixed. I haven't caught it sleeping on the job in a while at least. I think I'll check my memory allocation though, I have seen that "Waiting for memory" error even though the machines should have plenty of meemory for the task.

ID: 44704 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
DJStarfox

Send message
Joined: 19 Jul 07
Posts: 145
Credit: 1,239,073
RAC: 373
Message 44705 - Posted: 5 Aug 2007, 15:16:07 UTC - in response to Message 44704.  

I was having this problem on a Linux Mepis box. I updated to the latest version of BOINC and the problem seems to be reduced or perhaps even fixed. I haven't caught it sleeping on the job in a while at least. I think I'll check my memory allocation though, I have seen that "Waiting for memory" error even though the machines should have plenty of meemory for the task.


I'm running BOINC version 5.8.16 for i686. I haven't upgraded to 5.10.* because they are still working out serious bugs last time I checked.
ID: 44705 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
computerguy09

Send message
Joined: 17 Sep 05
Posts: 2
Credit: 5,293,068
RAC: 0
Message 44771 - Posted: 7 Aug 2007, 1:33:38 UTC - in response to Message 44705.  

I was having this problem on a Linux Mepis box. I updated to the latest version of BOINC and the problem seems to be reduced or perhaps even fixed. I haven't caught it sleeping on the job in a while at least. I think I'll check my memory allocation though, I have seen that "Waiting for memory" error even though the machines should have plenty of meemory for the task.


I'm running BOINC version 5.8.16 for i686. I haven't upgraded to 5.10.* because they are still working out serious bugs last time I checked.


I saw this problem on my Ubuntu box (dual core AMD) while running 5.8.x, and early versions of 5.10.x. I've been running 5.10.8 for over 2 weeks now with no "hanging" processes. Used to have to restart BOINC every day or so on this (and other) boxes, but haven't seen it since going to 5.10.8 of BOINC.

Mark
ID: 44771 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
DJStarfox

Send message
Joined: 19 Jul 07
Posts: 145
Credit: 1,239,073
RAC: 373
Message 44792 - Posted: 7 Aug 2007, 20:05:03 UTC - in response to Message 44771.  

Hmm...well, let me download 5.10.8 or higher and check it out for myself. It's still considered beta, so I'd rather finish my current climate model first.

Seems fishy that only project Rosetta would not exit cleanly; all other projects perform this fine (Climate, SETI, Einstein).
ID: 44792 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : WU Problem Linux



©2024 University of Washington
https://www.bakerlab.org