Problems with Rosetta version 5.54

Message boards : Number crunching : Problems with Rosetta version 5.54

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · Next

AuthorMessage
Rhiju
Volunteer moderator

Send message
Joined: 8 Jan 06
Posts: 223
Credit: 3,546
RAC: 0
Message 38555 - Posted: 29 Mar 2007, 0:59:38 UTC - in response to Message 38546.  

OK, I have a couple ideas for what could be going wrong with the Macs -- the problem is that I can't test them locally, because all of our Macs are working beautifully. Can anyone out there with the Mac problem attach their BOINC to RALPH, and then post here? Then I can keep track of your host's results on RALPH with the current ralph app (5.55, likely still problematic) and with the next couple updates (expect one tonight!).

Some Macs with problem results, just from following posts on the boards:
Cheryl PowerMac Darwin 8.9.0
Sauria PowerMac Darwin 8.8.0
Zifnab PowerMac [AltiVec] Darwin 8.8.0
Zifnab(2) PowerMac [AltiVec] Darwin 8.8.0
ninicool PowerMac [AltiVec] Darwin 8.8.0
Sergio Intel Darwin 8.9.1
David Riese 15 Macs, some steady, some error prone.
kctipton PowerMac [AltiVec] Darwin 7.9.0
Werner Grunow iMac Darwin 8.9.1
sauria PowerMac Darwin 8.8.0
Ian PowerMac [AltiVec] Darwin 7.9.0 - Some good, some bad results.

Some that are running great:
William Staman PowerMac [AltiVec] Darwin 8.9.0
GPV PowerMac [AltiVec] Darwin 8.9.0 - 1 err
GPV(2) Intel Darwin 8.9.1
GPV(3) Intel Darwin 8.9.1 - 1 err
anders n Intel Core 2 Darwin 8.9.1 - 1 wu stuck, but completed normally
Geoff Roynon PowerMac [AltiVec] Darwin 8.8.0 - reported fine, but posted about CPU consumed, but no progress.
Geoff Roynon(2) PowerMac [AltiVec] Darwin 8.8.0
morrisian PowerMac [AltiVec] Darwin 8.9.0
Stephen PowerMac Darwin 8.9.0
64Studebaker MacBookPro Darwin 8.9.1
64Studebaker(2) PowerMac Darwin 8.8.0
Snake Doctor PowerBook Darwin 8.9.0
Snake Doctor(2) PowerMac Darwin 8.9.0 - 2 errs
paddy MacBookPro Darwin 8.8.2 - 1 validate err
TLAF MacPro Darwin 8.8.1 - 24hr RT Pref
RedSled Intel Darwin 8.9.1
ChillyWilly5280 PowerMac [AltiVec] Darwin 8.9.0
ChillyWilly5280(2) PowerBook [AltiVec] Darwin 8.9.0


ID: 38555 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Hurlburt

Send message
Joined: 17 Jan 07
Posts: 3
Credit: 10,559,822
RAC: 88
Message 38558 - Posted: 29 Mar 2007, 1:12:18 UTC - in response to Message 38557.  

All Mac users! Some fraction of Mac's don't appear to be working -- although our own macs (we test on three machines locally, one powerpc, two mac intels) are running fine. We're looking into it! It would help us a lot if you post a link to your bad results.

Looks like 5.54 is hosed for MACs.

(that's a technical term)




I was running Rosetta on a dual G-5 mac, an intel mac book and a mac pro. All work units started correctly and had the 1% problem. The work units indicated that they were running and some actually did run for hours, but after a period of time, the mac activity monitor would indicate that rosetta was not using any CPU time. All were aborted. All of the macs are running OSX (10.4.9) and two of the three are running BOINC 10.8.15. Only one work unit was successfully completed. I aborted the other units when they did not appear to be running. (Sorry about the empty post above.)
ID: 38558 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Zifnab

Send message
Joined: 25 Mar 07
Posts: 8
Credit: 6,369
RAC: 0
Message 38559 - Posted: 29 Mar 2007, 1:31:34 UTC - in response to Message 38555.  

Hey Rhiju, I just attached one of my Macs (should be this one) to Ralph. It gave me a "no work to process" error, but I'm assuming that's expected? It and the other one are both identical machines, except for the hard drive, as far as I know, but if you want me to attach the other one, I can.
ID: 38559 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Rhiju
Volunteer moderator

Send message
Joined: 8 Jan 06
Posts: 223
Credit: 3,546
RAC: 0
Message 38560 - Posted: 29 Mar 2007, 2:16:45 UTC - in response to Message 38559.  

Zifnab, perfect! I'm sending out more work now. It would probably suffice to have just one machine on ralph -- just can you keep it on at 100% ralph for the next three days or so? Thanks a bunch!

Hey Rhiju, I just attached one of my Macs (should be this one) to Ralph. It gave me a "no work to process" error, but I'm assuming that's expected? It and the other one are both identical machines, except for the hard drive, as far as I know, but if you want me to attach the other one, I can.


ID: 38560 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Rhiju
Volunteer moderator

Send message
Joined: 8 Jan 06
Posts: 223
Credit: 3,546
RAC: 0
Message 38561 - Posted: 29 Mar 2007, 2:17:35 UTC

One more question for Zifnab and other Mac users who are having issues with 5.54: are you also having problems with other boinc apps like SETI@home or, say, Einstein@home?
ID: 38561 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Slywy

Send message
Joined: 4 Feb 07
Posts: 3
Credit: 159,287
RAC: 23
Message 38562 - Posted: 29 Mar 2007, 2:22:37 UTC - in response to Message 38561.  

One more question for Zifnab and other Mac users who are having issues with 5.54: are you also having problems with other boinc apps like SETI@home or, say, Einstein@home?


No, Einstein and SETI are working great for me.

I just attached to RALPH to see if that would help in the effort to pin down the issue.
ID: 38562 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Hurlburt

Send message
Joined: 17 Jan 07
Posts: 3
Credit: 10,559,822
RAC: 88
Message 38563 - Posted: 29 Mar 2007, 2:28:44 UTC - in response to Message 38562.  

One more question for Zifnab and other Mac users who are having issues with 5.54: are you also having problems with other boinc apps like SETI@home or, say, Einstein@home?


No, Einstein and SETI are working great for me.

I just attached to RALPH to see if that would help in the effort to pin down the issue.



Einstein and SETI are working fine.
ID: 38563 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
dumas777

Send message
Joined: 19 Nov 05
Posts: 39
Credit: 2,762,081
RAC: 0
Message 38572 - Posted: 29 Mar 2007, 5:03:34 UTC - in response to Message 38563.  

One more question for Zifnab and other Mac users who are having issues with 5.54: are you also having problems with other boinc apps like SETI@home or, say, Einstein@home?


No, Einstein and SETI are working great for me.

I just attached to RALPH to see if that would help in the effort to pin down the issue.



Einstein and SETI are working fine.


Einstein and WCG are running fine on my Mac Pro as well but had a stuck workunit at %15 as well as problems downloading a WU from Rosetta@home. I have never had a problem with Rosetta on windows but I have seen stuck work units from time to time on my Linux box and Mac. Its a good thing I like Dr. Baker and this project because any other project that gave me this much trouble, I would have detached from long ago. I understand the complexity of the app but these bugs are getting in the way of science being done not only for this project but for every other boinc project as well running on these boxes.
ID: 38572 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
WalknPickn

Send message
Joined: 24 Mar 07
Posts: 1
Credit: 1,037,254
RAC: 0
Message 38573 - Posted: 29 Mar 2007, 5:16:21 UTC

Is there a way to get the software to report via something other than a direct connection to the Internet on port 2468 (I think that's the port number)? My work PC can only go to the Internet via a proxy server and the firewall is blocking direct access on port 2468. Ideally, I would like to see it use the proxy server for all communication.
ID: 38573 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [AF>Le_Pommier] ninicool
Avatar

Send message
Joined: 28 Feb 06
Posts: 3
Credit: 45,691
RAC: 0
Message 38575 - Posted: 29 Mar 2007, 5:41:31 UTC - in response to Message 38561.  
Last modified: 29 Mar 2007, 6:08:53 UTC

One more question for Zifnab and other Mac users who are having issues with 5.54: are you also having problems with other boinc apps like SETI@home or, say, Einstein@home?


I had some problems with Ralph (application version 5.54). (http://ralph.bakerlab.org/results.php?userid=864)

At this moment, WCG is running fine on my Mac G5
Seti ant Einstein are not running on my mac.

Edit 1
Rhiju : I ask new work to ralph, I have a new unit "Rosetta Beta 5.56". I hope the result will be good. We will see.

Edit :2
I just ask some new work to predictor, but I have some problems too :

Jeu 29 mar 07:46:41 2007||Restarting dtasser_hr_on_1g_3956_8 - message timeout
Jeu 29 mar 07:46:41 2007|Predictor @ Home|Restarting task dtasser_hr_on_1g_3956_8 using dtasser version 14
Jeu 29 mar 07:46:42 2007||[error] Process 8269 not found
Jeu 29 mar 07:50:17 2007||Restarting dtasser_hr_on_1g_3956_8 - message timeout
Jeu 29 mar 07:50:18 2007||[error] Process 8307 not found
Jeu 29 mar 07:54:28 2007||Restarting dtasser_hr_on_1g_3956_8 - message timeout
Jeu 29 mar 07:54:29 2007||[error] Process 8332 not found
ID: 38575 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Zifnab

Send message
Joined: 25 Mar 07
Posts: 8
Credit: 6,369
RAC: 0
Message 38577 - Posted: 29 Mar 2007, 6:00:47 UTC - in response to Message 38560.  
Last modified: 29 Mar 2007, 6:02:37 UTC

Zifnab, perfect! I'm sending out more work now. It would probably suffice to have just one machine on ralph -- just can you keep it on at 100% ralph for the next three days or so? Thanks a bunch!


Rhiju, we usually bring the labs down at night for various reasons, but tomorrow morning I'll make the changes to keep that machine up 24x7. Just don't be worried if you don't get any data back until after 7:30am EST when the labs come back up. I didn't even think about that fact until now, and at 2am it's too late to get in there.. Ralph's the only project attached to that machine right now, too, so we should be good to go.

I haven't tried any other projects but Rosetta, so I can't give you any feedback on whether they work. Do you want me to try attaching one of them to the other machine I have with BOINC on it right now?
ID: 38577 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
The Ox

Send message
Joined: 24 Sep 05
Posts: 5
Credit: 1,356,735
RAC: 0
Message 38578 - Posted: 29 Mar 2007, 6:08:27 UTC - in response to Message 38561.  
Last modified: 29 Mar 2007, 6:08:58 UTC

One more question for Zifnab and other Mac users who are having issues with 5.54: are you also having problems with other boinc apps like SETI@home or, say, Einstein@home?


Rhiju,

I have not been having any issues with SIMAP, WCG, Einstein, RCN, or Seti. There were some issues with a Predictor WU, but from what I understand about their new applications that is actually to be expected - and I haven't crunched any new WUs for Predictor for a few days now.

Regards,
Clint

ID: 38578 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Shinichi Sawada

Send message
Joined: 26 Dec 06
Posts: 2
Credit: 585,543
RAC: 0
Message 38584 - Posted: 29 Mar 2007, 7:50:58 UTC

Hello.
My Mac Mini G4 had problem and I've just attached to RALPH. Should I suspend SETI? (SETI is currently giving me no WU)
SETI runs fine. However, it sometimes gave '[error] Process XXXX not found' message without causing any trouble crunching WU.
ID: 38584 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
ramostol

Send message
Joined: 6 Feb 07
Posts: 64
Credit: 584,052
RAC: 0
Message 38591 - Posted: 29 Mar 2007, 9:15:36 UTC

Sorry, just attached to Ralph, received a wu, but the wu on my iBook G4 10.3.9 hangs after 14-17 sec. CPU-usage:

Tor 29 Mar 10:59:31 2007|ralph@home|Starting 1zih__BOINC_SMOOTH_INCREASE_CYCLES10_RNA_ABINITIO-1zih_-_1877_32_0
Tor 29 Mar 10:59:32 2007|ralph@home|Starting task 1zih__BOINC_SMOOTH_INCREASE_CYCLES10_RNA_ABINITIO-1zih_-_1877_32_0 using rosetta_beta version 556
Tor 29 Mar 11:03:08 2007||Restarting 1zih__BOINC_SMOOTH_INCREASE_CYCLES10_RNA_ABINITIO-1zih_-_1877_32_0 - message timeout
Tor 29 Mar 11:03:09 2007||[error] Process 2505 not found
Tor 29 Mar 11:06:49 2007||Restarting 1zih__BOINC_SMOOTH_INCREASE_CYCLES10_RNA_ABINITIO-1zih_-_1877_32_0 - message timeout
Tor 29 Mar 11:06:50 2007||[error] Process 2533 not found [[etc.]]

-- R. A. Mostol
ID: 38591 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile anders n

Send message
Joined: 19 Sep 05
Posts: 403
Credit: 537,991
RAC: 0
Message 38593 - Posted: 29 Mar 2007, 9:31:27 UTC - in response to Message 38561.  

One more question for Zifnab and other Mac users who are having issues with 5.54: are you also having problems with other boinc apps like SETI@home or, say, Einstein@home?


Einstein is working ok.

Anders n
ID: 38593 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
ramostol

Send message
Joined: 6 Feb 07
Posts: 64
Credit: 584,052
RAC: 0
Message 38601 - Posted: 29 Mar 2007, 10:18:28 UTC - in response to Message 38591.  

Sorry, just attached to Ralph, received a wu, but the wu on my iBook G4 10.3.9 hangs after 14-17 sec. CPU-usage:

......

Tor 29 Mar 11:06:49 2007||Restarting 1zih__BOINC_SMOOTH_INCREASE_CYCLES10_RNA_ABINITIO-1zih_-_1877_32_0 - message timeout
Tor 29 Mar 11:06:50 2007||[error] Process 2533 not found [[etc.]]

-- R. A. Mostol


It finished this way:

Result


2007|ralph@home|Restarting task 1zih__BOINC_SMOOTH_INCREASE_CYCLES10_RNA_ABINITIO-1zih_-_1877_32_0 using rosetta_beta version 556
Tor 29 Mar 11:14:11 2007||[error] Process 2582 not found
.....
Tor 29 Mar 11:17:50 2007||Restarting 1zih__BOINC_SMOOTH_INCREASE_CYCLES10_RNA_ABINITIO-1zih_-_1877_32_0 - message timeout
Tor 29 Mar 11:17:50 2007|ralph@home|Restarting task 1zih__BOINC_SMOOTH_INCREASE_CYCLES10_RNA_ABINITIO-1zih_-_1877_32_0 using rosetta_beta version 556
Tor 29 Mar 11:17:51 2007||[error] Process 2606 not found
Tor 29 Mar 11:18:00 2007|ralph@home|Computation for task 1zih__BOINC_SMOOTH_INCREASE_CYCLES10_RNA_ABINITIO-1zih_-_1877_32_0 finished
Tor 29 Mar 11:18:00 2007|ralph@home|Output file 1zih__BOINC_SMOOTH_INCREASE_CYCLES10_RNA_ABINITIO-1zih_-_1877_32_0_0 for task 1zih__BOINC_SMOOTH_INCREASE_CYCLES10_RNA_ABINITIO-1zih_-_1877_32_0 absent

-- R. A. Mostol
ID: 38601 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile GPV67
Avatar

Send message
Joined: 17 Sep 05
Posts: 2
Credit: 2,219,828
RAC: 0
Message 38660 - Posted: 29 Mar 2007, 17:18:29 UTC - in response to Message 38561.  

One more question for Zifnab and other Mac users who are having issues with 5.54: are you also having problems with other boinc apps like SETI@home or, say, Einstein@home?

Hi,
no issues.
Seti, WCG and Einstein are running well with BOINC 5.8.15 on both Macs PPC and Intel.
ID: 38660 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Rhiju
Volunteer moderator

Send message
Joined: 8 Jan 06
Posts: 223
Credit: 3,546
RAC: 0
Message 38674 - Posted: 29 Mar 2007, 18:34:54 UTC - in response to Message 38561.  

Thanks to everyone for responding. And we definitely now have good Mac representation on RALPH! Unfortunately, my first idea for a fix didn't work (fixed a stack overflow in ralph 5.56), so now I'm going to try to recompile with the very latest BOINC libraries for ralph 5.57.

Its also good to know that other apps (SETI@home, CPDN, Einstein@home) are working fine, though I was certainly intrigued by ssawada's comment about "Process not found" warnings with those other applications.

We'll see how 5.57 goes...

One more question for Zifnab and other Mac users who are having issues with 5.54: are you also having problems with other boinc apps like SETI@home or, say, Einstein@home?


ID: 38674 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
AMD_is_logical

Send message
Joined: 20 Dec 05
Posts: 299
Credit: 31,460,681
RAC: 0
Message 38691 - Posted: 29 Mar 2007, 20:28:40 UTC

The result https://boinc.bakerlab.org/rosetta/result.php?resultid=69921784 hung on one of my Linux nodes. It wasn't using any CPU time. It looks like the watchdog decided to end it, but this caused it to hang rather than exit.

I halted and restarted BOINC, and the WU restarted and continued crunching as if nothing was wrong.
ID: 38691 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sergio

Send message
Joined: 28 Jan 07
Posts: 2
Credit: 1,707
RAC: 0
Message 38723 - Posted: 30 Mar 2007, 7:38:44 UTC

ralph@home 5.57 has been running happily the whole night:
http://ralph.bakerlab.org/results.php?userid=2857

So, well, 5.57 seems to have solved the problem! Well done! :-)
I hope to see it soon on Rosetta.

Cheers, and thanks, Sergio
ID: 38723 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4 · Next

Message boards : Number crunching : Problems with Rosetta version 5.54



©2024 University of Washington
https://www.bakerlab.org