Waiting for a team answer to this question

Message boards : Number crunching : Waiting for a team answer to this question

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
Profile Chilean
Avatar

Send message
Joined: 16 Oct 05
Posts: 711
Credit: 26,694,507
RAC: 0
Message 73986 - Posted: 9 Oct 2012, 4:25:25 UTC - in response to Message 73819.  
Last modified: 9 Oct 2012, 4:26:04 UTC

don't understand how Version 7 would foul up just some specific tasks.


I don't understand that too, but since according to other threads v7 is known to be responsible for some errors on rosetta, downgrade to v6 would be the first thing I'd try (unless I'd need to stay with v7 for some other reason).


I did 2 things and am now able to crunch for Rosie again...1st I downgraded to the version 6 series of Boinc, 2nd I STOPPED all gpu crunching on those machines that ALSO crunch for Rosie. I have had 1 or maybe even 2 funky units since then but ALL the rest are finishing just fine! Knocking on wood here!!!!!!!!!!!!!!


I just had to downgrade to 6.X due to the endless errors from the "validator". I am also running GPUGRID on this machine... hopefully I don't have to suspend GPUGRID to crunch for rossie when it should be completely compatible (I've always wanted to crunch for GPUGRID). I'm currently waiting for the servers to send me new WUs to crunch.

Computer in question: https://boinc.bakerlab.org/rosetta/show_host_detail.php?hostid=1569699
ID: 73986 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Chilean
Avatar

Send message
Joined: 16 Oct 05
Posts: 711
Credit: 26,694,507
RAC: 0
Message 73987 - Posted: 9 Oct 2012, 5:31:59 UTC

Alright, so downgrading didn't work. I uninstalled BOINC completely, deleted all the program data... etc, then installed 6.12.34 x64, THEN attached ONLY Rosetta. Let's see if the problem persists. If so, I'll try with the x32, then if the problem keeps happening, I'll switch to POEM. At least there I can run POEM happily under the CPU and let my GPU run GPUGRID.
ID: 73987 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1895
Credit: 9,217,610
RAC: 1,154
Message 73990 - Posted: 9 Oct 2012, 11:23:36 UTC - in response to Message 73987.  

Alright, so downgrading didn't work. I uninstalled BOINC completely, deleted all the program data... etc, then installed 6.12.34 x64, THEN attached ONLY Rosetta. Let's see if the problem persists. If so, I'll try with the x32, then if the problem keeps happening, I'll switch to POEM. At least there I can run POEM happily under the CPU and let my GPU run GPUGRID.


I had to both downgrade AND stop crunching with my gpu's to crunch for Rosie, that won't work for me so I am now crunching elsewhere.
ID: 73990 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Chilean
Avatar

Send message
Joined: 16 Oct 05
Posts: 711
Credit: 26,694,507
RAC: 0
Message 73991 - Posted: 9 Oct 2012, 17:56:36 UTC
Last modified: 9 Oct 2012, 17:58:02 UTC

Problem still persists: https://boinc.bakerlab.org/rosetta/results.php?hostid=1569699

Will try the x32 version as a last resort. This is unbelievable.

Edit: PC is brand new, and passes all CPU stress tests (using Linx).
ID: 73991 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Chilean
Avatar

Send message
Joined: 16 Oct 05
Posts: 711
Credit: 26,694,507
RAC: 0
Message 73992 - Posted: 9 Oct 2012, 18:46:09 UTC

Check this out:

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=487991930

The "failed" WU is from my machine using version 7.x, the other is from another cruncher ALSO running version 7.x, yet his WUs is validated as valid.
ID: 73992 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Chilean
Avatar

Send message
Joined: 16 Oct 05
Posts: 711
Credit: 26,694,507
RAC: 0
Message 73993 - Posted: 9 Oct 2012, 18:55:44 UTC

Another "failed" WU:

<core_client_version>6.12.34</core_client_version>
<![CDATA[
<stderr_txt>
[2012-10- 9 15: 9:36:] :: BOINC:: Initializing ... ok.
[2012-10- 9 15: 9:36:] :: BOINC :: boinc_init()
BOINC:: Setting up shared resources ... ok.
BOINC:: Setting up semaphores ... ok.
BOINC:: Updating status ... ok.
BOINC:: Registering timer callback... ok.
BOINC:: Worker initialized successfully.
Registering options..
Registered extra options.
Initializing broker options ...
Registered extra options.
Initializing core...
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Setting WU description ...
Unpacking zip data: ../../projects/boinc.bakerlab.org_rosetta/minirosetta_database_rev50262.zip
Setting database description ...
Setting up checkpointing ...
Setting up graphics native ...
Setting up folding (abrelax) ...
Beginning folding (abrelax) ...
BOINC:: Worker startup.
Starting watchdog...
Watchdog active.
Starting work on structure: _00001
# cpu_run_time_pref: 3600
======================================================
DONE :: 1 starting structures 2450.64 cpu seconds
This process generated 1 decoys from 1 attempts
======================================================
BOINC :: WS_max 2.36925e+008

BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down cleanly ...
called boinc_finish

</stderr_txt>
]]>

Using BOINC version 6.12.34. Leaving Rosetta until they fix this.
ID: 73993 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Chilean
Avatar

Send message
Joined: 16 Oct 05
Posts: 711
Credit: 26,694,507
RAC: 0
Message 73994 - Posted: 9 Oct 2012, 21:17:55 UTC

Updated BIOS, still same error:

https://boinc.bakerlab.org/rosetta/result.php?resultid=536660539

Changing to WCG since if has similar goals as Rosetta, though I would've loved to crunch with this machine for Rosetta.
ID: 73994 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1895
Credit: 9,217,610
RAC: 1,154
Message 73995 - Posted: 10 Oct 2012, 11:55:00 UTC - in response to Message 73994.  

Updated BIOS, still same error:

https://boinc.bakerlab.org/rosetta/result.php?resultid=536660539

Changing to WCG since if has similar goals as Rosetta, though I would've loved to crunch with this machine for Rosetta.


WELCOME to WCG!!! I too left Rosetta, I am still here checking to see if they have fixed it yet, and I now crunch for WCG with my cpu's. You can use ANY version of Boinc at WCG NOT just the 'recommended' one. And important plus for me is that thye don't care if you use your gpu for crunching or not, in fact WCG itself is currently beta testing a gpu app themselves!
ID: 73995 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2

Message boards : Number crunching : Waiting for a team answer to this question



©2024 University of Washington
https://www.bakerlab.org