Miscellaneous Work Unit Errors - II

Message boards : Number crunching : Miscellaneous Work Unit Errors - II

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
Shoikan

Send message
Joined: 4 Apr 06
Posts: 14
Credit: 180,211
RAC: 0
Message 14032 - Posted: 18 Apr 2006, 12:33:53 UTC
Last modified: 18 Apr 2006, 12:34:35 UTC


ID: 14032 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
TychoNJ

Send message
Joined: 29 Mar 06
Posts: 1
Credit: 17,174
RAC: 0
Message 14038 - Posted: 18 Apr 2006, 14:06:24 UTC

First time Rosetta poster.
One of my PCs hasn't finished a Rosetta WU in over a week. Every WU fails:

https://boinc.bakerlab.org/rosetta/results.php?hostid=192132

It's an Athlon XP 1700 running Windows XP SP2. It's a bit low on RAM (256) but it ran Rosetta fine for several weeks before this started happening.

I have two other PCs running Rosetta, an Athlon64 (1GB RAM) with minimal failures, and a Celeron-D (512MB RAM) with almost zero failures.

Is my Athlon XP just getting troublesome work units or is it the PC? Should I just detach that one?
ID: 14038 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Divide Overflow

Send message
Joined: 17 Sep 05
Posts: 82
Credit: 921,382
RAC: 0
Message 14093 - Posted: 19 Apr 2006, 7:14:24 UTC

I just had one of these Maximum CPU time exceeded WU's on my Linux box: https://boinc.bakerlab.org/rosetta/result.php?resultid=17037408

That's a lot of wasted CPU time. :(
ID: 14093 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mikkie

Send message
Joined: 1 Apr 06
Posts: 9
Credit: 5,700
RAC: 0
Message 14095 - Posted: 19 Apr 2006, 8:05:56 UTC
Last modified: 19 Apr 2006, 8:13:38 UTC

Ladies and Gentleman, for the good cause [helping cancer research] I teamed up for rosetta on 2006-04-01 but now, after 3 weeks its time to say goodbye. The last error did the trick.

2006-04-18 22:15:59 [rosetta@home] Unrecoverable error for result VP_PRODUCTION_core_vp26__442_161_0 ( - exit code -1073741811 (0xc000000d))
https://boinc.bakerlab.org/rosetta/result.php?resultid=173357

For this moment its too much a waste of time. I'll be back if the Boinc/Rosetta combination is a stable one.
ID: 14095 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Moderator9
Volunteer moderator

Send message
Joined: 22 Jan 06
Posts: 1014
Credit: 0
RAC: 0
Message 14103 - Posted: 19 Apr 2006, 12:01:41 UTC - in response to Message 14038.  

First time Rosetta poster.
One of my PCs hasn't finished a Rosetta WU in over a week. Every WU fails:

https://boinc.bakerlab.org/rosetta/results.php?hostid=192132

It's an Athlon XP 1700 running Windows XP SP2. It's a bit low on RAM (256) but it ran Rosetta fine for several weeks before this started happening.

I have two other PCs running Rosetta, an Athlon64 (1GB RAM) with minimal failures, and a Celeron-D (512MB RAM) with almost zero failures.

Is my Athlon XP just getting troublesome work units or is it the PC? Should I just detach that one?


Attach it to RALPH and see if the problem goes away. If not they will be able to use the information to diagnose the problem.

Moderator9
ROSETTA@home FAQ
Moderator Contact
ID: 14103 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Dimitris Hatzopoulos

Send message
Joined: 5 Jan 06
Posts: 336
Credit: 80,939
RAC: 0
Message 14151 - Posted: 20 Apr 2006, 0:39:15 UTC

Not a bug, but you should compress (.gz) the various .alltopologycodes.bar files, if possible.

The one I see right now in front of me 1tul__alltopologycodes.bar is 4.3MBytes uncompressed, compressible to 25 KBYTES with RAR !
Best UFO Resources
Wikipedia R@h
How-To: Join Distributed Computing projects that benefit humanity
ID: 14151 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Rhiju
Volunteer moderator

Send message
Joined: 8 Jan 06
Posts: 223
Credit: 3,546
RAC: 0
Message 14152 - Posted: 20 Apr 2006, 1:22:29 UTC - in response to Message 14151.  

Dimitri, thanks for pointing this out! A silly oversight on our part.
Not a bug, but you should compress (.gz) the various .alltopologycodes.bar files, if possible.

The one I see right now in front of me 1tul__alltopologycodes.bar is 4.3MBytes uncompressed, compressible to 25 KBYTES with RAR !


ID: 14152 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Dimitris Hatzopoulos

Send message
Joined: 5 Jan 06
Posts: 336
Credit: 80,939
RAC: 0
Message 14279 - Posted: 21 Apr 2006, 17:05:54 UTC - in response to Message 14152.  
Last modified: 21 Apr 2006, 17:08:17 UTC

Not a bug, but you should compress (.gz) the various .alltopologycodes.bar files, if possible.

The one I see right now in front of me 1tul__alltopologycodes.bar is 4.3MBytes uncompressed, compressible to 25 KBYTES with RAR !


Just a quick reminder that it still seems to be an issue (personally, I don't mind, but for folks with dialup...):

$ ll ~boinc/BOINC/projects/boinc.bakerlab.org_rosetta/
total 27384
[b]-rw-r--r--  1 boinc boinc 4422600 2006-04-21 08:00 1tul__alltopologycodes.bar[/b]
-rw-r--r--  1 boinc boinc     630 2006-04-21 08:00 1tul_.bar
-rw-r--r--  1 boinc boinc   58176 2006-04-21 07:57 1tul_EnergyCorrection.txt
-rw-r--r--  1 boinc boinc     123 2006-04-21 08:01 1tul_.fasta.gz
-rw-r--r--  1 boinc boinc   16858 2006-04-21 08:00 1tul.pdb.gz


Best UFO Resources
Wikipedia R@h
How-To: Join Distributed Computing projects that benefit humanity
ID: 14279 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
R/B

Send message
Joined: 8 Dec 05
Posts: 195
Credit: 28,095
RAC: 0
Message 14526 - Posted: 24 Apr 2006, 8:09:16 UTC

computer ID 92884

4 Apr 2006 12:25:21 UTC
name DOUBLE_SS_WEIGHT_1cei__419_179

Result id wu id
16115749 13176948

This is an apparent ghost unit? This particular machine has been running almost 24/7 and never times out.

Don't know if this is important or not but thought I'd put it here anyway.





Founder of BOINC GROUP - Objectivists - Philosophically minded rational data crunchers.


ID: 14526 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2

Message boards : Number crunching : Miscellaneous Work Unit Errors - II



©2024 University of Washington
https://www.bakerlab.org