Problems with Minirosetta v1.54

Message boards : Number crunching : Problems with Minirosetta v1.54

To post messages, you must log in.

Previous · 1 . . . 10 · 11 · 12 · 13 · 14 · 15 · Next

AuthorMessage
Snags

Send message
Joined: 22 Feb 07
Posts: 198
Credit: 2,798,190
RAC: 702
Message 60599 - Posted: 10 Apr 2009, 22:27:45 UTC

I have a 10 preferred runtime on my MacBook Pro. I spotted lb_all_multi_threshold.2.0_hb_t317__IGNORE_THE_REST_1I9SA_12_10355_4_0
still running at 10 hours and 20 minutes so I opened the graphics window to check on it. It was on model 33, step 1920, stage unk. Checking on it later it had run another cpu hour but failed to make any progress so I shut down BOINC completely and restarted. it now showed 5 hours and 20 minutes cpu time consumed, all the rest the same. Within a few seconds it returned to step 0 and apparently restarted model 33 over from the beginning. I didn't catch exactly when it reached step 1920 but it would have been about 3-4 cpu minutes after the restart. It didn't get stuck this time but continued on its merry way. It also moved out of the unk stage by the time I glanced at it 4+ minutes after restart. It has now finished successfully and validated with 58 models completed in 10 (non-stuck)hours.

Hope this helps.

Snags
ID: 60599 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
jswolf19

Send message
Joined: 3 Apr 09
Posts: 3
Credit: 1,040,577
RAC: 0
Message 60605 - Posted: 11 Apr 2009, 14:08:57 UTC

I was looking through the RALPH minirosetta v1.54 bug thread and found an issue about setting day-of-week overrides (http://ralph.bakerlab.org/forum_thread.php?id=432&nowrap=true#4590). I had some set on network usage that when I cleared and restarted BOINC (which I upgraded to 6.6.20) I started registering progress on a minirosetta task (as well as having some stderr progress past

Initializing options.... ok

This appears to have been the cause of my problem.
ID: 60605 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
TomaszPawel

Send message
Joined: 28 Apr 07
Posts: 54
Credit: 2,791,145
RAC: 0
Message 60620 - Posted: 14 Apr 2009, 10:59:53 UTC

ID: 60620 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Gavin Shaw
Avatar

Send message
Joined: 1 Feb 07
Posts: 10
Credit: 506,456
RAC: 0
Message 60638 - Posted: 14 Apr 2009, 23:09:36 UTC

While not exactly a bug, this morning I had a rather large upload file...

Task 243404526 had a 6.8MB file to upload. The task only run for about 50 minutes and my preference is set to 4 hours. It did 99 decoys from 99 attempts.

Thought admin might want to know...

Never surrender and never give up. In the darkest hour there is always hope.

ID: 60638 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 60639 - Posted: 15 Apr 2009, 0:20:20 UTC

Wow, good thing the watchdog only lets 99 models run. Just imagine how large it would have been with a 4 hour run!
Rosetta Moderator: Mod.Sense
ID: 60639 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
svincent

Send message
Joined: 30 Dec 05
Posts: 219
Credit: 11,805,838
RAC: 0
Message 60650 - Posted: 15 Apr 2009, 15:50:53 UTC

This task failed on Mac with an error in pairtermderiv that's been reported previously.

243548575

Hbond tripped.

ERROR: dis==0 in pairtermderiv!
ERROR:: Exit from: src/core/scoring/methods/PairEnergy.cc line: 338
BOINC:: Error reading and gzipping output datafile: default.out
called boinc_finish

</stderr_txt>


ID: 60650 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 60651 - Posted: 15 Apr 2009, 16:41:26 UTC
Last modified: 15 Apr 2009, 16:41:53 UTC

further notes to svincent's failed task

ERROR: dis==0 in pairtermderiv!
ERROR:: Exit from: src/core/scoring/methods/PairEnergy.cc line: 338
BOINC:: Error reading and gzipping output datafile: default.out

...and only 198 seconds of runtime.
Rosetta Moderator: Mod.Sense
ID: 60651 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 60653 - Posted: 15 Apr 2009, 18:27:09 UTC
Last modified: 15 Apr 2009, 18:27:52 UTC


!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
Alert about problem WUs.

Problem task names all begin with "res_careful_". For details on which proteins are known to have problems and should be aborted, and which will run OK and should be run normally, please see the link above.
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!

Rosetta Moderator: Mod.Sense
ID: 60653 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
l_mckeon

Send message
Joined: 5 Jun 07
Posts: 44
Credit: 180,717
RAC: 0
Message 60657 - Posted: 15 Apr 2009, 21:27:57 UTC

The following two tasks had shorter run times than usual (about 1:30 hrs and 1:50 hrs from memory) and their uploads totalled around 16MB.

rest3d85_ip40_1t4w.patchdock.6.pdb_0002_fa_dock.xml_score12_pert38_DOCK_10797_354_0_0
rest3d85_ip40_1t4w.patchdock.6.pdb_0002_fa_dock.xml_score12_pert38_DOCK_10797_354_0_0

ID: 60657 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 60659 - Posted: 15 Apr 2009, 22:45:20 UTC

l_mckeon, yes, those tasks hit the 99 model limit before reaching your normal runtime preference.
Rosetta Moderator: Mod.Sense
ID: 60659 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Gavin Shaw
Avatar

Send message
Joined: 1 Feb 07
Posts: 10
Credit: 506,456
RAC: 0
Message 60660 - Posted: 15 Apr 2009, 23:29:56 UTC

Had another big one overnight.

Task 243710356 was another 6.8MB upload, again with 99 decoys.

Of course I have now seen a post about some problem with units, but it didn't help as the unit had already crunched :)

Never surrender and never give up. In the darkest hour there is always hope.

ID: 60660 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 1966
Credit: 38,188,338
RAC: 11,005
Message 60661 - Posted: 16 Apr 2009, 0:14:14 UTC

Another Validation error with this job:

crys__BOINC_ABRELAX_R120G_CRYSTALLIN_SAVE_ALL_OUT_IGNORE_THE_REST-S25-9-S3-3--crys_-_9344_11912_2

No errors reported within the Task Details of any of them.

Previous ones reported here and here.
ID: 60661 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Paul D. Buck

Send message
Joined: 17 Sep 05
Posts: 815
Credit: 1,812,737
RAC: 0
Message 60662 - Posted: 16 Apr 2009, 6:48:51 UTC

Looks like I might have gotten one of the problems:

ERROR: [ERROR] Unable to open constraints file: resample_outward0.05_ub0.1_lb0.02_median.t364_.cst
ERROR:: Exit from: ....srccorescoringconstraintsConstraintIO.cc line: 330
BOINC:: Error reading and gzipping output datafile: default.out

task 243804881
ID: 60662 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Speedy
Avatar

Send message
Joined: 25 Sep 05
Posts: 163
Credit: 800,690
RAC: 173
Message 60678 - Posted: 17 Apr 2009, 3:21:20 UTC

This task https://boinc.bakerlab.org/rosetta/result.php?resultid=243902658 made 99 decoys & the upload was about 7.14MB is this normal for these tasks?
Have a crunching good day!!
ID: 60678 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5658
Credit: 5,670,291
RAC: 2,328
Message 60680 - Posted: 17 Apr 2009, 7:27:34 UTC - in response to Message 60678.  

This task https://boinc.bakerlab.org/rosetta/result.php?resultid=243902658 made 99 decoys & the upload was about 7.14MB is this normal for these tasks?


there is a limiter built into the program. it stops the crunching at 99 decoys.
this is normal.
ID: 60680 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Speedy
Avatar

Send message
Joined: 25 Sep 05
Posts: 163
Credit: 800,690
RAC: 173
Message 60685 - Posted: 17 Apr 2009, 9:12:28 UTC - in response to Message 60680.  

This task https://boinc.bakerlab.org/rosetta/result.php?resultid=243902658 made 99 decoys & the upload was about 7.14MB is this normal for these tasks?


there is a limiter built into the program. it stops the crunching at 99 decoys.
this is normal.

I'm aware of this thanks. greg be I think you misunderstood the question. I was referring to the the upload size of the work unit. Is the normal upload size 7.14MB for this type https://boinc.bakerlab.org/rosetta/result.php?resultid=243902658 of work unit?
Have a crunching good day!!
ID: 60685 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 60691 - Posted: 17 Apr 2009, 13:01:57 UTC

The more models completed, the larger the upload will be. The resulting increase in upload size is part of why mini put on the 99 model limit per task. So, it is normal, but will probably be reviewed and perhaps changed to run longer models in some way.
Rosetta Moderator: Mod.Sense
ID: 60691 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Paul D. Buck

Send message
Joined: 17 Sep 05
Posts: 815
Credit: 1,812,737
RAC: 0
Message 60703 - Posted: 17 Apr 2009, 21:28:26 UTC

ID: 60703 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5658
Credit: 5,670,291
RAC: 2,328
Message 60726 - Posted: 19 Apr 2009, 8:34:54 UTC
Last modified: 19 Apr 2009, 8:41:14 UTC

This weeks problems:

2 validate errors (non res_careful) and 3 res_careful errors (all listed as troubled from the res_careful thread)

rest3d85_ip40_2jkf.patchdock.25.pdb_0001_fa_dock.xml_score12_pert38_DOCK_10797_583_0
rest3d85_ip40_2v1l.patchdock.10.pdb_0001_fa_dock.xml_score12_pert38_DOCK_10797_499_0

no error message, just validate error.
ID: 60726 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Speedy
Avatar

Send message
Joined: 25 Sep 05
Posts: 163
Credit: 800,690
RAC: 173
Message 60728 - Posted: 19 Apr 2009, 10:18:57 UTC - in response to Message 60691.  

The more models completed, the larger the upload will be. The resulting increase in upload size is part of why mini put on the 99 model limit per task. So, it is normal, but will probably be reviewed and perhaps changed to run longer models in some way.

Thank you for explaining.

Have a crunching good day!!
ID: 60728 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 10 · 11 · 12 · 13 · 14 · 15 · Next

Message boards : Number crunching : Problems with Minirosetta v1.54



©2024 University of Washington
https://www.bakerlab.org