Message boards : Number crunching : Problems with Minirosetta v1.54
Previous · 1 . . . 10 · 11 · 12 · 13 · 14 · 15 · Next
Author | Message |
---|---|
Snags Send message Joined: 22 Feb 07 Posts: 198 Credit: 2,888,320 RAC: 0 |
I have a 10 preferred runtime on my MacBook Pro. I spotted lb_all_multi_threshold.2.0_hb_t317__IGNORE_THE_REST_1I9SA_12_10355_4_0 still running at 10 hours and 20 minutes so I opened the graphics window to check on it. It was on model 33, step 1920, stage unk. Checking on it later it had run another cpu hour but failed to make any progress so I shut down BOINC completely and restarted. it now showed 5 hours and 20 minutes cpu time consumed, all the rest the same. Within a few seconds it returned to step 0 and apparently restarted model 33 over from the beginning. I didn't catch exactly when it reached step 1920 but it would have been about 3-4 cpu minutes after the restart. It didn't get stuck this time but continued on its merry way. It also moved out of the unk stage by the time I glanced at it 4+ minutes after restart. It has now finished successfully and validated with 58 models completed in 10 (non-stuck)hours. Hope this helps. Snags |
jswolf19 Send message Joined: 3 Apr 09 Posts: 3 Credit: 1,040,577 RAC: 0 |
I was looking through the RALPH minirosetta v1.54 bug thread and found an issue about setting day-of-week overrides (http://ralph.bakerlab.org/forum_thread.php?id=432&nowrap=true#4590). I had some set on network usage that when I cleared and restarted BOINC (which I upgraded to 6.6.20) I started registering progress on a minirosetta task (as well as having some stderr progress past Initializing options.... ok This appears to have been the cause of my problem. |
TomaszPawel Send message Joined: 28 Apr 07 Posts: 54 Credit: 2,791,145 RAC: 0 |
|
Gavin Shaw Send message Joined: 1 Feb 07 Posts: 10 Credit: 506,456 RAC: 0 |
While not exactly a bug, this morning I had a rather large upload file... Task 243404526 had a 6.8MB file to upload. The task only run for about 50 minutes and my preference is set to 4 hours. It did 99 decoys from 99 attempts. Thought admin might want to know... Never surrender and never give up. In the darkest hour there is always hope. |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
Wow, good thing the watchdog only lets 99 models run. Just imagine how large it would have been with a 4 hour run! Rosetta Moderator: Mod.Sense |
svincent Send message Joined: 30 Dec 05 Posts: 219 Credit: 12,120,035 RAC: 0 |
This task failed on Mac with an error in pairtermderiv that's been reported previously. 243548575 Hbond tripped. ERROR: dis==0 in pairtermderiv! ERROR:: Exit from: src/core/scoring/methods/PairEnergy.cc line: 338 BOINC:: Error reading and gzipping output datafile: default.out called boinc_finish </stderr_txt> |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
further notes to svincent's failed task ERROR: dis==0 in pairtermderiv! ERROR:: Exit from: src/core/scoring/methods/PairEnergy.cc line: 338 BOINC:: Error reading and gzipping output datafile: default.out ...and only 198 seconds of runtime. Rosetta Moderator: Mod.Sense |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Alert about problem WUs. Problem task names all begin with "res_careful_". For details on which proteins are known to have problems and should be aborted, and which will run OK and should be run normally, please see the link above. !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Rosetta Moderator: Mod.Sense |
l_mckeon Send message Joined: 5 Jun 07 Posts: 44 Credit: 180,717 RAC: 0 |
The following two tasks had shorter run times than usual (about 1:30 hrs and 1:50 hrs from memory) and their uploads totalled around 16MB. rest3d85_ip40_1t4w.patchdock.6.pdb_0002_fa_dock.xml_score12_pert38_DOCK_10797_354_0_0 rest3d85_ip40_1t4w.patchdock.6.pdb_0002_fa_dock.xml_score12_pert38_DOCK_10797_354_0_0 |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
l_mckeon, yes, those tasks hit the 99 model limit before reaching your normal runtime preference. Rosetta Moderator: Mod.Sense |
Gavin Shaw Send message Joined: 1 Feb 07 Posts: 10 Credit: 506,456 RAC: 0 |
Had another big one overnight. Task 243710356 was another 6.8MB upload, again with 99 decoys. Of course I have now seen a post about some problem with units, but it didn't help as the unit had already crunched :) Never surrender and never give up. In the darkest hour there is always hope. |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2141 Credit: 41,534,988 RAC: 10,560 |
Another Validation error with this job: crys__BOINC_ABRELAX_R120G_CRYSTALLIN_SAVE_ALL_OUT_IGNORE_THE_REST-S25-9-S3-3--crys_-_9344_11912_2 No errors reported within the Task Details of any of them. Previous ones reported here and here. |
Paul D. Buck Send message Joined: 17 Sep 05 Posts: 815 Credit: 1,812,737 RAC: 0 |
Looks like I might have gotten one of the problems: ERROR: [ERROR] Unable to open constraints file: resample_outward0.05_ub0.1_lb0.02_median.t364_.cst ERROR:: Exit from: ....srccorescoringconstraintsConstraintIO.cc line: 330 BOINC:: Error reading and gzipping output datafile: default.out task 243804881 |
Speedy Send message Joined: 25 Sep 05 Posts: 163 Credit: 808,337 RAC: 1 |
This task https://boinc.bakerlab.org/rosetta/result.php?resultid=243902658 made 99 decoys & the upload was about 7.14MB is this normal for these tasks? Have a crunching good day!! |
Greg_BE Send message Joined: 30 May 06 Posts: 5691 Credit: 5,859,226 RAC: 0 |
This task https://boinc.bakerlab.org/rosetta/result.php?resultid=243902658 made 99 decoys & the upload was about 7.14MB is this normal for these tasks? there is a limiter built into the program. it stops the crunching at 99 decoys. this is normal. |
Speedy Send message Joined: 25 Sep 05 Posts: 163 Credit: 808,337 RAC: 1 |
This task https://boinc.bakerlab.org/rosetta/result.php?resultid=243902658 made 99 decoys & the upload was about 7.14MB is this normal for these tasks? I'm aware of this thanks. greg be I think you misunderstood the question. I was referring to the the upload size of the work unit. Is the normal upload size 7.14MB for this type https://boinc.bakerlab.org/rosetta/result.php?resultid=243902658 of work unit? Have a crunching good day!! |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
The more models completed, the larger the upload will be. The resulting increase in upload size is part of why mini put on the 99 model limit per task. So, it is normal, but will probably be reviewed and perhaps changed to run longer models in some way. Rosetta Moderator: Mod.Sense |
Paul D. Buck Send message Joined: 17 Sep 05 Posts: 815 Credit: 1,812,737 RAC: 0 |
|
Greg_BE Send message Joined: 30 May 06 Posts: 5691 Credit: 5,859,226 RAC: 0 |
This weeks problems: 2 validate errors (non res_careful) and 3 res_careful errors (all listed as troubled from the res_careful thread) rest3d85_ip40_2jkf.patchdock.25.pdb_0001_fa_dock.xml_score12_pert38_DOCK_10797_583_0 rest3d85_ip40_2v1l.patchdock.10.pdb_0001_fa_dock.xml_score12_pert38_DOCK_10797_499_0 no error message, just validate error. |
Speedy Send message Joined: 25 Sep 05 Posts: 163 Credit: 808,337 RAC: 1 |
The more models completed, the larger the upload will be. The resulting increase in upload size is part of why mini put on the 99 model limit per task. So, it is normal, but will probably be reviewed and perhaps changed to run longer models in some way. Thank you for explaining. Have a crunching good day!! |
Message boards :
Number crunching :
Problems with Minirosetta v1.54
©2024 University of Washington
https://www.bakerlab.org