Posts by TCU Computer Science

21) Message boards : Number crunching : Report stuck & aborted 5.01 WU here please - III (Message 14545)
Posted 24 Apr 2006 by TCU Computer Science
Post:
Three more 5.01 WUs were aborted this morning:

63.0 hrs
http://boinc.bakerlab.org/rosetta/result.php?resultid=17864947
NO_TERM_STRAND_1ogw_388_841

62.8 hrs
http://boinc.bakerlab.org/rosetta/result.php?resultid=17849525
FACONTACTS_RECENTER_NOFILTERS_1eyvA_448_842

30.2 hrs
http://boinc.bakerlab.org/rosetta/result.php?resultid=17942565
FACONTACTS_RECENTER_NOFILTERS_5croA_448_703
22) Message boards : Number crunching : Report stuck & aborted 5.01 WU here please - III (Message 14515)
Posted 24 Apr 2006 by TCU Computer Science
Post:
These 5.01 WUs were aborted today:

11.8 hrs
http://boinc.bakerlab.org/rosetta/result.php?resultid=17929192
FACONTACTS_RECENTER_NOFILTERS_1ubi__448_846

34.7 hrs
http://boinc.bakerlab.org/rosetta/result.php?resultid=17754714
HBLR_1.0_1hz6_420_5519

44.2 hrs
http://boinc.bakerlab.org/rosetta/result.php?resultid=17786665
HBLR_1.0_1di2_ROT_TRIALS_TRIE_449_49

50.7 hrs
http://boinc.bakerlab.org/rosetta/result.php?resultid=17762275
HBLR_1.0_1hz6_420_7237

49.3 hrs
http://boinc.bakerlab.org/rosetta/result.php?resultid=17773010
FACONTACTS_RECENTER_NOFILTERS_1vls__448_927

27.5 hrs
http://boinc.bakerlab.org/rosetta/result.php?resultid=17797075
NO_TERM_STRAND_1ogw_423_3285
23) Message boards : Number crunching : Report stuck & aborted WU here please - II (Message 14012)
Posted 18 Apr 2006 by TCU Computer Science
Post:
Let me know what the result number is -- if its not flagged to get credit, I can see why.


Here is one that ran for 88 hours. I aborted it on 28 March
http://boinc.bakerlab.org/rosetta/result.php?resultid=14764800
24) Message boards : Number crunching : Report stuck & aborted WU here please - II (Message 13881)
Posted 16 Apr 2006 by TCU Computer Science
Post:
I aborted nine WUs today.

These four showed 20-50 hours of accumulated time

17028012
TRUNCATE_TERMINI_FULLRELAX_1b3aA_433_678

17051917
TRUNCATE_TERMINI_FULLRELAX_1ptq__433_905

17050886
TRUNCATE_TERMINI_FULLRELAX_1enh__433_896

16238549
FA_RLXpt_hom006_1ptq__361_440


The following five showed little or no accumulated time but had been running for 4-11 days:

17016383
TRUNCATE_TERMINI_FULLRELAX_1ptq__433_569

16970141
TRUNCATE_TERMINI_FULLRELAX_2tif__433_104

16995174
TRUNCATE_TERMINI_FULLRELAX_2tif__433_369

16196147
FA_RLXpt_hom002_1ptq__361_379

16227211
FARELAX_NOFILTERS_1bm8__417_637
25) Message boards : Number crunching : Report stuck & aborted WU here please (Message 12772)
Posted 28 Mar 2006 by TCU Computer Science
Post:
The following were aborted today. All were stuck at 1.00% after running for 20+ hours

ID=12326404 name = HB_BARCODE_30_1c8cA_351_32403
ID=12261321 name = HB_BARCODE_30_256bA_351_28680
ID=12034212 name = HB_BARCODE_30_1bk2__351_16205
ID=11076727 name = FA_RLXb3_hom001_1b3aA_359_347
ID=11972587 name = FA_RLXb3_hom010_2chf__362_384
ID=11761822 name = FA_RLXur_hom004_1urnA_362_308
26) Message boards : Number crunching : Report stuck & aborted WU here please (Message 10367)
Posted 2 Feb 2006 by TCU Computer Science
Post:
Two WUs stuck at 1% and aborted on 02 Feb:

NO_SIM_ANNEAL_BARCODE_30_2reb_286_4502_0

TERMINI_2reb_294_6931_0

The first one ran for 94 hours before I noticed it.
27) Message boards : Number crunching : Report stuck & aborted WU here please (Message 10163)
Posted 29 Jan 2006 by TCU Computer Science
Post:
I just noticed that on one of my computers
NEW_SOFT_CENTROID_PACKING_1di2_225_7586_0
has been running since 6 January.

boincmgr shows
CPU Time 01:10:08
Progress 20%
To completion 05:01:08

but the messages show about 120 one-hour slices spent in execution.

The "pausing" messages show that it is being left in memory.


This is a known R@H bug. To prevent the problem you must do the following-

In your user preferences you should set the time between application switching (or swaps) to something cole to 2 hours (120 Min). That is usually enough to keep things going, But if you want to be really certain you should set the system so that it keeps the R@H application in memory during application swaps.

The is more about this in the FAQ sticky here.



Yes, I know about that. I have the preferences set to keep the app in memory and the "pausing" messages say that it is being kept in memory. So far, this is the only stuck WU that I have encountered.
28) Message boards : Number crunching : Report stuck & aborted WU here please (Message 9736)
Posted 24 Jan 2006 by TCU Computer Science
Post:
I just noticed that on one of my computers
NEW_SOFT_CENTROID_PACKING_1di2_225_7586_0
has been running since 6 January.

boincmgr shows
CPU Time 01:10:08
Progress 20%
To completion 05:01:08

but the messages show about 120 one-hour slices spent in execution.

The "pausing" messages show that it is being left in memory.


Previous 20



©2024 University of Washington
https://www.bakerlab.org