1)
Message boards :
Number crunching :
invalid results; 24 hours wasted
(Message 89331)
Posted 22 Jul 2018 by ChristianVirtual Post: name rb_07_16_508_732__t000__0_C3_SAVE_ALL_OUT_IGNORE_THE_REST_682151_12503 application Rosetta created 17 Jul 2018, 13:17:32 UTC canonical result 1016016653 granted credit 238.10 minimum quorum 1 initial replication 1 max # of error/total/success tasks 1, 1, 1 errors Too many total results get other one; this time on "Too many results" ... what does that mean ? Server is handing out more and dump those who still contribute their CPU cycles ? |
2)
Message boards :
Number crunching :
invalid results; 24 hours wasted
(Message 88892)
Posted 13 May 2018 by ChristianVirtual Post: I think you are right; I had a i7-8700 and 3930 in the past days and they had less problems. and also agree, that other projects like WCG have much less issues with Ryzen too bad for Rosetta ... |
3)
Message boards :
Number crunching :
invalid results; 24 hours wasted
(Message 88879)
Posted 13 May 2018 by ChristianVirtual Post: It's really frustrating to spend 24 hours of CPU cycles to get a WU invalidated https://boinc.bakerlab.org/workunit.php?wuid=897665706 https://boinc.bakerlab.org/workunit.php?wuid=897666513 Enough RAM and storage; that should not be a limit. Ryzen 1700x, Ubuntu 17.10 <core_client_version>7.11.0</core_client_version> <![CDATA[ <stderr_txt> command: ../../projects/boinc.bakerlab.org_rosetta/rosetta_4.07_x86_64-pc-linux-gnu -run:protocol jd2_scripting @flags_rb_05_08_164_241__t000__1_C1_robetta -silent_gz -mute all -out:file:silent default.out -in:file:boinc_wu_zip input_rb_05_08_164_241__t000__1_C1_robetta.zip -nstruct 10000 -cpu_run_time 28800 -watchdog -boinc:max_nstruct 600 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -run::rng mt19937 -constant_seed -jran 3291164 Starting watchdog... Watchdog active. ====================================================== DONE :: 329 starting structures 86145 cpu seconds This process generated 329 decoys from 329 attempts ====================================================== BOINC :: WS_max 4.30068e+08 BOINC :: Watchdog shutting down... 12:01:32 (3632): called boinc_finish(0) </stderr_txt> ]]> what an one do ? |
4)
Message boards :
Number crunching :
Error while computing - AMD Opteron
(Message 88873)
Posted 12 May 2018 by ChristianVirtual Post: another strange one https://boinc.bakerlab.org/workunit.php?wuid=898555819 why the server cancelled those ? (sorry, might should have made a new thread) |
5)
Message boards :
Number crunching :
Error while computing - AMD Opteron
(Message 88872)
Posted 12 May 2018 by ChristianVirtual Post: I have also quite some trouble with WU, 24 hours and fail ... on Ryzen with Ubuntu like this https://boinc.bakerlab.org/result.php?resultid=996310562 "Too many total results" |
©2024 University of Washington
https://www.bakerlab.org