Message boards : Number crunching : Some of the erroneous WUs are listed as successful and even pass validation
Author | Message |
---|---|
Mad_Max Send message Joined: 31 Dec 09 Posts: 209 Credit: 26,073,191 RAC: 16,693 |
I spot some WUs which completely failed, but has " Success" status, and even server validate and grant credits for such WUs with critical errors. Name pre_helical_bundles_round1_attempt1_SAVE_ALL_OUT_IGNORE_THE_REST_3cm5iv5y_1391022_1_0 While logs clearly shows fatal errors: <core_client_version>7.16.11</core_client_version> <![CDATA[ <stderr_txt> command: projects/boinc.bakerlab.org_rosetta/rosetta_4.20_windows_x86_64.exe -run:protocol jd2_scripting -parser:protocol pre_helix_boinc_v1.xml @helix_design.flags -in:file:silent pre_helical_bundles_round1_attempt1_SAVE_ALL_OUT_IGNORE_THE_REST_3cm5iv5y.silent -in:file:silent_struct_type binary -silent_gz -mute all -silent_read_through_errors true -out:file:silent_struct_type binary -out:file:silent default.out -in:file:boinc_wu_zip pre_helical_bundles_round1_attempt1_SAVE_ALL_OUT_IGNORE_THE_REST_3cm5iv5y.zip @pre_helical_bundles_round1_attempt1_SAVE_ALL_OUT_IGNORE_THE_REST_3cm5iv5y.flags -nstruct 10000 -cpu_run_time 28800 -boinc:max_nstruct 20000 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -boinc::cpu_run_timeout 36000 -run::rng mt19937 -constant_seed -jran 2578584 Using database: database_357d5d93529_n_methylminirosetta_database ERROR: [ERROR] Unable to open constraints file: bc1dd6b031238f177cab303f1b5a3aef_0001.MSAcst ERROR:: Exit from: ......srccorescoringconstraintsConstraintIO.cc line: 457 08:41:23 (4920): called boinc_finish(0) </stderr_txt> ]]> Links to tasks https://boinc.bakerlab.org/rosetta/result.php?resultid=1376543173 https://boinc.bakerlab.org/rosetta/result.php?resultid=1375927242 https://boinc.bakerlab.org/rosetta/result.php?resultid=1375899817 |
Tomcat雄猫 Send message Joined: 20 Dec 14 Posts: 180 Credit: 5,386,173 RAC: 0 |
I spot some WUs which completely failed, but has " Success" status, and even server validate and grant credits for such WUs with critical errors. It's a known issue with the pre_helical_bundles_round1_attempt1_SAVE_ALL_OUT_IGNORE_THE_REST tasks, it appears. |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1684 Credit: 17,950,321 RAC: 23,118 |
I spot some WUs which completely failed, but has " Success" status, and even server validate and grant credits for such WUs with critical errors.Even though it errored out before it was due to finish, the work done up to that point was Valid, so it Validated & was awarded Credit for the work done. Grant Darwin NT |
Mad_Max Send message Joined: 31 Dec 09 Posts: 209 Credit: 26,073,191 RAC: 16,693 |
It did not done any useful work. Such WUs error out just few minutes after start before computations of a very first decoy/model completed and results saved. Logs suggest it happens due to R@H app can not open/load all necessary input data for computation (possible a configuration error at WU generation stage?). But still somehow slip through a server validator and marked as Success/Valid. |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2125 Credit: 41,249,734 RAC: 9,368 |
It did not done any useful work. This was first reported to admins in late May I've finally got round to examining this issue involving "ERROR:: Exit from: ......srccorescoringconstraintsConstraintIO.cc line: 457" Their response is summarised here They're already aware of these bad tasks. |
Message boards :
Number crunching :
Some of the erroneous WUs are listed as successful and even pass validation
©2024 University of Washington
https://www.bakerlab.org