Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 121 · 122 · 123 · 124 · 125 · 126 · 127 . . . 309 · Next

AuthorMessage
fkmaster

Send message
Joined: 19 Jan 06
Posts: 2
Credit: 21,955,151
RAC: 2,881
Message 102589 - Posted: 11 Sep 2021, 7:09:55 UTC - in response to Message 102576.  
Last modified: 11 Sep 2021, 7:12:52 UTC

Thank you, I modified the cache settings and resource share on the i5-machine. For the last months Rosetta was the only project on this PC, yesterday I started yoyo@home because of the idle process.
It is 100-100 the resource share between Rosetta and yoyo, but no new tasks in Rosetta again.

I want to run Rosetta on i5 and Ryzen5 only.

Most of recent Rosetta tasks use a huge amount of memory, the 32 bit system can handle less than 4 GB. I wait for a while and reinstall the system with 64 bit and I will increase the RAM to 8 GB.

I hope it will help.
ID: 102589 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Tomcat雄猫

Send message
Joined: 20 Dec 14
Posts: 180
Credit: 5,386,173
RAC: 0
Message 102590 - Posted: 11 Sep 2021, 9:21:12 UTC - in response to Message 102588.  
Last modified: 11 Sep 2021, 9:24:15 UTC

There are no Android or Linux or Windows Tasks, they can all be processed by the appropriate application.
However many of the current Tasks need over 1GB of RAM. Depending on your BOINC memory settings & the amount of available RAM on the Android device it may not be possible for it to process them, so it won't get any even if Rosetta is owed processing time on the device.



Understood, thanks!

I should in theory have over 6 GB of RAM available, my RAM limit is at 100% (12GB). Let's see if I get any Rosetta tasks. I've gotten 2 Ralph tasks a few weeks back but so far it's been WCG for weeks.

It has to be noted that the resource share of Rosetta@home on my phone has been set to 100, just like WCG. I guess if a resource share of 200 for Rosetta and 100 for WCG results in BOINC seemingly favouring WCG tasks, equal resource share will just make it worse.
ID: 102590 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1725
Credit: 18,380,064
RAC: 20,136
Message 102591 - Posted: 12 Sep 2021, 4:30:44 UTC
Last modified: 12 Sep 2021, 4:36:42 UTC

Just had 2 5nvx_graft_buwei_ Tasks, one ran for 10min the other 26min. They both Validated although they produced errors

<core_client_version>7.16.11</core_client_version>
<![CDATA[
<stderr_txt>
command: projects/boinc.bakerlab.org_rosetta/rosetta_4.20_windows_x86_64.exe -run:protocol jd2_scripting -parser:protocol pdblite_boinc_120_10--fuse--predictor_v11_boinc_fix--fuse--tslp_design_v1_boinc_fix_plus6.xml @5nvx_graft_buwei_flags -in:file:silent 5nvx_graft_buwei_xaf_SAVE_ALL_OUT_IGNORE_THE_REST_5ht6pv3o.silent -in:file:silent_struct_type binary -silent_gz -mute all -silent_read_through_errors true -out:file:silent_struct_type binary -out:file:silent default.out -in:file:boinc_wu_zip 5nvx_graft_buwei_xaf_SAVE_ALL_OUT_IGNORE_THE_REST_5ht6pv3o.zip @5nvx_graft_buwei_xaf_SAVE_ALL_OUT_IGNORE_THE_REST_5ht6pv3o.flags -nstruct 10000 -cpu_run_time 28800 -boinc:max_nstruct 20000 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -boinc::cpu_run_timeout 36000 -run::rng mt19937 -constant_seed -jran 2084657
Using database: database_357d5d93529_n_methylminirosetta_database

ERROR: [ERROR] Unable to open constraints file: m_ems_3hM_2942_000000222_0001_44_63_H_._HHH_b2_01435_0001_1_0001.MSAcst
ERROR:: Exit from: ......srccorescoringconstraintsConstraintIO.cc line: 457
13:57:22 (7176): called boinc_finish(0)

</stderr_txt>
]]>



<core_client_version>7.16.11</core_client_version>
<![CDATA[
<stderr_txt>
command: projects/boinc.bakerlab.org_rosetta/rosetta_4.20_windows_x86_64.exe -run:protocol jd2_scripting -parser:protocol pdblite_boinc_120_10--fuse--predictor_v11_boinc_fix--fuse--tslp_design_v1_boinc_fix_plus6.xml @5nvx_graft_buwei_flags -in:file:silent 5nvx_graft_buwei_xaf_SAVE_ALL_OUT_IGNORE_THE_REST_4do0cs3f.silent -in:file:silent_struct_type binary -silent_gz -mute all -silent_read_through_errors true -out:file:silent_struct_type binary -out:file:silent default.out -in:file:boinc_wu_zip 5nvx_graft_buwei_xaf_SAVE_ALL_OUT_IGNORE_THE_REST_4do0cs3f.zip @5nvx_graft_buwei_xaf_SAVE_ALL_OUT_IGNORE_THE_REST_4do0cs3f.flags -nstruct 10000 -cpu_run_time 28800 -boinc:max_nstruct 20000 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -boinc::cpu_run_timeout 36000 -run::rng mt19937 -constant_seed -jran 1332669
Using database: database_357d5d93529_n_methylminirosetta_database

ERROR: [ERROR] Unable to open constraints file: m_036bed11a46af16b1bcff1c055f4941a_0001_000000269_0001_15_31_H_._HHH_b1_04870_0001_1_0001.MSAcst
ERROR:: Exit from: ......srccorescoringconstraintsConstraintIO.cc line: 457
13:25:06 (6244): called boinc_finish(0)

</stderr_txt>
]]>



Edit- just got a 3rd one that lasted for 15min.

<core_client_version>7.16.11</core_client_version>
<![CDATA[
<stderr_txt>
command: projects/boinc.bakerlab.org_rosetta/rosetta_4.20_windows_x86_64.exe -run:protocol jd2_scripting -parser:protocol pdblite_boinc_120_10--fuse--predictor_v11_boinc_fix--fuse--tslp_design_v1_boinc_fix_plus6.xml @5nvx_graft_buwei_flags -in:file:silent 5nvx_graft_buwei_xaf_SAVE_ALL_OUT_IGNORE_THE_REST_8oi6gv3g.silent -in:file:silent_struct_type binary -silent_gz -mute all -silent_read_through_errors true -out:file:silent_struct_type binary -out:file:silent default.out -in:file:boinc_wu_zip 5nvx_graft_buwei_xaf_SAVE_ALL_OUT_IGNORE_THE_REST_8oi6gv3g.zip @5nvx_graft_buwei_xaf_SAVE_ALL_OUT_IGNORE_THE_REST_8oi6gv3g.flags -nstruct 10000 -cpu_run_time 28800 -boinc:max_nstruct 20000 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -boinc::cpu_run_timeout 36000 -run::rng mt19937 -constant_seed -jran 1500897
Using database: database_357d5d93529_n_methylminirosetta_database

ERROR: [ERROR] Unable to open constraints file: m_HHH_b2_00235_000000121_0001_23_40_H_._HHH_b2_05032_0002_1_0001.MSAcst
ERROR:: Exit from: ......srccorescoringconstraintsConstraintIO.cc line: 457
14:01:48 (1528): called boinc_finish(0)

</stderr_txt>
]]>

Grant
Darwin NT
ID: 102591 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 2002
Credit: 9,780,807
RAC: 5,492
Message 102592 - Posted: 12 Sep 2021, 6:44:53 UTC - in response to Message 102591.  

Just had 2 5nvx_graft_buwei_ Tasks, one ran for 10min the other 26min. They both Validated although they produced errors


Same errors on Ralph
ID: 102592 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1725
Credit: 18,380,064
RAC: 20,136
Message 102594 - Posted: 12 Sep 2021, 9:19:21 UTC

I've now got 4 of those 5nvx_graft_buwei_ Tasks that died after 2min or less, which are Invalid due to a Validate error, even though they are giving the same Stderr output error as the ones that run for (slightly) longer & Validate.
And 2 more of those short runs (but longer than 2min) producing errors that Validate.

I've got 2 5nvx_graft_buwei_ Tasks that are still running- 3hrs and 4hr 45min and counting. Will be interesting to see if they make it to 8 hours, and if there is an error in the Stderr output when they are done or not.
Grant
Darwin NT
ID: 102594 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1725
Credit: 18,380,064
RAC: 20,136
Message 102595 - Posted: 12 Sep 2021, 19:55:04 UTC - in response to Message 102594.  
Last modified: 12 Sep 2021, 19:58:23 UTC

I've now got 4 of those 5nvx_graft_buwei_ Tasks that died after 2min or less, which are Invalid due to a Validate error, even though they are giving the same Stderr output error as the ones that run for (slightly) longer & Validate.
And 2 more of those short runs (but longer than 2min) producing errors that Validate.

I've got 2 5nvx_graft_buwei_ Tasks that are still running- 3hrs and 4hr 45min and counting. Will be interesting to see if they make it to 8 hours, and if there is an error in the Stderr output when they are done or not.
So far i have 3 _5nvx_graft_buwei_ Tasks that produced Decoys & Validated.
All the others (Valids & Invalids) just resulted in error messages.

Roughly a 82% failure rate.

No signs of issues with any of the other Tasks yet.
Grant
Darwin NT
ID: 102595 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 2002
Credit: 9,780,807
RAC: 5,492
Message 102596 - Posted: 12 Sep 2021, 19:57:26 UTC - in response to Message 102594.  

I've got 2 5nvx_graft_buwei_ Tasks that are still running- 3hrs and 4hr 45min and counting. Will be interesting to see if they make it to 8 hours, and if there is an error in the Stderr output when they are done or not.


I cannot understand.
Correct wus have the same error, but it's validated

ERROR: [ERROR] Unable to open constraints file: m_HHH_b1_05679_000000205_0001_20_37_H_._HHH_b1_02798_0002_1_0001.MSAcst
ERROR:: Exit from: ......srccorescoringconstraintsConstraintIO.cc line: 457
21:42:17 (7880): called boinc_finish(0)

ID: 102596 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1725
Credit: 18,380,064
RAC: 20,136
Message 102597 - Posted: 12 Sep 2021, 20:01:48 UTC - in response to Message 102596.  

I cannot understand.
Correct wus have the same error, but it's validated

[quote]ERROR: [ERROR] Unable to open constraints file: m_HHH_b1_05679_000000205_0001_20_37_H_._HHH_b1_02798_0002_1_0001.MSAcst
ERROR:: Exit from: ......srccorescoringconstraintsConstraintIO.cc line: 457
21:42:17 (7880): called boinc_finish(0)
Yep.
If it dies early, you get a Validation error. If it runs for more than a few minutes, but less than the full time, it Validates- even though it doesn't produce any decoys & it gives the same error as the ones that die in 2 min or less.
Grant
Darwin NT
ID: 102597 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Old man

Send message
Joined: 10 Nov 07
Posts: 25
Credit: 1,122,372
RAC: 0
Message 102604 - Posted: 14 Sep 2021, 16:57:33 UTC

Name 5nvx_graft_buwei_xab_SAVE_ALL_OUT_IGNORE_THE_REST_0oe9kw2f_1731803_2_0

Server state Over
Outcome Computation error
Client state Compute error
Exit status 1 (0x00000001) Unknown error code

Stderr output:
<core_client_version>7.16.11</core_client_version>
<![CDATA[
<message>
Funktio ei kelpaa.
(0x1) - exit code 1 (0x1)</message>
<stderr_txt>
command: projects/boinc.bakerlab.org_rosetta/rosetta_4.20_windows_x86_64.exe -run:protocol jd2_scripting -parser:protocol pdblite_boinc_120_10--fuse--predictor_v11_boinc_fix--fuse--tslp_design_v1_boinc_fix_plus6.xml @5nvx_graft_buwei_flags -in:file:silent 5nvx_graft_buwei_xab_SAVE_ALL_OUT_IGNORE_THE_REST_0oe9kw2f.silent -in:file:silent_struct_type binary -silent_gz -mute all -silent_read_through_errors true -out:file:silent_struct_type binary -out:file:silent default.out -in:file:boinc_wu_zip 5nvx_graft_buwei_xab_SAVE_ALL_OUT_IGNORE_THE_REST_0oe9kw2f.zip @5nvx_graft_buwei_xab_SAVE_ALL_OUT_IGNORE_THE_REST_0oe9kw2f.flags -nstruct 10000 -cpu_run_time 28800 -boinc:max_nstruct 20000 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -boinc::cpu_run_timeout 36000 -run::rng mt19937 -constant_seed -jran 3752359
Using database: database_357d5d93529_n_methylminirosetta_database

ERROR: [ERROR] Unable to open constraints file: m_ems_3hC_506_000000211_0001_43_58_H_._HHH_b2_02864_0001_1_0001.MSAcst
ERROR:: Exit from: ......srccorescoringconstraintsConstraintIO.cc line: 457
BOINC:: Error reading and gzipping output datafile: default.out
18:28:53 (10104): called boinc_finish(1)

</stderr_txt>
]]>

What to do?
ID: 102604 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
YAG

Send message
Joined: 13 Oct 19
Posts: 7
Credit: 13,015,426
RAC: 0
Message 102605 - Posted: 14 Sep 2021, 18:31:00 UTC

Good afternoon,

I have received four tasks of the application "rosetta python projects v1.03 (vbox64) x86_64-pc-linux-gnu". All of the tasks failed with the same error: "-186 (0xFFFFFF46) ERR_RESULT_DOWNLOAD". In the logs appears the following:
WU download error: couldn't get input files:
<file_xfer_error>
<file_name>AIMNet_vm_v2.vdi</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
<error_message>MD5 check failed</error_message>


This is the computer who obtained the errors: https://boinc.bakerlab.org/rosetta/show_host_detail.php?hostid=5909874

Here are the tasks:
https://boinc.bakerlab.org/rosetta/result.php?resultid=1425336189
https://boinc.bakerlab.org/rosetta/result.php?resultid=1425334597
https://boinc.bakerlab.org/rosetta/result.php?resultid=1425332824
https://boinc.bakerlab.org/rosetta/result.php?resultid=1425333029


Regards,
YAG
ID: 102605 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1233
Credit: 14,338,560
RAC: 2,014
Message 102606 - Posted: 14 Sep 2021, 18:56:00 UTC - in response to Message 102604.  

[snip]

ERROR: [ERROR] Unable to open constraints file: m_ems_3hC_506_000000211_0001_43_58_H_._HHH_b2_02864_0001_1_0001.MSAcst

What to do?

That's the important line of the error messages.

That usually means that one of the input files for the task was not downloaded correctly.

Lately, that has often been because the file was not in the correct place on the server. If so, you can't do much other than wait for a better task.

If anyone else who gets a task from the same workunit and has it fail the same way, you'll get a little bit of credit for trying to run the task.
ID: 102606 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 2002
Credit: 9,780,807
RAC: 5,492
Message 102608 - Posted: 14 Sep 2021, 19:29:02 UTC - in response to Message 102605.  
Last modified: 14 Sep 2021, 19:30:13 UTC

Good afternoon,

I have received four tasks of the application "rosetta python projects v1.03 (vbox64) x86_64-pc-linux-gnu". All of the tasks failed with the same error: "-186 (0xFFFFFF46) ERR_RESULT_DOWNLOAD". In the logs appears the following:
WU download error: couldn't get input files:
<file_xfer_error>
<file_name>AIMNet_vm_v2.vdi</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
<error_message>MD5 check failed</error_message>



Same error on Win 10
ID: 102608 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2140
Credit: 41,518,559
RAC: 10,612
Message 102625 - Posted: 15 Sep 2021, 17:13:22 UTC

My Boinc is set up as follows (from 32Gb total RAM):

15/09/2021 14:43:09 | | max memory usage when active: 21241.68 MB
15/09/2021 14:43:09 | | max memory usage when idle: 27777.58 MB


I've got one task "Waiting for Memory" so I stopped processing all new tasks to find out at what point it will run.
I'm currently down to just one other task running (from 16 cores) and it's still not able to continue yet, so I looked further and in Properties the task is showing

Virtual memory size 98.98 GB
Working set size 27.05 GB


"Houston, we have a problem..."
ID: 102625 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Old man

Send message
Joined: 10 Nov 07
Posts: 25
Credit: 1,122,372
RAC: 0
Message 102626 - Posted: 15 Sep 2021, 17:47:17 UTC - in response to Message 102606.  

[snip]

ERROR: [ERROR] Unable to open constraints file: m_ems_3hC_506_000000211_0001_43_58_H_._HHH_b2_02864_0001_1_0001.MSAcst

What to do?

That's the important line of the error messages.

That usually means that one of the input files for the task was not downloaded correctly.

Lately, that has often been because the file was not in the correct place on the server. If so, you can't do much other than wait for a better task.

If anyone else who gets a task from the same workunit and has it fail the same way, you'll get a little bit of credit for trying to run the task.


Thank you! I dont worry about that task anymore.
ID: 102626 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2140
Credit: 41,518,559
RAC: 10,612
Message 102627 - Posted: 15 Sep 2021, 19:07:59 UTC - in response to Message 102625.  

My Boinc is set up as follows (from 32Gb total RAM):

15/09/2021 14:43:09 | | max memory usage when active: 21241.68 MB
15/09/2021 14:43:09 | | max memory usage when idle: 27777.58 MB

I've got one task "Waiting for Memory" so I stopped processing all new tasks to find out at what point it will run.
I'm currently down to just one other task running (from 16 cores) and it's still not able to continue yet, so I looked further and in Properties the task is showing

Virtual memory size 98.98 GB
Working set size 27.05 GB

"Houston, we have a problem..."

This is the task, which I've now aborted
degrader_site_4yc9_0_plait_-2.5_bcov_40_5_SAVE_ALL_OUT_IGNORE_THE_REST_5uy0wa3y_1731760_5_0
Looking at the details it says
Peak working set size 1,523.41 MB
Peak swap size 1,520.17 MB
Peak disk usage 6.10 MB

which doesn't match up with what it showed while it was running.
Also strange that it ran 3h 26m before it started reporting this way
ID: 102627 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Tomcat雄猫

Send message
Joined: 20 Dec 14
Posts: 180
Credit: 5,386,173
RAC: 0
Message 102628 - Posted: 15 Sep 2021, 22:06:01 UTC - in response to Message 102627.  
Last modified: 15 Sep 2021, 22:09:22 UTC

I received three errors, two of which are python tasks.

The regular Rosetta 4.20 task that failed was our old friend "5nvx", most certainly a broken WU...

5nvx_graft_buwei_xaa_SAVE_ALL_OUT_IGNORE_THE_REST_1kg6qt9p_1731819_1_0
<core_client_version>7.16.11</core_client_version>
<![CDATA[
<message>
Incorrect function.
 (0x1) - exit code 1 (0x1)</message>
<stderr_txt>
command: projects/boinc.bakerlab.org_rosetta/rosetta_4.20_windows_x86_64.exe -run:protocol jd2_scripting -parser:protocol pdblite_boinc_120_10--fuse--predictor_v11_boinc_fix--fuse--tslp_design_v1_boinc_fix_plus6.xml @5nvx_graft_buwei_flags -in:file:silent 5nvx_graft_buwei_xaa_SAVE_ALL_OUT_IGNORE_THE_REST_1kg6qt9p.silent -in:file:silent_struct_type binary -silent_gz -mute all -silent_read_through_errors true -out:file:silent_struct_type binary -out:file:silent default.out -in:file:boinc_wu_zip 5nvx_graft_buwei_xaa_SAVE_ALL_OUT_IGNORE_THE_REST_1kg6qt9p.zip @5nvx_graft_buwei_xaa_SAVE_ALL_OUT_IGNORE_THE_REST_1kg6qt9p.flags -nstruct 10000 -cpu_run_time 28800 -boinc:max_nstruct 20000 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -boinc::cpu_run_timeout 36000 -run::rng mt19937 -constant_seed -jran 1392509
Using database: database_357d5d93529_n_methylminirosetta_database

ERROR: [ERROR] Unable to open constraints file: m_HHH_b2_05051_000000119_0001_1_20_H_._HHH_b2_01913_0001_1_0001.MSAcst
ERROR:: Exit from: ......srccorescoringconstraintsConstraintIO.cc line: 457
BOINC:: Error reading and gzipping output datafile: default.out
18:47:53 (15224): called boinc_finish(1)

</stderr_txt>
]]>


Seriously, how is this error still a thing?!

The Python tasks I'm not so sure about. Is it a configuration problem on my end or is it a known problem with these tasks?

aaaf-ACBC_pp-PTABU_pp-TIC_pp-NMBEN3_5_1732058_1_1
<core_client_version>7.16.11</core_client_version>
<![CDATA[
<message>
WU download error: couldn't get input files:
<file_xfer_error>
  <file_name>AIMNet_vm_v2.vdi</file_name>
  <error_code>-119 (md5 checksum failed for file)</error_code>
  <error_message>MD5 check failed</error_message>
</file_xfer_error>
</message>
]]>


aaaf-ACBC_pp-PTABU_pp-TIC_pp-NMBEN3_7_1732060_7_1

<core_client_version>7.16.11</core_client_version>
<![CDATA[
<message>
WU download error: couldn't get input files:
<file_xfer_error>
  <file_name>AIMNet_vm_v2.vdi</file_name>
  <error_code>-119 (md5 checksum failed for file)</error_code>
  <error_message>MD5 check failed</error_message>
</file_xfer_error>
</message>
]]>
ID: 102628 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
.clair.

Send message
Joined: 2 Jan 07
Posts: 274
Credit: 26,399,595
RAC: 0
Message 102629 - Posted: 16 Sep 2021, 0:55:59 UTC - in response to Message 102625.  

My Boinc is set up as follows (from 32Gb total RAM):
[snip]
Virtual memory size 98.98 GB
Working set size 27.05 GB


"Houston, we have a problem..."

WOW . and I thort the `Horns` tasks from a while ago where big :0
ID: 102629 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Tomcat雄猫

Send message
Joined: 20 Dec 14
Posts: 180
Credit: 5,386,173
RAC: 0
Message 102644 - Posted: 17 Sep 2021, 12:15:24 UTC - in response to Message 102629.  
Last modified: 17 Sep 2021, 12:30:41 UTC

7 more Rosetta Python projects v1.03 (vbox64) errors. It's been a solid 100% fail rate on my end. Download errors every single one of them.

Windows 10

aaaf-PTAMBA_pp-mPTAMBA-ACHC_pp-NMBEN3_pp_0_1733022_1_1
<core_client_version>7.16.11</core_client_version>
<![CDATA[
<message>
WU download error: couldn't get input files:
<file_xfer_error>
  <file_name>AIMNet_vm_v2.vdi</file_name>
  <error_code>-119 (md5 checksum failed for file)</error_code>
  <error_message>MD5 check failed</error_message>
</file_xfer_error>
</message>
]]>


aaaf-PTAMBA_pp-PTAMBA_pp-mNMABU_pp-NMBEN3_pp_0_1732919_6_0
<core_client_version>7.16.11</core_client_version>
<![CDATA[
<message>
WU download error: couldn't get input files:
<file_xfer_error>
  <file_name>AIMNet_vm_v2.vdi</file_name>
  <error_code>-119 (md5 checksum failed for file)</error_code>
  <error_message>MD5 check failed</error_message>
</file_xfer_error>
</message>
]]>


[url=]aaaf-PTAMBA_pp-SAR_pp-NHM_pp-NMBEN3_pp_5_1732891_9_0[/url]
<core_client_version>7.16.11</core_client_version>
<![CDATA[
<message>
WU download error: couldn't get input files:
<file_xfer_error>
  <file_name>AIMNet_vm_v2.vdi</file_name>
  <error_code>-119 (md5 checksum failed for file)</error_code>
  <error_message>MD5 check failed</error_message>
</file_xfer_error>
</message>
]]>


aaaf-SAR-ACPC_pp-NMASN_pp-NMBEN3_pp_0_1732771_1_0
I think you know the drill now.

aaaf-SAR-ACPC_pp-SAR_pp-NMBEN3_pp_0_1732766_2_0

aaaf-SAR-ACPC_pp-TIC_pp-NMBEN3_pp_0_1732765_2_0
ID: 102644 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Tomcat雄猫

Send message
Joined: 20 Dec 14
Posts: 180
Credit: 5,386,173
RAC: 0
Message 102645 - Posted: 17 Sep 2021, 12:20:22 UTC - in response to Message 102644.  
Last modified: 17 Sep 2021, 12:21:24 UTC

Btw, another error with our old friend "5nvx".

5nvx_graft_buwei_xaf_SAVE_ALL_OUT_IGNORE_THE_REST_0ii8kg8j_1731796_3_0
<core_client_version>7.16.11</core_client_version>
<![CDATA[
<stderr_txt>
command: projects/boinc.bakerlab.org_rosetta/rosetta_4.20_windows_x86_64.exe -run:protocol jd2_scripting -parser:protocol pdblite_boinc_120_10--fuse--predictor_v11_boinc_fix--fuse--tslp_design_v1_boinc_fix_plus6.xml @5nvx_graft_buwei_flags -in:file:silent 5nvx_graft_buwei_xaf_SAVE_ALL_OUT_IGNORE_THE_REST_0ii8kg8j.silent -in:file:silent_struct_type binary -silent_gz -mute all -silent_read_through_errors true -out:file:silent_struct_type binary -out:file:silent default.out -in:file:boinc_wu_zip 5nvx_graft_buwei_xaf_SAVE_ALL_OUT_IGNORE_THE_REST_0ii8kg8j.zip @5nvx_graft_buwei_xaf_SAVE_ALL_OUT_IGNORE_THE_REST_0ii8kg8j.flags -nstruct 10000 -cpu_run_time 28800 -boinc:max_nstruct 20000 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -boinc::cpu_run_timeout 36000 -run::rng mt19937 -constant_seed -jran 1821155
Using database: database_357d5d93529_n_methylminirosetta_database

ERROR: [ERROR] Unable to open constraints file: m_ems_3hM_2942_000000222_0001_44_63_H_._HHH_b1_01193_0002_1_0001.MSAcst
ERROR:: Exit from: ......srccorescoringconstraintsConstraintIO.cc line: 457
05:24:07 (11072): called boinc_finish(0)

</stderr_txt>
]]>


I really hope these are simply rerun of previous bad batches and not new broken WUs with what appears to be the same problem.
ID: 102645 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1725
Credit: 18,380,064
RAC: 20,136
Message 102658 - Posted: 18 Sep 2021, 5:11:19 UTC
Last modified: 18 Sep 2021, 5:16:21 UTC

Something's gone wrong somewhere.

Every request for work now results in a message telling me i don't have Virtual Box installed, and then not giving me any new work.
I'm doing Rosetta, not Ralph. It's not needed for Rosetta 4.20 Windows Tasks. And i'm not interested in Virtual Box. I don't want it, i don't need it, so i'm not going to install it.

18/09/2021 11:30:36 | Rosetta@home | Reporting 1 completed tasks
18/09/2021 11:30:36 | Rosetta@home | Not requesting tasks: don't need (CPU: job cache full; NVIDIA GPU: )
18/09/2021 11:30:39 | Rosetta@home | Scheduler request completed
18/09/2021 11:30:39 | Rosetta@home | Project requested delay of 31 seconds
18/09/2021 13:11:54 | Rosetta@home | Sending scheduler request: To fetch work.
18/09/2021 13:11:54 | Rosetta@home | Requesting new tasks for CPU
18/09/2021 13:11:57 | Rosetta@home | Scheduler request completed: got 0 new tasks
18/09/2021 13:11:57 | Rosetta@home | No tasks sent
18/09/2021 13:11:57 | Rosetta@home | Message from server: VirtualBox is not installed
18/09/2021 13:11:57 | Rosetta@home | Project requested delay of 31 seconds
18/09/2021 13:24:37 | Rosetta@home | Sending scheduler request: To fetch work.
18/09/2021 13:24:37 | Rosetta@home | Requesting new tasks for CPU
18/09/2021 13:24:39 | Rosetta@home | Scheduler request completed: got 0 new tasks
18/09/2021 13:24:39 | Rosetta@home | No tasks sent
18/09/2021 13:24:39 | Rosetta@home | Message from server: VirtualBox is not installed
18/09/2021 13:24:39 | Rosetta@home | Project requested delay of 31 seconds
18/09/2021 13:34:19 | Rosetta@home | Computation for task degrader_site_1tfq_plait_-1.5_bcov_40_5_SAVE_ALL_OUT_IGNORE_THE_REST_8xe4ce8r_1731465_5_1 finished
18/09/2021 13:34:22 | Rosetta@home | Starting task degrader_site_3mup_plait_-1.5_bcov_35.hbnet_5_SAVE_ALL_OUT_IGNORE_THE_REST_3ik3li5e_1731615_6_0
18/09/2021 13:34:23 | Rosetta@home | Started upload of degrader_site_1tfq_plait_-1.5_bcov_40_5_SAVE_ALL_OUT_IGNORE_THE_REST_8xe4ce8r_1731465_5_1_r881097555_0
18/09/2021 13:34:30 | Rosetta@home | Finished upload of degrader_site_1tfq_plait_-1.5_bcov_40_5_SAVE_ALL_OUT_IGNORE_THE_REST_8xe4ce8r_1731465_5_1_r881097555_0
18/09/2021 13:34:33 | Rosetta@home | Sending scheduler request: To fetch work.
18/09/2021 13:34:33 | Rosetta@home | Reporting 1 completed tasks
18/09/2021 13:34:33 | Rosetta@home | Requesting new tasks for CPU
18/09/2021 13:34:35 | Rosetta@home | Scheduler request completed: got 0 new tasks
18/09/2021 13:34:35 | Rosetta@home | No tasks sent
18/09/2021 13:34:35 | Rosetta@home | Message from server: VirtualBox is not installed
18/09/2021 13:34:35 | Rosetta@home | Project requested delay of 31 seconds
18/09/2021 13:44:16 | Rosetta@home | Sending scheduler request: To fetch work.
18/09/2021 13:44:16 | Rosetta@home | Requesting new tasks for CPU
18/09/2021 13:44:18 | Rosetta@home | Scheduler request completed: got 0 new tasks
18/09/2021 13:44:18 | Rosetta@home | No tasks sent
18/09/2021 13:44:18 | Rosetta@home | Message from server: VirtualBox is not installed
18/09/2021 13:44:18 | Rosetta@home | Project requested delay of 31 seconds
18/09/2021 14:06:05 | Rosetta@home | Sending scheduler request: To fetch work.
18/09/2021 14:06:05 | Rosetta@home | Requesting new tasks for CPU
18/09/2021 14:06:09 | Rosetta@home | Scheduler request completed: got 0 new tasks
18/09/2021 14:06:09 | Rosetta@home | No tasks sent
18/09/2021 14:06:09 | Rosetta@home | Message from server: VirtualBox is not installed
18/09/2021 14:06:09 | Rosetta@home | Project requested delay of 31 seconds
18/09/2021 14:19:54 | Rosetta@home | Computation for task 5nvx_graft_buwei_xab_SAVE_ALL_OUT_IGNORE_THE_REST_4mp3xf5t_1731803_3_0 finished
18/09/2021 14:19:56 | Rosetta@home | Starting task 5nvx_graft_buwei_xaa_SAVE_ALL_OUT_IGNORE_THE_REST_3rv9kn3y_1731815_3_1
18/09/2021 14:19:57 | Rosetta@home | Started upload of 5nvx_graft_buwei_xab_SAVE_ALL_OUT_IGNORE_THE_REST_4mp3xf5t_1731803_3_0_r1462363868_0
18/09/2021 14:20:07 | Rosetta@home | Finished upload of 5nvx_graft_buwei_xab_SAVE_ALL_OUT_IGNORE_THE_REST_4mp3xf5t_1731803_3_0_r1462363868_0
18/09/2021 14:20:12 | Rosetta@home | Sending scheduler request: To fetch work.
18/09/2021 14:20:12 | Rosetta@home | Reporting 1 completed tasks
18/09/2021 14:20:12 | Rosetta@home | Requesting new tasks for CPU
18/09/2021 14:20:15 | Rosetta@home | Scheduler request completed: got 0 new tasks
18/09/2021 14:20:15 | Rosetta@home | No tasks sent
18/09/2021 14:20:15 | Rosetta@home | Message from server: VirtualBox is not installed
18/09/2021 14:20:15 | Rosetta@home | Project requested delay of 31 seconds
18/09/2021 14:22:58 | Rosetta@home | Computation for task 5nvx_graft_buwei_xaa_SAVE_ALL_OUT_IGNORE_THE_REST_3rv9kn3y_1731815_3_1 finished
18/09/2021 14:23:00 | Rosetta@home | Starting task degrader_site_3mup_jhr_bcov4_SAVE_ALL_OUT_IGNORE_THE_REST_6hs9yf4g_1730712_6_0
18/09/2021 14:23:01 | Rosetta@home | Started upload of 5nvx_graft_buwei_xaa_SAVE_ALL_OUT_IGNORE_THE_REST_3rv9kn3y_1731815_3_1_r578275439_0
18/09/2021 14:23:03 | Rosetta@home | Finished upload of 5nvx_graft_buwei_xaa_SAVE_ALL_OUT_IGNORE_THE_REST_3rv9kn3y_1731815_3_1_r578275439_0
18/09/2021 14:23:07 | Rosetta@home | Sending scheduler request: To fetch work.
18/09/2021 14:23:07 | Rosetta@home | Reporting 1 completed tasks
18/09/2021 14:23:07 | Rosetta@home | Requesting new tasks for CPU
18/09/2021 14:23:09 | Rosetta@home | Scheduler request completed: got 0 new tasks
18/09/2021 14:23:09 | Rosetta@home | No tasks sent
18/09/2021 14:23:09 | Rosetta@home | Message from server: VirtualBox is not installed
18/09/2021 14:23:09 | Rosetta@home | Project requested delay of 31 seconds
18/09/2021 14:36:51 | Rosetta@home | Sending scheduler request: To fetch work.
18/09/2021 14:36:51 | Rosetta@home | Requesting new tasks for CPU
18/09/2021 14:36:53 | Rosetta@home | Scheduler request completed: got 0 new tasks
18/09/2021 14:36:53 | Rosetta@home | No tasks sent
18/09/2021 14:36:53 | Rosetta@home | Message from server: VirtualBox is not installed
18/09/2021 14:36:53 | Rosetta@home | Project requested delay of 31 seconds



Whatever server configuration changes were made recently, need to be reverted- Work in progress is steadily falling as people can no longer get work.
Grant
Darwin NT
ID: 102658 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 121 · 122 · 123 · 124 · 125 · 126 · 127 . . . 309 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org