Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 122 · 123 · 124 · 125 · 126 · 127 · 128 . . . 276 · Next

AuthorMessage
Old man

Send message
Joined: 10 Nov 07
Posts: 25
Credit: 1,122,372
RAC: 1
Message 102626 - Posted: 15 Sep 2021, 17:47:17 UTC - in response to Message 102606.  

[snip]

ERROR: [ERROR] Unable to open constraints file: m_ems_3hC_506_000000211_0001_43_58_H_._HHH_b2_02864_0001_1_0001.MSAcst

What to do?

That's the important line of the error messages.

That usually means that one of the input files for the task was not downloaded correctly.

Lately, that has often been because the file was not in the correct place on the server. If so, you can't do much other than wait for a better task.

If anyone else who gets a task from the same workunit and has it fail the same way, you'll get a little bit of credit for trying to run the task.


Thank you! I dont worry about that task anymore.
ID: 102626 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 1981
Credit: 38,436,901
RAC: 13,958
Message 102627 - Posted: 15 Sep 2021, 19:07:59 UTC - in response to Message 102625.  

My Boinc is set up as follows (from 32Gb total RAM):

15/09/2021 14:43:09 | | max memory usage when active: 21241.68 MB
15/09/2021 14:43:09 | | max memory usage when idle: 27777.58 MB

I've got one task "Waiting for Memory" so I stopped processing all new tasks to find out at what point it will run.
I'm currently down to just one other task running (from 16 cores) and it's still not able to continue yet, so I looked further and in Properties the task is showing

Virtual memory size 98.98 GB
Working set size 27.05 GB

"Houston, we have a problem..."

This is the task, which I've now aborted
degrader_site_4yc9_0_plait_-2.5_bcov_40_5_SAVE_ALL_OUT_IGNORE_THE_REST_5uy0wa3y_1731760_5_0
Looking at the details it says
Peak working set size 1,523.41 MB
Peak swap size 1,520.17 MB
Peak disk usage 6.10 MB

which doesn't match up with what it showed while it was running.
Also strange that it ran 3h 26m before it started reporting this way
ID: 102627 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Tomcat雄猫

Send message
Joined: 20 Dec 14
Posts: 180
Credit: 5,364,639
RAC: 0
Message 102628 - Posted: 15 Sep 2021, 22:06:01 UTC - in response to Message 102627.  
Last modified: 15 Sep 2021, 22:09:22 UTC

I received three errors, two of which are python tasks.

The regular Rosetta 4.20 task that failed was our old friend "5nvx", most certainly a broken WU...

5nvx_graft_buwei_xaa_SAVE_ALL_OUT_IGNORE_THE_REST_1kg6qt9p_1731819_1_0
<core_client_version>7.16.11</core_client_version>
<![CDATA[
<message>
Incorrect function.
 (0x1) - exit code 1 (0x1)</message>
<stderr_txt>
command: projects/boinc.bakerlab.org_rosetta/rosetta_4.20_windows_x86_64.exe -run:protocol jd2_scripting -parser:protocol pdblite_boinc_120_10--fuse--predictor_v11_boinc_fix--fuse--tslp_design_v1_boinc_fix_plus6.xml @5nvx_graft_buwei_flags -in:file:silent 5nvx_graft_buwei_xaa_SAVE_ALL_OUT_IGNORE_THE_REST_1kg6qt9p.silent -in:file:silent_struct_type binary -silent_gz -mute all -silent_read_through_errors true -out:file:silent_struct_type binary -out:file:silent default.out -in:file:boinc_wu_zip 5nvx_graft_buwei_xaa_SAVE_ALL_OUT_IGNORE_THE_REST_1kg6qt9p.zip @5nvx_graft_buwei_xaa_SAVE_ALL_OUT_IGNORE_THE_REST_1kg6qt9p.flags -nstruct 10000 -cpu_run_time 28800 -boinc:max_nstruct 20000 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -boinc::cpu_run_timeout 36000 -run::rng mt19937 -constant_seed -jran 1392509
Using database: database_357d5d93529_n_methylminirosetta_database

ERROR: [ERROR] Unable to open constraints file: m_HHH_b2_05051_000000119_0001_1_20_H_._HHH_b2_01913_0001_1_0001.MSAcst
ERROR:: Exit from: ......srccorescoringconstraintsConstraintIO.cc line: 457
BOINC:: Error reading and gzipping output datafile: default.out
18:47:53 (15224): called boinc_finish(1)

</stderr_txt>
]]>


Seriously, how is this error still a thing?!

The Python tasks I'm not so sure about. Is it a configuration problem on my end or is it a known problem with these tasks?

aaaf-ACBC_pp-PTABU_pp-TIC_pp-NMBEN3_5_1732058_1_1
<core_client_version>7.16.11</core_client_version>
<![CDATA[
<message>
WU download error: couldn't get input files:
<file_xfer_error>
  <file_name>AIMNet_vm_v2.vdi</file_name>
  <error_code>-119 (md5 checksum failed for file)</error_code>
  <error_message>MD5 check failed</error_message>
</file_xfer_error>
</message>
]]>


aaaf-ACBC_pp-PTABU_pp-TIC_pp-NMBEN3_7_1732060_7_1

<core_client_version>7.16.11</core_client_version>
<![CDATA[
<message>
WU download error: couldn't get input files:
<file_xfer_error>
  <file_name>AIMNet_vm_v2.vdi</file_name>
  <error_code>-119 (md5 checksum failed for file)</error_code>
  <error_message>MD5 check failed</error_message>
</file_xfer_error>
</message>
]]>
ID: 102628 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
.clair.

Send message
Joined: 2 Jan 07
Posts: 274
Credit: 26,399,595
RAC: 0
Message 102629 - Posted: 16 Sep 2021, 0:55:59 UTC - in response to Message 102625.  

My Boinc is set up as follows (from 32Gb total RAM):
[snip]
Virtual memory size 98.98 GB
Working set size 27.05 GB


"Houston, we have a problem..."

WOW . and I thort the `Horns` tasks from a while ago where big :0
ID: 102629 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Tomcat雄猫

Send message
Joined: 20 Dec 14
Posts: 180
Credit: 5,364,639
RAC: 0
Message 102644 - Posted: 17 Sep 2021, 12:15:24 UTC - in response to Message 102629.  
Last modified: 17 Sep 2021, 12:30:41 UTC

7 more Rosetta Python projects v1.03 (vbox64) errors. It's been a solid 100% fail rate on my end. Download errors every single one of them.

Windows 10

aaaf-PTAMBA_pp-mPTAMBA-ACHC_pp-NMBEN3_pp_0_1733022_1_1
<core_client_version>7.16.11</core_client_version>
<![CDATA[
<message>
WU download error: couldn't get input files:
<file_xfer_error>
  <file_name>AIMNet_vm_v2.vdi</file_name>
  <error_code>-119 (md5 checksum failed for file)</error_code>
  <error_message>MD5 check failed</error_message>
</file_xfer_error>
</message>
]]>


aaaf-PTAMBA_pp-PTAMBA_pp-mNMABU_pp-NMBEN3_pp_0_1732919_6_0
<core_client_version>7.16.11</core_client_version>
<![CDATA[
<message>
WU download error: couldn't get input files:
<file_xfer_error>
  <file_name>AIMNet_vm_v2.vdi</file_name>
  <error_code>-119 (md5 checksum failed for file)</error_code>
  <error_message>MD5 check failed</error_message>
</file_xfer_error>
</message>
]]>


[url=]aaaf-PTAMBA_pp-SAR_pp-NHM_pp-NMBEN3_pp_5_1732891_9_0[/url]
<core_client_version>7.16.11</core_client_version>
<![CDATA[
<message>
WU download error: couldn't get input files:
<file_xfer_error>
  <file_name>AIMNet_vm_v2.vdi</file_name>
  <error_code>-119 (md5 checksum failed for file)</error_code>
  <error_message>MD5 check failed</error_message>
</file_xfer_error>
</message>
]]>


aaaf-SAR-ACPC_pp-NMASN_pp-NMBEN3_pp_0_1732771_1_0
I think you know the drill now.

aaaf-SAR-ACPC_pp-SAR_pp-NMBEN3_pp_0_1732766_2_0

aaaf-SAR-ACPC_pp-TIC_pp-NMBEN3_pp_0_1732765_2_0
ID: 102644 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Tomcat雄猫

Send message
Joined: 20 Dec 14
Posts: 180
Credit: 5,364,639
RAC: 0
Message 102645 - Posted: 17 Sep 2021, 12:20:22 UTC - in response to Message 102644.  
Last modified: 17 Sep 2021, 12:21:24 UTC

Btw, another error with our old friend "5nvx".

5nvx_graft_buwei_xaf_SAVE_ALL_OUT_IGNORE_THE_REST_0ii8kg8j_1731796_3_0
<core_client_version>7.16.11</core_client_version>
<![CDATA[
<stderr_txt>
command: projects/boinc.bakerlab.org_rosetta/rosetta_4.20_windows_x86_64.exe -run:protocol jd2_scripting -parser:protocol pdblite_boinc_120_10--fuse--predictor_v11_boinc_fix--fuse--tslp_design_v1_boinc_fix_plus6.xml @5nvx_graft_buwei_flags -in:file:silent 5nvx_graft_buwei_xaf_SAVE_ALL_OUT_IGNORE_THE_REST_0ii8kg8j.silent -in:file:silent_struct_type binary -silent_gz -mute all -silent_read_through_errors true -out:file:silent_struct_type binary -out:file:silent default.out -in:file:boinc_wu_zip 5nvx_graft_buwei_xaf_SAVE_ALL_OUT_IGNORE_THE_REST_0ii8kg8j.zip @5nvx_graft_buwei_xaf_SAVE_ALL_OUT_IGNORE_THE_REST_0ii8kg8j.flags -nstruct 10000 -cpu_run_time 28800 -boinc:max_nstruct 20000 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -boinc::cpu_run_timeout 36000 -run::rng mt19937 -constant_seed -jran 1821155
Using database: database_357d5d93529_n_methylminirosetta_database

ERROR: [ERROR] Unable to open constraints file: m_ems_3hM_2942_000000222_0001_44_63_H_._HHH_b1_01193_0002_1_0001.MSAcst
ERROR:: Exit from: ......srccorescoringconstraintsConstraintIO.cc line: 457
05:24:07 (11072): called boinc_finish(0)

</stderr_txt>
]]>


I really hope these are simply rerun of previous bad batches and not new broken WUs with what appears to be the same problem.
ID: 102645 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1481
Credit: 14,594,732
RAC: 14,965
Message 102658 - Posted: 18 Sep 2021, 5:11:19 UTC
Last modified: 18 Sep 2021, 5:16:21 UTC

Something's gone wrong somewhere.

Every request for work now results in a message telling me i don't have Virtual Box installed, and then not giving me any new work.
I'm doing Rosetta, not Ralph. It's not needed for Rosetta 4.20 Windows Tasks. And i'm not interested in Virtual Box. I don't want it, i don't need it, so i'm not going to install it.

18/09/2021 11:30:36 | Rosetta@home | Reporting 1 completed tasks
18/09/2021 11:30:36 | Rosetta@home | Not requesting tasks: don't need (CPU: job cache full; NVIDIA GPU: )
18/09/2021 11:30:39 | Rosetta@home | Scheduler request completed
18/09/2021 11:30:39 | Rosetta@home | Project requested delay of 31 seconds
18/09/2021 13:11:54 | Rosetta@home | Sending scheduler request: To fetch work.
18/09/2021 13:11:54 | Rosetta@home | Requesting new tasks for CPU
18/09/2021 13:11:57 | Rosetta@home | Scheduler request completed: got 0 new tasks
18/09/2021 13:11:57 | Rosetta@home | No tasks sent
18/09/2021 13:11:57 | Rosetta@home | Message from server: VirtualBox is not installed
18/09/2021 13:11:57 | Rosetta@home | Project requested delay of 31 seconds
18/09/2021 13:24:37 | Rosetta@home | Sending scheduler request: To fetch work.
18/09/2021 13:24:37 | Rosetta@home | Requesting new tasks for CPU
18/09/2021 13:24:39 | Rosetta@home | Scheduler request completed: got 0 new tasks
18/09/2021 13:24:39 | Rosetta@home | No tasks sent
18/09/2021 13:24:39 | Rosetta@home | Message from server: VirtualBox is not installed
18/09/2021 13:24:39 | Rosetta@home | Project requested delay of 31 seconds
18/09/2021 13:34:19 | Rosetta@home | Computation for task degrader_site_1tfq_plait_-1.5_bcov_40_5_SAVE_ALL_OUT_IGNORE_THE_REST_8xe4ce8r_1731465_5_1 finished
18/09/2021 13:34:22 | Rosetta@home | Starting task degrader_site_3mup_plait_-1.5_bcov_35.hbnet_5_SAVE_ALL_OUT_IGNORE_THE_REST_3ik3li5e_1731615_6_0
18/09/2021 13:34:23 | Rosetta@home | Started upload of degrader_site_1tfq_plait_-1.5_bcov_40_5_SAVE_ALL_OUT_IGNORE_THE_REST_8xe4ce8r_1731465_5_1_r881097555_0
18/09/2021 13:34:30 | Rosetta@home | Finished upload of degrader_site_1tfq_plait_-1.5_bcov_40_5_SAVE_ALL_OUT_IGNORE_THE_REST_8xe4ce8r_1731465_5_1_r881097555_0
18/09/2021 13:34:33 | Rosetta@home | Sending scheduler request: To fetch work.
18/09/2021 13:34:33 | Rosetta@home | Reporting 1 completed tasks
18/09/2021 13:34:33 | Rosetta@home | Requesting new tasks for CPU
18/09/2021 13:34:35 | Rosetta@home | Scheduler request completed: got 0 new tasks
18/09/2021 13:34:35 | Rosetta@home | No tasks sent
18/09/2021 13:34:35 | Rosetta@home | Message from server: VirtualBox is not installed
18/09/2021 13:34:35 | Rosetta@home | Project requested delay of 31 seconds
18/09/2021 13:44:16 | Rosetta@home | Sending scheduler request: To fetch work.
18/09/2021 13:44:16 | Rosetta@home | Requesting new tasks for CPU
18/09/2021 13:44:18 | Rosetta@home | Scheduler request completed: got 0 new tasks
18/09/2021 13:44:18 | Rosetta@home | No tasks sent
18/09/2021 13:44:18 | Rosetta@home | Message from server: VirtualBox is not installed
18/09/2021 13:44:18 | Rosetta@home | Project requested delay of 31 seconds
18/09/2021 14:06:05 | Rosetta@home | Sending scheduler request: To fetch work.
18/09/2021 14:06:05 | Rosetta@home | Requesting new tasks for CPU
18/09/2021 14:06:09 | Rosetta@home | Scheduler request completed: got 0 new tasks
18/09/2021 14:06:09 | Rosetta@home | No tasks sent
18/09/2021 14:06:09 | Rosetta@home | Message from server: VirtualBox is not installed
18/09/2021 14:06:09 | Rosetta@home | Project requested delay of 31 seconds
18/09/2021 14:19:54 | Rosetta@home | Computation for task 5nvx_graft_buwei_xab_SAVE_ALL_OUT_IGNORE_THE_REST_4mp3xf5t_1731803_3_0 finished
18/09/2021 14:19:56 | Rosetta@home | Starting task 5nvx_graft_buwei_xaa_SAVE_ALL_OUT_IGNORE_THE_REST_3rv9kn3y_1731815_3_1
18/09/2021 14:19:57 | Rosetta@home | Started upload of 5nvx_graft_buwei_xab_SAVE_ALL_OUT_IGNORE_THE_REST_4mp3xf5t_1731803_3_0_r1462363868_0
18/09/2021 14:20:07 | Rosetta@home | Finished upload of 5nvx_graft_buwei_xab_SAVE_ALL_OUT_IGNORE_THE_REST_4mp3xf5t_1731803_3_0_r1462363868_0
18/09/2021 14:20:12 | Rosetta@home | Sending scheduler request: To fetch work.
18/09/2021 14:20:12 | Rosetta@home | Reporting 1 completed tasks
18/09/2021 14:20:12 | Rosetta@home | Requesting new tasks for CPU
18/09/2021 14:20:15 | Rosetta@home | Scheduler request completed: got 0 new tasks
18/09/2021 14:20:15 | Rosetta@home | No tasks sent
18/09/2021 14:20:15 | Rosetta@home | Message from server: VirtualBox is not installed
18/09/2021 14:20:15 | Rosetta@home | Project requested delay of 31 seconds
18/09/2021 14:22:58 | Rosetta@home | Computation for task 5nvx_graft_buwei_xaa_SAVE_ALL_OUT_IGNORE_THE_REST_3rv9kn3y_1731815_3_1 finished
18/09/2021 14:23:00 | Rosetta@home | Starting task degrader_site_3mup_jhr_bcov4_SAVE_ALL_OUT_IGNORE_THE_REST_6hs9yf4g_1730712_6_0
18/09/2021 14:23:01 | Rosetta@home | Started upload of 5nvx_graft_buwei_xaa_SAVE_ALL_OUT_IGNORE_THE_REST_3rv9kn3y_1731815_3_1_r578275439_0
18/09/2021 14:23:03 | Rosetta@home | Finished upload of 5nvx_graft_buwei_xaa_SAVE_ALL_OUT_IGNORE_THE_REST_3rv9kn3y_1731815_3_1_r578275439_0
18/09/2021 14:23:07 | Rosetta@home | Sending scheduler request: To fetch work.
18/09/2021 14:23:07 | Rosetta@home | Reporting 1 completed tasks
18/09/2021 14:23:07 | Rosetta@home | Requesting new tasks for CPU
18/09/2021 14:23:09 | Rosetta@home | Scheduler request completed: got 0 new tasks
18/09/2021 14:23:09 | Rosetta@home | No tasks sent
18/09/2021 14:23:09 | Rosetta@home | Message from server: VirtualBox is not installed
18/09/2021 14:23:09 | Rosetta@home | Project requested delay of 31 seconds
18/09/2021 14:36:51 | Rosetta@home | Sending scheduler request: To fetch work.
18/09/2021 14:36:51 | Rosetta@home | Requesting new tasks for CPU
18/09/2021 14:36:53 | Rosetta@home | Scheduler request completed: got 0 new tasks
18/09/2021 14:36:53 | Rosetta@home | No tasks sent
18/09/2021 14:36:53 | Rosetta@home | Message from server: VirtualBox is not installed
18/09/2021 14:36:53 | Rosetta@home | Project requested delay of 31 seconds



Whatever server configuration changes were made recently, need to be reverted- Work in progress is steadily falling as people can no longer get work.
Grant
Darwin NT
ID: 102658 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1481
Credit: 14,594,732
RAC: 14,965
Message 102667 - Posted: 18 Sep 2021, 6:05:58 UTC

Oh well, a few more hours & i'll be out of work again, even though this time there's still millions available.
Grant
Darwin NT
ID: 102667 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1481
Credit: 14,594,732
RAC: 14,965
Message 102668 - Posted: 18 Sep 2021, 6:32:35 UTC - in response to Message 102667.  

Oh well, a few more hours & i'll be out of work again, even though this time there's still millions available.
I don't want to jinx things, but work appears to be flowing again.
Complains about the lack of VirtualBox messages keep occurring, but at least i can get work again.
Grant
Darwin NT
ID: 102668 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1481
Credit: 14,594,732
RAC: 14,965
Message 102669 - Posted: 18 Sep 2021, 6:45:49 UTC - in response to Message 102668.  

Oh well, a few more hours & i'll be out of work again, even though this time there's still millions available.
I don't want to jinx things, but work appears to be flowing again.
Complains about the lack of VirtualBox messages keep occurring, but at least i can get work again.
I did jinx it.
Work has stopped flowing again.
Grant
Darwin NT
ID: 102669 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 374
Credit: 10,704,598
RAC: 5,643
Message 102673 - Posted: 18 Sep 2021, 8:35:38 UTC - in response to Message 102668.  

Oh well, a few more hours & i'll be out of work again, even though this time there's still millions available.
I don't want to jinx things, but work appears to be flowing again.
Complains about the lack of VirtualBox messages keep occurring, but at least i can get work again.


I’m with you, I will not be running virtualbox or the python tasks so if that screws up running normal Rosetta so be it.
ID: 102673 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sputnik
Avatar

Send message
Joined: 7 Aug 17
Posts: 1
Credit: 8,891,837
RAC: 7,500
Message 102674 - Posted: 18 Sep 2021, 8:52:21 UTC - in response to Message 80621.  
Last modified: 18 Sep 2021, 9:03:55 UTC

Hej. I still get download errors for the rosetta python projects 1.03 files, because there are some checksum errors / MD5 errors in BOINC Manager 7.16.11 (x64) for the co-loaded 2GB file AIMNet_vm_v2.vdi

18.09.2021 10:42:42 | Rosetta@home | Finished download of AIMNet_vm_v2.vdi
18.09.2021 10:43:23 | Rosetta@home | [error] MD5 check failed for AIMNet_vm_v2.vdi
18.09.2021 10:43:23 | Rosetta@home | [error] expected d41d8cd98f00b204e9800998ecf8427e, got 61fef19456bb58ec941845ef08d8c5ef
18.09.2021 10:43:23 | Rosetta@home | [error] Checksum or signature error for AIMNet_vm_v2.vdi

How to stop the download of these incorrect co-loaded 2GB file/python projects in the rosetta portal (https://boinc.bakerlab.org/rosetta/prefs.php?subset=project)?
Or do i have to stop crunching for the whole rosetta project (e.g. rosetta 4.20)?

The Oracle VirtualBox 6.1.26 is installed - and is running fine - used for LHC Home projects.


THX Sputnik
ID: 102674 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
winny33

Send message
Joined: 15 Sep 09
Posts: 1
Credit: 611,268
RAC: 138
Message 102679 - Posted: 18 Sep 2021, 11:12:59 UTC
Last modified: 18 Sep 2021, 11:15:37 UTC

Hello Me I can't stop having this error in all my spots.


18/0918/09/2021 13:08:21 | Rosetta@home | [error] MD5 check failed for AIMNet_vm_v2.vdi
/2021 13:08:21 | Rosetta@home | [error] expected d41d8cd98f00b204e9800998ecf8427e, got 61fef19456bb58ec941845ef08d8c5ef
18/09/2021 13:08:21 | Rosetta@home | [error] Checksum or signature error for AIMNet_vm_v2.vdi


<core_client_version>7.16.11</core_client_version>
<![CDATA[
<message>
WU download error: couldn't get input files:
<file_xfer_error>
<file_name>AIMNet_vm_v2.vdi</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
<error_message>MD5 check failed</error_message>
</file_xfer_error>
</message>
]]>
ID: 102679 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
StarCastle

Send message
Joined: 25 Apr 20
Posts: 7
Credit: 786,135
RAC: 928
Message 102680 - Posted: 18 Sep 2021, 11:49:41 UTC

Receiving a checksum error on the python downloads. This started happening mid last week.

Here is the event log of one of those downloads


09/18/2021 07:13:49 | Rosetta@home | Started download of AIMNet_vm_v2.vdi
09/18/2021 07:13:49 | Rosetta@home | Started download of aaaf-mNMABU_pp-mPTAMBA_pp-NHM_pp-NMBEN3_pp_0.gz
09/18/2021 07:13:51 | Rosetta@home | Finished download of aaaf-mNMABU_pp-mPTAMBA_pp-NHM_pp-NMBEN3_pp_0.gz
09/18/2021 07:13:51 | Rosetta@home | Started download of aaaf-mNMALA_pp-PTAMBA_pp-TIC_pp-NMBEN3_pp_0.gz
09/18/2021 07:13:52 | Rosetta@home | Finished download of aaaf-mNMALA_pp-PTAMBA_pp-TIC_pp-NMBEN3_pp_0.gz
09/18/2021 07:13:52 | Rosetta@home | Started download of aaaf-mNMABU_pp-PTAMBA_pp-PIP_pp-NMBEN3_pp_0.gz
09/18/2021 07:13:53 | Rosetta@home | Finished download of aaaf-mNMABU_pp-PTAMBA_pp-PIP_pp-NMBEN3_pp_0.gz
09/18/2021 07:13:53 | Rosetta@home | Started download of aaaf-mNMABU-ACPC-ABU_pp-NMBEN3_pp_0.gz
09/18/2021 07:13:54 | Rosetta@home | Finished download of aaaf-mNMABU-ACPC-ABU_pp-NMBEN3_pp_0.gz
09/18/2021 07:13:54 | Rosetta@home | Started download of aaaf-mNMABU-ACPC-SER_pp-NMBEN3_pp_0.gz
09/18/2021 07:13:56 | Rosetta@home | Finished download of aaaf-mNMABU-ACPC-SER_pp-NMBEN3_pp_0.gz
09/18/2021 07:13:56 | Rosetta@home | Started download of aaaf-mNMABU-ACPC-TBA_pp-NMBEN3_pp_0.gz
09/18/2021 07:13:57 | Rosetta@home | Finished download of aaaf-mNMABU-ACPC-TBA_pp-NMBEN3_pp_0.gz
09/18/2021 07:13:57 | Rosetta@home | Started download of aaaf-mNMASN-ACPC_pp-PIP_pp-NMBEN3_pp_0.gz
09/18/2021 07:13:58 | Rosetta@home | Finished download of aaaf-mNMASN-ACPC_pp-PIP_pp-NMBEN3_pp_0.gz
09/18/2021 07:13:58 | Rosetta@home | Started download of aaaf-mNMASN-ACPC_pp-NMLEU_pp-NMBEN3_pp_0.gz
09/18/2021 07:13:59 | Rosetta@home | Finished download of aaaf-mNMASN-ACPC_pp-NMLEU_pp-NMBEN3_pp_0.gz
09/18/2021 07:21:59 | Rosetta@home | Finished download of AIMNet_vm_v2.vdi
09/18/2021 07:23:12 | Rosetta@home | [error] MD5 check failed for AIMNet_vm_v2.vdi
09/18/2021 07:23:12 | Rosetta@home | [error] expected d41d8cd98f00b204e9800998ecf8427e, got 61fef19456bb58ec941845ef08d8c5ef
09/18/2021 07:23:12 | Rosetta@home | [error] Checksum or signature error for AIMNet_vm_v2.vdi
ID: 102680 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 102699 - Posted: 18 Sep 2021, 21:46:47 UTC

This is all very interesting, but the problems that I see here are all on Windows 10 machines. I will be able to start running them on Ubuntu 20.04.3 machines by Monday. If anyone is seeing problems on that, I would appreciate knowing about it.

My machines have enough memory to run four to six at a time. If there are any cache limitations on running that many, it would be interesting to know about that too. Good luck.
ID: 102699 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1481
Credit: 14,594,732
RAC: 14,965
Message 102701 - Posted: 18 Sep 2021, 21:54:21 UTC

Event log no longer showing complaints about lack of VirtualBox, and system is now able to get work again.
Grant
Darwin NT
ID: 102701 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5662
Credit: 5,700,734
RAC: 2,091
Message 102716 - Posted: 19 Sep 2021, 9:45:14 UTC - in response to Message 102699.  

This is all very interesting, but the problems that I see here are all on Windows 10 machines. I will be able to start running them on Ubuntu 20.04.3 machines by Monday. If anyone is seeing problems on that, I would appreciate knowing about it.

My machines have enough memory to run four to six at a time. If there are any cache limitations on running that many, it would be interesting to know about that too. Good luck.



But it is not a issue for Linux machines because your running the native environment of the task.
Us Windows users are emulating Linux via a Virtual Machine and that seems to be where the problem is at.
The Virtual Disk Image file Checksum code that we get does not seem to match the code the server wants and then we get a code -119 MD5 checksum error.

I've been fortunate not to get that.
There is a guy in Italy in another thread with a Windows machine that gets nothing but these errors, no matter what we have tried. And it only related to Python tasks which are VM tasks.
ID: 102716 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 102720 - Posted: 19 Sep 2021, 11:50:34 UTC - in response to Message 102716.  

But it is not a issue for Linux machines because your running the native environment of the task.
Us Windows users are emulating Linux via a Virtual Machine and that seems to be where the problem is at.
The Virtual Disk Image file Checksum code that we get does not seem to match the code the server wants and then we get a code -119 MD5 checksum error.

I have to run VirtualBox on my Ubuntu machines also to do the pythons.

But it does not always work quite the same way as on Windows. It is hopefully a small discrepancy, but we all need someone at Rosetta to look into it, and they never bother to even acknowledge problems. Maybe someone can get the attention of the Admin.
ID: 102720 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Nature Boy

Send message
Joined: 16 Jul 10
Posts: 1
Credit: 2,217,155
RAC: 0
Message 102727 - Posted: 19 Sep 2021, 14:21:11 UTC - in response to Message 102699.  

This is all very interesting, but the problems that I see here are all on Windows 10 machines. I will be able to start running them on Ubuntu 20.04.3 machines by Monday. If anyone is seeing problems on that, I would appreciate knowing about it.


Running Utuntu 20.04 and the file keeps trying to download and failing. Boinc Manager 7.16.6. The file is attempting to download, again, as I type.

The latest failure part of the log:

Sun 19 Sep 2021 02:09:53 AM EDT | Rosetta@home | Finished download of AIMNet_vm_v2.vdi
Sun 19 Sep 2021 02:10:55 AM EDT | Rosetta@home | [error] MD5 check failed for AIMNet_vm_v2.vdi
Sun 19 Sep 2021 02:10:55 AM EDT | Rosetta@home | [error] expected d41d8cd98f00b204e9800998ecf8427e, got 61fef19456bb58ec941845ef08d8c5ef
Sun 19 Sep 2021 02:10:55 AM EDT | Rosetta@home | [error] Checksum or signature error for AIMNet_vm_v2.vdi

First occurrence, in my log is:

Fri 17 Sep 2021 08:26:54 AM EDT | Rosetta@home | Started download of AIMNet_vm_v2.vdi
Fri 17 Sep 2021 08:26:54 AM EDT | Rosetta@home | Started download of AIMNet_minimization_python_project.py
Fri 17 Sep 2021 08:26:56 AM EDT | Rosetta@home | Finished download of AIMNet_minimization_python_project.py
Fri 17 Sep 2021 08:26:56 AM EDT | Rosetta@home | Started download of aaaf-PTAMBA-mTBG_pp-NMABU_pp-NMBEN3_pp_1.gz
Fri 17 Sep 2021 08:26:59 AM EDT | Rosetta@home | Finished download of aaaf-PTAMBA-mTBG_pp-NMABU_pp-NMBEN3_pp_1.gz
Fri 17 Sep 2021 09:24:18 AM EDT | Rosetta@home | Starting task degrader_site_1tfq_plait_-1.5_bcov_25.hbnet_5_SAVE_ALL_OUT_IGNORE_THE_REST_5bw7nj6v_1731293_5_0
Fri 17 Sep 2021 09:24:22 AM EDT | Rosetta@home | Starting task 5nvx_graft_buwei_xad_SAVE_ALL_OUT_IGNORE_THE_REST_9mp6yp0h_1731808_3_0
Fri 17 Sep 2021 10:19:13 AM EDT | Rosetta@home | Finished download of AIMNet_vm_v2.vdi
Fri 17 Sep 2021 10:20:12 AM EDT | Rosetta@home | [error] MD5 check failed for AIMNet_vm_v2.vdi
Fri 17 Sep 2021 10:20:12 AM EDT | Rosetta@home | [error] expected d41d8cd98f00b204e9800998ecf8427e, got 61fef19456bb58ec941845ef08d8c5ef
Fri 17 Sep 2021 10:20:12 AM EDT | Rosetta@home | [error] Checksum or signature error for AIMNet_vm_v2.vdi

How can I make it stop? I have been a long time supporter of RH, but will disconnect from the project if it keeps wasting my bandwidth with 2GB failed downloads.
ID: 102727 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Dr Who Fan
Avatar

Send message
Joined: 28 May 06
Posts: 62
Credit: 230,705
RAC: 110
Message 102728 - Posted: 19 Sep 2021, 15:58:59 UTC - in response to Message 102727.  

How can I make it stop? I have been a long time supporter of RH, but will disconnect from the project if it keeps wasting my bandwidth with 2GB failed downloads.

Temporarily STOP REQUESTING NEW WORK FOR THE PROJET by settling NO NEW WORK for Rosetta on your PC(S) in BOINC until the project fixes the problem.

ID: 102728 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 122 · 123 · 124 · 125 · 126 · 127 · 128 . . . 276 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org