Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 47 · 48 · 49 · 50 · 51 · 52 · 53 . . . 309 · Next

AuthorMessage
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1725
Credit: 18,378,164
RAC: 20,578
Message 96003 - Posted: 4 May 2020, 10:37:59 UTC

Seems to be a problem with the web site/database when checking out Tasks.

When i go to my Account, i can click on View next to Tasks to see all of my Tasks.
But if go to click on Valid or Error etc all i get is
Already logged in
You are logged in as Grant (SSSF) . Log out

and the url is
https://boinc.bakerlab.org/rosetta/login_form.php?next_url=%2Fresults.php%3Fuserid%3D2125796%26offset%3D0%26show_names%3D0%26state%3D4%26appid%3D
For some reason it's pulling up the login form.

If on my account page i click on View for Computers on this account, then Tasks for each of the computers i can then see the Valids, Errors etc. However at the top right corner my name is replaced with "Sign Up" and next to it Log out is replaced with Login.
Grant
Darwin NT
ID: 96003 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Boiler Paul

Send message
Joined: 14 Apr 20
Posts: 4
Credit: 775,245
RAC: 0
Message 96008 - Posted: 4 May 2020, 12:27:53 UTC - in response to Message 96003.  

Seems to be a problem with the web site/database when checking out Tasks.

When i go to my Account, i can click on View next to Tasks to see all of my Tasks.
But if go to click on Valid or Error etc all i get is
Already logged in
You are logged in as Grant (SSSF) . Log out

and the url is
https://boinc.bakerlab.org/rosetta/login_form.php?next_url=%2Fresults.php%3Fuserid%3D2125796%26offset%3D0%26show_names%3D0%26state%3D4%26appid%3D
For some reason it's pulling up the login form.

If on my account page i click on View for Computers on this account, then Tasks for each of the computers i can then see the Valids, Errors etc. However at the top right corner my name is replaced with "Sign Up" and next to it Log out is replaced with Login.



happening to me too. I logged out and logged back in...no change. Even cleared out cookies and rebooted.....no change.
ID: 96008 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
James W

Send message
Joined: 25 Nov 12
Posts: 130
Credit: 1,766,254
RAC: 0
Message 96029 - Posted: 4 May 2020, 15:16:59 UTC - in response to Message 96003.  

Same situation with me since last night. All I can see is first screen of "All tasks for James W." If I click on any option, such as to go to next screen, see valid tasks, etc., will get the "already logged in" message like Grant mentioned. Apparently a web site issue.
ID: 96029 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Toni Guerrero

Send message
Joined: 1 Oct 08
Posts: 1
Credit: 163,278
RAC: 0
Message 96030 - Posted: 4 May 2020, 15:27:15 UTC

Hello everybody.

I get computation errors (exit code 11) in all Junior_HalfRoid_design5_COVID-19 tasks I'm crunching on Android 5.0.2, Boinc 7.4.53, Rosetta v4.20 arm-android-linux-gnu, CPU ARMv7 Processor rev 0 (v7l). Previously, when runnin rosetta v4.16 this same device whas crunching those tasks with no issues. Anyone has this same behaviour?

Thank you.
ID: 96030 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Admin
Project administrator

Send message
Joined: 1 Jul 05
Posts: 5144
Credit: 0
RAC: 0
Message 96040 - Posted: 4 May 2020, 16:09:16 UTC - in response to Message 96029.  

Sorry about the web site issues. I made some updates that obviously caused a bug. I'll work on a fix.
ID: 96040 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1480
Credit: 4,334,829
RAC: 0
Message 96049 - Posted: 4 May 2020, 16:38:14 UTC - in response to Message 96030.  

Hello everybody.

I get computation errors (exit code 11) in all Junior_HalfRoid_design5_COVID-19 tasks I'm crunching on Android 5.0.2, Boinc 7.4.53, Rosetta v4.20 arm-android-linux-gnu, CPU ARMv7 Processor rev 0 (v7l). Previously, when runnin rosetta v4.16 this same device whas crunching those tasks with no issues. Anyone has this same behaviour?

Thank you.


Those jobs appear to have some issues. People have reported that if the job restarts, it can cause an error. We'll look into this but since it's somewhat rare we are continuing these jobs.
ID: 96049 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1725
Credit: 18,378,164
RAC: 20,578
Message 96063 - Posted: 4 May 2020, 18:46:53 UTC - in response to Message 96040.  

Sorry about the web site issues. I made some updates that obviously caused a bug. I'll work on a fix.
Working again.
Thanks.
Grant
Darwin NT
ID: 96063 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
svincent

Send message
Joined: 30 Dec 05
Posts: 219
Credit: 12,120,035
RAC: 0
Message 96237 - Posted: 7 May 2020, 15:28:46 UTC

This task failed on Ubuntu 19.10

https://boinc.bakerlab.org/rosetta/result.php?resultid=1172834155

<core_client_version>7.16.3</core_client_version>
<![CDATA[
<message>
process exited with code 1 (0x1, -255)</message>
<stderr_txt>
command: ../../projects/boinc.bakerlab.org_rosetta/rosetta_4.20_x86_64-pc-linux-gnu @rb_05_07_20646_23758_ab_t000__robetta_FLAGS -in::file::fasta t000_.fasta -jumps:pairing_file t000_.fasta.bbcontacts.jumps -jumps:random_sheets 1 -constraints::cst_file t000_.fasta.CB.cst -constraints:cst_weight 5.0 -constraints::cst_fa_file t000_.fasta.MIN.cst -constraints:cst_fa_weight 5.0 -in:file:boinc_wu_zip rb_05_07_20646_23758_ab_t000__robetta.zip -frag3 rb_05_07_20646_23758_ab_t000__robetta.200.3mers.index.gz -fragA rb_05_07_20646_23758_ab_t000__robetta.200.11mers.index.gz -fragB rb_05_07_20646_23758_ab_t000__robetta.200.5mers.index.gz -nstruct 10000 -cpu_run_time 28800 -boinc:max_nstruct 20000 -checkpoint_interval 120 -mute all -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -boinc::cpu_run_timeout 36000 -run::rng mt19937 -constant_seed -jran 2100616
Using database: database_357d5d93529_n_methyl/minirosetta_database

[ ERROR ]: Caught exception:


File: src/core/pack/dunbrack/SingleResidueDunbrackLibrary.hh:306
chi angle must be between -180 and 180: -nan
------------------------ Begin developer's backtrace -------------------------
BACKTRACE:
[0x3ce8b7f]
[0x62b4e53]
[0x408ae82]
ID: 96237 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Folding Proteins

Send message
Joined: 27 Mar 20
Posts: 2
Credit: 349,986
RAC: 0
Message 96336 - Posted: 10 May 2020, 13:52:42 UTC
Last modified: 10 May 2020, 13:53:55 UTC

Hello,

since changing the URL in the BOINC client (7.16.5, Win10) with the HTTPS prefix, my WUs are not saved correctly upon exit of the BOINC manager.
I have the "Leave non-GPU tasks in memory while suspended" option checked in computing preferences.
The issue also coincides with the Roseta 4.20 release, so I am not exactly sure whether the problem comes from the URL change or it is something from how the server handles tasks now.
I had no problems of WU being saved and resumed after cleint, machine shutdowns before.

Not sure what exactly is needed but I can attach some log files and settings if requested.
ID: 96336 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 96338 - Posted: 10 May 2020, 14:59:25 UTC - in response to Message 96336.  

Another recent change was to add checkpointing between completed models in one of the search protocols. When you say the WUs were not saved correctly... what are you looking at to define that? Are you familiar with the CPU time since last checkpoint shown in the task properties? When you "exit" (rather than "close") BOINC Manager, the active tasks are ended, and will revert to their last checkpoints when they restart. The setting for leaving tasks in memory only applies when BOINC is still running, but has suspended the task to run another project, or because the user requested it.
Rosetta Moderator: Mod.Sense
ID: 96338 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Folding Proteins

Send message
Joined: 27 Mar 20
Posts: 2
Credit: 349,986
RAC: 0
Message 96342 - Posted: 10 May 2020, 19:58:27 UTC - in response to Message 96338.  
Last modified: 10 May 2020, 19:59:24 UTC

To explain further:
I crunch during the bigger part of the day but then have to shut down the machine overnight (for about 8 hours or so).
My routine is as follows: while running, update the Rosetta project so all work is check-pointed, then simply exit the BOINC manager (with the option to suspend all work on exit enabled), then power off my PC. The next day I just power on the machine with running on startup settings and usually the WUs just continue from where they were before shut down.
What happens now is when I power on the PC, all WUs are gone (marked as failed tasks) and new ones start from scratch.
Since the URL/Rosetta 4.20 change I have 60/40 portion in completed and failed tasks.
ID: 96342 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1233
Credit: 14,338,560
RAC: 2,014
Message 96344 - Posted: 10 May 2020, 20:46:15 UTC - in response to Message 96342.  
Last modified: 10 May 2020, 20:50:41 UTC

To explain further:
I crunch during the bigger part of the day but then have to shut down the machine overnight (for about 8 hours or so).
My routine is as follows: while running, update the Rosetta project so all work is check-pointed, then simply exit the BOINC manager (with the option to suspend all work on exit enabled), then power off my PC. The next day I just power on the machine with running on startup settings and usually the WUs just continue from where they were before shut down.
What happens now is when I power on the PC, all WUs are gone (marked as failed tasks) and new ones start from scratch.
Since the URL/Rosetta 4.20 change I have 60/40 portion in completed and failed tasks.

Most computers do not have main memory that will retain its contents with the power turned off.

Updating Rosetta@home does NOT automatically checkpoint all work.

You may need to look into telling BOINC to suspend all work, then telling your computer to Sleep instead of Shut down, so it can write the entire contents of its memory to the hard drive, and then write this back into main memory when you turn the computer on again. This lets it resume any programs that were suspended rather than aborted.

I suspect that you previously had your computer set to use sleep instead of shut down, and some change since then (possibly 4.20) has turned off this setting.

It could, however, also mean that 4.20 has timing routines that cannot properly handle very long delays well, or a resume from checkpoint section that fails to work properly 40% of the time it is used.
ID: 96344 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 96346 - Posted: 11 May 2020, 1:27:57 UTC

As robertmiles points out, you cannot force checkpoints. And using the "sleep" function (where memory remains active and everything is preserved) or the "hibernate" function (where memory contents are purged to disk and memory is powered off) would be good ideas to maximize the work done on your machine. It should also avoid whatever this error condition is that is being encountered.

You mention that you "...exit the BOINC manager (with the option to suspend all work on exit enabled)". I am not familiar with such an option. What is the wording on the screen for this option? Are you referring to the activity option for when to run? And setting it to suspend?
Rosetta Moderator: Mod.Sense
ID: 96346 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
rholliday
Avatar

Send message
Joined: 24 Aug 07
Posts: 1
Credit: 3,534,413
RAC: 0
Message 96396 - Posted: 12 May 2020, 15:16:29 UTC
Last modified: 12 May 2020, 15:44:35 UTC

Not sure if this is the correct place for this, but I just noticed some work units starting with 8CHARACTER_UNIQUENAME instead of a project name or an actual 8 character unique identifier. Is this potentially a problem? They do appear to have unique identifiers later, albeit after the ignore flag.
8CHARACTER_UNIQUENAME_SAVE_ALL_OUT_IGNORE_THE_REST_3id7ts5f_928224_2
8CHARACTER_UNIQUENAME_SAVE_ALL_OUT_IGNORE_THE_REST_1gd4cn8l_927782_2
8CHARACTER_UNIQUENAME_SAVE_ALL_OUT_IGNORE_THE_REST_4sl0rv2n_928224_2
ID: 96396 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sven

Send message
Joined: 7 Feb 16
Posts: 8
Credit: 222,005
RAC: 0
Message 96399 - Posted: 12 May 2020, 16:12:04 UTC

Hi all,

I just found an old outsourced computer which would work pretty well just for crunching Rosetta tasks.
Unfortunately the computation stopps after some seconds with "computation error".

Does anyone have an idea, where the problem could be? See the event log below:


12/05/2020 13:41:12 | | cc_config.xml not found - using defaults
12/05/2020 13:41:12 | | Starting BOINC client version 7.14.2 for windows_intelx86
12/05/2020 13:41:12 | | log flags: file_xfer, sched_ops, task
12/05/2020 13:41:12 | | Libraries: libcurl/7.47.1 OpenSSL/1.0.2g zlib/1.2.8
12/05/2020 13:41:12 | | Data directory: C:Programmeboinc
12/05/2020 13:41:12 | | Running under account Sven
12/05/2020 13:41:12 | | No usable GPUs found
12/05/2020 13:41:12 | | Creating new client state file
12/05/2020 13:41:12 | | Host name: viperle
12/05/2020 13:41:12 | | Processor: 4 GenuineIntel Intel(R) Core(TM)2 Quad CPU Q6600 @ 2.40GHz [Family 6 Model 15 Stepping 11]
12/05/2020 13:41:12 | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 cx16 nx lm vmx tm2 pbe
12/05/2020 13:41:12 | | OS: Microsoft Windows XP: Professional x86 Edition, Service Pack 3, (05.01.2600.00)
12/05/2020 13:41:12 | | Memory: 3.25 GB physical, 5.09 GB virtual
12/05/2020 13:41:12 | | Disk: 146.48 GB total, 116.06 GB free
12/05/2020 13:41:12 | | Local time is UTC +2 hours
12/05/2020 13:41:12 | | Last benchmark was 18394 days 11:41:12 ago
12/05/2020 13:41:17 | | No general preferences found - using defaults
12/05/2020 13:41:17 | | Preferences:
12/05/2020 13:41:17 | | max memory usage when active: 1663.49 MB
12/05/2020 13:41:17 | | max memory usage when idle: 2994.28 MB
12/05/2020 13:41:17 | | max disk usage: 115.97 GB
12/05/2020 13:41:17 | | don't use GPU while active
12/05/2020 13:41:17 | | suspend work if non-BOINC CPU load exceeds 25%
12/05/2020 13:41:17 | | (to change preferences, visit a project web site or select Preferences in the Manager)
12/05/2020 13:41:17 | | Setting up project and slot directories
12/05/2020 13:41:17 | | Checking active tasks
12/05/2020 13:41:17 | | Setting up GUI RPC socket
12/05/2020 13:41:17 | | Checking presence of 0 project files
12/05/2020 13:41:17 | | This computer is not attached to any projects
12/05/2020 13:41:26 | | Fetching configuration file from https://boinc.bakerlab.org/rosetta/get_project_config.php
12/05/2020 13:41:45 | | Running CPU benchmarks
12/05/2020 13:41:45 | | Suspending computation - CPU benchmarks in progress
12/05/2020 13:42:16 | | Benchmark results:
12/05/2020 13:42:16 | | Number of CPUs: 4
12/05/2020 13:42:16 | | 2341 floating point MIPS (Whetstone) per CPU
12/05/2020 13:42:16 | | 5245 integer MIPS (Dhrystone) per CPU
12/05/2020 13:42:17 | | Resuming computation
12/05/2020 13:42:19 | Rosetta@home | Master file download succeeded
12/05/2020 13:42:24 | Rosetta@home | Sending scheduler request: Project initialization.
12/05/2020 13:42:24 | Rosetta@home | Requesting new tasks for CPU
12/05/2020 13:42:26 | Rosetta@home | Scheduler request completed: got 1 new tasks
12/05/2020 13:42:26 | Rosetta@home | New computer location: home
12/05/2020 13:42:31 | | General prefs: from http://lhcathomeclassic.cern.ch/sixtrack/ (last modified 06-Mar-2015 20:46:54)
12/05/2020 13:42:31 | | Host location: none
12/05/2020 13:42:31 | | General prefs: using your defaults
12/05/2020 13:42:31 | | Preferences:
12/05/2020 13:42:31 | | max memory usage when active: 1663.49 MB
12/05/2020 13:42:31 | | max memory usage when idle: 2994.28 MB
12/05/2020 13:42:31 | | max disk usage: 73.24 GB
12/05/2020 13:42:31 | | Number of usable CPUs has changed from 4 to 2.
12/05/2020 13:42:31 | | max CPUs used: 2
12/05/2020 13:42:31 | | don't use GPU while active
12/05/2020 13:42:31 | | suspend work if non-BOINC CPU load exceeds 25%
12/05/2020 13:42:31 | | (to change preferences, visit a project web site or select Preferences in the Manager)
12/05/2020 13:42:32 | Rosetta@home | Started download of rosetta_4.21_windows_intelx86.exe
12/05/2020 13:42:32 | Rosetta@home | Started download of rosetta_graphics_4.21_windows_intelx86.exe
12/05/2020 13:43:01 | Rosetta@home | Sending scheduler request: To fetch work.
12/05/2020 13:43:01 | Rosetta@home | Requesting new tasks for CPU
12/05/2020 13:43:03 | Rosetta@home | Scheduler request completed: got 1 new tasks
12/05/2020 13:44:14 | | General prefs: from http://lhcathomeclassic.cern.ch/sixtrack/ (last modified 06-Mar-2015 20:46:54)
12/05/2020 13:44:14 | | Host location: none
12/05/2020 13:44:14 | | General prefs: using your defaults
12/05/2020 13:44:14 | | Reading preferences override file
12/05/2020 13:44:14 | | Preferences:
12/05/2020 13:44:14 | | max memory usage when active: 1663.49 MB
12/05/2020 13:44:14 | | max memory usage when idle: 2994.28 MB
12/05/2020 13:44:14 | | max disk usage: 73.24 GB
12/05/2020 13:44:14 | | max CPUs used: 2
12/05/2020 13:44:14 | | suspend work if non-BOINC CPU load exceeds 25%
12/05/2020 13:44:14 | | (to change preferences, visit a project web site or select Preferences in the Manager)
12/05/2020 13:45:23 | Rosetta@home | Finished download of rosetta_graphics_4.21_windows_intelx86.exe
12/05/2020 13:45:23 | Rosetta@home | Started download of database_357d5d93529_n_methyl.zip
12/05/2020 13:45:29 | Rosetta@home | Finished download of rosetta_4.21_windows_intelx86.exe
12/05/2020 13:45:29 | Rosetta@home | Started download of LiberationSans-Regular.ttf
12/05/2020 13:45:31 | Rosetta@home | Finished download of LiberationSans-Regular.ttf
12/05/2020 13:45:31 | Rosetta@home | Started download of flags_covid_groove2
12/05/2020 13:45:32 | Rosetta@home | Finished download of flags_covid_groove2
12/05/2020 13:45:32 | Rosetta@home | Started download of Mini_Protein_binds_COVID-19_groove_design1_9_SAVE_ALL_OUT_IGNORE_THE_REST_6gv4fh9s.zip
12/05/2020 13:45:38 | Rosetta@home | Finished download of Mini_Protein_binds_COVID-19_groove_design1_9_SAVE_ALL_OUT_IGNORE_THE_REST_6gv4fh9s.zip
12/05/2020 13:45:38 | Rosetta@home | Started download of Mini_Protein_binds_COVID-19_groove_design1_9_SAVE_ALL_OUT_IGNORE_THE_REST_6gv4fh9s.flags
12/05/2020 13:45:39 | Rosetta@home | Finished download of Mini_Protein_binds_COVID-19_groove_design1_9_SAVE_ALL_OUT_IGNORE_THE_REST_6gv4fh9s.flags
12/05/2020 13:45:39 | Rosetta@home | Started download of r3x_0904_data.zip
12/05/2020 13:45:40 | Rosetta@home | Finished download of r3x_0904_data.zip
12/05/2020 14:07:32 | Rosetta@home | Finished download of database_357d5d93529_n_methyl.zip
12/05/2020 14:15:53 | Rosetta@home | Starting task Mini_Protein_binds_COVID-19_groove_design1_9_SAVE_ALL_OUT_IGNORE_THE_REST_6gv4fh9s_927641_8_0
12/05/2020 14:16:08 | Rosetta@home | Computation for task Mini_Protein_binds_COVID-19_groove_design1_9_SAVE_ALL_OUT_IGNORE_THE_REST_6gv4fh9s_927641_8_0 finished
12/05/2020 14:16:08 | Rosetta@home | Output file Mini_Protein_binds_COVID-19_groove_design1_9_SAVE_ALL_OUT_IGNORE_THE_REST_6gv4fh9s_927641_8_0_r924213539_0 for task Mini_Protein_binds_COVID-19_groove_design1_9_SAVE_ALL_OUT_IGNORE_THE_REST_6gv4fh9s_927641_8_0 absent
12/05/2020 14:16:12 | Rosetta@home | Starting task r3x_0904_fold_SAVE_ALL_OUT_920537_1458_0
12/05/2020 14:16:16 | Rosetta@home | Computation for task r3x_0904_fold_SAVE_ALL_OUT_920537_1458_0 finished
12/05/2020 14:16:16 | Rosetta@home | Output file r3x_0904_fold_SAVE_ALL_OUT_920537_1458_0_r1563406117_0 for task r3x_0904_fold_SAVE_ALL_OUT_920537_1458_0 absent
12/05/2020 14:16:34 | Rosetta@home | update requested by user
12/05/2020 14:16:39 | Rosetta@home | Sending scheduler request: Requested by user.
12/05/2020 14:16:39 | Rosetta@home | Reporting 2 completed tasks
12/05/2020 14:16:39 | Rosetta@home | Requesting new tasks for CPU
12/05/2020 14:16:42 | Rosetta@home | Scheduler request completed: got 2 new tasks
12/05/2020 14:16:44 | Rosetta@home | Started download of flags_dummy
12/05/2020 14:16:44 | Rosetta@home | Started download of 8CHARACTER_UNIQUENAME_SAVE_ALL_OUT_IGNORE_THE_REST_4qw8of3n.zip
12/05/2020 14:16:46 | Rosetta@home | Finished download of flags_dummy
12/05/2020 14:16:46 | Rosetta@home | Started download of 8CHARACTER_UNIQUENAME_SAVE_ALL_OUT_IGNORE_THE_REST_4qw8of3n.flags
12/05/2020 14:16:47 | Rosetta@home | Finished download of 8CHARACTER_UNIQUENAME_SAVE_ALL_OUT_IGNORE_THE_REST_4qw8of3n.flags
12/05/2020 14:16:47 | Rosetta@home | Started download of fp200511_C2_pair139_X_fnr_9_193_fragments_fold_data.zip
12/05/2020 14:16:55 | Rosetta@home | Finished download of 8CHARACTER_UNIQUENAME_SAVE_ALL_OUT_IGNORE_THE_REST_4qw8of3n.zip
12/05/2020 14:17:06 | Rosetta@home | Finished download of fp200511_C2_pair139_X_fnr_9_193_fragments_fold_data.zip
12/05/2020 14:22:42 | Rosetta@home | Starting task fp200511_C2_pair139_X_fnr_9_193_fragments_abinitio_SAVE_ALL_OUT_927867_682_0
12/05/2020 14:22:50 | Rosetta@home | Computation for task fp200511_C2_pair139_X_fnr_9_193_fragments_abinitio_SAVE_ALL_OUT_927867_682_0 finished
12/05/2020 14:22:50 | Rosetta@home | Output file fp200511_C2_pair139_X_fnr_9_193_fragments_abinitio_SAVE_ALL_OUT_927867_682_0_r1926040998_0 for task fp200511_C2_pair139_X_fnr_9_193_fragments_abinitio_SAVE_ALL_OUT_927867_682_0 absent
12/05/2020 14:22:53 | Rosetta@home | Starting task 8CHARACTER_UNIQUENAME_SAVE_ALL_OUT_IGNORE_THE_REST_4qw8of3n_928224_2_0
12/05/2020 14:23:00 | Rosetta@home | Computation for task 8CHARACTER_UNIQUENAME_SAVE_ALL_OUT_IGNORE_THE_REST_4qw8of3n_928224_2_0 finished
12/05/2020 14:23:00 | Rosetta@home | Output file 8CHARACTER_UNIQUENAME_SAVE_ALL_OUT_IGNORE_THE_REST_4qw8of3n_928224_2_0_r336126669_0 for task 8CHARACTER_UNIQUENAME_SAVE_ALL_OUT_IGNORE_THE_REST_4qw8of3n_928224_2_0 absent
12/05/2020 14:25:24 | Rosetta@home | Sending scheduler request: To fetch work.
12/05/2020 14:25:24 | Rosetta@home | Reporting 2 completed tasks
12/05/2020 14:25:24 | Rosetta@home | Requesting new tasks for CPU
12/05/2020 14:25:26 | Rosetta@home | Scheduler request completed: got 2 new tasks
12/05/2020 14:25:28 | Rosetta@home | Started download of new_3cl_10aa_6lu7_modified_AVLstub_relaxed_renumbered_0674_33_extract_B.zip
12/05/2020 14:25:28 | Rosetta@home | Started download of new_3cl_10aa_6lu7_modified_AVLstub_relaxed_renumbered_0674_33_extract_B.flags
12/05/2020 14:25:30 | Rosetta@home | Finished download of new_3cl_10aa_6lu7_modified_AVLstub_relaxed_renumbered_0674_33_extract_B.zip
12/05/2020 14:25:30 | Rosetta@home | Finished download of new_3cl_10aa_6lu7_modified_AVLstub_relaxed_renumbered_0674_33_extract_B.flags
12/05/2020 14:25:30 | Rosetta@home | Started download of rb_05_11_24048_24276_ab_t000__h002_robetta_FLAGS
12/05/2020 14:25:30 | Rosetta@home | Started download of rb_05_11_24048_24276_ab_t000__h002_robetta.zip
12/05/2020 14:25:33 | Rosetta@home | Starting task new_3cl_10aa_6lu7_modified_AVLstub_relaxed_renumbered_0674_33_extract_B_SAVE_ALL_OUT_928500_1_1
12/05/2020 14:25:34 | Rosetta@home | Finished download of rb_05_11_24048_24276_ab_t000__h002_robetta_FLAGS
12/05/2020 14:25:34 | Rosetta@home | Finished download of rb_05_11_24048_24276_ab_t000__h002_robetta.zip
12/05/2020 14:25:34 | Rosetta@home | Started download of rb_05_11_24048_24276_ab_t000__h002_robetta.200.3mers.index.gz
12/05/2020 14:25:34 | Rosetta@home | Started download of rb_05_11_24048_24276_ab_t000__h002_robetta.200.5mers.index.gz
12/05/2020 14:25:36 | Rosetta@home | Finished download of rb_05_11_24048_24276_ab_t000__h002_robetta.200.3mers.index.gz
12/05/2020 14:25:36 | Rosetta@home | Finished download of rb_05_11_24048_24276_ab_t000__h002_robetta.200.5mers.index.gz
12/05/2020 14:25:39 | Rosetta@home | Starting task rb_05_11_24048_24276_ab_t000__h002_robetta_IGNORE_THE_REST_03_05_927766_1_1
12/05/2020 14:25:49 | Rosetta@home | Computation for task rb_05_11_24048_24276_ab_t000__h002_robetta_IGNORE_THE_REST_03_05_927766_1_1 finished
12/05/2020 14:25:49 | Rosetta@home | Output file rb_05_11_24048_24276_ab_t000__h002_robetta_IGNORE_THE_REST_03_05_927766_1_1_r858648371_0 for task rb_05_11_24048_24276_ab_t000__h002_robetta_IGNORE_THE_REST_03_05_927766_1_1 absent
12/05/2020 14:27:49 | Rosetta@home | Sending scheduler request: To fetch work.
12/05/2020 14:27:49 | Rosetta@home | Reporting 1 completed tasks
12/05/2020 14:27:49 | Rosetta@home | Requesting new tasks for CPU
12/05/2020 14:27:51 | Rosetta@home | Scheduler request completed: got 1 new tasks
12/05/2020 14:27:53 | Rosetta@home | Started download of flags_covid_groove2
12/05/2020 14:27:53 | Rosetta@home | Started download of Mini_Protein_binds_COVID-19_groove_design1_3_SAVE_ALL_OUT_IGNORE_THE_REST_8dv5ad8c.zip
12/05/2020 14:27:55 | Rosetta@home | Finished download of flags_covid_groove2
12/05/2020 14:27:55 | Rosetta@home | Started download of Mini_Protein_binds_COVID-19_groove_design1_3_SAVE_ALL_OUT_IGNORE_THE_REST_8dv5ad8c.flags
12/05/2020 14:27:56 | Rosetta@home | Finished download of Mini_Protein_binds_COVID-19_groove_design1_3_SAVE_ALL_OUT_IGNORE_THE_REST_8dv5ad8c.flags
12/05/2020 14:28:00 | Rosetta@home | Finished download of Mini_Protein_binds_COVID-19_groove_design1_3_SAVE_ALL_OUT_IGNORE_THE_REST_8dv5ad8c.zip
12/05/2020 14:28:48 | Rosetta@home | Computation for task new_3cl_10aa_6lu7_modified_AVLstub_relaxed_renumbered_0674_33_extract_B_SAVE_ALL_OUT_928500_1_1 finished
12/05/2020 14:28:48 | Rosetta@home | Output file new_3cl_10aa_6lu7_modified_AVLstub_relaxed_renumbered_0674_33_extract_B_SAVE_ALL_OUT_928500_1_1_r1150183738_0 for task new_3cl_10aa_6lu7_modified_AVLstub_relaxed_renumbered_0674_33_extract_B_SAVE_ALL_OUT_928500_1_1 absent
12/05/2020 14:28:51 | Rosetta@home | Starting task Mini_Protein_binds_COVID-19_groove_design1_3_SAVE_ALL_OUT_IGNORE_THE_REST_8dv5ad8c_927633_8_0
12/05/2020 14:28:53 | Rosetta@home | Computation for task Mini_Protein_binds_COVID-19_groove_design1_3_SAVE_ALL_OUT_IGNORE_THE_REST_8dv5ad8c_927633_8_0 finished
12/05/2020 14:28:53 | Rosetta@home | Output file Mini_Protein_binds_COVID-19_groove_design1_3_SAVE_ALL_OUT_IGNORE_THE_REST_8dv5ad8c_927633_8_0_r1359986480_0 for task Mini_Protein_binds_COVID-19_groove_design1_3_SAVE_ALL_OUT_IGNORE_THE_REST_8dv5ad8c_927633_8_0 absent
12/05/2020 14:31:28 | Rosetta@home | Sending scheduler request: To fetch work.
12/05/2020 14:31:28 | Rosetta@home | Reporting 2 completed tasks
12/05/2020 14:31:28 | Rosetta@home | Requesting new tasks for CPU
12/05/2020 14:31:31 | Rosetta@home | Scheduler request completed: got 2 new tasks
12/05/2020 14:31:33 | Rosetta@home | Started download of flags_dummy
12/05/2020 14:31:33 | Rosetta@home | Started download of 8CHARACTER_UNIQUENAME_SAVE_ALL_OUT_IGNORE_THE_REST_3gh9pd8e.zip
12/05/2020 14:31:35 | Rosetta@home | Finished download of flags_dummy
12/05/2020 14:31:35 | Rosetta@home | Started download of 8CHARACTER_UNIQUENAME_SAVE_ALL_OUT_IGNORE_THE_REST_3gh9pd8e.flags
12/05/2020 14:31:36 | Rosetta@home | Finished download of 8CHARACTER_UNIQUENAME_SAVE_ALL_OUT_IGNORE_THE_REST_3gh9pd8e.flags
12/05/2020 14:31:36 | Rosetta@home | Started download of r3x_3679_data.zip
12/05/2020 14:31:37 | Rosetta@home | Finished download of r3x_3679_data.zip
12/05/2020 14:31:40 | Rosetta@home | Starting task r3x_3679_fold_SAVE_ALL_OUT_919997_1459_0
12/05/2020 14:31:48 | Rosetta@home | Finished download of 8CHARACTER_UNIQUENAME_SAVE_ALL_OUT_IGNORE_THE_REST_3gh9pd8e.zip
12/05/2020 14:31:50 | Rosetta@home | Starting task 8CHARACTER_UNIQUENAME_SAVE_ALL_OUT_IGNORE_THE_REST_3gh9pd8e_927911_2_0
12/05/2020 14:31:51 | Rosetta@home | Computation for task r3x_3679_fold_SAVE_ALL_OUT_919997_1459_0 finished
12/05/2020 14:31:51 | Rosetta@home | Output file r3x_3679_fold_SAVE_ALL_OUT_919997_1459_0_r460780872_0 for task r3x_3679_fold_SAVE_ALL_OUT_919997_1459_0 absent
12/05/2020 14:31:59 | Rosetta@home | Computation for task 8CHARACTER_UNIQUENAME_SAVE_ALL_OUT_IGNORE_THE_REST_3gh9pd8e_927911_2_0 finished
12/05/2020 14:31:59 | Rosetta@home | Output file 8CHARACTER_UNIQUENAME_SAVE_ALL_OUT_IGNORE_THE_REST_3gh9pd8e_927911_2_0_r529209938_0 for task 8CHARACTER_UNIQUENAME_SAVE_ALL_OUT_IGNORE_THE_REST_3gh9pd8e_927911_2_0 absent
12/05/2020 14:35:52 | Rosetta@home | Sending scheduler request: To fetch work.
12/05/2020 14:35:52 | Rosetta@home | Reporting 2 completed tasks
12/05/2020 14:35:52 | Rosetta@home | Requesting new tasks for CPU
12/05/2020 14:35:55 | Rosetta@home | Scheduler request completed: got 2 new tasks
12/05/2020 14:35:57 | Rosetta@home | Started download of flags_covid_groove2
12/05/2020 14:35:57 | Rosetta@home | Started download of Mini_Protein_binds_COVID-19_groove_design1_8_SAVE_ALL_OUT_IGNORE_THE_REST_4tx4gb5k.zip
12/05/2020 14:35:59 | Rosetta@home | Finished download of flags_covid_groove2
12/05/2020 14:35:59 | Rosetta@home | Started download of Mini_Protein_binds_COVID-19_groove_design1_8_SAVE_ALL_OUT_IGNORE_THE_REST_4tx4gb5k.flags
12/05/2020 14:36:00 | Rosetta@home | Finished download of Mini_Protein_binds_COVID-19_groove_design1_8_SAVE_ALL_OUT_IGNORE_THE_REST_4tx4gb5k.flags
12/05/2020 14:36:00 | Rosetta@home | Started download of fp200511_allo2_pair125_X_fnr_8_042_fragments_fold_data.zip
12/05/2020 14:36:04 | Rosetta@home | Finished download of Mini_Protein_binds_COVID-19_groove_design1_8_SAVE_ALL_OUT_IGNORE_THE_REST_4tx4gb5k.zip
12/05/2020 14:36:07 | Rosetta@home | Starting task Mini_Protein_binds_COVID-19_groove_design1_8_SAVE_ALL_OUT_IGNORE_THE_REST_4tx4gb5k_927640_8_0
12/05/2020 14:36:11 | Rosetta@home | Computation for task Mini_Protein_binds_COVID-19_groove_design1_8_SAVE_ALL_OUT_IGNORE_THE_REST_4tx4gb5k_927640_8_0 finished
12/05/2020 14:36:11 | Rosetta@home | Output file Mini_Protein_binds_COVID-19_groove_design1_8_SAVE_ALL_OUT_IGNORE_THE_REST_4tx4gb5k_927640_8_0_r1031802064_0 for task Mini_Protein_binds_COVID-19_groove_design1_8_SAVE_ALL_OUT_IGNORE_THE_REST_4tx4gb5k_927640_8_0 absent
12/05/2020 14:36:24 | Rosetta@home | Finished download of fp200511_allo2_pair125_X_fnr_8_042_fragments_fold_data.zip
12/05/2020 14:36:27 | Rosetta@home | Starting task fp200511_allo2_pair125_X_fnr_8_042_fragments_abinitio_SAVE_ALL_OUT_927809_693_0
12/05/2020 14:36:34 | Rosetta@home | Computation for task fp200511_allo2_pair125_X_fnr_8_042_fragments_abinitio_SAVE_ALL_OUT_927809_693_0 finished
12/05/2020 14:36:34 | Rosetta@home | Output file fp200511_allo2_pair125_X_fnr_8_042_fragments_abinitio_SAVE_ALL_OUT_927809_693_0_r585054160_0 for task fp200511_allo2_pair125_X_fnr_8_042_fragments_abinitio_SAVE_ALL_OUT_927809_693_0 absent
12/05/2020 14:36:52 | Rosetta@home | work fetch suspended by user
12/05/2020 14:38:53 | Rosetta@home | Sending scheduler request: To report completed tasks.
12/05/2020 14:38:53 | Rosetta@home | Reporting 2 completed tasks
12/05/2020 14:38:53 | Rosetta@home | Not requesting tasks: "no new tasks" requested via Manager
12/05/2020 14:38:55 | Rosetta@home | Scheduler request completed
12/05/2020 18:04:27 | | Suspending computation - CPU is busy
12/05/2020 18:05:37 | | Resuming computation
12/05/2020 18:06:47 | Rosetta@home | work fetch resumed by user
12/05/2020 18:06:50 | Rosetta@home | Sending scheduler request: To fetch work.
12/05/2020 18:06:50 | Rosetta@home | Requesting new tasks for CPU
12/05/2020 18:06:53 | Rosetta@home | Scheduler request completed: got 2 new tasks
12/05/2020 18:06:55 | Rosetta@home | Started download of flags
12/05/2020 18:06:55 | Rosetta@home | Started download of Junior_HalfRoid_design5_COVID-19_SAVE_ALL_OUT_IGNORE_THE_REST_4vi4qx8m.zip
12/05/2020 18:06:57 | Rosetta@home | Finished download of flags
12/05/2020 18:06:57 | Rosetta@home | Started download of Junior_HalfRoid_design5_COVID-19_SAVE_ALL_OUT_IGNORE_THE_REST_4vi4qx8m.flags
12/05/2020 18:06:58 | Rosetta@home | Finished download of Junior_HalfRoid_design5_COVID-19_SAVE_ALL_OUT_IGNORE_THE_REST_4vi4qx8m.zip
12/05/2020 18:06:58 | Rosetta@home | Finished download of Junior_HalfRoid_design5_COVID-19_SAVE_ALL_OUT_IGNORE_THE_REST_4vi4qx8m.flags
12/05/2020 18:06:58 | Rosetta@home | Started download of rb_05_12_24836_24417_ab_t000__robetta_FLAGS
12/05/2020 18:06:58 | Rosetta@home | Started download of rb_05_12_24836_24417_ab_t000__robetta.zip
12/05/2020 18:07:00 | Rosetta@home | Finished download of rb_05_12_24836_24417_ab_t000__robetta_FLAGS
12/05/2020 18:07:00 | Rosetta@home | Finished download of rb_05_12_24836_24417_ab_t000__robetta.zip
12/05/2020 18:07:00 | Rosetta@home | Started download of rb_05_12_24836_24417_ab_t000__robetta.200.3mers.index.gz
12/05/2020 18:07:00 | Rosetta@home | Started download of rb_05_12_24836_24417_ab_t000__robetta.200.10mers.index.gz
12/05/2020 18:07:01 | Rosetta@home | Finished download of rb_05_12_24836_24417_ab_t000__robetta.200.3mers.index.gz
12/05/2020 18:07:01 | Rosetta@home | Finished download of rb_05_12_24836_24417_ab_t000__robetta.200.10mers.index.gz
12/05/2020 18:07:01 | Rosetta@home | Started download of rb_05_12_24836_24417_ab_t000__robetta.200.6mers.index.gz
12/05/2020 18:07:02 | Rosetta@home | Finished download of rb_05_12_24836_24417_ab_t000__robetta.200.6mers.index.gz
12/05/2020 18:07:05 | Rosetta@home | Starting task rb_05_12_24836_24417_ab_t000__robetta_cstwt_5.0_FT_IGNORE_THE_REST_06_10_928596_23_0
12/05/2020 18:07:17 | Rosetta@home | Computation for task rb_05_12_24836_24417_ab_t000__robetta_cstwt_5.0_FT_IGNORE_THE_REST_06_10_928596_23_0 finished
12/05/2020 18:07:17 | Rosetta@home | Output file rb_05_12_24836_24417_ab_t000__robetta_cstwt_5.0_FT_IGNORE_THE_REST_06_10_928596_23_0_r1393946342_0 for task rb_05_12_24836_24417_ab_t000__robetta_cstwt_5.0_FT_IGNORE_THE_REST_06_10_928596_23_0 absent
ID: 96399 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1233
Credit: 14,338,560
RAC: 2,014
Message 96416 - Posted: 13 May 2020, 0:21:13 UTC - in response to Message 96399.  

Hi all,

I just found an old outsourced computer which would work pretty well just for crunching Rosetta tasks.
Unfortunately the computation stopps after some seconds with "computation error".

Does anyone have an idea, where the problem could be? See the event log below:

[snip]

You're looking at the wrong log file to see much about THIS problem.

To see the one with more information for this problem:

If you're using the simple view of the BOINC Manager, find View near the top line, click on it, then click on Advanced View.

Click on Projects in one of the top lines, then Rosetta@home, then Your tasks.

In the Status column, find one of the failed tasks. Ignore those shown as In progress - they don't have the other log file yet. Ignore those shown as Completed and validated for now unless you want to see one without errors for comparison purposes.

When you find one, click on the number in this line, but in the Task column. This gives the log file specific to that task.

Scroll down as needed. In this case, look at the line starting with unzip, which shows the error triggering all other errors for this task,

I've looked at this file for some of your failed tasks, and have thought of three possibilities:

1. The task was built improperly, and the list of files it needs left out one or two of the zip files. It may have assumed, incorrectly, that some previous task had downloaded it or them.

2. Your antivirus program hid one or both of those files. You might check the log file of your antivirus program, if it has one.

3. These tasks used version v4.21 of the Rosetta application, which I have not seen mentioned before. That version might have the wrong builtin names for files to send to unzip.

For 1, about all you can do is wait for more tasks that fix this problem.

For 2, you might have to tell your antivirus program not to scan the directories for BOINC.

For 3, you might watch the forums for mentions of that version, to see if similar problems are reported by others.

On another subject, I also looked at the specs for that computer, which runs Windows XP. You might look for threads on whether the current versions of Rosetta are compatible with Windows XP.

The current tasks take up to 2 GB of memory each and sometimes more, so you may have problems with the tasks running out of memory once you can get them to run longer, if you allow one task each for the 4 virtual CPU cores on that computer. You may have to limit it to only one task at a time. That computer has only 4 GB of main memory, and some of it is reserved for the Windows operating system.
ID: 96416 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1233
Credit: 14,338,560
RAC: 2,014
Message 96417 - Posted: 13 May 2020, 0:28:05 UTC - in response to Message 96396.  

Not sure if this is the correct place for this, but I just noticed some work units starting with 8CHARACTER_UNIQUENAME instead of a project name or an actual 8 character unique identifier. Is this potentially a problem? They do appear to have unique identifiers later, albeit after the ignore flag.
8CHARACTER_UNIQUENAME_SAVE_ALL_OUT_IGNORE_THE_REST_3id7ts5f_928224_2
8CHARACTER_UNIQUENAME_SAVE_ALL_OUT_IGNORE_THE_REST_1gd4cn8l_927782_2
8CHARACTER_UNIQUENAME_SAVE_ALL_OUT_IGNORE_THE_REST_4sl0rv2n_928224_2

Probably not a problem as long as only ONE batch has names starting that way. Likely to cause confusion in the server database if there is more than one such batch, and therefore there are often two or more workunits with the same full name
ID: 96417 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
MarkJ

Send message
Joined: 28 Mar 20
Posts: 72
Credit: 25,238,680
RAC: 0
Message 96426 - Posted: 13 May 2020, 6:15:09 UTC - in response to Message 96417.  

8CHARACTER_UNIQUENAME_SAVE_ALL_OUT_IGNORE_THE_REST_3id7ts5f_928224_2
8CHARACTER_UNIQUENAME_SAVE_ALL_OUT_IGNORE_THE_REST_1gd4cn8l_927782_2
8CHARACTER_UNIQUENAME_SAVE_ALL_OUT_IGNORE_THE_REST_4sl0rv2n_928224_2

Probably not a problem as long as only ONE batch has names starting that way. Likely to cause confusion in the server database if there is more than one such batch, and therefore there are often two or more workunits with the same full name

There are a bunch of "yournamehere" ones as well. Probably the person submitting them didn't know what to do when filling out the job template.
BOINC blog
ID: 96426 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1725
Credit: 18,378,164
RAC: 20,578
Message 96427 - Posted: 13 May 2020, 6:15:37 UTC - in response to Message 96399.  

Does anyone have an idea, where the problem could be? See the event log below
Some people are able to run using WInXP without problems, others are unable to get things to run on XP.
I'd suggest using a more recent OS.
I would also limit the number of cores you use to process work with- even with all the RAM available using a 64bit OS, you will still not have enough RAM to run some Tasks while using all cores- you generally need to allow 1.3GB RAM per Task to avoid out of memory issues.
Grant
Darwin NT
ID: 96427 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Rob R

Send message
Joined: 20 May 14
Posts: 2
Credit: 2,785,677
RAC: 1,593
Message 96431 - Posted: 13 May 2020, 6:51:20 UTC

In the last couple days I've noticed a number of failed tasks, they all start with 3cl in the name, here is an example.
https://boinc.bakerlab.org/rosetta/workunit.php?wuid=1057932888
The tasks run for about 8-15 seconds then return compute error. System is a Ryzen 5 3600, 16gb ram.
Hope this helps with troubleshooting!
ID: 96431 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 47 · 48 · 49 · 50 · 51 · 52 · 53 . . . 309 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org