Very long wus

Message boards : Number crunching : Very long wus

To post messages, you must log in.

AuthorMessage
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1994
Credit: 9,623,704
RAC: 7,594
Message 96175 - Posted: 6 May 2020, 19:58:06 UTC

All my "Junior_HalfRoid_design5_COVID-19_" are very long.
My default run time is (temporarily, i need to empty the queue to change the url) 2hs, and all these are over 12hs (for only 1 decoy).
Complex simulations!! :-O
ID: 96175 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 96186 - Posted: 6 May 2020, 23:28:50 UTC - in response to Message 96175.  
Last modified: 6 May 2020, 23:29:56 UTC

How well are they checkpointing for you? Can you point to some of the results files?
Rosetta Moderator: Mod.Sense
ID: 96186 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1994
Credit: 9,623,704
RAC: 7,594
Message 96205 - Posted: 7 May 2020, 8:33:37 UTC - in response to Message 96186.  

How well are they checkpointing for you?[/url]
I don't reboot the pc

[quote]Can you point to some of the results files?

Sure.
1171231755
1171231757
1171231759
1171261792
etc..
ID: 96205 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 96229 - Posted: 7 May 2020, 14:00:05 UTC
Last modified: 7 May 2020, 14:06:02 UTC

They've been cleaning up completed tasks very quickly, three were already gone. But the first one you linked shows this output, I just wanted to get it into the thread before the WU is gone.

It seems that a 2hr runtime plus a 10hr watchdog was not enough to complete the first model.

<core_client_version>7.16.5</core_client_version>
<![CDATA[
<stderr_txt>
command: projects/boinc.bakerlab.org_rosetta/rosetta_4.20_windows_x86_64.exe -run:protocol jd2_scripting -parser:protocol jhr_boinc_v4.xml @flags -in:file:silent Junior_HalfRoid_design5_COVID-19_SAVE_ALL_OUT_IGNORE_THE_REST_7ri9uc0b.silent -in:file:silent_struct_type binary -silent_gz -mute all -out:file:silent_struct_type binary -out:file:silent default.out -in:file:boinc_wu_zip Junior_HalfRoid_design5_COVID-19_SAVE_ALL_OUT_IGNORE_THE_REST_7ri9uc0b.zip @Junior_HalfRoid_design5_COVID-19_SAVE_ALL_OUT_IGNORE_THE_REST_7ri9uc0b.flags -nstruct 10000 -cpu_run_time 28800 -boinc:max_nstruct 20000 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -boinc::cpu_run_timeout 36000 -run::rng mt19937 -constant_seed -jran 2472631
Using database: database_357d5d93529_n_methylminirosetta_database
BOINC:: CPU time: 43253.1s, 36000s + 7200s[2020- 5- 6 19:53:56:] :: BOINC 
WARNING! cannot get file size for default.out.gz: could not open file.
Output exists: default.out.gz Size: -1
InternalDecoyCount: 0 (GZ)
-----
0
-----
Stream information inconsistent.
Writing W_0000001
======================================================
DONE ::     1 starting structures  43253.1 cpu seconds
This process generated      1 decoys from       1 attempts
======================================================
19:53:56 (3924): called boinc_finish(0)

</stderr_txt>
]]>

Rosetta Moderator: Mod.Sense
ID: 96229 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : Very long wus



©2024 University of Washington
https://www.bakerlab.org