Mini Rosetta 3.45

Message boards : Number crunching : Mini Rosetta 3.45

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · Next

AuthorMessage
Mad_Max

Send message
Joined: 31 Dec 09
Posts: 207
Credit: 23,317,169
RAC: 11,448
Message 74522 - Posted: 23 Nov 2012, 1:31:22 UTC
Last modified: 23 Nov 2012, 1:35:19 UTC

Last error due to the double slash in the path to file? :)
minirosetta_database//sampling/filtered.vall.dat.2006-05-05

Oh, no - in addition to double slash it try to use file that was removed from current database revision(minirosetta_database_rev52077.zip).
It was in previos revision (minirosetta_database_rev52076.zip) but was removed with 3.43 version of minirosetta.
So all WUs from this series will fail.
ID: 74522 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 74523 - Posted: 23 Nov 2012, 6:54:12 UTC

Another two failed quickly with the same problem.

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=495892637

hyb_am_bench_4G9Q_SAVE_ALL_OUT_IGNORE_THE_REST_65001_17_1

------------------------------------

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=495890875

hyb_am_bench_4F8X_SAVE_ALL_OUT_IGNORE_THE_REST_64927_41_1

ERROR: can't open file: minirosetta_database//sampling/filtered.vall.dat.2006-05-05
ERROR:: Exit from: src/core/fragment/picking_old/vall/vall_io.cc line: 63
BOINC:: Error reading and gzipping output datafile: default.out
called boinc_finish
# cpu_run_time_pref: 21600

</stderr_txt>
]]>

ID: 74523 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 74550 - Posted: 24 Nov 2012, 20:52:14 UTC

Theses two failed quick ~ 5 sec.

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=496220220

R2x2_UM21_K18_DE05_relax_SAVE_ALL_OUT_65314_137_0

------------------------------------------------------------

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=496218708

R2x2_UM21_K18_DE05_abinitio_SAVE_ALL_OUT_65314_16_0

======================================

ERROR: unrecognized aa LG1
ERROR:: Exit from: src/core/io/pdb/file_data.cc line: 1238
BOINC:: Error reading and gzipping output datafile: default.out
called boinc_finish

</stderr_txt>
]]>

ID: 74550 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5662
Credit: 5,700,157
RAC: 2,063
Message 74553 - Posted: 24 Nov 2012, 21:51:10 UTC

Ploop4 tasks are defective as well
<message>
couldn't start CreateProcess() failed - Access is denied. (0x5): -148
</message>

Total rubbish tasks..should not have been sent out in the first place.
Test on Ralph first before putting them here...that's what Raph is for isn't it?
Beta testing.....
ID: 74553 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5662
Credit: 5,700,157
RAC: 2,063
Message 74571 - Posted: 25 Nov 2012, 22:44:39 UTC

Sorry about comments for Ploop4. That seems to be ok.
Found out in another thread what was going on.

On a different note, why are LP5 tasks ignoring the max run time of 6hrs and going for 8?
ID: 74571 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5662
Credit: 5,700,157
RAC: 2,063
Message 74574 - Posted: 25 Nov 2012, 22:52:20 UTC - in response to Message 74571.  

Sorry about comments for Ploop4. That seems to be ok.
Found out in another thread what was going on.

On a different note, why are LP5 tasks ignoring the max run time of 6hrs and going for 8?



Never mind this either..timer on BOINC is way off on these tasks.
Actual run time is 5.95 hrs. BOINC was showing 8hrs.
ID: 74574 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1829
Credit: 115,475,644
RAC: 55,959
Message 74576 - Posted: 25 Nov 2012, 22:57:25 UTC - in response to Message 74571.  
Last modified: 25 Nov 2012, 22:58:20 UTC

On a different note, why are LP5 tasks ignoring the max run time of 6hrs and going for 8?

It seems I'm following you around the threads!

It's just a run-time preference, not max run time- it'll not start any more tasks if it calculates that it won't finish them within that period, but if it's started one then it won't dump the work it's done to keep within the period. I just pulled up this LP5 one of yours:

https://boinc.bakerlab.org/rosetta/result.php?resultid=545820019

which ran for 5.98 hours!

[edit]
Ah - you beat me to it ;)
ID: 74576 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Link
Avatar

Send message
Joined: 4 May 07
Posts: 352
Credit: 382,349
RAC: 0
Message 74583 - Posted: 26 Nov 2012, 16:13:29 UTC - in response to Message 74576.  

It's just a run-time preference, not max run time- it'll not start any more tasks if it calculates that it won't finish them within that period, but if it's started one then it won't dump the work it's done to keep within the period.

Also note the wording "Target CPU run time". It's CPU time per WU preference, not run (elapsed) time preference. I have 18 hours preference, with the CPU doing a lot of other things most tasks finish for me after 20 hours elapsed time with <17 hours CPU time.
.
ID: 74583 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5662
Credit: 5,700,157
RAC: 2,063
Message 74628 - Posted: 30 Nov 2012, 15:08:48 UTC

and the hits keep on coming...ploop4 strikes again:

ERROR: Cannot open PDB file "y367.pdb.gz"
ERROR:: Exit from: ......srccoreimport_poseimport_pose.cc line: 198
BOINC:: Error reading and gzipping output datafile: default.out
called boinc_finish

https://boinc.bakerlab.org/rosetta/result.php?resultid=547070758
ID: 74628 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 74669 - Posted: 4 Dec 2012, 23:56:50 UTC

This one failed after 1min, 40sec's.

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=498118247

rb_12_03_34782_65400__t000__1_C1_SAVE_ALL_OUT_IGNORE_THE_REST_66465_163_0


Unpacking zip data: ../../projects/boinc.bakerlab.org_rosetta/minirosetta_database_rev52077.zip
Unpacking WU data ...
Unpacking data: ../../projects/boinc.bakerlab.org_rosetta/input_rb_12_03_34782_65400__t000__1_C1_robetta.zip
Setting database description ...
Setting up checkpointing ...
Setting up graphics native ...
BOINC:: Worker startup.
Starting watchdog...
Watchdog active.
# cpu_run_time_pref: 21600
dof_atom1 atomno= 3 rsd= 1
atom1 atomno= 1 rsd= 1
atom2 atomno= 2 rsd= 1
atom3 atomno= 5 rsd= 1
atom4 atomno= 6 rsd= 1
THETA1 nan
THETA3 nan
PHI2 0

ERROR: AtomTree::torsion_angle_dof_id: angle range error
ERROR:: Exit from: src/core/kinematics/AtomTree.cc line: 780
BOINC:: Error reading and gzipping output datafile: default.out
called boinc_finish

</stderr_txt>
]]>

ID: 74669 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 74711 - Posted: 11 Dec 2012, 7:06:45 UTC

This thing ran for over ten hours(10), on my 6hr limit then failed.

Others had had this fail as well, Thanks for nothing!

hyb_af_bench_4aimA_SAVE_ALL_OUT_IGNORE_THE_REST_57052_456_2

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=496983312

Setting database description ...
Setting up checkpointing ...
Setting up graphics native ...
BOINC:: Worker startup.
Starting watchdog...
Watchdog active.
# cpu_run_time_pref: 21600
BOINC:: CPU time: 36354.1s, 14400s + 21600s[2012-12-11 17:50: 2:] :: BOINC
WARNING! cannot get file size for default.out.gz: could not open file.
Output exists: default.out.gz Size: -1
InternalDecoyCount: 0 (GZ)
-----
0
-----
Stream information inconsistent.
Writing W_0000001
======================================================
DONE :: 1 starting structures 36354.1 cpu seconds
This process generated 1 decoys from 1 attempts
======================================================
called boinc_finish

</stderr_txt>
]]>

Validate state Workunit error - check skipped
Claimed credit 277.111340820468
Granted credit 0
application version 3.45

ID: 74711 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Warped

Send message
Joined: 15 Jan 06
Posts: 48
Credit: 1,788,185
RAC: 0
Message 74714 - Posted: 11 Dec 2012, 17:26:47 UTC - in response to Message 74711.  

This thing ran for over ten hours(10), on my 6hr limit then failed.

Others had had this fail as well, Thanks for nothing!

hyb_af_bench_4aimA_SAVE_ALL_OUT_IGNORE_THE_REST_57052_456_2

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=496983312

Setting database description ...
Setting up checkpointing ...
Setting up graphics native ...
BOINC:: Worker startup.
Starting watchdog...
Watchdog active.
# cpu_run_time_pref: 21600
BOINC:: CPU time: 36354.1s, 14400s + 21600s[2012-12-11 17:50: 2:] :: BOINC
WARNING! cannot get file size for default.out.gz: could not open file.
Output exists: default.out.gz Size: -1
InternalDecoyCount: 0 (GZ)
-----
0
-----
Stream information inconsistent.
Writing W_0000001
======================================================
DONE :: 1 starting structures 36354.1 cpu seconds
This process generated 1 decoys from 1 attempts
======================================================
called boinc_finish

</stderr_txt>
]]>

Validate state Workunit error - check skipped
Claimed credit 277.111340820468
Granted credit 0
application version 3.45


I try to remember to abort these "hyb" tasks as soon as they arrive.
However, I just discovered one and it's already way over limit without a single checkpoint :-(
Warped

ID: 74714 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1860
Credit: 8,152,819
RAC: 8,197
Message 74715 - Posted: 11 Dec 2012, 22:34:33 UTC

Screensaver crash on all "endo_aa_Pan" wus (on win xp and win7)
ID: 74715 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mad_Max

Send message
Joined: 31 Dec 09
Posts: 207
Credit: 23,317,169
RAC: 11,448
Message 74716 - Posted: 12 Dec 2012, 0:18:14 UTC - in response to Message 74715.  

Screensaver crash on all "endo_aa_Pan" wus (on win xp and win7)

I confirm this
Screensaver(/show grapchics) crash (or hangs in some cases) on endo_aa_Pan... WUs
ID: 74716 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1860
Credit: 8,152,819
RAC: 8,197
Message 74742 - Posted: 16 Dec 2012, 10:09:48 UTC - in response to Message 74716.  

I confirm this
Screensaver(/show grapchics) crash (or hangs in some cases) on endo_aa_Pan... WUs


And also problem with checkpoint in endo_aa_Pan...
ID: 74742 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mad_Max

Send message
Joined: 31 Dec 09
Posts: 207
Credit: 23,317,169
RAC: 11,448
Message 74743 - Posted: 16 Dec 2012, 11:20:51 UTC

Confirm promlems with checkpoint too:
All my endo_aa_ WUs fall back to 0.0% progress after restart.
And not only "endo_aa_Pan..." but all "endo_aa_..." (endo_aa_AaeI... endo_aa_BcuI... etc.)
ID: 74743 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 74754 - Posted: 19 Dec 2012, 7:48:44 UTC

This failed after about 2min.


rb_12_17_34962_66206__t000__1_C1_SAVE_ALL_OUT_IGNORE_THE_REST_68957_485_0

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=500612816


# cpu_run_time_pref: 21600
dof_atom1 atomno= 3 rsd= 1
atom1 atomno= 1 rsd= 1
atom2 atomno= 2 rsd= 1
atom3 atomno= 5 rsd= 1
atom4 atomno= 6 rsd= 1
THETA1 nan
THETA3 nan
PHI2 0

ERROR: AtomTree::torsion_angle_dof_id: angle range error
ERROR:: Exit from: src/core/kinematics/AtomTree.cc line: 780
BOINC:: Error reading and gzipping output datafile: default.out
called boinc_finish

</stderr_txt>

ID: 74754 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1860
Credit: 8,152,819
RAC: 8,197
Message 74774 - Posted: 23 Dec 2012, 17:24:09 UTC - in response to Message 74743.  

Confirm promlems with checkpoint too:
All my endo_aa_ WUs fall back to 0.0% progress after restart.
And not only "endo_aa_Pan..." but all "endo_aa_..." (endo_aa_AaeI... endo_aa_BcuI... etc.)


Again....
Please, fix it

ID: 74774 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1860
Credit: 8,152,819
RAC: 8,197
Message 74775 - Posted: 24 Dec 2012, 8:12:25 UTC - in response to Message 74743.  

Confirm promlems with checkpoint too:


Same problems (screen and checkpoint) on all my CDPK_aa_

ID: 74775 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mad_Max

Send message
Joined: 31 Dec 09
Posts: 207
Credit: 23,317,169
RAC: 11,448
Message 74776 - Posted: 24 Dec 2012, 11:32:39 UTC

Confirm problem with graphics app on CDPK_aa_... WUs too.
Not sure about checkpoints yet. (only 2 WUs atm)
ID: 74776 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4 · Next

Message boards : Number crunching : Mini Rosetta 3.45



©2024 University of Washington
https://www.bakerlab.org