ABORT THESE JOBS

Message boards : Number crunching : ABORT THESE JOBS

To post messages, you must log in.

AuthorMessage
Profile dag
Avatar

Send message
Joined: 16 Dec 05
Posts: 106
Credit: 1,000,020
RAC: 0
Message 14690 - Posted: 26 Apr 2006, 17:46:08 UTC
Last modified: 26 Apr 2006, 17:47:04 UTC

I would like to see a sticky thread that is read-only for the general user that would contain only alerts from the Rosetta staff to abort jobs or similar advisories.

I could subscribe to the thread and know that the email advisory was important.
dag
--Finding aliens is cool, but understanding the structure of proteins is useful.
ID: 14690 · Rating: 3 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Los Alcoholicos~La Muis

Send message
Joined: 4 Nov 05
Posts: 34
Credit: 1,041,724
RAC: 0
Message 14727 - Posted: 27 Apr 2006, 6:51:20 UTC
Last modified: 27 Apr 2006, 7:10:01 UTC

I support this request!

I simply don't have time to read all the threads and find the scattered warnings about bad or (possible) problemgiving wu's.

I just had to abort 2 HBLR_1.0_xxx wu's that held 2 pc hostage for days. And just now, when I reported these errors, I find that somewhere in a thread on this board there is this warning by Rhiju:
Please abort these jobs. The workunits with the following names appear to be causing problems on some machines:

HBLR_1.0_XXX_ROT_TRIALS_TRIE_449...

Its a bit strange, since we didn't see this problem in our Ralph tests. But just to be safe, go ahead and abort!


And that is a ridiculous situation. These kind of warnings should reach all of us as quick as posible and not only those who have the time to scan all of the treads.


ID: 14727 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Tribaal
Avatar

Send message
Joined: 6 Feb 06
Posts: 80
Credit: 2,754,607
RAC: 0
Message 14729 - Posted: 27 Apr 2006, 7:21:05 UTC

I support this request

I agree. WU's that need to be aborted NEED to be listed in ONE convenient place, that can be bookmarked and checked easily.


- trib'
ID: 14729 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
MAOJC

Send message
Joined: 19 Jan 06
Posts: 15
Credit: 2,727,567
RAC: 0
Message 14752 - Posted: 27 Apr 2006, 13:56:03 UTC

Ditto from me. Running a farm and having to baby sit these inds issues is a real pain
ID: 14752 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1832
Credit: 119,664,803
RAC: 11,191
Message 14753 - Posted: 27 Apr 2006, 14:00:06 UTC

A purge function would reduce the need for this greatly - is that a possibility?
ID: 14753 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Lucky Angel~AES_koetje

Send message
Joined: 18 Mar 06
Posts: 4
Credit: 0
RAC: 0
Message 14755 - Posted: 27 Apr 2006, 14:34:07 UTC - in response to Message 14729.  

I support this request

I agree. WU's that need to be aborted NEED to be listed in ONE convenient place, that can be bookmarked and checked easily.


- trib'

I do as well. Seems very easy to implement. So .....?

Another important improvement to the community could be An One Place list of common error messages. I posted four days ago this message 14465. It is very silent.
ID: 14755 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Los Alcoholicos~La Muis

Send message
Joined: 4 Nov 05
Posts: 34
Credit: 1,041,724
RAC: 0
Message 14793 - Posted: 27 Apr 2006, 20:31:11 UTC
Last modified: 27 Apr 2006, 20:34:01 UTC

One of the pc's I just freed from a 55 hour loop caused by a HBLR_1.0_xxx wu was at 1,26% after 2 hours on a FACONTACTS_RECENTER wu. Quiet worried because of the previous time waste I used the search and found this:

Posted by Rhiju on 22 april
I'm stopping the sendout of these jobs. For some machines,
there appears to be an incompatibility of 5.01 with these
jobs. Please abort FACONTACTS_RECENTER jobs and HBLR1.0 jobs.


I found a total of 14 of these wu's in the queue of 4 different pc's. So this time I can abort these wu's on time, but what about the next...?

ID: 14793 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile rbpeake

Send message
Joined: 25 Sep 05
Posts: 168
Credit: 247,828
RAC: 0
Message 14804 - Posted: 28 Apr 2006, 0:08:25 UTC

Yes, for instance now that 5.06 has been released, should we abort workunits in progress under the earlier 5.01 application?

I am guessing no, that the 5.01 workunits are valuable too, and we should process them to completion, but it would be nice for a confirmation. :)
Regards,
Bob P.
ID: 14804 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Rhiju
Volunteer moderator

Send message
Joined: 8 Jan 06
Posts: 223
Credit: 3,546
RAC: 0
Message 14808 - Posted: 28 Apr 2006, 0:48:13 UTC - in response to Message 14804.  

Yes, go ahead and keep crunching 5.01 workunits unless you suspect that they're stuck. Thanks for checking.

Yes, for instance now that 5.06 has been released, should we abort workunits in progress under the earlier 5.01 application?

I am guessing no, that the 5.01 workunits are valuable too, and we should process them to completion, but it would be nice for a confirmation. :)


ID: 14808 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Lee Carre

Send message
Joined: 6 Oct 05
Posts: 96
Credit: 79,331
RAC: 0
Message 14858 - Posted: 28 Apr 2006, 12:27:22 UTC

I see that, as requested, a moderator has created a thread for this
Want to search the BOINC Wiki, BOINCstats, or various BOINC forums from within firefox? Try the BOINC related Firefox Search Plugins
ID: 14858 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Moderator9
Volunteer moderator

Send message
Joined: 22 Jan 06
Posts: 1014
Credit: 0
RAC: 0
Message 14873 - Posted: 28 Apr 2006, 14:19:43 UTC - in response to Message 14858.  

I see that, as requested, a moderator has created a thread for this


We do try ;>)

Moderator9
ROSETTA@home FAQ
Moderator Contact
ID: 14873 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Lee Carre

Send message
Joined: 6 Oct 05
Posts: 96
Credit: 79,331
RAC: 0
Message 14874 - Posted: 28 Apr 2006, 14:34:35 UTC - in response to Message 14873.  

I see that, as requested, a moderator has created a thread for this


We do try ;>)

i just wanted to point it out to those who'd requested it, incase they hadn't seen it already

i must say it's very nice having a highly organised forum, i for one appreciate your efforts, whoever you might be Mr. Moderator :p
Want to search the BOINC Wiki, BOINCstats, or various BOINC forums from within firefox? Try the BOINC related Firefox Search Plugins
ID: 14874 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Los Alcoholicos~La Muis

Send message
Joined: 4 Nov 05
Posts: 34
Credit: 1,041,724
RAC: 0
Message 14906 - Posted: 28 Apr 2006, 19:30:33 UTC - in response to Message 14873.  

I see that, as requested, a moderator has created a thread for this


We do try ;>)

Thanks
ID: 14906 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dag
Avatar

Send message
Joined: 16 Dec 05
Posts: 106
Credit: 1,000,020
RAC: 0
Message 14910 - Posted: 28 Apr 2006, 20:06:21 UTC - in response to Message 14873.  
Last modified: 28 Apr 2006, 20:06:54 UTC

I see that, as requested, a moderator has created a thread for this


We do try ;>)

It works, too.

Let's kill this thread, OK?
dag
--Finding aliens is cool, but understanding the structure of proteins is useful.
ID: 14910 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Moderator9
Volunteer moderator

Send message
Joined: 22 Jan 06
Posts: 1014
Credit: 0
RAC: 0
Message 14911 - Posted: 28 Apr 2006, 20:14:12 UTC - in response to Message 14910.  

...
Let's kill this thread, OK?


Are you kidding? If the idea is a flop I want people to know it was not mine;>0

Thanks for the suggestion people. Keep them coming


Moderator9
ROSETTA@home FAQ
Moderator Contact
ID: 14911 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : ABORT THESE JOBS



©2024 University of Washington
https://www.bakerlab.org