Report stuck & aborted 5.01 WU here please - III

Message boards : Number crunching : Report stuck & aborted 5.01 WU here please - III

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · Next

AuthorMessage
Rebel Alliance

Send message
Joined: 4 Nov 05
Posts: 50
Credit: 3,579,531
RAC: 0
Message 14684 - Posted: 26 Apr 2006, 16:19:40 UTC

Result ID 17981175
Name FACONTACTS_RECENTER_NOFILTERS_1wit__448_181_1
Workunit 14525452
Created 23 Apr 2006 2:00:28 UTC
Sent 23 Apr 2006 9:28:17 UTC
Received 26 Apr 2006 16:11:30 UTC
Server state Over
Outcome Client error
Client state Computing
Exit status -197 (0xffffff3b)
Computer ID 107679
Report deadline 7 May 2006 9:28:17 UTC
CPU time 83730.416483
ID: 14684 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile EdMulock
Avatar

Send message
Joined: 14 Mar 06
Posts: 30
Credit: 2,347,485
RAC: 0
Message 14689 - Posted: 26 Apr 2006, 17:30:23 UTC

work unit 12214678 aborted after 35 hours showing 4 % completion

Claimed credit 430.094033921575
ID: 14689 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
[DPC]TeamGrazzie~APCIII

Send message
Joined: 17 Mar 06
Posts: 1
Credit: 271,636
RAC: 0
Message 14700 - Posted: 26 Apr 2006, 20:25:08 UTC

Aborted HBLR_1.0_1hz6_420_9212 https://boinc.bakerlab.org/rosetta/workunit.php?wuid=13425964 on computer https://boinc.bakerlab.org/rosetta/show_host_detail.php?hostid=184836.

WU is jumping back to "Ab initio" after model 1; step +/-34500 (full atom relax)
This repeads everytime the WU passes step 345xx.

Result ID https://boinc.bakerlab.org/rosetta/result.php?resultid=18221537
ID: 14700 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
[DPC]Division_Brabant~OldButNotSoWise
Avatar

Send message
Joined: 23 Jan 06
Posts: 42
Credit: 371,797
RAC: 0
Message 14701 - Posted: 26 Apr 2006, 20:55:44 UTC
Last modified: 26 Apr 2006, 20:57:39 UTC

https://boinc.bakerlab.org/rosetta/result.php?resultid=17773392
Maximum CPU time exceeded.
That's no fun, more then 5 days crunching and suddenly it goes on error :(

*peep* happens, just joking, that's the risc when you're crunching :)

This one has give me beautiful red dots in the graphics :D
ID: 14701 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Los Alcoholicos~La Muis

Send message
Joined: 4 Nov 05
Posts: 34
Credit: 1,041,724
RAC: 0
Message 14704 - Posted: 26 Apr 2006, 22:16:38 UTC
Last modified: 26 Apr 2006, 22:21:42 UTC

Aborted HBLR_1.0_1mky_ROT_TRIALS_TRIE_449_22_0

cpu time 45:16 at 8,55%


And another one HBLR_1.0_1di2_420_4823_1

cpu time 55:25 at 33,92%
ID: 14704 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile KWSN - Sir Brian - err sorry - wrong film!

Send message
Joined: 23 Feb 06
Posts: 1
Credit: 353,945
RAC: 0
Message 14705 - Posted: 26 Apr 2006, 22:18:14 UTC

Here ye go a few from me to add to the list...


https://boinc.bakerlab.org/rosetta/result.php?resultid=18309752

https://boinc.bakerlab.org/rosetta/result.php?resultid=18068362

https://boinc.bakerlab.org/rosetta/result.php?resultid=17878733


ID: 14705 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
brianwc

Send message
Joined: 7 Dec 05
Posts: 3
Credit: 701,894
RAC: 0
Message 14718 - Posted: 27 Apr 2006, 3:50:14 UTC

Aborted work unit: 14555727

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=14555727

Three people errored-out on this one.
ID: 14718 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bones

Send message
Joined: 16 Sep 05
Posts: 3
Credit: 713,317
RAC: 0
Message 14728 - Posted: 27 Apr 2006, 7:19:17 UTC
Last modified: 27 Apr 2006, 7:19:42 UTC

And another one 13331599 going really slowly at 3.02% and aborted after 15 hours.
ID: 14728 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
msr-berlin

Send message
Joined: 28 Nov 05
Posts: 2
Credit: 8,058
RAC: 0
Message 14730 - Posted: 27 Apr 2006, 7:27:02 UTC

Aborted the following WU PROD_ABINITIO_1tul__447_80279
https://boinc.bakerlab.org/rosetta/workunit.php?wuid=15073495

0.0% after 22 hours of work

ID: 14730 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile anders n

Send message
Joined: 19 Sep 05
Posts: 403
Credit: 537,991
RAC: 0
Message 14740 - Posted: 27 Apr 2006, 10:25:54 UTC

Stuck at 1 % and with several "red dots" on grafics.

https://boinc.bakerlab.org/rosetta/result.php?resultid=18246367

Anders n
ID: 14740 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mind

Send message
Joined: 20 Feb 06
Posts: 1
Credit: 50,095
RAC: 0
Message 14742 - Posted: 27 Apr 2006, 11:04:19 UTC - in response to Message 14740.  
Last modified: 27 Apr 2006, 11:04:40 UTC

FACONTACTS_RECENTER_NOFILTER_1cei__448_591_2.

running 60 hours, aborted.

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=14553294
https://boinc.bakerlab.org/rosetta/result.php?resultid=18081512

(not sure which link i had to post, so posted both.)
ID: 14742 · Rating: 1 · rate: Rate + / Rate - Report as offensive    Reply Quote
Rebel Alliance

Send message
Joined: 4 Nov 05
Posts: 50
Credit: 3,579,531
RAC: 0
Message 14757 - Posted: 27 Apr 2006, 14:47:49 UTC

Result ID 18047453
Name HBLR_1.0_1ogw_420_7359_2
Workunit 13416696
Created 23 Apr 2006 19:40:01 UTC
Sent 24 Apr 2006 2:10:22 UTC
Received 27 Apr 2006 14:46:09 UTC
Server state Over
Outcome Client error
Client state Computing
Exit status -197 (0xffffff3b)
Computer ID 77284
Report deadline 8 May 2006 2:10:22 UTC
CPU time 127774.709382
ID: 14757 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Rebel Alliance

Send message
Joined: 4 Nov 05
Posts: 50
Credit: 3,579,531
RAC: 0
Message 14758 - Posted: 27 Apr 2006, 15:20:05 UTC

Result ID 17930605
Name HBLR_1.0_1ogw_420_1370_2
Workunit 13336463
Created 22 Apr 2006 12:53:08 UTC
Sent 22 Apr 2006 19:30:38 UTC
Received 27 Apr 2006 15:18:39 UTC
Server state Over
Outcome Client error
Client state Computing
Exit status -197 (0xffffff3b)
Computer ID 148992
Report deadline 6 May 2006 19:30:38 UTC
CPU time 107976.460834
ID: 14758 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Rebel Alliance

Send message
Joined: 4 Nov 05
Posts: 50
Credit: 3,579,531
RAC: 0
Message 14759 - Posted: 27 Apr 2006, 15:25:59 UTC

Result ID 17915761
Name FACONTACTS_RECENTER_NOFILTERS_1bk2__448_398_1
Workunit 14540163
Created 22 Apr 2006 9:00:23 UTC
Sent 22 Apr 2006 15:49:23 UTC
Received 27 Apr 2006 15:25:11 UTC
Server state Over
Outcome Client error
Client state Computing
Exit status -197 (0xffffff3b)
Computer ID 155638
Report deadline 6 May 2006 15:49:23 UTC
CPU time 129242.046875
ID: 14759 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Charlie

Send message
Joined: 25 Mar 06
Posts: 53
Credit: 424,472
RAC: 0
Message 14776 - Posted: 27 Apr 2006, 17:52:05 UTC

AB_CASP6_t272__456_3679 Aborted due to rosetta at how causeing a Windows error. I was unable to track down the error as it caused a reboot of windows.
ID: 14776 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jimi@0wned.org.uk

Send message
Joined: 10 Mar 06
Posts: 29
Credit: 335,252
RAC: 0
Message 14783 - Posted: 27 Apr 2006, 18:51:50 UTC

This didn't have a problem except it wouldn't restart after a reboot. Don't know why, don't think it's the unit's fault tbh.

Result ID 18286555
Name PROD_ABINITIO_FAST_1tul__447_82848_0
Workunit 15088910
Created 26 Apr 2006 6:21:10 UTC
Sent 26 Apr 2006 11:16:48 UTC
Received 26 Apr 2006 19:15:45 UTC
Server state Over
Outcome Client error
Client state Computing
Exit status -197 (0xffffff3b)
Computer ID 190981
Report deadline 10 May 2006 11:16:48 UTC
CPU time 4557.71875
ID: 14783 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Hassan

Send message
Joined: 7 Mar 06
Posts: 4
Credit: 750,146
RAC: 0
Message 14784 - Posted: 27 Apr 2006, 18:57:32 UTC
Last modified: 27 Apr 2006, 18:59:11 UTC

https://boinc.bakerlab.org/rosetta/result.php?resultid=18122796

2006-04-26 21:35:57 [rosetta@home] Scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi succeeded
2006-04-26 22:23:18 [rosetta@home] Aborting result HBLR_1.0_1dtj_420_3452_3: exceeded CPU time limit 130783.200508
2006-04-26 22:23:18 [rosetta@home] Unrecoverable error for result HBLR_1.0_1dtj_420_3452_3 (Maximum CPU time exceeded)
2006-04-26 22:23:19 [---] request_reschedule_cpus: process exited
2006-04-26 22:23:19 [rosetta@home] Computation for result HBLR_1.0_1dtj_420_3452_3 finished
2006-04-26 22:23:19 [rosetta@home] Starting result PROD_ABINITIO_ALPHABETABAR_1tul__447_85936_0 using rosetta version 501

Mine it appears auto-aborted, thats almost 1200 claimed credit 0 granted, so is that a waste or do I get still get credit.

Result ID 18122796
Name HBLR_1.0_1dtj_420_3452_3
Workunit 13389346
Created 24 Apr 2006 15:15:38 UTC
Sent 24 Apr 2006 20:33:06 UTC
Received 27 Apr 2006 5:24:19 UTC
Server state Over
Outcome Client error
Client state Computing
Exit status -177 (0xffffff4f)
Computer ID 175797
Report deadline 8 May 2006 20:33:06 UTC
CPU time 130784.75
stderr out <core_client_version>5.2.13</core_client_version>
<message>Maximum CPU time exceeded
</message>
<stderr_txt>
# random seed: 1597253
# cpu_run_time_pref: 7200
# random seed: 1597253
# random seed: 1597253

</stderr_txt>


Validate state Invalid
Claimed credit 1165.54456988046
Granted credit 0
application version 5.01

ID: 14784 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
tralala

Send message
Joined: 8 Apr 06
Posts: 376
Credit: 581,806
RAC: 0
Message 14787 - Posted: 27 Apr 2006, 19:08:30 UTC - in response to Message 14784.  

https://boinc.bakerlab.org/rosetta/result.php?resultid=18122796


Mine it appears auto-aborted, thats almost 1200 claimed credit 0 granted, so is that a waste or do I get still get credit.

Result ID 18122796
Name HBLR_1.0_1dtj_420_3452_3
Workunit 13389346


That's an amazing WU. It was first send out April 6th and after the deadline was reached without a result it was three more times sent out and all three times failed. It was then VALID returned from the first host with a reported runtime from only 6800 seconds and with app 4.83. Whoppa!
ID: 14787 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
[DPC]Division_Brabant~OldButNotSoWise
Avatar

Send message
Joined: 23 Jan 06
Posts: 42
Credit: 371,797
RAC: 0
Message 14794 - Posted: 27 Apr 2006, 20:36:28 UTC
Last modified: 27 Apr 2006, 20:46:51 UTC

https://boinc.bakerlab.org/rosetta/result.php?resultid=18190274
https://boinc.bakerlab.org/rosetta/result.php?resultid=18190275

both run more then 15 hours and claimed to be at 5 %

I learned my lesson after a job crunched for more then 5 days and end with a CPU time exceeded error, so I aborted the jobs.

This happens more and more, very anoying.



ID: 14794 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [DPC]FOKschaap~_mcintosh_
Avatar

Send message
Joined: 4 Dec 05
Posts: 5
Credit: 118,303
RAC: 0
Message 14798 - Posted: 27 Apr 2006, 22:16:13 UTC
Last modified: 27 Apr 2006, 22:17:05 UTC

AB_CASP6_t216__458_807 stuck at 1% aborted

AB_CASP6_t216__456_2307 stuck at 1%, but it seems that the job is succesfully crunched by another user.

This is strange because i have a much faster CPU than the one it was completed with, and he finished in 9,834.69 sec. and mine was still at 1% after 5,698.12
ID: 14798 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · 6 · Next

Message boards : Number crunching : Report stuck & aborted 5.01 WU here please - III



©2025 University of Washington
https://www.bakerlab.org