Dodgy wu's?

Message boards : Number crunching : Dodgy wu's?

To post messages, you must log in.

AuthorMessage
Profile adrianxw
Avatar

Send message
Joined: 18 Sep 05
Posts: 652
Credit: 11,662,550
RAC: 1,276
Message 35284 - Posted: 22 Jan 2007, 11:49:22 UTC
Last modified: 22 Jan 2007, 12:13:33 UTC

I have received 3 wu's with names like PSH_0134_looprlx_GP120_OD1_115_136_0694_1506_27, (the numbers vary but the general format is the same). All have crashed after about 20 seconds. All have been resent, and crashed at the second host as well.

Error

<core_client_version>5.4.11</core_client_version>
<message>
Forkert funktion. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
# random seed: 2081524
ERROR:: Exit at: .fragments.cc line:459

</stderr_txt>

Wu's

52044006
52055125
52059224

More...

52064313
52066797
52067270

... have suspended Rosetta.
Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.
ID: 35284 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
murky

Send message
Joined: 24 Sep 06
Posts: 9
Credit: 214,896
RAC: 0
Message 35289 - Posted: 22 Jan 2007, 12:04:17 UTC - in response to Message 35284.  

[quote]I have received 3 wu's with names like PSH_0134_looprlx_GP120_OD1_115_136_0694_1506_27, (the numbers vary but the general format is the same). All have crashed after about 20 seconds. All have been resent, and crashed at the second host as well.

The same situation here on one machine. & WU's like this:

PSH_0134_looprlx_GP120_OD1_115_136_0723_1506_14

They were reurned after 17 seconds!!!!
ID: 35289 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Wang Solutions
Avatar

Send message
Joined: 16 Jul 06
Posts: 3
Credit: 1,909,342
RAC: 0
Message 35290 - Posted: 22 Jan 2007, 12:24:47 UTC

The last dozen or so work units I have downloaded have all had a name beginning with something like PSH_0135_looprlx_ and all have failed after 21 seconds both for me and the other machine that downloaded them.

All showing the same error messages as in this thread. I have also suspended Rosetta.

Join the No.1 Australian Team!
ID: 35290 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
288VKYUjwsXfAaTXn6SFJC4LVPRf

Send message
Joined: 16 Dec 05
Posts: 31
Credit: 153,110
RAC: 0
Message 35291 - Posted: 22 Jan 2007, 13:04:39 UTC

Same problem here

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=52067585
ID: 35291 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Aukusti

Send message
Joined: 16 Jul 06
Posts: 2
Credit: 5,469,295
RAC: 0
Message 35292 - Posted: 22 Jan 2007, 13:29:51 UTC - in response to Message 35291.  

Same here, last two jobs were psh 0133 or0134 and both crashed under 20 seconds

https://boinc.bakerlab.org/rosetta/results.php?userid=100296

ID: 35292 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile danmark1966

Send message
Joined: 26 Aug 06
Posts: 1
Credit: 785,436
RAC: 0
Message 35299 - Posted: 22 Jan 2007, 14:38:15 UTC

Same here.
ID: 35299 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile anders n

Send message
Joined: 19 Sep 05
Posts: 403
Credit: 537,991
RAC: 0
Message 35320 - Posted: 22 Jan 2007, 18:29:26 UTC

I hope the team sees this so they can take them out of the que.
ID: 35320 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
The_Bad_Penguin
Avatar

Send message
Joined: 5 Jun 06
Posts: 2751
Credit: 4,271,025
RAC: 0
Message 35324 - Posted: 22 Jan 2007, 19:27:19 UTC - in response to Message 35290.  

Same with: PSH_0139_looprlx_GP120_OD1_115_136_1780_1506_28

The last dozen or so work units I have downloaded have all had a name beginning with something like PSH_0135_looprlx_ and all have failed after 21 seconds both for me and the other machine that downloaded them.

ID: 35324 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Astro
Avatar

Send message
Joined: 2 Oct 05
Posts: 987
Credit: 500,253
RAC: 0
Message 35327 - Posted: 22 Jan 2007, 19:36:13 UTC
Last modified: 22 Jan 2007, 19:49:28 UTC

Thank You, I thought I was going to be left out of this fun, but I've gotten 3 so far. I didn't even see them error out on the manager, but when I checked my results I saw them. 13-15 seconds per, what a waste of my cpu time (giggle, chuckle, snicker.......cough, cough)

I hope they take the time to manually grant my .0625 credits for each of those three....... (slapping knee.....ROFL)
ID: 35327 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Ocean Archer
Avatar

Send message
Joined: 22 Sep 05
Posts: 32
Credit: 49,302
RAC: 0
Message 35352 - Posted: 22 Jan 2007, 23:50:47 UTC

Oh well - with everyone getting their fair (?) share, who am I to complain ???
ID: 35352 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Tymbrimi
Volunteer moderator
Avatar

Send message
Joined: 22 Aug 06
Posts: 148
Credit: 153
RAC: 0
Message 35355 - Posted: 23 Jan 2007, 0:56:20 UTC

I passed this on to the programming team. Hopefully they can track down the source of the errant WUs, remove them from the queue and replace them with good ones quickly.


Rosetta Moderator: Mod.Tymbrimi
ROSETTA@home FAQ
Moderator Contact
ID: 35355 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Feet1st
Avatar

Send message
Joined: 30 Dec 05
Posts: 1755
Credit: 4,690,520
RAC: 0
Message 35362 - Posted: 23 Jan 2007, 4:36:53 UTC

Quote of Chu:
A bad batch (PSH_003?_looprlx...) slipped through and we were purging it from the database this morning. As it failed right away after it is started, we don't expect any left out there now...

Add this signature to your EMail:
Running Microsoft's "System Idle Process" will never help cure cancer, AIDS nor Alzheimer's. But running Rosetta@home just might!
https://boinc.bakerlab.org/rosetta/
ID: 35362 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : Dodgy wu's?



©2024 University of Washington
https://www.bakerlab.org