Miscellaneous Work Unit Errors

Message boards : Number crunching : Miscellaneous Work Unit Errors

To post messages, you must log in.

Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · Next

AuthorMessage
Lee Carre

Send message
Joined: 6 Oct 05
Posts: 96
Credit: 79,331
RAC: 0
Message 13191 - Posted: 7 Apr 2006, 21:54:39 UTC

not a processing error, but a download error, details in the problem downloading a WU - HTTP 416 thread
Want to search the BOINC Wiki, BOINCstats, or various BOINC forums from within firefox? Try the BOINC related Firefox Search Plugins
ID: 13191 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
casio7131

Send message
Joined: 10 Oct 05
Posts: 35
Credit: 149,748
RAC: 0
Message 13206 - Posted: 8 Apr 2006, 3:55:36 UTC

8/04/2006 1:44:51 PM|rosetta@home|Unrecoverable error for result HBLR_1.0_1hz6_420_4766_0 ( - exit code -1073741811 (0xc000000d))
resultid=16362541
according to the boinc manager, it was at 33.64% after 5h26min of cpu time.

note, i had the same error 5 days ago (from below on this thread):
Message 12979 - Posted 3 Apr 2006 13:48:31 UTC

3/04/2006 11:38:45 PM|rosetta@home|Unrecoverable error for result HB_BARCODE_30_4ubpA_351_49332_0 ( - exit code -1073741811 (0xc000000d))

resultid=15780509


my system is a dual p3 933, 768 mb ram, winxp sp2.
ID: 13206 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
casio7131

Send message
Joined: 10 Oct 05
Posts: 35
Credit: 149,748
RAC: 0
Message 13219 - Posted: 8 Apr 2006, 7:00:34 UTC

another HB workunit error:
8/04/2006 4:34:36 PM|rosetta@home|Unrecoverable error for result HBLR_1.0_1ogw_425_8951_0 ( - exit code -1073741819 (0xc0000005))
https://boinc.bakerlab.org/rosetta/result.php?resultid=16550140

i think i'll give up reporting any more of these HB errors for the time being (i've reported about 5, on both ralph and rosetta, over the past few hours).

ID: 13219 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dgnuff
Avatar

Send message
Joined: 1 Nov 05
Posts: 350
Credit: 24,773,605
RAC: 0
Message 13223 - Posted: 8 Apr 2006, 10:35:08 UTC - in response to Message 13219.  

another HB workunit error:
8/04/2006 4:34:36 PM|rosetta@home|Unrecoverable error for result HBLR_1.0_1ogw_425_8951_0 ( - exit code -1073741819 (0xc0000005))
https://boinc.bakerlab.org/rosetta/result.php?resultid=16550140

i think i'll give up reporting any more of these HB errors for the time being (i've reported about 5, on both ralph and rosetta, over the past few hours).


Dunno if it's significant or not. I just had a machine error out an HLBR WU. What's interesting is that I've had a number of them run just fine in the last few days, all with the 4.83 client. This is the first one that failed, but it's also the first WU I've done with the new 4.97 client. This is in no way a statistically significant sample. Not yet. I'll keep an eye on this, since the same system is now merrily crunching away on another HLBR 4.97 WU.
ID: 13223 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Team_Elteor_Borislavj~BensPlace

Send message
Joined: 15 Mar 06
Posts: 1
Credit: 53,458
RAC: 0
Message 13225 - Posted: 8 Apr 2006, 11:09:14 UTC

same here
8-4-2006 12:39:49|rosetta@home|Unrecoverable error for result HBLR_1.0_1ogw_426_2393_0 ( - exit code -1073741819 (0xc0000005))

got 8 off those exits now.

all with 4.97

Ben
ID: 13225 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile RThigpen

Send message
Joined: 5 Apr 06
Posts: 1
Credit: 2,447
RAC: 0
Message 13227 - Posted: 8 Apr 2006, 12:33:32 UTC - in response to Message 13225.  

same here
8-4-2006 12:39:49|rosetta@home|Unrecoverable error for result HBLR_1.0_1ogw_426_2393_0 ( - exit code -1073741819 (0xc0000005))

got 8 off those exits now.

all with 4.97

Ben


Same here...I'm new to Rosetta@home (just started on the 5th) but I've had no problems until the switch to 4.97. I haven't had a successful WU since then.

Ray
ID: 13227 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jimi@0wned.org.uk

Send message
Joined: 10 Mar 06
Posts: 29
Credit: 335,252
RAC: 0
Message 13228 - Posted: 8 Apr 2006, 12:35:10 UTC

Computer 190981

All WUs on 8th April ended with memory errors - this is my Crucial Ballistix 2GB memory kit starting to die, I'm afraid. Now offline for testing.
ID: 13228 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
biodoc

Send message
Joined: 19 Feb 06
Posts: 14
Credit: 30,717,792
RAC: 0
Message 13231 - Posted: 8 Apr 2006, 14:19:44 UTC
Last modified: 8 Apr 2006, 14:42:58 UTC

Since the upgrade to version 4.97, I've had most of the WU's failing with client errors on my 4 windoze boxes (42 failures & counting!). My linux box is OK. FYI, I had absolutely no errors with 4.83 since it was released. These are the errors I'm seeing:

***UNHANDLED EXCEPTION****
Reason: Access Violation (0xc0000005) at address 0x00599FF4 read attempt to address 0x06C2FFF0
***UNHANDLED EXCEPTION****
Reason: Access Violation (0xc0000005) at address 0x00599FF4 read attempt to address 0x06C2FC8C
***UNHANDLED EXCEPTION****
Reason: Access Violation (0xc0000005) at address 0x007022EA read attempt to address 0x06AAFF34
***UNHANDLED EXCEPTION****
Reason: Access Violation (0xc0000005) at address 0x00599FF4 read attempt to address 0x06CBFA98
***UNHANDLED EXCEPTION****
Reason: Access Violation (0xc0000005) at address 0x007022EA read attempt to address 0x0EB4FF90


ID: 13231 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile 426hemi
Avatar

Send message
Joined: 8 Apr 06
Posts: 1
Credit: 62,774
RAC: 0
Message 13234 - Posted: 8 Apr 2006, 14:45:02 UTC

I am new here and using version 4.97. I too have almost all my WU's failing with similar codes. ***unrecoverable error for result HBLR_1.0_2reb_426_1061_0 (-exit code -1073741819 (0xc0000005))***
ID: 13234 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile adrianxw
Avatar

Send message
Joined: 18 Sep 05
Posts: 655
Credit: 11,902,696
RAC: 2,481
Message 13237 - Posted: 8 Apr 2006, 14:52:32 UTC
Last modified: 8 Apr 2006, 15:23:02 UTC

I've had 5 wu's crash all running 4.97, some from each of my two machines, (one machine is Intel P-IV 2.533GHz Northwood running NT4, the other an Intel P-IV 3.2GHz Prescott running XP neither OC'd). All but one crashed with the same error diagnostic...

<core_client_version>5.2.6</core_client_version>
<message> - exit code -1073741819 (0xc0000005)
</message>
<stderr_txt>
# random seed: 1055990
# cpu_run_time_pref: 14400

***UNHANDLED EXCEPTION****
Reason: Access Violation (0xc0000005) at address 0x00599FF4 read attempt to address 0x0A51FFD8

LoadLibrary( "dbghelp.dll" ): GetLastError = 126

Dump of the Worker(offending) thread:
1: 04/08/06 13:03:07
1: SymGetLineFromAddr(): GetLastError = 126


Dump of the Timer thread:
2: 04/08/06 13:03:07


Dump of the Graphics thread:
3: 04/08/06 13:03:07


Exiting...

</stderr_txt>

... three of them crashed after a little over 300 seconds, the other at a little over 600 seconds.

16611251
16600930
16590438
16520360

The other had run for 5,668 seconds and gave this diagnostic...
<core_client_version>5.2.6</core_client_version>
<message> - exit code -1073741819 (0xc0000005)
</message>
<stderr_txt>
# random seed: 1387795
# cpu_run_time_pref: 7200

***UNHANDLED EXCEPTION****
Reason: Access Violation (0xc0000005) at address 0x007022EA read attempt to address 0x0A69FD64

LoadLibrary( "dbghelp.dll" ): GetLastError = 126

Dump of the Worker(offending) thread:
1: 04/08/06 11:33:10


Dump of the Timer thread:
2: 04/08/06 11:33:11


Dump of the Graphics thread:
3: 04/08/06 11:33:11


Exiting...

</stderr_txt>


During the same period, I have had 2 wu's complete normally.

The error, (126), is a "module not found" error.
Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.
ID: 13237 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
David Baker
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 17 Sep 05
Posts: 705
Credit: 559,847
RAC: 0
Message 13238 - Posted: 8 Apr 2006, 14:52:43 UTC - in response to Message 13234.  

I am new here and using version 4.97. I too have almost all my WU's failing with similar codes. ***unrecoverable error for result HBLR_1.0_2reb_426_1061_0 (-exit code -1073741819 (0xc0000005))***



I'm really sorry about these problems. I checked yesterday on RALPH and everything seemed fine, but there clearly is a problem. Unfortunately, I'm just leaving for a family weekend trip so can't figure things out right away. Please bear with us for a couple of days.
ID: 13238 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Rebel Alliance

Send message
Joined: 4 Nov 05
Posts: 50
Credit: 3,579,531
RAC: 0
Message 13241 - Posted: 8 Apr 2006, 15:06:43 UTC

These are starting to get to my machines as well and on the one machine that has them 3 out of the 4 work units crunch has failed with the same messages as the other people.
"***UNHANDLED EXCEPTION****
Reason: Access Violation (0xc0000005) at address 0x00599FF4 read attempt to address 0x07CDFF48"

"***UNHANDLED EXCEPTION****
Reason: Access Violation (0xc0000005) at address 0x00599FF4 read attempt to address 0x07CDFF60"

and

"***UNHANDLED EXCEPTION****
Reason: Access Violation (0xc0000005) at address 0x007022EA read attempt to address 0x07B5FC7C"

This machine is a amd 2000xp and has never had a problem with work units before.
ID: 13241 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
[DPC]Charley

Send message
Joined: 18 Mar 06
Posts: 9
Credit: 295,915
RAC: 0
Message 13242 - Posted: 8 Apr 2006, 15:10:12 UTC
Last modified: 8 Apr 2006, 15:11:28 UTC

I'm getting tons of errors on the HBLR_1* stuff as well.
Out of 11 work units, 9 returned an error taking from about 1 minute to a couple of minutes short of an hour on 1 box (193403, for the admins).

Error codes:
8-4-2006 5:56:24|rosetta@home|Unrecoverable error for result HBLR_1.0_1mky_425_5187_0 ( - exit code -1073741819 (0xc0000005))
8-4-2006 6:03:32|rosetta@home|Unrecoverable error for result HBLR_1.0_2tif_425_7375_0 ( - exit code -1073741819 (0xc0000005))
8-4-2006 6:04:41|rosetta@home|Unrecoverable error for result HBLR_1.0_1n0u_425_9208_0 ( - exit code -1073741819 (0xc0000005))
8-4-2006 7:02:35|rosetta@home|Unrecoverable error for result HBLR_1.0_1mky_425_9364_0 ( - exit code -1073741819 (0xc0000005))
8-4-2006 7:30:31|rosetta@home|Unrecoverable error for result HBLR_1.0_1ogw_425_9448_0 ( - exit code -1073741819 (0xc0000005))
8-4-2006 7:43:45|rosetta@home|Unrecoverable error for result HBLR_1.0_1di2_426_203_0 ( - exit code -1073741819 (0xc0000005))
8-4-2006 12:05:59|rosetta@home|Unrecoverable error for result HBLR_1.0_2tif_426_571_0 ( - exit code -1073741819 (0xc0000005))
8-4-2006 16:08:16|rosetta@home|Unrecoverable error for result HBLR_1.0_2tif_426_3762_0 ( - exit code -1073741819 (0xc0000005))
8-4-2006 16:10:55|rosetta@home|Unrecoverable error for result HBLR_1.0_2reb_426_4608_0 ( - exit code -1073741819 (0xc0000005))


Second machine (181715) is also pumping out errors. Taking from 150 to 1230 seconds. These are the first units it's doing with 4.97.
Error codes:
08/04/2006 16:26:31|rosetta@home|Unrecoverable error for result HBLR_1.0_1ogw_426_283_1 ( - exit code -1073741819 (0xc0000005))
08/04/2006 16:49:21|rosetta@home|Unrecoverable error for result HBLR_1.0_1r69_426_428_1 ( - exit code -1073741819 (0xc0000005))
08/04/2006 16:53:59|rosetta@home|Unrecoverable error for result HBLR_1.0_1mky_426_4883_0 ( - exit code -1073741819 (0xc0000005))


Number three (193007) isn't doing any better, 4 out of 4 errors. Can't reach those error codes right now.

[/b]Number four[/b] (187877) is doing slightly better, with only 1 error so far out of 4 units.
Error codes:
8-4-2006 10:52:59|rosetta@home|Unrecoverable error for result HBLR_1.0_1mky_426_753_0 ( - exit code -1073741819 (0xc0000005))


All boxen are running windows XP home or pro and Rosetta 4.97.
The differences I can make out on my four boxen:
SP1 generates less errors than SP2. (Box 4 is still on SP1)
Pentium generates less errors than AMD. (Box 4 is a P3 733MHz, other boxen are AMD XP 2500+ and AMD 64 3700+).
Important note: not statistically relevant data of course, need more people for that ;)

ID: 13242 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Species8472

Send message
Joined: 7 Apr 06
Posts: 1
Credit: 55,732
RAC: 0
Message 13243 - Posted: 8 Apr 2006, 15:14:18 UTC
Last modified: 8 Apr 2006, 15:55:40 UTC

4.97 WU have a 85% failure rate on my A64 X2 3800+, running at stock speed...
4.83 WU's finished fine.

Errors are all of the same type:

***UNHANDLED EXCEPTION****
Reason: Access Violation (0xc0000005) at address 0x007022EA read attempt to address 0x0704FFA0

Timespan reaches from 10 seconds --> 90 minutes / unit before failure.
ID: 13243 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Buffalo Bill
Avatar

Send message
Joined: 25 Mar 06
Posts: 71
Credit: 1,630,458
RAC: 0
Message 13244 - Posted: 8 Apr 2006, 15:39:13 UTC

Lots here too...

Errors: Desktop
08/04/2006 7:09:37 AM|rosetta@home|Unrecoverable error for result HBLR_1.0_1r69_426_2525_0 ( - exit code -1073741819 (0xc0000005))
08/04/2006 9:11:01 AM|rosetta@home|Unrecoverable error for result HBLR_1.0_1dcj_426_4094_0 ( - exit code -1073741819 (0xc0000005))
08/04/2006 9:12:53 AM|rosetta@home|Unrecoverable error for result HBLR_1.0_1r69_426_4262_0 ( - exit code -1073741819 (0xc0000005))

Errors: Laptop
4/8/2006 12:20:26 AM|rosetta@home|Unrecoverable error for result
HBLR_1.0_1n0u_425_4428_0 ( - exit code -1073741819 (0xc0000005))
4/8/2006 12:22:28 AM|rosetta@home|Unrecoverable error for result
HBLR_1.0_2tif_425_9497_0 ( - exit code -1073741819 (0xc0000005))
4/8/2006 12:30:01 AM|rosetta@home|Unrecoverable error for result
HBLR_1.0_1n0u_426_963_0 ( - exit code -1073741819 (0xc0000005))
4/8/2006 4:57:17 AM|rosetta@home|Unrecoverable error for result
HBLR_1.0_1ogw_426_1087_0 ( - exit code -1073741819 (0xc0000005))
4/8/2006 5:02:37 AM|rosetta@home|Unrecoverable error for result
HBLR_1.0_1ogw_425_7374_1 ( - exit code -1073741819 (0xc0000005))
4/8/2006 5:14:59 AM|rosetta@home|Unrecoverable error for result
HBLR_1.0_2tif_425_5274_1 ( - exit code -1073741819 (0xc0000005))
ID: 13244 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
[DPC] C0w Crunch3rz

Send message
Joined: 11 Feb 06
Posts: 1
Credit: 286,166
RAC: 0
Message 13246 - Posted: 8 Apr 2006, 16:29:01 UTC

Is it (technically) possible to switch back to 4.83? It seems that these errors only occur in 4.97.
ID: 13246 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
genes
Avatar

Send message
Joined: 8 Oct 05
Posts: 60
Credit: 704,566
RAC: 0
Message 13247 - Posted: 8 Apr 2006, 16:54:51 UTC
Last modified: 8 Apr 2006, 16:56:25 UTC

I've gotten 8 errors with 4.97 over the last 2 days on several machines, and that's just with Rosetta! There's also Ralph, which is currently using 4.97, and I'm having errors there as well.

They are ALL 0xC0000005 errors (access violation). I could list them here, but there are already plenty to look at. Just checking in.

I think I have had only one finish without errors.
ID: 13247 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jimi@0wned.org.uk

Send message
Joined: 10 Mar 06
Posts: 29
Credit: 335,252
RAC: 0
Message 13249 - Posted: 8 Apr 2006, 17:07:11 UTC

When the last 4.83 WU finishes, I'm pulling my boxes til this gets fixed. All the 4.97s have failed; some have failed elsewhere before, others have gone on to fail elsewhere. It's a show-stopper, whatever the change was.
ID: 13249 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Nite Owl
Avatar

Send message
Joined: 2 Nov 05
Posts: 87
Credit: 3,019,449
RAC: 0
Message 13251 - Posted: 8 Apr 2006, 17:22:06 UTC
Last modified: 8 Apr 2006, 17:26:36 UTC

Somehow I get the feeling 4.97 is NOT ready for prime time.... 24 of 25 WU's failed running at Rosetta since about 5:20 EDT yesterday, and 15 of 16 running at Ralph since it was released...... Hopefully this can be resolved without having to wait until Monday..... Hopefully
Join the Teddies@WCG
ID: 13251 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Dimitris Hatzopoulos

Send message
Joined: 5 Jan 06
Posts: 336
Credit: 80,939
RAC: 0
Message 13253 - Posted: 8 Apr 2006, 17:42:49 UTC
Last modified: 8 Apr 2006, 18:26:13 UTC

Mostly failures with v4.97 here too.

PS: It'd been almost 2 months since I had errors on my machines, so it's probably a 4.97 thing...
Best UFO Resources
Wikipedia R@h
How-To: Join Distributed Computing projects that benefit humanity
ID: 13253 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · Next

Message boards : Number crunching : Miscellaneous Work Unit Errors



©2025 University of Washington
https://www.bakerlab.org