Miscellaneous Work Unit Errors

Message boards : Number crunching : Miscellaneous Work Unit Errors

To post messages, you must log in.

Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · 10 · Next

AuthorMessage
hugothehermit

Send message
Joined: 26 Sep 05
Posts: 238
Credit: 314,893
RAC: 0
Message 12972 - Posted: 3 Apr 2006, 7:42:55 UTC
Last modified: 3 Apr 2006, 8:07:00 UTC


Hi hugotheherit

This is one of your results. A win 98 right?

https://boinc.bakerlab.org/rosetta/result.php?resultid=15296368

Anders n


Yep, I have three machines running Rosetta@Home and my smoothwall router/internet sever etc... running seti cause it's a bit small for anything else :)

P4 3.0 Ghz, 1GB Ram Win XP Home SP2 (running Rosetta and Ralph, Ralphs out of work at the moment)
P4 1.0 Ghz, 256 Ram Win 98se (running Rosetta)
P3 933Mhz, 256 Ram Win 98se (running Rosetta)
P3 500Mhz, 128 Ram GNU/Linux (running smoothwall and seti@home)

Edit: GNU linux not Redhat :oops
ID: 12972 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
casio7131

Send message
Joined: 10 Oct 05
Posts: 35
Credit: 149,748
RAC: 0
Message 12979 - Posted: 3 Apr 2006, 13:48:31 UTC

3/04/2006 11:38:45 PM|rosetta@home|Unrecoverable error for result HB_BARCODE_30_4ubpA_351_49332_0 ( - exit code -1073741811 (0xc000000d))

resultid=15780509

ID: 12979 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Charles Dennett
Avatar

Send message
Joined: 27 Sep 05
Posts: 102
Credit: 2,070,914
RAC: 0
Message 13004 - Posted: 3 Apr 2006, 19:23:12 UTC
Last modified: 3 Apr 2006, 19:24:02 UTC

Please check out the the thread:

https://boinc.bakerlab.org/rosetta/forum_thread.php?id=1323

There seems to be a new kind of problem that has popped up the past few days on older Win98* machines where the the workunits are not reporting the cpu time back to the core client.

I know these older machines do not meet the minimum specs of the project, but at least mine and those of another person who report the same problem have been working fine up to now.

Just wanted to make sure this was brought to the attention of the project leaders.

Charlie

-Charlie
ID: 13004 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Monitor-Man

Send message
Joined: 19 Dec 05
Posts: 4
Credit: 6,034,589
RAC: 0
Message 13040 - Posted: 4 Apr 2006, 10:18:14 UTC

win 98 machines running 4.83 now completes WU's with 0 time and no errors roports as sucess but no credit as cpu time shows zero. but they do go so far through and the time & work done does increment but seems to suddenly say 100% and report.

I have 2 of these machines too old and not enough memory to run XP, this may be related to 4.83 as it seems to be a recent problem.

Machine names

dick.workgroup
piii-1g.WorkGroup

Regards

Rich
ID: 13040 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile anders n

Send message
Joined: 19 Sep 05
Posts: 403
Credit: 537,991
RAC: 0
Message 13041 - Posted: 4 Apr 2006, 10:48:07 UTC

The first time I saw the 0 time problem on Win 98 was with
Rosetta 4.82 Windows.

Anders n
ID: 13041 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
David Baker
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 17 Sep 05
Posts: 705
Credit: 559,847
RAC: 0
Message 13068 - Posted: 5 Apr 2006, 6:14:25 UTC - in response to Message 13004.  

Please check out the the thread:

https://boinc.bakerlab.org/rosetta/forum_thread.php?id=1323

There seems to be a new kind of problem that has popped up the past few days on older Win98* machines where the the workunits are not reporting the cpu time back to the core client.

I know these older machines do not meet the minimum specs of the project, but at least mine and those of another person who report the same problem have been working fine up to now.

Just wanted to make sure this was brought to the attention of the project leaders.

Charlie


Thanks. I asked Rom about this today, and he said that boinc had a special fix to deal with win98 lack of a timer function, and that his fix to the "leave in memory" problem might have messed up the boinc time keeper. he is looking into fixing it, but in the mean time all the results win98 computers are producing are getting properly collected and are helping us, and we will award credit for all of these jobs, so please bear with the problem for a bit longer.

ID: 13068 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
jomebrew

Send message
Joined: 31 Mar 06
Posts: 2
Credit: 25,914,516
RAC: 0
Message 13084 - Posted: 5 Apr 2006, 16:14:56 UTC

I have had this a three times on this machine since I started Rosetta 3/31. I get WIndowes XP dialog box that athe application errored. This is what is in the event log:

Faulting application rosetta_4.83_windows_intelx86.exe, version 0.0.0.0, faulting module rosetta_4.83_windows_intelx86.exe, version 0.0.0.0, fault address 0x004da3d4.

This has happened a few times on this machine. A P4 3ghz with WIndows XP Pro. It has also occurred on my AMD64 X2, but I do not know if the evnt log says the same thing.



ID: 13084 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile anders n

Send message
Joined: 19 Sep 05
Posts: 403
Credit: 537,991
RAC: 0
Message 13085 - Posted: 5 Apr 2006, 18:08:25 UTC

This WU https://boinc.bakerlab.org/rosetta/result.php?resultid=16041300

was aborted sins it did not count up the steps and did not regester the energi changes.

Anders n
ID: 13085 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Kevin

Send message
Joined: 15 Jan 06
Posts: 21
Credit: 109,496
RAC: 0
Message 13148 - Posted: 7 Apr 2006, 3:35:31 UTC - in response to Message 13085.  

This WU https://boinc.bakerlab.org/rosetta/result.php?resultid=16041300

was aborted sins it did not count up the steps and did not regester the energi changes.

Anders n



I just got one of these workunits and saw no energies or steps are being registered. How will this affect the workunit and what is returned to the server?
ID: 13148 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Lee Carre

Send message
Joined: 6 Oct 05
Posts: 96
Credit: 79,331
RAC: 0
Message 13191 - Posted: 7 Apr 2006, 21:54:39 UTC

not a processing error, but a download error, details in the problem downloading a WU - HTTP 416 thread
Want to search the BOINC Wiki, BOINCstats, or various BOINC forums from within firefox? Try the BOINC related Firefox Search Plugins
ID: 13191 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
casio7131

Send message
Joined: 10 Oct 05
Posts: 35
Credit: 149,748
RAC: 0
Message 13206 - Posted: 8 Apr 2006, 3:55:36 UTC

8/04/2006 1:44:51 PM|rosetta@home|Unrecoverable error for result HBLR_1.0_1hz6_420_4766_0 ( - exit code -1073741811 (0xc000000d))
resultid=16362541
according to the boinc manager, it was at 33.64% after 5h26min of cpu time.

note, i had the same error 5 days ago (from below on this thread):
Message 12979 - Posted 3 Apr 2006 13:48:31 UTC

3/04/2006 11:38:45 PM|rosetta@home|Unrecoverable error for result HB_BARCODE_30_4ubpA_351_49332_0 ( - exit code -1073741811 (0xc000000d))

resultid=15780509


my system is a dual p3 933, 768 mb ram, winxp sp2.
ID: 13206 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
casio7131

Send message
Joined: 10 Oct 05
Posts: 35
Credit: 149,748
RAC: 0
Message 13219 - Posted: 8 Apr 2006, 7:00:34 UTC

another HB workunit error:
8/04/2006 4:34:36 PM|rosetta@home|Unrecoverable error for result HBLR_1.0_1ogw_425_8951_0 ( - exit code -1073741819 (0xc0000005))
https://boinc.bakerlab.org/rosetta/result.php?resultid=16550140

i think i'll give up reporting any more of these HB errors for the time being (i've reported about 5, on both ralph and rosetta, over the past few hours).

ID: 13219 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dgnuff
Avatar

Send message
Joined: 1 Nov 05
Posts: 350
Credit: 24,773,605
RAC: 0
Message 13223 - Posted: 8 Apr 2006, 10:35:08 UTC - in response to Message 13219.  

another HB workunit error:
8/04/2006 4:34:36 PM|rosetta@home|Unrecoverable error for result HBLR_1.0_1ogw_425_8951_0 ( - exit code -1073741819 (0xc0000005))
https://boinc.bakerlab.org/rosetta/result.php?resultid=16550140

i think i'll give up reporting any more of these HB errors for the time being (i've reported about 5, on both ralph and rosetta, over the past few hours).


Dunno if it's significant or not. I just had a machine error out an HLBR WU. What's interesting is that I've had a number of them run just fine in the last few days, all with the 4.83 client. This is the first one that failed, but it's also the first WU I've done with the new 4.97 client. This is in no way a statistically significant sample. Not yet. I'll keep an eye on this, since the same system is now merrily crunching away on another HLBR 4.97 WU.
ID: 13223 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Team_Elteor_Borislavj~BensPlace

Send message
Joined: 15 Mar 06
Posts: 1
Credit: 53,458
RAC: 0
Message 13225 - Posted: 8 Apr 2006, 11:09:14 UTC

same here
8-4-2006 12:39:49|rosetta@home|Unrecoverable error for result HBLR_1.0_1ogw_426_2393_0 ( - exit code -1073741819 (0xc0000005))

got 8 off those exits now.

all with 4.97

Ben
ID: 13225 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile RThigpen

Send message
Joined: 5 Apr 06
Posts: 1
Credit: 2,447
RAC: 0
Message 13227 - Posted: 8 Apr 2006, 12:33:32 UTC - in response to Message 13225.  

same here
8-4-2006 12:39:49|rosetta@home|Unrecoverable error for result HBLR_1.0_1ogw_426_2393_0 ( - exit code -1073741819 (0xc0000005))

got 8 off those exits now.

all with 4.97

Ben


Same here...I'm new to Rosetta@home (just started on the 5th) but I've had no problems until the switch to 4.97. I haven't had a successful WU since then.

Ray
ID: 13227 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jimi@0wned.org.uk

Send message
Joined: 10 Mar 06
Posts: 29
Credit: 335,252
RAC: 0
Message 13228 - Posted: 8 Apr 2006, 12:35:10 UTC

Computer 190981

All WUs on 8th April ended with memory errors - this is my Crucial Ballistix 2GB memory kit starting to die, I'm afraid. Now offline for testing.
ID: 13228 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
biodoc

Send message
Joined: 19 Feb 06
Posts: 14
Credit: 30,298,003
RAC: 8,546
Message 13231 - Posted: 8 Apr 2006, 14:19:44 UTC
Last modified: 8 Apr 2006, 14:42:58 UTC

Since the upgrade to version 4.97, I've had most of the WU's failing with client errors on my 4 windoze boxes (42 failures & counting!). My linux box is OK. FYI, I had absolutely no errors with 4.83 since it was released. These are the errors I'm seeing:

***UNHANDLED EXCEPTION****
Reason: Access Violation (0xc0000005) at address 0x00599FF4 read attempt to address 0x06C2FFF0
***UNHANDLED EXCEPTION****
Reason: Access Violation (0xc0000005) at address 0x00599FF4 read attempt to address 0x06C2FC8C
***UNHANDLED EXCEPTION****
Reason: Access Violation (0xc0000005) at address 0x007022EA read attempt to address 0x06AAFF34
***UNHANDLED EXCEPTION****
Reason: Access Violation (0xc0000005) at address 0x00599FF4 read attempt to address 0x06CBFA98
***UNHANDLED EXCEPTION****
Reason: Access Violation (0xc0000005) at address 0x007022EA read attempt to address 0x0EB4FF90


ID: 13231 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile 426hemi
Avatar

Send message
Joined: 8 Apr 06
Posts: 1
Credit: 62,774
RAC: 0
Message 13234 - Posted: 8 Apr 2006, 14:45:02 UTC

I am new here and using version 4.97. I too have almost all my WU's failing with similar codes. ***unrecoverable error for result HBLR_1.0_2reb_426_1061_0 (-exit code -1073741819 (0xc0000005))***
ID: 13234 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile adrianxw
Avatar

Send message
Joined: 18 Sep 05
Posts: 652
Credit: 11,662,550
RAC: 1,276
Message 13237 - Posted: 8 Apr 2006, 14:52:32 UTC
Last modified: 8 Apr 2006, 15:23:02 UTC

I've had 5 wu's crash all running 4.97, some from each of my two machines, (one machine is Intel P-IV 2.533GHz Northwood running NT4, the other an Intel P-IV 3.2GHz Prescott running XP neither OC'd). All but one crashed with the same error diagnostic...

<core_client_version>5.2.6</core_client_version>
<message> - exit code -1073741819 (0xc0000005)
</message>
<stderr_txt>
# random seed: 1055990
# cpu_run_time_pref: 14400

***UNHANDLED EXCEPTION****
Reason: Access Violation (0xc0000005) at address 0x00599FF4 read attempt to address 0x0A51FFD8

LoadLibrary( "dbghelp.dll" ): GetLastError = 126

Dump of the Worker(offending) thread:
1: 04/08/06 13:03:07
1: SymGetLineFromAddr(): GetLastError = 126


Dump of the Timer thread:
2: 04/08/06 13:03:07


Dump of the Graphics thread:
3: 04/08/06 13:03:07


Exiting...

</stderr_txt>

... three of them crashed after a little over 300 seconds, the other at a little over 600 seconds.

16611251
16600930
16590438
16520360

The other had run for 5,668 seconds and gave this diagnostic...
<core_client_version>5.2.6</core_client_version>
<message> - exit code -1073741819 (0xc0000005)
</message>
<stderr_txt>
# random seed: 1387795
# cpu_run_time_pref: 7200

***UNHANDLED EXCEPTION****
Reason: Access Violation (0xc0000005) at address 0x007022EA read attempt to address 0x0A69FD64

LoadLibrary( "dbghelp.dll" ): GetLastError = 126

Dump of the Worker(offending) thread:
1: 04/08/06 11:33:10


Dump of the Timer thread:
2: 04/08/06 11:33:11


Dump of the Graphics thread:
3: 04/08/06 11:33:11


Exiting...

</stderr_txt>


During the same period, I have had 2 wu's complete normally.

The error, (126), is a "module not found" error.
Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.
ID: 13237 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
David Baker
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 17 Sep 05
Posts: 705
Credit: 559,847
RAC: 0
Message 13238 - Posted: 8 Apr 2006, 14:52:43 UTC - in response to Message 13234.  

I am new here and using version 4.97. I too have almost all my WU's failing with similar codes. ***unrecoverable error for result HBLR_1.0_2reb_426_1061_0 (-exit code -1073741819 (0xc0000005))***



I'm really sorry about these problems. I checked yesterday on RALPH and everything seemed fine, but there clearly is a problem. Unfortunately, I'm just leaving for a family weekend trip so can't figure things out right away. Please bear with us for a couple of days.
ID: 13238 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · 10 · Next

Message boards : Number crunching : Miscellaneous Work Unit Errors



©2024 University of Washington
https://www.bakerlab.org