Loads and loads of computing errors today

Message boards : Number crunching : Loads and loads of computing errors today

To post messages, you must log in.

1 · 2 · 3 · 4 . . . 5 · Next

AuthorMessage
Profile [BAT] tutta55
Avatar

Send message
Joined: 16 Sep 05
Posts: 59
Credit: 99,832
RAC: 0
Message 1587 - Posted: 21 Oct 2005, 22:21:00 UTC

On 2 of my machines I'm producing dozens of computing errors today. On the others I don't :-(

I doubt it has anything to do with the machines themselves, because for most of these work units others are producing errors too.

An example https://boinc.bakerlab.org/rosetta/workunit.php?wuid=342005

BOINC.BE: For Belgians who love the smell of glowing red cpu's in the morning
Tutta55's Lair
ID: 1587 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
[AF>france>pas-de-calais]symaski62

Send message
Joined: 19 Sep 05
Posts: 47
Credit: 33,871
RAC: 0
Message 1588 - Posted: 21 Oct 2005, 23:19:13 UTC - in response to Message 1587.  

On 2 of my machines I'm producing dozens of computing errors today. On the others I don't :-(

I doubt it has anything to do with the machines themselves, because for most of these work units others are producing errors too.

An example https://boinc.bakerlab.org/rosetta/workunit.php?wuid=342005


<core_client_version>4.19</core_client_version>
<message>Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<active_task_state>1</active_task_state>
<signal>0</signal>
<stderr_txt>


ID: 1588 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1018
Credit: 4,334,829
RAC: 0
Message 1589 - Posted: 21 Oct 2005, 23:46:42 UTC
Last modified: 21 Oct 2005, 23:52:43 UTC

Can you try the updated client? or at least 4.4+?

Also, see the boinc wiki page about this error.
ID: 1589 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile wdsmia
Avatar

Send message
Joined: 25 Sep 05
Posts: 3
Credit: 2,109,013
RAC: 0
Message 1590 - Posted: 22 Oct 2005, 0:11:47 UTC
Last modified: 22 Oct 2005, 0:17:30 UTC

I just started having computing errors on one box looks like the 4.78 app is not playing well with others or aleast with the 4.19 client.

<core_client_version>4.19</core_client_version>
<message>Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<active_task_state>1</active_task_state>
<signal>0</signal>
<stderr_txt>

application version 4.78

ID: 1590 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Fuzzy Hollynoodles
Avatar

Send message
Joined: 7 Oct 05
Posts: 234
Credit: 15,020
RAC: 0
Message 1593 - Posted: 22 Oct 2005, 0:42:48 UTC - in response to Message 1590.  

@wdsmia and others.

My 4.72 is working really fine and is really stable.

You can download it from this, where you can find all versions of BOINC.

The advantage of 4.72 in preference to 4.45 is that the time estimate is better, so it adjust itself after some WU's. This means that it downloads relatively correct instead of suddenly going in panic mode about some WU's which are estimated wrongly to take more time.


[b]"I'm trying to maintain a shred of dignity in this world." - Me[/b]

ID: 1593 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile wdsmia
Avatar

Send message
Joined: 25 Sep 05
Posts: 3
Credit: 2,109,013
RAC: 0
Message 1594 - Posted: 22 Oct 2005, 1:00:32 UTC

@Hollynoodles
looking at several results from different people its looking more like there may be a problem with the new 4.78 app Im finding the same errors on boxes running 4.19, 4.45 and 4.72.
The Windows/x86 4.78 app was released 21 Oct 2005 1:36:59 UTC If you are running with more than one days cache you may not have run one using the new app.
ID: 1594 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mscharmack
Avatar

Send message
Joined: 29 Sep 05
Posts: 2
Credit: 11,323
RAC: 0
Message 1595 - Posted: 22 Oct 2005, 1:07:21 UTC

I just had 42 computational errors in a row, then two test WU's were run and I am currently into my next WU. It seems to be running okay??????
ID: 1595 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile wdsmia
Avatar

Send message
Joined: 25 Sep 05
Posts: 3
Credit: 2,109,013
RAC: 0
Message 1597 - Posted: 22 Oct 2005, 2:06:17 UTC

ok after investagating a bit more.
Its looking like most of these errors are on wu's with the name of xxxxx_abrelaxmode_random_gauss_xxxxx.
why some boxes can process them with no errors and others error out?
I havent a clue.
ID: 1597 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
AnRM
Avatar

Send message
Joined: 18 Sep 05
Posts: 123
Credit: 1,355,486
RAC: 0
Message 1601 - Posted: 22 Oct 2005, 6:35:23 UTC
Last modified: 22 Oct 2005, 6:41:37 UTC

I had three boxes sharing R@H and LHC@H and had to run BOINC ver.4.19 because LHC@H have not upgraded to 5.x yet.....R@H started giving computing errors when R@H ver.4.78 came along and, unfortunately, rattled off a whole herd of failed WU's on all three before I caught it. Soooo.... I had to upgrade to BOINC ver5.2.2 to keep R@H running and wave goodbye to LHC@H until they get their act together. Bottom line: ver. 4.78 won't run on BOINC ver.4.19. FYI, the other boxes are using BOINC ver.5.2.1 and are happy.
ID: 1601 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
KWSN_Dagger

Send message
Joined: 19 Oct 05
Posts: 5
Credit: 2,803
RAC: 0
Message 1604 - Posted: 22 Oct 2005, 6:54:54 UTC

I noticed that with these errors, it depends on what machine your running. ie if you run an AMD and it errored out, a person with a pentium would have success and vice versa.
ID: 1604 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [BAT] tutta55
Avatar

Send message
Joined: 16 Sep 05
Posts: 59
Credit: 99,832
RAC: 0
Message 1605 - Posted: 22 Oct 2005, 8:17:24 UTC

Upgrading to 4.45 does not help. A message like this occurs for those same work units:

<core_client_version>4.45</core_client_version>
<message>CreateProcess() failed - The process cannot access the file because it is being used by another process. (0x20)
</message>

BOINC.BE: For Belgians who love the smell of glowing red cpu's in the morning
Tutta55's Lair
ID: 1605 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Ulrich Metzner
Avatar

Send message
Joined: 17 Sep 05
Posts: 22
Credit: 405,640
RAC: 0
Message 1613 - Posted: 22 Oct 2005, 13:17:06 UTC
Last modified: 22 Oct 2005, 13:36:27 UTC


greetz, Uli

ID: 1613 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Webmaster Yoda
Avatar

Send message
Joined: 17 Sep 05
Posts: 161
Credit: 162,253
RAC: 0
Message 1615 - Posted: 22 Oct 2005, 13:58:16 UTC
Last modified: 22 Oct 2005, 14:26:09 UTC

Same here, both my AMD Athlon powered PCs have had a number of work units crash with oxC0000005 errors. I upgraded both to BOINC 5.2.2 to see if it made any difference, which has reduced the number of errors but not eliminated them.

My Pentium 4 (2.4, 2.8 and 3.4 GHz) machines have been fine.

EDIT: The Athlon 64 has been stable for the last 10 or so WU, but the Athlon XP is having more errors than successes and I do keep the WU in memory.
*** Join BOINC@Australia today ***
ID: 1615 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
AnRM
Avatar

Send message
Joined: 18 Sep 05
Posts: 123
Credit: 1,355,486
RAC: 0
Message 1616 - Posted: 22 Oct 2005, 14:02:29 UTC - in response to Message 1613.  
Last modified: 22 Oct 2005, 14:09:16 UTC

....Could it be related to some optimizations in the new executable 4.78? This error occurs on an Athlon TB 1400 without SSE support.[/quote]
>Most of my boxes are AMD ie. Durons, Semprons, Athlons and since upgrading to BOINC ver.5.x I have not had any problems with the new R@H ver.4.78. IMHO, the main heavy in the piece is the version of BOINC used.....Cheers, Rog.
ID: 1616 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Robert Nelson

Send message
Joined: 17 Sep 05
Posts: 4
Credit: 1,307,087
RAC: 1,520
Message 1617 - Posted: 22 Oct 2005, 14:20:59 UTC - in response to Message 1616.  

....Could it be related to some optimizations in the new executable 4.78? This error occurs on an Athlon TB 1400 without SSE support.

>Most of my boxes are AMD ie. Durons, Semprons, Athlons and since upgrading to BOINC ver.5.x I have not had any problems with the new R@H ver.4.78. IMHO, the main heavy in the piece is the version of BOINC used.....Cheers, Rog.[/quote]
Have not had any errors of the type mentioned, using BOINC 5.2.2 the units lately have been 4.78, windows boxs AMD and Intel processors. I do keep processs in memory. Note has been problem mentioned in many threads concerning not keeping rosetta in memory.
ID: 1617 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Ulrich Metzner
Avatar

Send message
Joined: 17 Sep 05
Posts: 22
Credit: 405,640
RAC: 0
Message 1618 - Posted: 22 Oct 2005, 14:25:56 UTC - in response to Message 1617.  

Have not had any errors of the type mentioned, using BOINC 5.2.2 the units lately have been 4.78, windows boxs AMD and Intel processors. I do keep processs in memory. Note has been problem mentioned in many threads concerning not keeping rosetta in memory.

I presume, you are talking about Athlon XP's? Because the Athon XP has SSE instructions. The older Athlon Thunderbirds don't have SSE instructions. That's my point...
greetz, Uli

ID: 1618 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
AnRM
Avatar

Send message
Joined: 18 Sep 05
Posts: 123
Credit: 1,355,486
RAC: 0
Message 1622 - Posted: 22 Oct 2005, 15:22:22 UTC - in response to Message 1618.  
Last modified: 22 Oct 2005, 15:32:42 UTC

.....I presume, you are talking about Athlon XP's? Because the Athon XP has SSE instructions. The older Athlon Thunderbirds don't have SSE instructions. That's my point...[/quote]

>Point well taken!.....my Athlon is a XP3000+ with 1Gb RAM and running WinXP. I have not had any problems since upgrading to BOINC 5.2.1. I also leave Rosetta in memory and run R@H and CPDN 90/10 with 120 min changeover. Your idea certainly seems intresting. I am not having any problems with my 1.8Gz Durons or 2.8GHz Semprons, howerver. I don't know if they have SSE instructions capability but I doubt it?? Hope this helps.....Cheers,Rog.
ID: 1622 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile kb7rzf
Avatar

Send message
Joined: 7 Oct 05
Posts: 16
Credit: 35,427
RAC: 0
Message 1629 - Posted: 22 Oct 2005, 20:08:07 UTC

I've done 2 of these WU's that have been talked about, and so far no problem, running 5.2.2, both WU's finished between 45 minutes and an hours worth of time. And did not error out. My computer is an Intel Celeron 2.6ghz with 512mb ram on WinXP home.

Jeremy

ID: 1629 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Ulrich Metzner
Avatar

Send message
Joined: 17 Sep 05
Posts: 22
Credit: 405,640
RAC: 0
Message 1633 - Posted: 23 Oct 2005, 1:05:27 UTC


greetz, Uli

ID: 1633 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Beezlebub
Avatar

Send message
Joined: 18 Oct 05
Posts: 40
Credit: 260,375
RAC: 0
Message 1634 - Posted: 23 Oct 2005, 5:03:07 UTC

I have 6 errors out of 55 WU returned 4 from a P4 3.4ghz w/HT and 2 from a AMD XP 2800+ 2.1 ghz. running Boinc 5.2.2 and none from my P4 2.0 ghz

Two were the old style WU(4.77) and the other 4 are the new WU(4.78).

This is from 18 oct. thru 22 oct.
e6600 quad @ 2.5ghz
2418 floating point
5227 integer

e6750 dual @ 3.71ghz
3598 floating point
7918 integer


ID: 1634 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
1 · 2 · 3 · 4 . . . 5 · Next

Message boards : Number crunching : Loads and loads of computing errors today



©2024 University of Washington
https://www.bakerlab.org