Message boards : Number crunching : Client Errors
Previous · 1 · 2 · 3 · Next
Author | Message |
---|---|
Bok Send message Joined: 17 Sep 05 Posts: 54 Credit: 3,514,973 RAC: 0 |
I got quite a few around about the same time on a new install on an AMD 3800+ Reset the project and they started working ok. I guess there was a bad batch out there ? Bok Free-DC Stats for all projects Custom Stats |
Paul D. Buck Send message Joined: 17 Sep 05 Posts: 815 Credit: 1,812,737 RAC: 0 |
I don't know ... :( I looked again today, and I have not had one yet and I have 60 some work units on the two computers. I guess I should count my blessings ... |
Jeff Send message Joined: 21 Sep 05 Posts: 20 Credit: 380,889 RAC: 0 |
Hmmm... I have Rosetta running on 3 AMD3800x2 systems and 4 other various systems and none of these kind of errors yet. WinXP SP2 is running on all of them and they are all running Rosetta exclusively in BOINC 24/7. Half of them also have the DIMES project running. Strange problem here for some folks... Jeff's Computer Farm |
FZB Send message Joined: 17 Sep 05 Posts: 84 Credit: 4,948,999 RAC: 0 |
you don't get the error when you run it exclusive... it happens (on some systems) when switching from rosetta to something else (actually seems to happen after switching...) -- Florian www.domplatz1.de |
rbpeake Send message Joined: 25 Sep 05 Posts: 168 Credit: 247,828 RAC: 0 |
you don't get the error when you run it exclusive... it happens (on some systems) when switching from rosetta to something else (actually seems to happen after switching...) That explains why I have had no problems. After hearing DB's plea for more computational power, I dedicated one machine to exclusively running R@home. :) Regards, Bob P. |
Jeff Send message Joined: 21 Sep 05 Posts: 20 Credit: 380,889 RAC: 0 |
you don't get the error when you run it exclusive... it happens (on some systems) when switching from rosetta to something else (actually seems to happen after switching...) ;o) Makes sense then in my case. Jeff's Computer Farm |
JaRski-S60R Send message Joined: 24 Sep 05 Posts: 4 Credit: 608,548 RAC: 0 |
you don't get the error when you run it exclusive... it happens (on some systems) when switching from rosetta to something else (actually seems to happen after switching...) But I often have an error when WU is done 100% (using boinc v4.72 and also other projects). Sofar none succesful (9) :-( <img src="http://www.boincstats.com/stats/banner.php?id=184312"><img src="http://i23.photobucket.com/albums/b398/6teacher/vobo.gif"> <img src="http://i23.photobucket.com/albums/b398/6teacher/faster.gif"> |
Neil Woodvine Send message Joined: 16 Sep 05 Posts: 3 Credit: 30,708 RAC: 0 |
had something similar on my p4 3.2ghz ht box. noticed about 2 hours ago that one of the wu's had been stuck at 1% for 4 days and the other for a day and a half =/. i reset the project on the box just in case it was a bad batch of wu's and it happily cruched away for 2 hours and then did the 4day benchmark and errored the two wu's it had been running. I've only seen this problem on my ht box all the "single" cpu's are suspending the wu and coming back fine. maybe it's a problem with suspending 2 running rosetta wu's at the same time ? (ht /dual cpu's) |
UBT - Halifax--lad Send message Joined: 17 Sep 05 Posts: 157 Credit: 2,687 RAC: 0 |
Going to give it one last go and if it fails will stop allowing new work until something is sorted out in the future Join us in Chat (see the forum) Click the Sig Join UBT |
FZB Send message Joined: 17 Sep 05 Posts: 84 Credit: 4,948,999 RAC: 0 |
But I often have an error when WU is done 100% (using boinc v4.72 and also other projects). Sofar none succesful (9) :-( what was your exit code? might be a different error? the one i see while/after switching is 0xc0000005 -- Florian www.domplatz1.de |
JaRski-S60R Send message Joined: 24 Sep 05 Posts: 4 Credit: 608,548 RAC: 0 |
But I often have an error when WU is done 100% (using boinc v4.72 and also other projects). Sofar none succesful (9) :-( Sry, haven't wrote it down and I had to restart my pc since then so the messages were cleared. But I give update soon, 1WU is at 83,33% complete but I keep my sentings to same for moment (so thus allowing ALL projects to work) but I did switched to "leave in memory" (got 1Gb ram so) so just let's wait :-) <img src="http://www.boincstats.com/stats/banner.php?id=184312"><img src="http://i23.photobucket.com/albums/b398/6teacher/vobo.gif"> <img src="http://i23.photobucket.com/albums/b398/6teacher/faster.gif"> |
JaRski-S60R Send message Joined: 24 Sep 05 Posts: 4 Credit: 608,548 RAC: 0 |
mmm...that's funky :-S It resumed but now it's already standint for over a hour orsow at the same % (83.33 that is) :-( Keep you updated, coz I noticed that before, it eventually resumed with the earlier WU. Just waiting and see what the outcome error-message will be. <img src="http://www.boincstats.com/stats/banner.php?id=184312"><img src="http://i23.photobucket.com/albums/b398/6teacher/vobo.gif"> <img src="http://i23.photobucket.com/albums/b398/6teacher/faster.gif"> |
JaRski-S60R Send message Joined: 24 Sep 05 Posts: 4 Credit: 608,548 RAC: 0 |
ROFL omg ... as soon as I wrote last message I checked BOINC again and it was running and showed 91.97% <img src="http://www.boincstats.com/stats/banner.php?id=184312"><img src="http://i23.photobucket.com/albums/b398/6teacher/vobo.gif"> <img src="http://i23.photobucket.com/albums/b398/6teacher/faster.gif"> |
rbpeake Send message Joined: 25 Sep 05 Posts: 168 Credit: 247,828 RAC: 0 |
mmm...that's funky :-S I had the same stuck percentage (or approximately). R@h was only using about 0.290 Mb of RAM on it at the time--in other words, next to nothing. So R@h seemed to be "lost in space" or something. I suspended the wu, started the next, unsuspended the wu (the program continued on with the new one), and turned in for the night. When I checked this morning, both wu had been successfully completed. Very mysterious! It was as though the wu had lost its way in molecular space somewhere....but then found its way back again! Regards, Bob P. |
UBT - Halifax--lad Send message Joined: 17 Sep 05 Posts: 157 Credit: 2,687 RAC: 0 |
Im holding back off this project for now just had another failed WU. 01/10/2005 16:03:48|rosetta@home|Unrecoverable error for result 1pvaA_abrelax_no_cst_21829_0 ( - exit code -1073741819 (0xc0000005)) will wait 4 the problems to be sorted Join us in Chat (see the forum) Click the Sig Join UBT |
Paul D. Buck Send message Joined: 17 Sep 05 Posts: 815 Credit: 1,812,737 RAC: 0 |
interestingly enough I had one work unit client error. Issued 3 times, 2 client error, and one success ... more to be puzzled about ... |
JimB Send message Joined: 17 Sep 05 Posts: 19 Credit: 228,111 RAC: 0 |
Almost the same here - issued 3 times, one success, one error, one still out. *Right now* I seem to get errors only every 5 days when benchmarks run, only on rosetta wu's. 2005-09-30 14:56:17 [---] Suspending computation and network activity - running CPU benchmarks 2005-09-30 14:56:17 [rosetta@home] Pausing result 1btn__abrelax_no_cst_14765_1 (removed from memory) 2005-09-30 14:56:17 [rosetta@home] Pausing result 1btn__abrelax_no_cst_13275_1 (removed from memory) 2005-09-30 14:56:18 [rosetta@home] Unrecoverable error for result 1btn__abrelax_no_cst_14765_1 ( - exit code -1073741819 (0xc0000005)) 2005-09-30 14:56:18 [rosetta@home] Unrecoverable error for result 1btn__abrelax_no_cst_13275_1 ( - exit code -1073741819 (0xc0000005)) 2005-09-30 14:56:18 [---] request_reschedule_cpus: process exited 2005-09-30 14:56:19 [---] Running CPU benchmarks "Be all that you can be...considering." Harold Green |
FZB Send message Joined: 17 Sep 05 Posts: 84 Credit: 4,948,999 RAC: 0 |
I've only seen this problem on my ht box all the "single" cpu's are suspending the wu and coming back fine. maybe it's a problem with suspending 2 running rosetta wu's at the same time ? (ht /dual cpu's) https://boinc.bakerlab.org/rosetta/workunit.php?wuid=103170 while i see this on my two boxes (both multi proc/core) when i have not leave in memory on, the above wu i returned successful was returned by a pentium m with 1 cpu, so does not seem to be a multi proc exclusive issue -- Florian www.domplatz1.de |
Paul D. Buck Send message Joined: 17 Sep 05 Posts: 815 Credit: 1,812,737 RAC: 0 |
Ah Ha! Forgot to check the logs ... yes, mine was the x05 error with the benchmarks noted as aborted as tasks are active. So ... Tentative conclusion: Rosetta@Home does not to be stopped ... Ugh ... |
Joe Send message Joined: 26 Sep 05 Posts: 3 Credit: 605,002 RAC: 2,085 |
I also get client error every time the application is removed ftom memory to run benchmarks. |
Message boards :
Number crunching :
Client Errors
©2024 University of Washington
https://www.bakerlab.org