Rosetta Beta 6.00

Message boards : Number crunching : Rosetta Beta 6.00

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · Next

AuthorMessage
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1914
Credit: 8,870,806
RAC: 10,755
Message 108697 - Posted: 15 Nov 2023, 8:42:52 UTC

All errors.
And, obviously, these works didn't pass through Ralph@home...
ID: 108697 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1914
Credit: 8,870,806
RAC: 10,755
Message 108747 - Posted: 6 Dec 2023, 5:51:41 UTC

Up to now, over 180 wus (new7snme) without errors
Well done, guys
ID: 108747 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1914
Credit: 8,870,806
RAC: 10,755
Message 108755 - Posted: 11 Dec 2023, 11:27:31 UTC

I think that the 6.xx branch of code makes the same things as the 4.xx plus other things.
But it's in beta since the beginning of April.
Why not abandon the 4.xx branch (that has over 3 years old code)?
ID: 108755 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Gray Handcock

Send message
Joined: 26 Sep 05
Posts: 20
Credit: 2,018,415
RAC: 0
Message 108756 - Posted: 12 Dec 2023, 9:09:18 UTC

Hi

my computer specs:

Intel(R) Core(TM) i7-4790 CPU @ 3.60GHz [Family 6 Model 60 Stepping 3]
Number of processors 8
Operating System Debian GNU/Linux 12 (bookworm) [6.1.0-15-amd64|libc 2.36]
BOINC version 7.20.5
Memory 7833.52 MB

Rosetta v4.20 x86_64-pc-linux-gnu are completing normally as expected
Rosetta Beta v6.05 x86_64-pc-linux-gnu errors out, every single one gives "Error while computing"

Is there any option to block the "beta 6*" series until this is sorted out ?

thanks
ID: 108756 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Gray Handcock

Send message
Joined: 26 Sep 05
Posts: 20
Credit: 2,018,415
RAC: 0
Message 108758 - Posted: 12 Dec 2023, 13:41:52 UTC - in response to Message 108756.  

Hi

my computer specs:

Intel(R) Core(TM) i7-4790 CPU @ 3.60GHz [Family 6 Model 60 Stepping 3]
Number of processors 8
Operating System Debian GNU/Linux 12 (bookworm) [6.1.0-15-amd64|libc 2.36]
BOINC version 7.20.5
Memory 7833.52 MB

Rosetta v4.20 x86_64-pc-linux-gnu are completing normally as expected
Rosetta Beta v6.05 x86_64-pc-linux-gnu errors out, every single one gives "Error while computing"

Is there any option to block the "beta 6*" series until this is sorted out ?

thanks


UPDATE: currently the errors are at 116 and counting.
Just for the record, this is a headless install of Debian stable and no overclocking applied to the hardware,
which has been set up just for Rosetta & WCG.
ID: 108758 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2031
Credit: 39,945,516
RAC: 19,756
Message 108759 - Posted: 12 Dec 2023, 16:52:41 UTC - in response to Message 108756.  

Hi

my computer specs:

Intel(R) Core(TM) i7-4790 CPU @ 3.60GHz [Family 6 Model 60 Stepping 3]
Number of processors 8
Operating System Debian GNU/Linux 12 (bookworm) [6.1.0-15-amd64|libc 2.36]
BOINC version 7.20.5
Memory 7833.52 MB

Rosetta v4.20 x86_64-pc-linux-gnu are completing normally as expected
Rosetta Beta v6.05 x86_64-pc-linux-gnu errors out, every single one gives "Error while computing"

Is there any option to block the "beta 6*" series until this is sorted out ?

thanks

Sadly not.
On the plus side they're erroring out immediately, so no processing time is wasted - just bandwidth in downloading and returning them.
Tasks that error out go to other users so they're not wasted, though that's no consolation to you. Sorry
ID: 108759 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jean-David Beyer

Send message
Joined: 2 Nov 05
Posts: 180
Credit: 5,932,567
RAC: 4,799
Message 108760 - Posted: 12 Dec 2023, 17:07:21 UTC - in response to Message 108758.  

I do not seem to be having problems with beta tasks.

State: All (111) · In progress (19) · Validation pending (0) · Validation inconclusive (0) · Valid (91) · Invalid (0) · Error (1)
Application: All (111) · Rosetta (9) · Rosetta Beta (102) · Rosetta Mini (0) · rosetta python projects (0) 


My machine is:

CPU type 	GenuineIntel
Intel(R) Xeon(R) W-2245 CPU @ 3.90GHz [Family 6 Model 85 Stepping 7]
Number of processors 	16
Operating System 	Linux Red Hat Enterprise Linux
Red Hat Enterprise Linux 8.9 (Ootpa) [4.18.0-513.9.1.el8_9.x86_64|libc 2.28]
BOINC version 	7.20.2
Memory 	128073.86 MB
Cache 	16896 KB
Swap space 	15992 MB
Total disk space 	488.04 GB
Free Disk Space 	480.6 GB
Measured floating point speed 	5955.12 million ops/sec
Measured integer speed 	24244.4 million ops/sec


And this is the most recently completed task:

1540123327 	1370471503 	10 Dec 2023, 18:01:22 UTC 	12 Dec 2023, 16:30:58 UTC 	Completed and validated 	28,991.88 	28,785.55 	488.23 	Rosetta Beta v6.05
x86_64-pc-linux-gnu


And this is the only error one.
1538969660 	1368095704 	8 Dec 2023, 4:25:57 UTC 	8 Dec 2023, 4:56:41 UTC 	Cancelled by server 	0.00 	0.00 	--- 	Rosetta Beta v6.05
x86_64-pc-linux-gnu

ID: 108760 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 380
Credit: 11,394,399
RAC: 9,962
Message 108761 - Posted: 12 Dec 2023, 19:47:06 UTC - in response to Message 108758.  

Hi

my computer specs:

Intel(R) Core(TM) i7-4790 CPU @ 3.60GHz [Family 6 Model 60 Stepping 3]
Number of processors 8
Operating System Debian GNU/Linux 12 (bookworm) [6.1.0-15-amd64|libc 2.36]
BOINC version 7.20.5
Memory 7833.52 MB

Rosetta v4.20 x86_64-pc-linux-gnu are completing normally as expected
Rosetta Beta v6.05 x86_64-pc-linux-gnu errors out, every single one gives "Error while computing"

Is there any option to block the "beta 6*" series until this is sorted out ?

thanks


UPDATE: currently the errors are at 116 and counting.
Just for the record, this is a headless install of Debian stable and no overclocking applied to the hardware,
which has been set up just for Rosetta & WCG.


What is the error that shows in the stderror file - click on the workunit in the tasks link of your account.
ID: 108761 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1556
Credit: 16,013,897
RAC: 17,537
Message 108763 - Posted: 13 Dec 2023, 5:19:51 UTC - in response to Message 108761.  

What is the error that shows in the stderror file - click on the workunit in the tasks link of your account.
Here's the output of one Task.

<core_client_version>7.20.5</core_client_version>
<![CDATA[
<message>
process exited with code 127 (0x7f, -129)</message>
<stderr_txt>
../../projects/boinc.bakerlab.org_rosetta/rosetta_beta_6.05_x86_64-pc-linux-gnu: error while loading shared libraries: libGL.so.1: cannot open shared object file: No such file or directory

</stderr_txt>
]]>

Installation/permissions issue?
Grant
Darwin NT
ID: 108763 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 380
Credit: 11,394,399
RAC: 9,962
Message 108764 - Posted: 13 Dec 2023, 9:49:28 UTC - in response to Message 108763.  

What is the error that shows in the stderror file - click on the workunit in the tasks link of your account.
Here's the output of one Task.

<core_client_version>7.20.5</core_client_version>
<![CDATA[
<message>
process exited with code 127 (0x7f, -129)</message>
<stderr_txt>
../../projects/boinc.bakerlab.org_rosetta/rosetta_beta_6.05_x86_64-pc-linux-gnu: error while loading shared libraries: libGL.so.1: cannot open shared object file: No such file or directory

</stderr_txt>
]]>

Installation/permissions issue?


Certainly looks like it.
ID: 108764 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Gray Handcock

Send message
Joined: 26 Sep 05
Posts: 20
Credit: 2,018,415
RAC: 0
Message 108765 - Posted: 13 Dec 2023, 10:35:52 UTC - in response to Message 108764.  

Hi - see the below for the last 6.05 - which obviously failed to proceed:

<core_client_version>7.20.5</core_client_version>
<![CDATA[
<message>
process exited with code 127 (0x7f, -129)</message>
<stderr_txt>
../../projects/boinc.bakerlab.org_rosetta/rosetta_beta_6.05_x86_64-pc-linux-gnu: error while loading shared libraries: libGL.so.1: cannot open shared object file: No such file or directory

</stderr_txt>
]]>

Hope it helps to fix this issue - the 4.20 WUs are all fine and validated and failed 6.50 WUs are at 118 now

Thanks
ID: 108765 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Gray Handcock

Send message
Joined: 26 Sep 05
Posts: 20
Credit: 2,018,415
RAC: 0
Message 108766 - Posted: 13 Dec 2023, 10:45:36 UTC - in response to Message 108765.  

Hi - see the below for the last 6.05 - which obviously failed to proceed:

<core_client_version>7.20.5</core_client_version>
<![CDATA[
<message>
process exited with code 127 (0x7f, -129)</message>
<stderr_txt>
../../projects/boinc.bakerlab.org_rosetta/rosetta_beta_6.05_x86_64-pc-linux-gnu: error while loading shared libraries: libGL.so.1: cannot open shared object file: No such file or directory

</stderr_txt>
]]>

Hope it helps to fix this issue - the 4.20 WUs are all fine and validated and failed 6.50 WUs are at 118 now

Thanks


I am wondering if not having a GUI installed might be the problem - the box is accessed via ssh, using command--line only.
ID: 108766 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Gray Handcock

Send message
Joined: 26 Sep 05
Posts: 20
Credit: 2,018,415
RAC: 0
Message 108767 - Posted: 13 Dec 2023, 11:42:26 UTC - in response to Message 108766.  

Hi

I have just installed libgl1 (with a bunch of dependencies). I will allow more work and see if that fixes the problem

Thanks
ID: 108767 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Gray Handcock

Send message
Joined: 26 Sep 05
Posts: 20
Credit: 2,018,415
RAC: 0
Message 108769 - Posted: 14 Dec 2023, 10:54:14 UTC - in response to Message 108767.  
Last modified: 14 Dec 2023, 10:55:37 UTC

Hi

So, to update the info for anyone else with this problem:

Prior to the "fix" for Rosetta Beta v6.05 x86_64-pc-linux-gnu I had 118 WUs error out of 118

Error message:
<stderr_txt>
../../projects/boinc.bakerlab.org_rosetta/rosetta_beta_6.05_x86_64-pc-linux-gnu: error while loading shared libraries: libGL.so.1: cannot open shared object file: No such file or directory

</stderr_txt>

The fix for my problem would appear to be the installation of libgl1,
which also installed the following dependencies:

libxcb-present0, libxxf86vm1, libglx-mesa0, libglvnd0, libx11-xcb1,
libxshmfence1, libxcb-dri2-0, libxcb-dri3-0, libpciaccess0, libglx0,
libdrm-nouveau2, libllvm15, libz3-4, libgl1-mesa-dri, libdrm-common,
libxcb-glx0, libglapi-mesa, libdrm-amdgpu1, libdrm-radeon1, libdrm2,
libxcb-randr0, libxcb-shm0, libxcb-sync1, libdrm-intel1, libxfixes3,
libxcb-xfixes0

I received 11 WUs for Rosetta Beta v6.05 x86_64-pc-linux-gnu yesterday after allowing
more work. 6 of them have completed and been validated - the rest are in progress.

As a comment it would seem a bit strange to require what would appear to be
GUI-related files for CPU work - but the fix does seem to be functioning at this time, so I am happy :)

Thanks
ID: 108769 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Aurum

Send message
Joined: 12 Jul 17
Posts: 32
Credit: 38,158,977
RAC: 2
Message 108774 - Posted: 20 Dec 2023, 16:03:12 UTC - in response to Message 108310.  

I use an APP_CONFIG to run all my projects. I can set to run just 1 beta WU, never tried to set max tasks to 0.
Right now i'm running Milky Way until Beta is more stable.

<max_concurrent>Zero</max_concurrent> = <max_concurrent>Infinity</max_concurrent>
ID: 108774 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jean-David Beyer

Send message
Joined: 2 Nov 05
Posts: 180
Credit: 5,932,567
RAC: 4,799
Message 108775 - Posted: 20 Dec 2023, 16:56:21 UTC - in response to Message 108769.  

I have a bunch of beta 6.05 tasks running. And a bunch earlier have also run.
They have one to five houors on them.

State: All (107) · In progress (20) · Validation pending (0) · Validation inconclusive (0) · Valid (87) · Invalid (0) · Error (0)
Application: All (107) · Rosetta (39) · Rosetta Beta (68) · Rosetta Mini (0) · rosetta python projects (0)

ID: 108775 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1894
Credit: 8,866,740
RAC: 1,585
Message 108777 - Posted: 22 Dec 2023, 1:52:19 UTC - in response to Message 108774.  

I use an APP_CONFIG to run all my projects. I can set to run just 1 beta WU, never tried to set max tasks to 0.
Right now i'm running Milky Way until Beta is more stable.

<max_concurrent>Zero</max_concurrent> = <max_concurrent>Infinity</max_concurrent>


I don't know what your settings are but I'm getting nearly 100% valid tasks here on my both Windows and Linux pc;s.

State: All (2097) · In progress (576) · Validation pending (0) · Validation inconclusive (0) · Valid (1514) · Invalid (0) · Error (7)
Application: All (2097) · Rosetta (102) · Rosetta Beta (1995) · Rosetta Mini (0) · rosetta python projects (0)
ID: 108777 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jean-David Beyer

Send message
Joined: 2 Nov 05
Posts: 180
Credit: 5,932,567
RAC: 4,799
Message 108778 - Posted: 22 Dec 2023, 6:27:31 UTC - in response to Message 108777.  

I don't know what your settings are but I'm getting nearly 100% valid tasks here on my both Windows and Linux pc;s.

State: All (2097) · In progress (576) · Validation pending (0) · Validation inconclusive (0) · Valid (1514) · Invalid (0) · Error (7)
Application: All (2097) · Rosetta (102) · Rosetta Beta (1995) · Rosetta Mini (0) · rosetta python projects (0)


I get the same kind of thing. I get no errors for my fast Linux mchine, but I got 5 for my slower Windows 10 machine.

State: All (171) · In progress (37) · Validation pending (0) · Validation inconclusive (0) · Valid (129) · Invalid (0) · Error (5)
Application: All (171) · Rosetta (50) · Rosetta Beta (121) · Rosetta Mini (0) · rosetta python projects (0) 

ID: 108778 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile The Ancient One

Send message
Joined: 4 Oct 05
Posts: 11
Credit: 751,429
RAC: 1
Message 108779 - Posted: 24 Dec 2023, 16:11:53 UTC

Hi, microsoft detected the following error:

rosetta_beta_6.04_windows_x86_64.exe

Description
Faulting Application Path: C:ProgramDataBOINCprojectsboinc.bakerlab.org_rosettarosetta_beta_6.04_windows_x86_64.exe
Creation Time: 24/12/2023 12:57:52
Problem: Stopped working
Status: Report sent

Problem signature
Problem Event Name: APPCRASH
Application Name: rosetta_beta_6.04_windows_x86_64.exe
Application Version: 0.0.0.0
Application Timestamp: 650a8b67
Fault Module Name: StackHash_0000
Fault Module Version: 10.0.19041.3636
Fault Module Timestamp: 9b64aa6f
Exception Code: c0000374
Exception Offset: PCH_84

Extra information about the problem
Bucket ID: 94f0a8d188c87f8f34609c10c9d5462c (1468345074442389036)
ID: 108779 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Aurum

Send message
Joined: 12 Jul 17
Posts: 32
Credit: 38,158,977
RAC: 2
Message 108781 - Posted: 26 Dec 2023, 13:11:40 UTC - in response to Message 108777.  

I use an APP_CONFIG to run all my projects. I can set to run just 1 beta WU, never tried to set max tasks to 0.
Right now i'm running Milky Way until Beta is more stable.

<max_concurrent>Zero</max_concurrent> = <max_concurrent>Infinity</max_concurrent>


I don't know what your settings are but I'm getting nearly 100% valid tasks here on my both Windows and Linux pc;s.
Me too. Send more betas.
My coment was meant to say that the syntax <max_concurrent>0</max_concurrent> is meaningless to BOINC.
ID: 108781 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · 6 · Next

Message boards : Number crunching : Rosetta Beta 6.00



©2024 University of Washington
https://www.bakerlab.org