Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 117 · 118 · 119 · 120 · 121 · 122 · 123 . . . 302 · Next

AuthorMessage
Ross Parlette

Send message
Joined: 10 Nov 05
Posts: 32
Credit: 2,165,044
RAC: 0
Message 102348 - Posted: 7 Aug 2021, 18:52:55 UTC

Are we out of work units? I haven't gotten any in a while.
ID: 102348 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1233
Credit: 14,283,319
RAC: 1,049
Message 102349 - Posted: 7 Aug 2021, 19:00:05 UTC - in response to Message 102348.  

Are we out of work units? I haven't gotten any in a while.

I have noticed that completing a GPU task tends to restrict your computer from downloading any CPU tasks for a few days.
ID: 102349 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Falconet

Send message
Joined: 9 Mar 09
Posts: 353
Credit: 1,227,479
RAC: 617
Message 102350 - Posted: 7 Aug 2021, 19:19:59 UTC - in response to Message 102348.  

Yes, work ran out a couple days ago.
ID: 102350 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1686
Credit: 18,014,297
RAC: 23,705
Message 102353 - Posted: 7 Aug 2021, 21:36:14 UTC
Last modified: 7 Aug 2021, 21:44:17 UTC

Latest batah of work units, gb10_3CL_3CL_AVLstub_reversed_
Looking at around a 60% or higher error rate. Crash & burn within seconds of starting.


eg
gb10_3CL_3CL_AVLstub_reversed_renumbered_293_002400_extract_A_SAVE_ALL_OUT_1728505_321_0


              Outcome Computation error
         Client state Compute error
          Exit status -1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION
          Computer ID 3933928
             Run time 7 sec
             CPU time 1 sec
       Validate state Invalid
               Credit 0.00
    Device peak FLOPS 4.88 GFLOPS
  Application version Rosetta v4.20 windows_x86_64
Peak working set size 42.54 MB
       Peak swap size 16.57 MB
      Peak disk usage 0.01 MB




[pre]Stderr output
<core_client_version>7.16.11</core_client_version>
<![CDATA[
<message>
(unknown error) - exit code 3221225477 (0xc0000005)</message>
<stderr_txt>
command: projects/boinc.bakerlab.org_rosetta/rosetta_4.20_windows_x86_64.exe @gb10_3CL_3CL_AVLstub_reversed_renumbered_293_002400_extract_A.flags -nstruct 10000 -cpu_run_time 28800 -boinc:max_nstruct 20000 -checkpoint_interval 120 -mute all -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -boinc::cpu_run_timeout 36000 -run::rng mt19937 -constant_seed -jran 3657876
Using database: database_357d5d93529_n_methylminirosetta_database


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x00007FF64E658698

Engaging BOINC Windows Runtime Debugger...



********************


BOINC Windows Runtime Debugger Version 7.9.0


Dump Timestamp : 08/08/21 06:55:47
Install Directory : C:Program FilesBOINC
Data Directory : C:ProgramDataBOINC
Project Symstore : https://boinc.bakerlab.org/rosetta/symstore
LoadLibraryA( C:ProgramDataBOINCdbghelp.dll ): GetLastError = 126
Loaded Library : dbghelp.dll
LoadLibraryA( C:ProgramDataBOINCsymsrv.dll ): GetLastError = 126
LoadLibraryA( symsrv.dll ): GetLastError = 126
LoadLibraryA( C:ProgramDataBOINCsrcsrv.dll ): GetLastError = 126
LoadLibraryA( srcsrv.dll ): GetLastError = 126
LoadLibraryA( C:ProgramDataBOINCversion.dll ): GetLastError = 126
Loaded Library : version.dll
Debugger Engine : 4.0.5.0
Symbol Search Path: C:ProgramDataBOINCslots9;C:ProgramDataBOINCprojectsboinc.bakerlab.org_rosetta;srv*C:ProgramDataBOINCprojectsboinc.bakerlab.org_rosettasymbols*http://msdl.microsoft.com/download/symbols;srv*C:ProgramDataBOINCprojectsboinc.bakerlab.org_rosettasymbols*https://boinc.bakerlab.org/rosetta/symstore


ModLoad: 000000004aaa0000 00000000057ef000 C:ProgramDataBOINCprojectsboinc.bakerlab.org_rosettarosetta_4.20_windows_x86_64.exe (-exported- Symbols Loaded)
Linked PDB Filename : C:cygwin64homeboinc4.17RosettamainsourceideVisualStudiox64BoincReleaserosetta_4.20_windows_x86_64.pdb

ModLoad: 00000000bff30000 00000000001f5000 C:WINDOWSSYSTEM32ntdll.dll (6.2.19041.1081) (-exported- Symbols Loaded)
Linked PDB Filename : ntdll.pdb
File Version : 10.0.19041.1023 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft&#174; Windows&#174; Operating System
Product Version : 10.0.19041.1023

ModLoad: 00000000bf370000 00000000000bd000 C:WINDOWSSystem32KERNEL32.DLL (6.2.19041.1023) (-exported- Symbols Loaded)
Linked PDB Filename : kernel32.pdb
File Version : 10.0.19041.1023 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft&#174; Windows&#174; Operating System
Product Version : 10.0.19041.1023

ModLoad: 00000000bd670000 00000000002c9000 C:WINDOWSSystem32KERNELBASE.dll (6.2.19041.1081) (-exported- Symbols Loaded)
Linked PDB Filename : kernelbase.pdb
File Version : 10.0.19041.1023 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft&#174; Windows&#174; Operating System
Product Version : 10.0.19041.1023

ModLoad: 00000000bf220000 000000000006b000 C:WINDOWSSystem32WS2_32.dll (6.2.19041.546) (-exported- Symbols Loaded)
Linked PDB Filename : ws2_32.pdb
File Version : 10.0.19041.1081 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft&#174; Windows&#174; Operating System
Product Version : 10.0.19041.1081

ModLoad: 00000000bede0000 000000000012a000 C:WINDOWSSystem32RPCRT4.dll (6.2.19041.1081) (-exported- Symbols Loaded)
Linked PDB Filename : rpcrt4.pdb
File Version : 10.0.19041.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft&#174; Windows&#174; Operating System
Product Version : 10.0.19041.1

ModLoad: 00000000be840000 00000000001a0000 C:WINDOWSSystem32USER32.dll (6.2.19041.906) (-exported- Symbols Loaded)
Linked PDB Filename : user32.pdb
File Version : 10.0.19038.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft&#174; Windows&#174; Operating System
Product Version : 10.0.19038.1

ModLoad: 00000000bdbb0000 0000000000022000 C:WINDOWSSystem32win32u.dll (6.2.19041.1081) (-exported- Symbols Loaded)
Linked PDB Filename : win32u.pdb
File Version : 10.0.19041.1081 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft&#174; Windows&#174; Operating System
Product Version : 10.0.19041.1081

ModLoad: 00000000bf980000 000000000002a000 C:WINDOWSSystem32GDI32.dll (6.2.19041.746) (-exported- Symbols Loaded)
Linked PDB Filename : gdi32.pdb
File Version : 10.0.19041.746 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft&#174; Windows&#174; Operating System
Product Version : 10.0.19041.746

ModLoad: 00000000bde90000 000000000010b000 C:WINDOWSSystem32gdi32full.dll (6.2.19041.928) (-exported- Symbols Loaded)
Linked PDB Filename : gdi32full.pdb
File Version : 10.0.19041.928 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft&#174; Windows&#174; Operating System
Product Version : 10.0.19041.928

ModLoad: 00000000bddf0000 000000000009d000 C:WINDOWSSystem32msvcp_win.dll (6.2.19041.789) (-exported- Symbols Loaded)
Linked PDB Filename : msvcp_win.pdb
File Version : 10.0.19041.789 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft&#174; Windows&#174; Operating System
Product Version : 10.0.19041.789

ModLoad: 00000000bdcf0000 0000000000100000 C:WINDOWSSystem32ucrtbase.dll (6.2.19041.789) (-exported- Symbols Loaded)
Linked PDB Filename : ucrtbase.pdb
File Version : 10.0.19041.789 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft&#174; Windows&#174; Operating System
Product Version : 10.0.19041.789

ModLoad: 00000000bf040000 00000000000ac000 C:WINDOWSSystem32ADVAPI32.dll (6.2.19041.1052) (-exported- Symbols Loaded)
Linked PDB Filename : advapi32.pdb
File Version : 10.0.19041.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft&#174; Windows&#174; Operating System
Product Version : 10.0.19041.1

ModLoad: 00000000bed40000 000000000009e000 C:WINDOWSSystem32msvcrt.dll (7.0.19041.546) (-exported- Symbols Loaded)
Linked PDB Filename : msvcrt.pdb
File Version : 7.0.19041.546 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft&#174; Windows&#174; Operating System
Product Version : 7.0.19041.546

ModLoad: 00000000bf7b0000 000000000009b000 C:WINDOWSSystem32sechost.dll (6.2.19041.906) (-exported- Symbols Loaded)
Linked PDB Filename : sechost.pdb
File Version : 10.0.19041.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft&#174; Windows&#174; Operating System
Product Version : 10.0.19041.1

ModLoad: 00000000bf160000 0000000000030000 C:WINDOWSSystem32IMM32.DLL (6.2.19041.546) (-exported- Symbols Loaded)
Linked PDB Filename : imm32.pdb
File Version : 10.0.19041.546 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft&#174; Windows&#174; Operating System
Product Version : 10.0.19041.546

ModLoad: 00000000bb5c0000 0000000000012000 C:WINDOWSSYSTEM32kernel.appcore.dll (6.2.19041.546) (-exported- Symbols Loaded)
Linked PDB Filename : Kernel.Appcore.pdb
File Version : 10.0.19041.546 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft&#174; Windows&#174; Operating System
Product Version : 10.0.19041.546

ModLoad: 00000000bc3a0000 0000000000033000 C:WINDOWSSYSTEM32ntmarta.dll (6.2.19041.546) (-exported- Symbols Loaded)
Linked PDB Filename : ntmarta.pdb
File Version : 10.0.19041.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft&#174; Windows&#174; Operating System
Product Version : 10.0.19041.1

ModLoad: 00000000b8100000 00000000001e4000 C:WINDOWSSYSTEM32dbghelp.dll (6.2.19041.867) (-exported- Symbols Loaded)
Linked PDB Filename : dbghelp.pdb
File Version : 10.0.19041.867 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft&#174; Windows&#174; Operating System
Product Version : 10.0.19041.867

ModLoad: 00000000b8c20000 000000000000a000 C:WINDOWSSYSTEM32version.dll (6.2.19041.546) (-exported- Symbols Loaded)
Linked PDB Filename : version.pdb
File Version : 10.0.19041.546 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft&#174; Windows&#174; Operating System
Product Version : 10.0.19041.546

ModLoad: 00000000bdaa0000 0000000000083000 C:WINDOWSSystem32bcryptPrimitives.dll (6.2.19041.1023) (-exported- Symbols Loaded)
Linked PDB Filename : bcryptprimitives.pdb
File Version : 10.0.19041.1023 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft&#174; Windows&#174; Operating System
Product Version : 10.0.19041.1023



*** Dump of the Process Statistics: ***

- I/O Operations Counters -
Read: 5000, Write: 656, Other 13726

- I/O Transfers Counters -
Read: 14493717, Write: 11103, Other 6808

- Paged Pool Usage -
QuotaPagedPoolUsage: 317096, QuotaPeakPagedPoolUsage: 317376
QuotaNonPagedPoolUsage: 7200, QuotaPeakNonPagedPoolUsage: 7352

- Virtual Memory Usage -
VirtualSize: 83091456, PeakVirtualSize: 895533056

- Pagefile Usage -
PagefileUsage: 83091456, PeakPagefileUsage: 83091456

- Working Set Size -
WorkingSetSize: 109625344, PeakWorkingSetSize: 109629440, PageFaultCount: 27283

*** Dump of thread ID 8252 (state: Initialized): ***

- Information -
Status: Base Priority: Normal, Priority: Normal, , Kernel Time: 0.000000, User Time: 0.000000, Wait Time: 0.000000

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x00007FF64E658698

- Registers -
rax=000000000000003a rbx=00000000c78e41b0 rcx=00000000c82eeac0 rdx=00000000c83cebf8 rsi=000000000000000b rdi=00000000c82eeac0
r8=000000000000003a r9=0000000000000421 r10=000000004e646e80 r11=0000000042145580 r12=000000004aaa0000 r13=000000004215fcc0
r14=0000000042145cc0 r15=000000000048b215 rip=000000004e658698 rsp=00000000421455f8 rbp=0000000000000000
cs=0033 ss=002b ds=002b es=002b fs=0053 gs=002b efl=00010202

- Callstack -
ChildEBP RetAddr Args to Child
421455f0 4af7831c 00000000 4e646d60 4e646e80 421455d8 rosetta_4.20_windows_x86_64!xmlValidateNotationDecl+0x0
42145620 4af3935d c78e41b0 421456c0 4af2b215 00000000 rosetta_4.20_windows_x86_64!xmlParserInputRead+0x0
42145650 4e0a7f10 4ef90150 4215fcc0 00000000 00000001 rosetta_4.20_windows_x86_64!xmlParserInputRead+0x0
42145680 4af239e8 5003a32c 4aaa0000 42145770 bff60e7b rosetta_4.20_windows_x86_64!xmlValidateNotationDecl+0x0
421456f0 bffd217f 00000000 42145c70 42146330 00000000 rosetta_4.20_windows_x86_64!xmlParserInputRead+0x0
42145720 bff81454 00000000 42145c70 42146330 00000000 ntdll!__chkstk+0x0
42145e30 bffd0cae 00000000 00000030 4e71a450 00000008 ntdll!RtlRaiseException+0x0
421465e0 4b1e3e2b fffffffe cbe393c8 ffffffff 4b1f18c5 ntdll!KiUserExceptionDispatcher+0x0
42146630 4b1f3690 4e71a3a0 cbe39120 4e71a3a0 42146729 rosetta_4.20_windows_x86_64!cppdb::session::is_open+0x0
42146760 4b309ee8 cb568798 cbcc5df0 cbe39120 cbcc5df0 rosetta_4.20_windows_x86_64!cppdb::session::is_open+0x0
42147310 4b2a4b6c cc019fe0 bff5b3c7 cd830000 00000000 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0
42147510 4b2a488e 421475f8 00000000 421477e0 00000000 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0
42147670 4b203da1 421477e8 00000000 c78e3b70 421478b0 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0
42147a30 4b209f08 42147d80 42147d80 42147d80 00000000 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0
42148080 4b2084db c7dc3d00 421480e0 c7e73880 c7e73880 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0
421481e0 4b171fb7 00000000 421482f0 c7e73880 421484f0 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0
42148350 4b1757a6 00000005 4af15190 c7ea87e0 c7ea87e0 rosetta_4.20_windows_x86_64!cppdb::session::is_open+0x0
421483c0 4b1756cc 421486c8 42148539 421486c8 c7e73880 rosetta_4.20_windows_x86_64!cppdb::session::is_open+0x0
42148470 4b23b6f5 421486c8 42148a41 00000000 4af375e8 rosetta_4.20_windows_x86_64!cppdb::session::is_open+0x0
42148590 4b23a592 00000005 421486c8 421488a0 00000000 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0
42148660 4b23ad06 00000000 00000000 42148f80 cd830000 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0
42148800 4b6971a3 421488a0 42148f80 ffffff01 4af23e73 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0
42148af0 4b699d09 00000000 00000001 42148c00 42148f80 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0
42148e80 4b692f8a 42148ec0 42148f80 cb3b3de0 c7ee9630 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0
42148ee0 4b8acc70 42148f80 421496a8 c7ea87e0 00000000 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0
42149670 4b8ac6e4 cbf93460 cbee9050 4ff15cc0 4af175a6 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0
421496d0 4b8b603e 421497c0 cbf93190 421497e0 42149f30 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0
42149e50 4b8b56d4 5ef76948 5ef76a58 4fe87f70 4b8d6cb4 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0
42149ee0 4b8b578e 00000005 4214a488 c7ee9630 00000001 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0
4214a080 4af2081d c81ab820 c81ab820 c7ee9630 c78e5701 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0
4215fcb0 4af2b215 00000000 00000000 4fe4ccf8 00000000 rosetta_4.20_windows_x86_64!xmlParserInputRead+0x0
4215fcf0 bf387034 00000000 00000000 00000000 00000000 rosetta_4.20_windows_x86_64!xmlParserInputRead+0x0
4215fd20 bff82651 00000000 00000000 00000000 00000000 KERNEL32!BaseThreadInitThunk+0x0
4215fda0 00000000 00000000 00000000 00000000 00000000 ntdll!RtlUserThreadStart+0x0

*** Dump of thread ID 32766 (state: Initialized): ***

- Information -
Status: Base Priority: Normal, Priority: Unknown, , Kernel Time: 6.000000, User Time: 0.000000, Wait Time: 3265596160.000000

- Registers -
rax=0000000000000000 rbx=0000000000000000 rcx=0000000000000000 rdx=0000000000000000 rsi=0000000000000000 rdi=0000000000000000
r8=0000000000000000 r9=0000000000000000 r10=0000000000000000 r11=0000000000000000 r12=0000000000000000 r13=0000000000000000
r14=0000000000000000 r15=0000000000000000 rip=0000000000000000 rsp=0000000000000000 rbp=0000000000000000
cs=0000 ss=0000 ds=0000 es=0000 fs=0000 gs=0000 efl=00000000

- Callstack -
ChildEBP RetAddr Args to Child
(-nosymbols- PC == 0)
00000000 00000000 00000000 00000000 00000000 00000000 !+0x0

*** Dump of thread ID 30903250 (state: Unknown): ***

- Information -
Status: Base Priority: Normal, Priority: Unknown, , Kernel Time: 17179869184.000000, User Time: 21474836480.000000, Wait Time: 0.000000

- Registers -
rax=0000000000000000 rbx=0000000000000000 rcx=0000000000000000 rdx=0000000000000000 rsi=0000000000000000 rdi=0000000000000000
r8=0000000000000000 r9=0000000000000000 r10=0000000000000000 r11=0000000000000000 r12=0000000000000000 r13=0000000000000000
r14=0000000000000000 r15=0000000000000000 rip=0000000000000000 rsp=0000000000000000 rbp=0000000000000000
cs=0000 ss=0000 ds=0000 es=0000 fs=0000 gs=0000 efl=00000000

- Callstack -
ChildEBP RetAddr Args to Child
(-nosymbols- PC == 0)
00000000 00000000 00000000 00000000 00000000 00000000 !+0x0


*** Debug Message Dump ****


*** Foreground Window Data ***
Window Name :
Window Class :
Window Process ID: 0
Window Thread ID : 0

Exiting...

</stderr_txt>
]]>[pre]
Grant
Darwin NT
ID: 102353 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
TribbleRED

Send message
Joined: 24 Jun 10
Posts: 2
Credit: 27,867,968
RAC: 5,052
Message 102355 - Posted: 7 Aug 2021, 21:39:22 UTC

Seems like a lot of subjects going on in this thread but here we go:

Node Config:
Win10 Pro (10.0.19043)
Gigabyte x570 Aorus Xtreme
5950x
128GB (4x32GB) Trident Royal Z 3600 @ 16-22-22-42
1x Gigabyte RTX 3090 Gaming OC
3x Western Digital Black sn850 in RAID0 (AMD-RAID)
ALL drivers up-to-date
No Gigabyte settings software installed

Problem: All WU fail within seconds rendering the following log:

<core_client_version>7.16.11</core_client_version>
<![CDATA[
<message>
Incorrect function.
(0x1) - exit code 1 (0x1)</message>
<stderr_txt>
command: projects/boinc.bakerlab.org_rosetta/rosetta_4.20_windows_x86_64.exe @gb10_3CL_3CL_AVLstub_reversed_renumbered_12_000155_extract_A.flags -nstruct 10000 -cpu_run_time 28800 -boinc:max_nstruct 20000 -checkpoint_interval 120 -mute all -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -boinc::cpu_run_timeout 36000 -run::rng mt19937 -constant_seed -jran 3682047
Using database: database_357d5d93529_n_methylminirosetta_database

ERROR: Residue topology file 'D:B_DATAprojectsboinc.bakerlab.org_rosettadatabase_357d5d93529_n_methylminirosetta_databasechemical/residue_type_sets/fa_standard/residue_types/metal_ions/FE.params' does not contain valid ATOM records.
ERROR:: Exit from: ......srccorechemicalresidue_io.cc line: 696
BOINC:: Error reading and gzipping output datafile: default.out
15:08:11 (8808): called boinc_finish(1)

</stderr_txt>
]]>


This is a new node.

Any help would be appreciated.
ID: 102355 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1686
Credit: 18,014,297
RAC: 23,705
Message 102356 - Posted: 7 Aug 2021, 21:56:13 UTC - in response to Message 102355.  
Last modified: 7 Aug 2021, 21:57:06 UTC

This is a new node.

Any help would be appreciated.
That's a completely different error to what i'm getting. While most of my Tasks are failing, still many of them are processing OK. All of yours failed, with a different error to mine.
I would suggest Resetting Rosetta- it could be it missed getting a file it needed when you attached & it initially got work.
BOINC Manager, select Rosetta, Reset Project.

That will make it dump all the application & support files & re-download them.
It could be a few days before there is work readily available here, and hopefully by then most of it won't error out as the present batch is doing.
If you Reset the project & are able to pick up some more work, if it does error out, check to see if they're the same type of error you are getting now, or if it's the same type that i posted above. If it's the type i posted above, then some Tasks should run OK. If it's still the same as your current errors, then all Tasks will error out again, regardless & i' suggest waiting till there is plenty of work available that doesn't produce a high percentage of errors before resetting the project (yet again).
Grant
Darwin NT
ID: 102356 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Gandolph1
Avatar

Send message
Joined: 3 Aug 21
Posts: 7
Credit: 587,066
RAC: 470
Message 102357 - Posted: 7 Aug 2021, 23:06:36 UTC - in response to Message 102356.  

Finally got my machine to start downloading tasks again (Problem at Boincstats) and now everything it's downloading is failing after a short computation time. Is there a way around this issue?
ID: 102357 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1686
Credit: 18,014,297
RAC: 23,705
Message 102358 - Posted: 7 Aug 2021, 23:10:28 UTC - in response to Message 102357.  
Last modified: 7 Aug 2021, 23:18:12 UTC

Finally got my machine to start downloading tasks again (Problem at Boincstats) and now everything it's downloading is failing after a short computation time. Is there a way around this issue?
As mentioned a few posts earlier, the majority of the new Tasks will error out in a few seconds, but the rest will run OK.

As it is, it looks like that batch has all been sent out, so it'll just be resends again until another larger batch of work is released.



NB- since you are running more than one project, i would suggest reducing your cache size to a half day or less. At times like these where Rosetta is out of work, your system will do just Einstein, when Rosetta gets more work then it will do more of Rosetta until your Resource share settings are being met. No need to cache work.

eg
Computing preferences, Other,
           Store at least 0.4  days of work
Store up to an additional 0.01 days of work

Grant
Darwin NT
ID: 102358 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1233
Credit: 14,283,319
RAC: 1,049
Message 102359 - Posted: 7 Aug 2021, 23:17:44 UTC - in response to Message 102357.  

Finally got my machine to start downloading tasks again (Problem at Boincstats) and now everything it's downloading is failing after a short computation time. Is there a way around this issue?

Errors after very short computation times are usually due to errors in one of the input files. In that case, not much can be done except to have the project staff cancel every task that shares the defective input file. Getting them to notice the need to do do is usually quite difficult.

I saw two of the last batch I got fail quickly, but the other five have been running for an hour. This may mean that they have filtered out some, but not all, of the tasks with defective inputs.
ID: 102359 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
TribbleRED

Send message
Joined: 24 Jun 10
Posts: 2
Credit: 27,867,968
RAC: 5,052
Message 102360 - Posted: 8 Aug 2021, 1:57:27 UTC - in response to Message 102356.  

Thank you Grant. I'll give it a go
ID: 102360 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Albert H.

Send message
Joined: 31 Jan 14
Posts: 2
Credit: 8,917,754
RAC: 475
Message 102361 - Posted: 8 Aug 2021, 9:45:51 UTC

HI,
Since 3 days no new tasks, is there a problem on my side ?

08/08/2021 11:13:17 | Rosetta@home | Project requested delay of 31 seconds
08/08/2021 11:13:52 | Rosetta@home | Sending scheduler request: To fetch work.
08/08/2021 11:13:52 | Rosetta@home | Requesting new tasks for CPU
08/08/2021 11:13:53 | Rosetta@home | Scheduler request completed: got 0 new tasks
08/08/2021 11:13:53 | Rosetta@home | No tasks sent
08/08/2021 11:13:53 | Rosetta@home | Project requested delay of 31 seconds
08/08/2021 11:28:33 | Rosetta@home | Sending scheduler request: To fetch work.
08/08/2021 11:28:33 | Rosetta@home | Requesting new tasks for CPU
08/08/2021 11:28:34 | Rosetta@home | Scheduler request completed: got 0 new tasks
08/08/2021 11:28:34 | Rosetta@home | No tasks sent

Thanks
Albert H.
ID: 102361 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1686
Credit: 18,014,297
RAC: 23,705
Message 102362 - Posted: 8 Aug 2021, 10:00:17 UTC - in response to Message 102361.  

HI,
Since 3 days no new tasks, is there a problem on my side ?
The project is presently out of work.
There are some resends, and a couple of small batches of work being released, but it's down to luck as to whether you get any or not.
Hopefully come the start of the new work week in the US more work will be loaded up at some stage and will be readily available.
Grant
Darwin NT
ID: 102362 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Kissagogo27

Send message
Joined: 31 Mar 20
Posts: 86
Credit: 2,939,678
RAC: 2,829
Message 102367 - Posted: 8 Aug 2021, 19:28:18 UTC

i got only resent ^^

and some GB10_3CL errored out too,
ID: 102367 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2127
Credit: 41,266,340
RAC: 8,573
Message 102369 - Posted: 8 Aug 2021, 21:32:53 UTC - in response to Message 102362.  

Since 3 days no new tasks, is there a problem on my side ?
The project is presently out of work.
There are some resends, and a couple of small batches of work being released, but it's down to luck as to whether you get any or not.
Hopefully come the start of the new work week in the US more work will be loaded up at some stage and will be readily available.

I managed to get 2.64 seconds total runtime from my last 40 errored tasks #Blessed
Time to give WCG its head until they fix it - sometime whenever
ID: 102369 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Stevie G

Send message
Joined: 15 Dec 18
Posts: 107
Credit: 842,208
RAC: 1,004
Message 102370 - Posted: 8 Aug 2021, 23:27:21 UTC - in response to Message 102369.  

I also had nothing from Roseta for almost a week. Then I got a string or tasks that worked out as "errors while computing."

Must be something amiss with the new setup?

S. Gaber
ID: 102370 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1995
Credit: 9,650,240
RAC: 7,012
Message 102371 - Posted: 9 Aug 2021, 8:44:47 UTC - in response to Message 102370.  

I also had nothing from Roseta for almost a week. Then I got a string or tasks that worked out as "errors while computing."

No work here and problems on Ralph
Seems: "Closed for holiday" :-P
ID: 102371 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Admin
Project administrator

Send message
Joined: 1 Jul 05
Posts: 4805
Credit: 0
RAC: 0
Message 102372 - Posted: 9 Aug 2021, 15:51:57 UTC

I've notified the researcher that submitted the jobs that are failing.

We are looking into the HTTP download issues at the moment.
ID: 102372 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile xroule
Avatar

Send message
Joined: 9 Feb 15
Posts: 4
Credit: 58,799,283
RAC: 5,892
Message 102373 - Posted: 10 Aug 2021, 0:27:57 UTC - in response to Message 102372.  

Not a new problem. It often and it last many days. Not very serious! :-(
I had to go back to W,C,G.
ID: 102373 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1233
Credit: 14,283,319
RAC: 1,049
Message 102374 - Posted: 10 Aug 2021, 2:45:18 UTC - in response to Message 102373.  

Not a new problem. It often and it last many days. Not very serious! :-(
I had to go back to W,C,G.

W.C.G. is now almost out of tasks, so you may need to add another project.
ID: 102374 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile xroule
Avatar

Send message
Joined: 9 Feb 15
Posts: 4
Credit: 58,799,283
RAC: 5,892
Message 102375 - Posted: 10 Aug 2021, 2:53:17 UTC - in response to Message 102374.  

I have lots of WCG w.u. in store. And yes I can join an other project. I can stop crunching altogether. All that does not solve the fact that this project runs out of W.U. just too often without explanation. That show how much they care about the crunchers. And no I do not know who (they) are.
ID: 102375 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 117 · 118 · 119 · 120 · 121 · 122 · 123 . . . 302 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org