Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 32 · 33 · 34 · 35 · 36 · 37 · 38 . . . 309 · Next

AuthorMessage
Profile yoerik
Avatar

Send message
Joined: 24 Mar 20
Posts: 128
Credit: 169,525
RAC: 0
Message 92827 - Posted: 31 Mar 2020, 22:43:46 UTC - in response to Message 92821.  
Last modified: 31 Mar 2020, 22:50:14 UTC

Due to my posts being removed for suggesting China started Coronavirus, I am withdrawing my computers from your project. I will not be told what I can and cannot say.


I hope the door doesn't hit your rear too hard on the way out.


So you think it's ok for a country to screw over the whole world? It came from Chinese meat markets. So did SARS, yet they didn't learn. They must be held responsible.


China didn't intentionally start it.

Right now, we need to focus on solving it. We can deal with blame when no one is dying from this disease. The mods are just doing their job.
ID: 92827 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 92828 - Posted: 31 Mar 2020, 22:45:49 UTC - in response to Message 92826.  

HPE Belgium, something that may help on the server problem:

Some versions of windows automatically lock their licenses to the first hard drive they are installed on.

Has a hard drive of the server with the problem been replaced? If so, you'll need to talk to Microsoft, about how to get the license transferred to the replacement hard drive.


Surely if the license became invalid, there would be lots of moaning by the OS about it. Also it would show as unactivated in the control panel.


Something else to check: Is the version of Windows on the server able to use more than one CPU in the same computer?


Yes, some are very restricted, eg:
https://en.wikipedia.org/wiki/Windows_Server_2012#Editions
ID: 92828 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 12,116,986
RAC: 9,863
Message 92829 - Posted: 31 Mar 2020, 22:46:50 UTC - in response to Message 92827.  
Last modified: 31 Mar 2020, 22:48:10 UTC

Due to my posts being removed for suggesting China started Coronavirus, I am withdrawing my computers from your project. I will not be told what I can and cannot say.


I hope the door doesn't hit your rear too hard on the way out.


So you think it's ok for a country to screw over the whole world? It came from Chinese meat markets. So did SARS, yet they didn't learn. They must be held responsible.


China didn't intentionally start it.

Right now, we need to focus on solving it. We can deal with blame when no one is dying from this disease.


They started it through negligence. And I'm not overly concerned with what is just a nasty flu bug. It's the government lockdowns that are screwing us over. Better for 96% of us to survive than for 100% of us to have our lives messed up.
ID: 92829 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1234
Credit: 14,338,560
RAC: 2,014
Message 92834 - Posted: 31 Mar 2020, 23:21:18 UTC - in response to Message 92829.  

[snip]

They started it through negligence. And I'm not overly concerned with what is just a nasty flu bug. It's the government lockdowns that are screwing us over. Better for 96% of us to survive than for 100% of us to have our lives messed up.


So you wouldn't mind being one of the other 4%?
ID: 92834 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 92847 - Posted: 1 Apr 2020, 1:19:49 UTC

This thread is for reporting problems and technical issues with Rosetta@home

Please take other discussions and bantering to the cafe. Keeping in mind the posting guidelines.

The developers need concise information about problem reports. That shouldn't be three pages a day of posts about other things.
Rosetta Moderator: Mod.Sense
ID: 92847 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
MarkJ

Send message
Joined: 28 Mar 20
Posts: 72
Credit: 25,238,680
RAC: 0
Message 92855 - Posted: 1 Apr 2020, 2:58:12 UTC
Last modified: 1 Apr 2020, 3:54:27 UTC

I'm noticing scheduler requests failing. HTTP gateway errors or they time out. Another indicator is others reporting ghost tasks. That is a sign the scheduler is being overwhelmed by the number of requests. I would suggest increasing the back off interval from 6 seconds to say 30 or even 60 seconds.

Einstein uses 60 second back off
Seti uses 301 seconds back off
ID: 92855 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1725
Credit: 18,399,907
RAC: 19,807
Message 92862 - Posted: 1 Apr 2020, 6:47:44 UTC

Another bunch of errors.
All of my errors so far have occurred with the Rosetta v4.07 windows_intelx86 application.

rb_03_30_20012_19819__t000__1_C1_SAVE_ALL_OUT_IGNORE_THE_REST_904726_807_0
<core_client_version>7.6.33</core_client_version>
<![CDATA[
<message>
Incorrect function.
 (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
command: projects/boinc.bakerlab.org_rosetta/rosetta_4.07_windows_intelx86.exe -run:protocol jd2_scripting @flags_rb_03_30_20012_19819__t000__1_C1_robetta -silent_gz -mute all -out:file:silent default.out -in:file:boinc_wu_zip input_rb_03_30_20012_19819__t000__1_C1_robetta.zip -nstruct 10000 -cpu_run_time 28800 -watchdog -boinc:max_nstruct 600 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -run::rng mt19937 -constant_seed -jran 1202891
Starting watchdog...
Watchdog active.

</stderr_txt>
]]>




rb_03_30_20012_19819__t000__3_C1_SAVE_ALL_OUT_IGNORE_THE_REST_904726_817_0
<core_client_version>7.6.33</core_client_version>
<![CDATA[
<message>
Incorrect function.
 (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
command: projects/boinc.bakerlab.org_rosetta/rosetta_4.07_windows_intelx86.exe -run:protocol jd2_scripting @flags_rb_03_30_20012_19819__t000__3_C1_robetta -silent_gz -mute all -out:file:silent default.out -in:file:boinc_wu_zip input_rb_03_30_20012_19819__t000__3_C1_robetta.zip -nstruct 10000 -cpu_run_time 28800 -watchdog -boinc:max_nstruct 600 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -run::rng mt19937 -constant_seed -jran 1200381
Starting watchdog...
Watchdog active.

</stderr_txt>
]]>




0gj7fl7x_jhr_design1_COVID-19_SAVE_ALL_OUT_903184_1_1
<core_client_version>7.6.33</core_client_version>
<![CDATA[
<message>
(unknown error) - exit code -529697949 (0xe06d7363)
</message>
<stderr_txt>
command: projects/boinc.bakerlab.org_rosetta/rosetta_4.07_windows_intelx86.exe -run:protocol jd2_scripting -parser:protocol jhr_boinc.xml @flags -in:file:silent 0gj7fl7x_jhr_design1_COVID-19.silent -in:file:silent_struct_type binary -silent_gz -mute all -out:file:silent_struct_type binary -out:file:silent default.out -in:file:boinc_wu_zip 0gj7fl7x_jhr_design1_COVID-19.zip -nstruct 10000 -cpu_run_time 28800 -watchdog -boinc:max_nstruct 600 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -run::rng mt19937 -constant_seed -jran 2210503
Starting watchdog...
Watchdog active.


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Out Of Memory (C++ Exception) (0xe06d7363) at address 0x76484192

Engaging BOINC Windows Runtime Debugger...



********************


BOINC Windows Runtime Debugger Version 7.9.0


Dump Timestamp    : 04/01/20 07:52:11
Install Directory : 
Data Directory    : C:ProgramDataBOINC
Project Symstore  : https://boinc.bakerlab.org/rosetta/symstore
LoadLibraryA( C:ProgramDataBOINCdbghelp.dll ): GetLastError = 126
Loaded Library    : dbghelp.dll
LoadLibraryA( C:ProgramDataBOINCsymsrv.dll ): GetLastError = 126
LoadLibraryA( symsrv.dll ): GetLastError = 126
LoadLibraryA( C:ProgramDataBOINCsrcsrv.dll ): GetLastError = 126
LoadLibraryA( srcsrv.dll ): GetLastError = 126
LoadLibraryA( C:ProgramDataBOINCversion.dll ): GetLastError = 126
Loaded Library    : version.dll
Debugger Engine   : 4.0.5.0
Symbol Search Path: C:ProgramDataBOINCslots5;C:ProgramDataBOINCprojectsboinc.bakerlab.org_rosetta;srv*C:ProgramDataBOINCprojectsboinc.bakerlab.org_rosettasymbols*http://msdl.microsoft.com/download/symbols;srv*C:ProgramDataBOINCprojectsboinc.bakerlab.org_rosettasymbols*https://boinc.bakerlab.org/rosetta/symstore


ModLoad: 0000000000200000 0000000003413000 C:ProgramDataBOINCprojectsboinc.bakerlab.org_rosettarosetta_4.07_windows_intelx86.exe (-exported- Symbols Loaded)
    Linked PDB Filename   : C:cygwinhomeboincRosetta_4.07mainsourceideVisualStudioBoincReleaserosetta_4.07_windows_intelx86.pdb

ModLoad: 0000000077030000 000000000019a000 C:WINDOWSSYSTEM32ntdll.dll (6.2.18362.719) (-exported- Symbols Loaded)
    Linked PDB Filename   : wntdll.pdb
    File Version          : 10.0.18362.329 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.329

ModLoad: 0000000075b60000 00000000000e0000 C:WINDOWSSystem32KERNEL32.DLL (6.2.18362.329) (-exported- Symbols Loaded)
    Linked PDB Filename   : wkernel32.pdb
    File Version          : 10.0.18362.329 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.329

ModLoad: 0000000076370000 00000000001fe000 C:WINDOWSSystem32KERNELBASE.dll (6.2.18362.719) (-exported- Symbols Loaded)
    Linked PDB Filename   : wkernelbase.pdb
    File Version          : 10.0.18362.329 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.329

ModLoad: 0000000076880000 000000000005e000 C:WINDOWSSystem32WS2_32.dll (6.2.18362.387) (-exported- Symbols Loaded)
    Linked PDB Filename   : ws2_32.pdb
    File Version          : 10.0.18362.1 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.1

ModLoad: 0000000076a50000 00000000000bb000 C:WINDOWSSystem32RPCRT4.dll (6.2.18362.628) (-exported- Symbols Loaded)
    Linked PDB Filename   : wrpcrt4.pdb
    File Version          : 10.0.18362.1 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.1

ModLoad: 0000000074800000 0000000000020000 C:WINDOWSSystem32SspiCli.dll (6.2.18362.1) (-exported- Symbols Loaded)
    Linked PDB Filename   : wsspicli.pdb
    File Version          : 10.0.18362.1 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.1

ModLoad: 00000000747f0000 000000000000a000 C:WINDOWSSystem32CRYPTBASE.dll (6.2.18362.1) (-exported- Symbols Loaded)
    Linked PDB Filename   : cryptbase.pdb
    File Version          : 10.0.18362.1 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.1

ModLoad: 0000000076b10000 000000000005f000 C:WINDOWSSystem32bcryptPrimitives.dll (6.2.18362.295) (-exported- Symbols Loaded)
    Linked PDB Filename   : bcryptprimitives.pdb
    File Version          : 10.0.18362.295 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.295

ModLoad: 0000000075ea0000 0000000000076000 C:WINDOWSSystem32sechost.dll (6.2.18362.693) (-exported- Symbols Loaded)
    Linked PDB Filename   : sechost.pdb
    File Version          : 10.0.18362.1 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.1

ModLoad: 0000000074eb0000 0000000000197000 C:WINDOWSSystem32USER32.dll (6.2.18362.719) (-exported- Symbols Loaded)
    Linked PDB Filename   : wuser32.pdb
    File Version          : 10.0.17134.343 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.17134.343

ModLoad: 0000000075180000 0000000000017000 C:WINDOWSSystem32win32u.dll (6.2.18362.719) (-exported- Symbols Loaded)
    Linked PDB Filename   : wwin32u.pdb
    File Version          : 10.0.18362.719 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.719

ModLoad: 0000000075310000 0000000000021000 C:WINDOWSSystem32GDI32.dll (6.2.18362.1) (-exported- Symbols Loaded)
    Linked PDB Filename   : wgdi32.pdb
    File Version          : 10.0.18362.1 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.1

ModLoad: 00000000751b0000 000000000015a000 C:WINDOWSSystem32gdi32full.dll (6.2.18362.719) (-exported- Symbols Loaded)
    Linked PDB Filename   : wgdi32full.pdb
    File Version          : 10.0.18362.719 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.719

ModLoad: 0000000075050000 000000000007c000 C:WINDOWSSystem32msvcp_win.dll (6.2.18362.387) (-exported- Symbols Loaded)
    Linked PDB Filename   : msvcp_win.pdb
    File Version          : 10.0.18362.387 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.387

ModLoad: 0000000075340000 000000000011f000 C:WINDOWSSystem32ucrtbase.dll (6.2.18362.387) (-exported- Symbols Loaded)
    Linked PDB Filename   : ucrtbase.pdb
    File Version          : 10.0.18362.387 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.387

ModLoad: 00000000767c0000 0000000000079000 C:WINDOWSSystem32ADVAPI32.dll (6.2.18362.329) (-exported- Symbols Loaded)
    Linked PDB Filename   : advapi32.pdb
    File Version          : 10.0.18362.1 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.1

ModLoad: 0000000074820000 00000000000bf000 C:WINDOWSSystem32msvcrt.dll (7.0.18362.1) (-exported- Symbols Loaded)
    Linked PDB Filename   : msvcrt.pdb
    File Version          : 7.0.18362.1 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 7.0.18362.1

ModLoad: 0000000076c70000 0000000000025000 C:WINDOWSSystem32IMM32.DLL (6.2.18362.387) (-exported- Symbols Loaded)
    Linked PDB Filename   : wimm32.pdb
    File Version          : 10.0.18362.387 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.387

ModLoad: 0000000075da0000 000000000000f000 C:WINDOWSSystem32kernel.appcore.dll (6.2.18362.1) (-exported- Symbols Loaded)
    Linked PDB Filename   : Kernel.Appcore.pdb
    File Version          : 10.0.18362.1 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.1

ModLoad: 00000000747c0000 0000000000029000 C:WINDOWSSYSTEM32ntmarta.dll (6.2.18362.1) (-exported- Symbols Loaded)
    Linked PDB Filename   : ntmarta.pdb
    File Version          : 10.0.18362.1 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.1

ModLoad: 00000000744c0000 000000000018f000 C:WINDOWSSYSTEM32dbghelp.dll (6.2.18362.1) (-exported- Symbols Loaded)
    Linked PDB Filename   : dbghelp.pdb
    File Version          : 10.0.18362.1 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.1

ModLoad: 00000000744b0000 0000000000008000 C:WINDOWSSYSTEM32version.dll (6.2.18362.1) (-exported- Symbols Loaded)
    Linked PDB Filename   : version.pdb
    File Version          : 10.0.18362.1 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.1



*** Dump of the Process Statistics: ***

- I/O Operations Counters -
Read: 45296, Write: 0, Other 13383

- I/O Transfers Counters -
Read: 0, Write: 201934, Other 0

- Paged Pool Usage -
QuotaPagedPoolUsage: 247216, QuotaPeakPagedPoolUsage: 247520
QuotaNonPagedPoolUsage: 33240, QuotaPeakNonPagedPoolUsage: 38000

- Virtual Memory Usage -
VirtualSize: 2118262784, PeakVirtualSize: 2138513408

- Pagefile Usage -
PagefileUsage: 408424448, PeakPagefileUsage: 1467297792

- Working Set Size -
WorkingSetSize: 419426304, PeakWorkingSetSize: 1472090112, PageFaultCount: 18753581

*** Dump of thread ID 7904 (state: Waiting): ***

- Information -
Status: Wait Reason: UserRequest, , Kernel Time: 342500000.000000, User Time: 167920320512.000000, Wait Time: 9382121.000000

- Unhandled Exception Record -
Reason: Out Of Memory (C++ Exception) (0xe06d7363) at address 0x76484192

- Registers -
eax=0995d678 ebx=0995d724 ecx=00000003 edx=00000000 esi=02971c60 edi=02da4c54
eip=76484192 esp=0995d678 ebp=0995d6d0
cs=0023  ss=002b  ds=002b  es=002b  fs=0053  gs=002b             efl=00000216

- Callstack -
ChildEBP RetAddr  Args to Child
0995d6d0 004dac4b e06d7363 00000001 00000003 0995d708 KERNELBASE!RaiseException+0x0 
0995d714 004e2854 0995d724 02da4c54 029729a0 029729a8 rosetta_4.07_windows_intelx86!xmlParserInputRead+0x0 
0995d730 004e1b6e 0995d74c 00b5df10 00068704 3b36c1a0 rosetta_4.07_windows_intelx86!xmlParserInputRead+0x0 
0995d738 00b5df10 00068704 3b36c1a0 0995d900 0995d778 rosetta_4.07_windows_intelx86!xmlParserInputRead+0x0 
0995d74c 00b5de44 0995d900 43b09dcd 3b36c1a0 10a73700 rosetta_4.07_windows_intelx86!cppdb::backend::static_driver::in_use+0x0 
0995d778 02496c41 0995d900 43b09219 3b36c1a0 3b36c198 rosetta_4.07_windows_intelx86!cppdb::backend::static_driver::in_use+0x0 
0995d8ac 0083db19 10a72e10 10a73700 0aaa4fb8 4a9bb548 rosetta_4.07_windows_intelx86!cppdb::mutex::mutex+0x0 
0995d8d8 00b14595 10a72e10 10a73700 0aaa4fb8 0995d900 rosetta_4.07_windows_intelx86!cppdb::backend::statement::cache+0x0 
0995da04 00b115a7 0aaa4fb8 4a9bb548 10a15398 1c3fe668 rosetta_4.07_windows_intelx86!cppdb::backend::static_driver::in_use+0x0 
0995da78 00b2e198 0aaa4fb8 4a9bb548 10a15398 1c3fe668 rosetta_4.07_windows_intelx86!cppdb::backend::static_driver::in_use+0x0 
0995dabc 00b2de34 0995daf0 0e61bae8 45590cf0 0aaa4fb8 rosetta_4.07_windows_intelx86!cppdb::backend::static_driver::in_use+0x0 
0995db20 00b0b7f1 0995db68 0e61bae8 45590cf0 0aaa4fb8 rosetta_4.07_windows_intelx86!cppdb::backend::static_driver::in_use+0x0 
0995dbc8 00c6d009 0aaa4fb8 4a9bb548 0e61bae8 3d7ad3f0 rosetta_4.07_windows_intelx86!cppdb::backend::static_driver::in_use+0x0 
0995dc48 00c6ba06 0aaa4fb8 43b097ed 49348ed0 0aaa4fb8 rosetta_4.07_windows_intelx86!cppdb::backend::static_driver::in_use+0x0 
0995dd58 00c8a95b 0aaa4fb8 43b0ba21 53f021a8 49348ed0 rosetta_4.07_windows_intelx86!cppdb::backend::static_driver::in_use+0x0 
0995f094 00c3979a 0aaa4fb8 43b0bb89 53f021a8 78efd5dc rosetta_4.07_windows_intelx86!cppdb::backend::static_driver::in_use+0x0 
0995f13c 00c38ce7 0aaa4fb8 78efd5dc 43b0bb4d 0a153de8 rosetta_4.07_windows_intelx86!cppdb::backend::static_driver::in_use+0x0 
0995f1f8 00bfaf25 0aaa4fb8 43b0b929 5e8380d3 0a153de8 rosetta_4.07_windows_intelx86!cppdb::backend::static_driver::in_use+0x0 
0995f39c 00bf86df 0995f484 5e8380d3 00000000 0995f43c rosetta_4.07_windows_intelx86!cppdb::backend::static_driver::in_use+0x0 
0995f47c 00c3fa05 53f021a8 1c209a30 43b0be1d 00004e21 rosetta_4.07_windows_intelx86!cppdb::backend::static_driver::in_use+0x0 
0995f4a8 00c33dcf 09de20c0 09fa5d08 43b0be5d 09e78c60 rosetta_4.07_windows_intelx86!cppdb::backend::static_driver::in_use+0x0 
0995f4e8 004d94aa 09de20c0 09fa5d08 43b0b311 032a14a4 rosetta_4.07_windows_intelx86!cppdb::backend::static_driver::in_use+0x0 
0995f9a4 004e2267 00000026 099f3210 099ea7b8 43b0b359 rosetta_4.07_windows_intelx86!xmlParserInputRead+0x0 
0995f9ec 75b76359 0397e000 75b76340 0995fa58 77097b74 rosetta_4.07_windows_intelx86!xmlParserInputRead+0x0 
0995f9fc 77097b74 0397e000 08cf4be6 00000000 00000000 KERNEL32!BaseThreadInitThunk+0x0 
0995fa58 77097b44 ffffffff 770b8f3f 00000000 00000000 ntdll!RtlGetAppContainerNamedObjectPath+0x0 
0995fa68 00000000 004e22dd 0397e000 00000000 00000000 ntdll!RtlGetAppContainerNamedObjectPath+0x0 

*** Dump of thread ID 2420 (state: Waiting): ***

- Information -
Status: Wait Reason: ExecutionDelay, , Kernel Time: 0.000000, User Time: 0.000000, Wait Time: 9382116.000000

- Registers -
eax=00000000 ebx=0000000a ecx=00000000 edx=00000000 esi=00000000 edi=2ef8fc9c
eip=770a20bc esp=2ef8fc5c ebp=2ef8fcc0
cs=0023  ss=002b  ds=002b  es=002b  fs=0053  gs=002b             efl=00000206

- Callstack -
ChildEBP RetAddr  Args to Child
2ef8fcc0 7647f32f 00000064 00000000 2ef8fefc 016ed11b ntdll!ZwDelayExecution+0x0 
2ef8fcd0 016ed11b 00000064 016ed0f0 016ed0f0 00000000 KERNELBASE!Sleep+0x0 
2ef8fefc 75b76359 00000000 75b76340 2ef8ff68 77097b74 rosetta_4.07_windows_intelx86!xmlMutexLock+0x0 
2ef8ff0c 77097b74 00000000 2fa24ed6 00000000 00000000 KERNEL32!BaseThreadInitThunk+0x0 
2ef8ff68 77097b44 ffffffff 770b8f3f 00000000 00000000 ntdll!RtlGetAppContainerNamedObjectPath+0x0 
2ef8ff78 00000000 016ed0f0 00000000 00000000 00000000 ntdll!RtlGetAppContainerNamedObjectPath+0x0 

*** Dump of thread ID 3664 (state: Waiting): ***

- Information -
Status: Wait Reason: ExecutionDelay, , Kernel Time: 0.000000, User Time: 0.000000, Wait Time: 9382095.000000

- Registers -
eax=00000000 ebx=0a869b01 ecx=00000000 edx=00000000 esi=00000000 edi=366cfa1c
eip=770a20bc esp=366cf9dc ebp=366cfa40
cs=0023  ss=002b  ds=002b  es=002b  fs=0053  gs=002b             efl=00000202

- Callstack -
ChildEBP RetAddr  Args to Child
366cfa40 7647f32f 000007d0 00000000 366cfb38 00f68d91 ntdll!ZwDelayExecution+0x0 
366cfa50 00f68d91 000007d0 7c49b18d 0a869b40 00f68f70 KERNELBASE!Sleep+0x0 
366cfb38 00f68f77 00000000 016da2f5 00000000 7c49b1c9 rosetta_4.07_windows_intelx86!xmlMutexLock+0x0 
366cfb7c 75b76359 0a869b40 75b76340 366cfbe8 77097b74 rosetta_4.07_windows_intelx86!xmlMutexLock+0x0 
366cfb8c 77097b74 0a869b40 37364a56 00000000 00000000 KERNEL32!BaseThreadInitThunk+0x0 
366cfbe8 77097b44 ffffffff 770b8f3f 00000000 00000000 ntdll!RtlGetAppContainerNamedObjectPath+0x0 
366cfbf8 00000000 016da29e 0a869b40 00000000 00000000 ntdll!RtlGetAppContainerNamedObjectPath+0x0 


*** Debug Message Dump ****


*** Foreground Window Data ***
    Window Name      : 
    Window Class     : 
    Window Process ID: 0
    Window Thread ID : 0

Exiting...

</stderr_txt>
]]>

Grant
Darwin NT
ID: 92862 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
marek

Send message
Joined: 31 Mar 20
Posts: 2
Credit: 201,719
RAC: 0
Message 92871 - Posted: 1 Apr 2020, 8:16:39 UTC

-3 tasks in progress about 50% progress each
-suspend project, turn off computer
-next day turn it on, run boinc manager, resume project
-1 task magically is 100% and uploading, other 2 running normal

Is this jump from 50% to 100% normal ????
ID: 92871 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jonathan

Send message
Joined: 4 Oct 17
Posts: 43
Credit: 1,337,472
RAC: 0
Message 92876 - Posted: 1 Apr 2020, 9:33:59 UTC - in response to Message 92871.  

I just exit Boinc Manager and check the box for stopping all running Boinc tasks.
FILE -> Exit Boinc

This will cleanly stop the tasks and when started again, they take off from the last checkpoints. If you are running Virtual Box tasks, give the computer a minute or two to save those tasks then you can power off or reboot.
ID: 92876 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
marek

Send message
Joined: 31 Mar 20
Posts: 2
Credit: 201,719
RAC: 0
Message 92884 - Posted: 1 Apr 2020, 10:40:12 UTC - in response to Message 92876.  
Last modified: 1 Apr 2020, 10:49:03 UTC

Thank you for the tip.
However, should i inform someone about this one task with this strange jump??

PS
1.There is not this option to stop end exit in "advanced view". It is only present in "simple view". I have boinc from debian testing repository.
2. Jumping between view types: advanced -> simple -> advanced - > window disappear, no boinc manager process, calculations still running.
ID: 92884 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
HPE Belgium

Send message
Joined: 27 Mar 20
Posts: 16
Credit: 367,648,439
RAC: 0
Message 92917 - Posted: 1 Apr 2020, 14:52:25 UTC - in response to Message 92884.  

So what's up with the credits for the task? I read it here somewhere here I think but can't find it.

Latest tasks only credit for 6-8 points. Older tasks were 100-300....

Is this something being looked into? Any comments on this? Wondering when this will be solved...

TIA
ID: 92917 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
koetjesreep

Send message
Joined: 24 Mar 20
Posts: 5
Credit: 495,994
RAC: 0
Message 92931 - Posted: 1 Apr 2020, 15:34:32 UTC

Rosetta v4.12 x86_64-apple-darwin task crashed due to

<core_client_version>7.14.2</core_client_version>
<![CDATA[
<message>
process got signal 11</message>
<stderr_txt>
etta_4.12_x86_64-apple-darwin(74386,0x7fffac05d380) malloc: *** error for object 0x10dbc2000: pointer being freed was not allocated
*** set a breakpoint in malloc_error_break to debug
rosetta_4.12_x86_64-apple-darwin(74386,0x7fffac05d380) malloc: *** error for object 0x10dbc2000: pointer being freed was not allocated
*** set a breakpoint in malloc_error_break to debug
rosetta_4.12_x86_64-apple-darwin(74386,0x7fffac05d380) malloc: *** error for object 0x10dbc2000: pointer being freed was not allocated
*** set a breakpoint in malloc_error_break to debug
rosetta_4.12_x86_64-apple-darwin(74386,0x7fffac05d380) malloc: *** error for object 0x10dbc2000: pointer being freed was not allocated
*** set a breakpoint in malloc_error_break to debug
[snip]

https://boinc.bakerlab.org/rosetta/result.php?resultid=1138137769
ID: 92931 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Keith Myers
Avatar

Send message
Joined: 29 Mar 20
Posts: 97
Credit: 332,619
RAC: 25
Message 92937 - Posted: 1 Apr 2020, 15:48:19 UTC

My failures mostly seem to be on the 4.08 X86_64 application. Seems that process signal exit 11 is caused by the application trying to execute an instruction that that cpu does not support.
See this thread.
https://boinc.bakerlab.org/rosetta/forum_thread.php?id=13658
ID: 92937 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
rlpm

Send message
Joined: 23 Mar 20
Posts: 13
Credit: 84
RAC: 0
Message 92938 - Posted: 1 Apr 2020, 15:48:39 UTC - in response to Message 92931.  

Signal 11 is SEGV (segmentation fault). This is typically due to a programming bug. Per stderr, looks like a few double frees as well, perhaps related. Anyone know how to report this to the boffins that write the software? Moderators?
ID: 92938 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
GLadi

Send message
Joined: 21 Jan 07
Posts: 3
Credit: 303,172
RAC: 0
Message 92941 - Posted: 1 Apr 2020, 15:52:23 UTC - in response to Message 92871.  

-3 tasks in progress about 50% progress each
-suspend project, turn off computer
-next day turn it on, run boinc manager, resume project
-1 task magically is 100% and uploading, other 2 running normal

Is this jump from 50% to 100% normal ????

It happened to me few days ago. For some WUs progress changed from let's say 50% to 100% immediately after resuming (even when BOINC Manager was switching between tasks from other projects). I haven't noticed it again.

BTW I cannot see some of my tasks between 24-28 Mar 2020 in my account.
ID: 92941 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
koetjesreep

Send message
Joined: 24 Mar 20
Posts: 5
Credit: 495,994
RAC: 0
Message 92942 - Posted: 1 Apr 2020, 15:54:44 UTC - in response to Message 92938.  
Last modified: 1 Apr 2020, 15:56:17 UTC

Signal 11 is SEGV (segmentation fault). This is typically due to a programming bug. Per stderr, looks like a few double frees as well, perhaps related. Anyone know how to report this to the boffins that write the software? Moderators?

https://boinc.bakerlab.org/rosetta/forum_thread.php?id=6893&postid=92847#92847 says this thread is for problem reports, hence I posted it here :-)
ID: 92942 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
rlpm

Send message
Joined: 23 Mar 20
Posts: 13
Credit: 84
RAC: 0
Message 92943 - Posted: 1 Apr 2020, 16:02:42 UTC - in response to Message 92942.  

Yep, good call. I'm also wondering if anyone on this thread has access to the code or know anyone who does and can hunt down this bug.
ID: 92943 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Keith Myers
Avatar

Send message
Joined: 29 Mar 20
Posts: 97
Credit: 332,619
RAC: 25
Message 92945 - Posted: 1 Apr 2020, 16:08:58 UTC - in response to Message 92943.  

Yep, good call. I'm also wondering if anyone on this thread has access to the code or know anyone who does and can hunt down this bug.

This thread seems to have the best explanation of the errors.
https://boinc.bakerlab.org/rosetta/forum_thread.php?id=13658
Seems they are not parsing the cpu features correctly and attempting to run instructions that the cpu does not support.
ID: 92945 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
vowelmarauder

Send message
Joined: 22 Mar 20
Posts: 2
Credit: 2,114,237
RAC: 0
Message 92985 - Posted: 1 Apr 2020, 22:20:51 UTC

I just noticed that my tasks are taking almost twice as long as the ETA says. The time is either standing still with 1-2 seconds either way or counting *up*... I don't think I've tinkered with any settings and boinc is using all its cores fully. Is this normal? What's going on?

https://i.imgur.com/3uwyfAU.jpg
ID: 92985 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 92986 - Posted: 1 Apr 2020, 22:30:11 UTC - in response to Message 92985.  
Last modified: 1 Apr 2020, 22:31:29 UTC

We'll have to see one of 'em report back in to see for sure, but it sounds like you may have changed the Preference for the workunit runtime from the 8 hour default up to 12 or 24 hours. The watchdog will keep an eye on them for you if they run too long. I suggest letting them run to completion.
Rosetta Moderator: Mod.Sense
ID: 92986 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 32 · 33 · 34 · 35 · 36 · 37 · 38 . . . 309 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org