Message boards : Number crunching : Rosetta 4.1+ and 4.2+
Previous · 1 . . . 14 · 15 · 16 · 17 · 18 · 19 · 20 . . . 34 · Next
Author | Message |
---|---|
Brian Nixon Send message Joined: 12 Apr 20 Posts: 293 Credit: 8,432,366 RAC: 0 |
I just leave ’em be…That said: if tasks that are expected to take 8 hours only take 2¾ (the 12V ones seem to be targeting 10 000 s), it could potentially mess up BOINC’s work calculation and cause hosts with infrequent Internet connections to run out of work prematurely… |
James W Send message Joined: 25 Nov 12 Posts: 130 Credit: 1,766,254 RAC: 0 |
I just leave ’em be…That said: if tasks that are expected to take 8 hours only take 2¾ (the 12V ones seem to be targeting 10 000 s), it could potentially mess up BOINC’s work calculation and cause hosts with infrequent Internet connections to run out of work prematurely… Though 12V3AL*** tasks may have been running "short," other 12V*** tasks I've processed have run the full 8 hours or more on my two hosts. I therefore doubt that the observed short-running tasks would have a major impact on overall BOINC work calculations, etc. |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1734 Credit: 18,532,940 RAC: 17,945 |
Yep.Though 12V3AL*** tasks may have been running "short," other 12V*** tasks I've processed have run the full 8 hours or more on my two hosts. I therefore doubt that the observed short-running tasks would have a major impact on overall BOINC work calculations, etc.I just leave ’em be…That said: if tasks that are expected to take 8 hours only take 2¾ (the 12V ones seem to be targeting 10 000 s), it could potentially mess up BOINC’s work calculation and cause hosts with infrequent Internet connections to run out of work prematurely… Although i've been getting plenty of the shorter running Tasks, most Tasks run close to the Target CPU Runtime and my Estimated completion times are still at 7:59:59. An Estimated completion time within 1 second of the Target CPU Runtime is pretty good in my opinion. Grant Darwin NT |
Tomcat雄猫 Send message Joined: 20 Dec 14 Posts: 180 Credit: 5,386,173 RAC: 0 |
Got this error [url]Junior_HalfRoid_design6_cart_COVID-19_SAVE_ALL_OUT_IGNORE_THE_REST_4pd4vc0f_929576_40_0 url=https://boinc.bakerlab.org/rosetta/result.php?resultid=1199104948[/url] <core_client_version>7.16.7</core_client_version> <![CDATA[ <message> Incorrect function. (0x1) - exit code 1 (0x1)</message> <stderr_txt> command: projects/boinc.bakerlab.org_rosetta/rosetta_4.20_windows_x86_64.exe -run:protocol jd2_scripting -parser:protocol jhr_boinc_v4_cart.xml @flags -in:file:silent Junior_HalfRoid_design6_cart_COVID-19_SAVE_ALL_OUT_IGNORE_THE_REST_4pd4vc0f.silent -in:file:silent_struct_type binary -silent_gz -mute all -silent_read_through_errors true -out:file:silent_struct_type binary -out:file:silent default.out -in:file:boinc_wu_zip Junior_HalfRoid_design6_cart_COVID-19_SAVE_ALL_OUT_IGNORE_THE_REST_4pd4vc0f.zip @Junior_HalfRoid_design6_cart_COVID-19_SAVE_ALL_OUT_IGNORE_THE_REST_4pd4vc0f.flags -nstruct 10000 -cpu_run_time 28800 -boinc:max_nstruct 20000 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -boinc::cpu_run_timeout 36000 -run::rng mt19937 -constant_seed -jran 1052475 Using database: database_357d5d93529_n_methylminirosetta_database ERROR: [ERROR] Unable to open constraints file: 731194738833825cfcfe8f26a04b26f3_n1_c0_1_0001.MSAcst ERROR:: Exit from: ......srccorescoringconstraintsConstraintIO.cc line: 457 BOINC:: Error reading and gzipping output datafile: default.out 18:29:44 (18508): called boinc_finish(1) </stderr_txt> ]]>[/quote][/code] |
crystalsys Send message Joined: 11 Aug 09 Posts: 8 Credit: 1,645,976 RAC: 404 |
Android - I have multiple tasks that have been saying uploading for days, and it isn't getting new tasks. And it says 'Nothing to do'. Also some that have been saying downloading, but nothing is happening. Maybe in wrong thread? Can't delete? |
monk_duck Send message Joined: 17 Nov 09 Posts: 11 Credit: 284,039 RAC: 0 |
Android - I have multiple tasks that have been saying uploading for days, and it isn't getting new tasks. And it says 'Nothing to do'. Also some that have been saying downloading, but nothing is happening. You want this thread https://boinc.bakerlab.org/rosetta/forum_thread.php?id=14006 there is a certificate issue with the boinc software (a lot of companies were caught out by this), we're awaiting a new build to hit Google Play. |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2146 Credit: 41,570,180 RAC: 8,210 |
Yep.Though 12V3AL*** tasks may have been running "short," other 12V*** tasks I've processed have run the full 8 hours or more on my two hosts. I therefore doubt that the observed short-running tasks would have a major impact on overall BOINC work calculations, etc.I just leave ’em be…That said: if tasks that are expected to take 8 hours only take 2¾ (the 12V ones seem to be targeting 10 000 s), it could potentially mess up BOINC’s work calculation and cause hosts with infrequent Internet connections to run out of work prematurely… I understand - and agree - with the premise of the question, but I've just reported a task that ran 3hr 15mins and my estimated times are still 8:00:00. Fact is, none of my tasks run this amount of time, now or ever in the past. But my estimated times have been between 1 or 2secs of 8hrs for maybe a week. I don't think there's anything to complain about (unless you've set a vastly different non-default CPU runtime, in which case there certainly is) - it's just odd. But neither do I think it's an average of past performance like it used to be. Something's definitely changed, but I'll leave it to those who have problems to report it |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1734 Credit: 18,532,940 RAC: 17,945 |
I don't think there's anything to complain about (unless you've set a vastly different non-default CPU runtime, in which case there certainly is) - it's just odd. But neither do I think it's an average of past performance like it used to be.It was changed by the project to stop systems from getting too much work whenever a new cruncher joined up, or a new application was released. Since Rosetta is unlike other projects, and work is processed for a selected period of time, having the Estimated completion time being set to be the same as the Target CPU time makes sense. If you change the Target CPU time, then the Estimated completion time should also end up matching that newly selected Target CPU time. Grant Darwin NT |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2146 Credit: 41,570,180 RAC: 8,210 |
I don't think there's anything to complain about (unless you've set a vastly different non-default CPU runtime, in which case there certainly is) - it's just odd. But neither do I think it's an average of past performance like it used to be.It was changed by the project to stop systems from getting too much work whenever a new cruncher joined up, or a new application was released. Oh. I was aware of that, but seeing as it didn't apply to me at the time, I ignored and obviously forgot it. So that's what it does. Ok, ta |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1734 Credit: 18,532,940 RAC: 17,945 |
Looks like another batch of faulty Work units, all crashed and burned in a matter of seconds. 061020SR_YAAAAAAXO_2-11_199_72102125_2mers_0001_0001_SAVE_ALL_OUT_947694_237_0 061020SR_YAAAAAAXO_2-11_199_72102125_2mers_0001_0001_SAVE_ALL_OUT_947694_323_1 061020SR_YAAAAAAXO_2-11_455_8524850_2mers_0001_0001_SAVE_ALL_OUT_947733_329_0 So far it's 50/50 with these Work Units- half have processed OK and Validated, then there's this half that just errored out. <core_client_version>7.6.22</core_client_version> <![CDATA[ <message> (unknown error) - exit code -1073741819 (0xc0000005) </message> <stderr_txt> command: projects/boinc.bakerlab.org_rosetta/rosetta_4.20_windows_x86_64.exe @061020SR_YAAAAAAXO_2-11_455_8524850_2mers_0001_0001.flags -nstruct 10000 -cpu_run_time 28800 -boinc:max_nstruct 20000 -checkpoint_interval 120 -mute all -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -boinc::cpu_run_timeout 36000 -run::rng mt19937 -constant_seed -jran 1089012 Using database: database_357d5d93529_n_methylminirosetta_database Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x000001C12A257AD8 Engaging BOINC Windows Runtime Debugger... ******************** BOINC Windows Runtime Debugger Version 7.9.0 Dump Timestamp : 06/13/20 00:34:32 Install Directory : C:Program FilesBOINC Data Directory : C:ProgramDataBOINC Project Symstore : https://boinc.bakerlab.org/rosetta/symstore LoadLibraryA( C:ProgramDataBOINCdbghelp.dll ): GetLastError = 126 Loaded Library : dbghelp.dll LoadLibraryA( C:ProgramDataBOINCsymsrv.dll ): GetLastError = 126 LoadLibraryA( symsrv.dll ): GetLastError = 126 LoadLibraryA( C:ProgramDataBOINCsrcsrv.dll ): GetLastError = 126 LoadLibraryA( srcsrv.dll ): GetLastError = 126 LoadLibraryA( C:ProgramDataBOINCversion.dll ): GetLastError = 126 Loaded Library : version.dll Debugger Engine : 4.0.5.0 Symbol Search Path: C:ProgramDataBOINCslots ;C:ProgramDataBOINCprojectsboinc.bakerlab.org_rosetta;srv*C:ProgramDataBOINCprojectsboinc.bakerlab.org_rosettasymbols*http://msdl.microsoft.com/download/symbols;srv*C:ProgramDataBOINCprojectsboinc.bakerlab.org_rosettasymbols*https://boinc.bakerlab.org/rosetta/symstore ModLoad: 00000000aea60000 00000000057ef000 C:ProgramDataBOINCprojectsboinc.bakerlab.org_rosettarosetta_4.20_windows_x86_64.exe (-exported- Symbols Loaded) Linked PDB Filename : C:cygwin64homeboinc4.17RosettamainsourceideVisualStudiox64BoincReleaserosetta_4.20_windows_x86_64.pdb ModLoad: 000000008bea0000 00000000001f0000 C:WINDOWSSYSTEM32ntdll.dll (6.2.18362.778) (-exported- Symbols Loaded) Linked PDB Filename : ntdll.pdb File Version : 10.0.18362.329 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.18362.329 ModLoad: 000000008acc0000 00000000000b2000 C:WINDOWSSystem32KERNEL32.DLL (6.2.18362.778) (-exported- Symbols Loaded) Linked PDB Filename : kernel32.pdb File Version : 10.0.18362.329 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.18362.329 ModLoad: 0000000089600000 00000000002a3000 C:WINDOWSSystem32KERNELBASE.dll (6.2.18362.778) (-exported- Symbols Loaded) Linked PDB Filename : kernelbase.pdb File Version : 10.0.18362.329 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.18362.329 ModLoad: 000000008b9a0000 000000000006f000 C:WINDOWSSystem32WS2_32.dll (6.2.18362.387) (-exported- Symbols Loaded) Linked PDB Filename : ws2_32.pdb File Version : 10.0.18362.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.18362.1 ModLoad: 000000008ade0000 0000000000120000 C:WINDOWSSystem32RPCRT4.dll (6.2.18362.628) (-exported- Symbols Loaded) Linked PDB Filename : rpcrt4.pdb File Version : 10.0.18362.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.18362.1 ModLoad: 000000008aa00000 0000000000194000 C:WINDOWSSystem32USER32.dll (6.2.18362.778) (-exported- Symbols Loaded) Linked PDB Filename : user32.pdb File Version : 10.0.17763.802 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.17763.802 ModLoad: 0000000089bd0000 0000000000021000 C:WINDOWSSystem32win32u.dll (6.2.18362.778) (-exported- Symbols Loaded) Linked PDB Filename : win32u.pdb File Version : 10.0.18362.778 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.18362.778 ModLoad: 000000008b590000 0000000000026000 C:WINDOWSSystem32GDI32.dll (6.2.18362.1) (-exported- Symbols Loaded) Linked PDB Filename : gdi32.pdb File Version : 10.0.18362.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.18362.1 ModLoad: 0000000089c00000 0000000000194000 C:WINDOWSSystem32gdi32full.dll (6.2.18362.778) (-exported- Symbols Loaded) Linked PDB Filename : gdi32full.pdb File Version : 10.0.18362.778 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.18362.778 ModLoad: 0000000089e50000 000000000009e000 C:WINDOWSSystem32msvcp_win.dll (6.2.18362.387) (-exported- Symbols Loaded) Linked PDB Filename : msvcp_win.pdb File Version : 10.0.18362.387 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.18362.387 ModLoad: 0000000089960000 00000000000fa000 C:WINDOWSSystem32ucrtbase.dll (6.2.18362.387) (-exported- Symbols Loaded) Linked PDB Filename : ucrtbase.pdb File Version : 10.0.18362.387 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.18362.387 ModLoad: 000000008af20000 00000000000a3000 C:WINDOWSSystem32ADVAPI32.dll (6.2.18362.752) (-exported- Symbols Loaded) Linked PDB Filename : advapi32.pdb File Version : 10.0.18362.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.18362.1 ModLoad: 000000008aba0000 000000000009e000 C:WINDOWSSystem32msvcrt.dll (7.0.18362.1) (-exported- Symbols Loaded) Linked PDB Filename : msvcrt.pdb File Version : 7.0.18362.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 7.0.18362.1 ModLoad: 000000008afd0000 0000000000097000 C:WINDOWSSystem32sechost.dll (6.2.18362.693) (-exported- Symbols Loaded) Linked PDB Filename : sechost.pdb File Version : 10.0.18362.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.18362.1 ModLoad: 000000008a0b0000 000000000002e000 C:WINDOWSSystem32IMM32.DLL (6.2.18362.387) (-exported- Symbols Loaded) Linked PDB Filename : imm32.pdb File Version : 10.0.18362.387 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.18362.387 ModLoad: 0000000088d70000 0000000000011000 C:WINDOWSSystem32kernel.appcore.dll (6.2.18362.1) (-exported- Symbols Loaded) Linked PDB Filename : Kernel.Appcore.pdb File Version : 10.0.18362.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.18362.1 ModLoad: 0000000087dc0000 0000000000031000 C:WINDOWSSYSTEM32ntmarta.dll (6.2.18362.1) (-exported- Symbols Loaded) Linked PDB Filename : ntmarta.pdb File Version : 10.0.18362.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.18362.1 ModLoad: 00000000822e0000 00000000001f4000 C:WINDOWSSYSTEM32dbghelp.dll (6.2.18362.1) (-exported- Symbols Loaded) Linked PDB Filename : dbghelp.pdb File Version : 10.0.18362.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.18362.1 ModLoad: 0000000089da0000 0000000000080000 C:WINDOWSSystem32bcryptPrimitives.dll (6.2.18362.295) (-exported- Symbols Loaded) Linked PDB Filename : bcryptprimitives.pdb File Version : 10.0.18362.295 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.18362.295 ModLoad: 0000000083400000 000000000000a000 C:WINDOWSSYSTEM32version.dll (6.2.18362.1) (-exported- Symbols Loaded) Linked PDB Filename : version.pdb File Version : 10.0.18362.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.18362.1 *** Dump of the Process Statistics: *** - I/O Operations Counters - Read: 5002, Write: 646, Other 13717 - I/O Transfers Counters - Read: 14512312, Write: 19118, Other 6542 - Paged Pool Usage - QuotaPagedPoolUsage: 317448, QuotaPeakPagedPoolUsage: 317576 QuotaNonPagedPoolUsage: 6792, QuotaPeakNonPagedPoolUsage: 7352 - Virtual Memory Usage - VirtualSize: 83120128, PeakVirtualSize: 895655936 - Pagefile Usage - PagefileUsage: 83120128, PeakPagefileUsage: 83128320 - Working Set Size - WorkingSetSize: 103743488, PeakWorkingSetSize: 103747584, PageFaultCount: 25733 *** Dump of thread ID 3488 (state: Initialized): *** - Information - Status: Base Priority: Normal, Priority: Normal, , Kernel Time: 0.000000, User Time: 0.000000, Wait Time: 0.000000 - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x000001C12A257AD8 - Registers - rax=000000000000003a rbx=00000000296147b0 rcx=000000002a00cad0 rdx=000000002a0ecc08 rsi=000000000000000b rdi=000000002a00cad0 r8=000000000000003a r9=0000000000000421 r10=00000000b2606e80 r11=000000000a1450c0 r12=00000000aea60000 r13=000000000a15f7d0 r14=000000000a145800 r15=000000000048b215 rip=000000002a257ad8 rsp=000000000a145138 rbp=0000000000000000 cs=0033 ss=002b ds=002b es=002b fs=0053 gs=002b efl=00010202 - Callstack - ChildEBP RetAddr Args to Child 0a145130 aef3831c 00000000 b2606d60 b2606e80 b25ebe78 !+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = '2a257ad8' 0a145160 aeef935d 296147b0 0a145200 0a145980 aeee355d rosetta_4.20_windows_x86_64!xmlParserInputRead+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'aef3831c' 0a145190 b2067f10 b2f50150 0a15f7d0 00000000 aeee3265 rosetta_4.20_windows_x86_64!xmlParserInputRead+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'aeef935d' 0a1451c0 aeee39e8 0a145e70 041c3000 0a1457c8 0a145850 rosetta_4.20_windows_x86_64!xmlValidateNotationDecl+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'b2067f10' 0a145230 8bf411cf 00000000 0a1457b0 0a145e70 0a145e70 rosetta_4.20_windows_x86_64!xmlParserInputRead+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'aeee39e8' 0a145260 8bf0a209 00000001 aea60000 00000000 b3ffa32c ntdll!__chkstk+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = '8bf411cf' 0a145970 8bf3fe3e 29600000 8bedb997 b26da450 8bedc43f ntdll!RtlRaiseException+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = '8bf0a209' 0a1460f0 af1a3e2b fffffffe 2da77558 ffffffff af1b18c5 ntdll!KiUserExceptionDispatcher+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = '8bf3fe3e' 0a146140 af1b3690 b26da3a0 2da772b0 b26da3a0 0a146239 rosetta_4.20_windows_x86_64!cppdb::session::is_open+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'af1a3e2b' 0a146270 af2c9ee8 2d2c83d8 2da09e10 2da772b0 2da09e10 rosetta_4.20_windows_x86_64!cppdb::session::is_open+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'af1b3690' 0a146e20 af264b6c 2dd8e950 8bedb997 29540000 00000000 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'af2c9ee8' 0a147020 af26488e 0a147108 00000000 0a1472f0 00000000 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'af264b6c' 0a147180 af1c3da1 0a1472f8 00000000 29614000 0a1473c0 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'af26488e' 0a147540 af1c9f08 0a147890 0a147890 0a147890 00000000 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'af1c3da1' 0a147b90 af1c84db 29cf21d0 0a147bf0 29bc0080 29bc0080 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'af1c9f08' 0a147cf0 af131fb7 00000000 0a147e00 29bc0080 0a148000 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'af1c84db' 0a147e60 af1357a6 00000005 aeed5190 29fc8ba0 29fc8ba0 rosetta_4.20_windows_x86_64!cppdb::session::is_open+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'af131fb7' 0a147ed0 af1356cc 0a1481d8 0a148049 0a1481d8 29bc0080 rosetta_4.20_windows_x86_64!cppdb::session::is_open+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'af1357a6' 0a147f80 af1fb6f5 0a1481d8 0a148541 00000000 aeef75e8 rosetta_4.20_windows_x86_64!cppdb::session::is_open+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'af1356cc' 0a1480a0 af1fa592 00000005 0a1481d8 0a1483b0 00000000 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'af1fb6f5' 0a148170 af1fad06 00000000 00000000 0a148a90 29540000 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'af1fa592' 0a148310 af6571a3 0a1483b0 0a148a90 ffffff01 aeee3e73 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'af1fad06' 0a148600 af659d09 00000000 00000001 0a148710 0a148a90 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'af6571a3' 0a148990 af652f8a 0a1489d0 0a148a90 2ce1b570 29c22450 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'af659d09' 0a1489f0 af86cc70 0a148a90 0a1491b8 29fc8ba0 00000000 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'af652f8a' 0a149180 af86c6e4 2dc91a70 2dbaeb90 b3ed5cc0 aeed75a6 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'af86cc70' 0a1491e0 af87603e 0a1492d0 2dc917a0 0a1492f0 0a149a40 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'af86c6e4' 0a149960 af8756d4 b27e312e b27e301e b3e47f70 af896cb4 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'af87603e' 0a1499f0 af87578e 00000005 0a149f98 29c22450 00000001 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'af8756d4' 0a149b90 aeee081d 29eeb820 29eeb820 29c22450 29615d01 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'af87578e' 0a15f7c0 aeeeb215 00000000 00000000 b3e0ccf8 00000000 rosetta_4.20_windows_x86_64!xmlParserInputRead+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'aeee081d' 0a15f800 8acd7bd4 00000000 00000000 00000000 00000000 rosetta_4.20_windows_x86_64!xmlParserInputRead+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = 'aeeeb215' 0a15f830 8bf0ce51 00000000 00000000 00000000 00000000 KERNEL32!BaseThreadInitThunk+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = '8acd7bd4' 0a15f8b0 00000000 00000000 00000000 00000000 00000000 ntdll!RtlUserThreadStart+0x0 SymFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = '8bf0ce51' *** Dump of thread ID 32763 (state: Initialized): *** - Information - Status: Base Priority: Normal, Priority: Unknown, , Kernel Time: 6.000000, User Time: 0.000000, Wait Time: 3280771328.000000 - Registers - rax=0000000000000000 rbx=0000000000000000 rcx=0000000000000000 rdx=0000000000000000 rsi=0000000000000000 rdi=0000000000000000 r8=0000000000000000 r9=0000000000000000 r10=0000000000000000 r11=0000000000000000 r12=0000000000000000 r13=0000000000000000 r14=0000000000000000 r15=0000000000000000 rip=0000000000000000 rsp=0000000000000000 rbp=0000000000000000 cs=0000 ss=0000 ds=0000 es=0000 fs=0000 gs=0000 efl=00000000 - Callstack - ChildEBP RetAddr Args to Child (-nosymbols- PC == 0) 00000000 00000000 00000000 00000000 00000000 00000000 !+0x0 *** Dump of thread ID 30818506 (state: Unknown): *** - Information - Status: Base Priority: Normal, Priority: Unknown, , Kernel Time: 17179869184.000000, User Time: 21474836480.000000, Wait Time: 0.000000 - Registers - rax=0000000000000000 rbx=0000000000000000 rcx=0000000000000000 rdx=0000000000000000 rsi=0000000000000000 rdi=0000000000000000 r8=0000000000000000 r9=0000000000000000 r10=0000000000000000 r11=0000000000000000 r12=0000000000000000 r13=0000000000000000 r14=0000000000000000 r15=0000000000000000 rip=0000000000000000 rsp=0000000000000000 rbp=0000000000000000 cs=0000 ss=0000 ds=0000 es=0000 fs=0000 gs=0000 efl=00000000 - Callstack - ChildEBP RetAddr Args to Child (-nosymbols- PC == 0) 00000000 00000000 00000000 00000000 00000000 00000000 !+0x0 *** Debug Message Dump **** *** Foreground Window Data *** Window Name : Window Class : Window Process ID: 0 Window Thread ID : 0 Exiting... </stderr_txt> ]]> Grant Darwin NT |
Tomcat雄猫 Send message Joined: 20 Dec 14 Posts: 180 Credit: 5,386,173 RAC: 0 |
I wonder what this is? Ran for about half an hour before erroring out. rb_06_13_29145_28622_ab_t000__robetta_cstwt_5.0_FT_IGNORE_THE_REST_06_05_947971_63_1 <core_client_version>7.16.7</core_client_version> <![CDATA[ <message> Incorrect function. (0x1) - exit code 1 (0x1)</message> <stderr_txt> command: projects/boinc.bakerlab.org_rosetta/rosetta_4.20_windows_x86_64.exe @rb_06_13_29145_28622_ab_t000__robetta_FLAGS -in::file::fasta t000_.fasta -jumps:pairing_file t000_.fasta.bbcontacts.jumps -jumps:random_sheets 9 1 -constraints::cst_file t000_.fasta.CB.cst -constraints:cst_weight 5.0 -constraints::cst_fa_file t000_.fasta.MIN.cst -constraints:cst_fa_weight 5.0 -in:file:boinc_wu_zip rb_06_13_29145_28622_ab_t000__robetta.zip -frag3 rb_06_13_29145_28622_ab_t000__robetta.200.3mers.index.gz -fragA rb_06_13_29145_28622_ab_t000__robetta.200.5mers.index.gz -fragB rb_06_13_29145_28622_ab_t000__robetta.200.6mers.index.gz -nstruct 10000 -cpu_run_time 28800 -boinc:max_nstruct 20000 -checkpoint_interval 120 -mute all -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -boinc::cpu_run_timeout 36000 -run::rng mt19937 -constant_seed -jran 2496535 Using database: database_357d5d93529_n_methylminirosetta_database [ ERROR ]: Caught exception: File: C:cygwin64homeboinc4.17Rosettamainsourcesrccore/pack/dunbrack/SingleResidueDunbrackLibrary.hh:306 chi angle must be between -180 and 180: -nan(ind) ------------------------ Begin developer's backtrace ------------------------- BACKTRACE: ------------------------- End developer's backtrace -------------------------- AN INTERNAL ERROR HAS OCCURED. PLEASE SEE THE CONTENTS OF ROSETTA_CRASH.log FOR DETAILS. </stderr_txt> ]]> |
James W Send message Joined: 25 Nov 12 Posts: 130 Credit: 1,766,254 RAC: 0 |
This morning (6/25) had 10 errors on my 2 hosts (one with device 3710630 and 9 with device 1759960 Name: Series "rb_06_25_30616_30000__t000__ab_robetta_IGNORE_THE_REST_****" Application: Rosetta v4.20 windows_x86_64 Sample Task for Host 3710630: 1210226936. Sample WU for same host: 1086224610. Status: Error while downloading. Exit status: -186 (0xFFFFFF46) ERR_RESULT_DOWNLOAD Stderr output: WU download error: couldn't get input files:Sample Task for Host 1759960: 1210207254. Sample WU for same host: 1086211597. Status: Error while downloading. Exit status: -186 (0xFFFFFF46) ERR_RESULT_DOWNLOAD Stderr output: WU download error: couldn't get input files:Currently using BOINC Manager V7.16.5 on both hosts. Would the updated Manager V7.16.7 fix this particular problem? |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1734 Credit: 18,532,940 RAC: 17,945 |
Would the updated Manager V7.16.7 fix this particular problem?Nope. I'd suggest re-booting your system & modem. If download issues are still occurring, i'd check your AV/security software to see if there have been any recent updates that are now clobbering Rosetta downloads (although you'll have to wat for some new work to be loaded before you'll be able to see if that does help things). Grant Darwin NT |
James W Send message Joined: 25 Nov 12 Posts: 130 Credit: 1,766,254 RAC: 0 |
I did get a batch of tasks after these failed ones, including some in same series as problem ones, including WU 1086231903. I'm the wingman, with original task failing due to downloading issue I had previously. This one has been crunching away for over 5 hrs. and 40 mins. so far, with expected 2 hrs. & 50 mins. to go. I have 3 other tasks in this series (rb_06_25_30623_30025_ab_t000__h001_robetta_****) running as well on host 1759960. |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1734 Credit: 18,532,940 RAC: 17,945 |
I'm the wingman, with original task failing due to downloading issue I had previously.With the other copy also giving a download error & the "missing input file" part of the error message, it could have been a server issue- the file missing from the download server (or at least not where it was expected to be). Grant Darwin NT |
James W Send message Joined: 25 Nov 12 Posts: 130 Credit: 1,766,254 RAC: 0 |
I just noted that 9 of the 10 download error tasks I had were reissued to a host (3791293) which has not contacted server since 6/25. This host also timed out multiple tasks (252). Not a good choice to use as wingman! These 9 tasks will no doubt time out as well, as last valid tasks for this host were on 6/23/20! |
James W Send message Joined: 25 Nov 12 Posts: 130 Credit: 1,766,254 RAC: 0 |
Appears your assumption is correct, as my host validly processed these previously erroneous tasks.I'm the wingman, with original task failing due to downloading issue I had previously.With the other copy also giving a download error & the "missing input file" part of the error message, it could have been a server issue- the file missing from the download server (or at least not where it was expected to be). |
James W Send message Joined: 25 Nov 12 Posts: 130 Credit: 1,766,254 RAC: 0 |
7/8/2020 3:58:48 PM | Rosetta@home | Task 81efa213_fold_SAVE_ALL_OUT_951627_1211_0 exited with zero status but no 'finished' fileThis is the first time I've seen this happen on Rosetta 4.20. It happened all the time with Mini tasks. Host: 1759960 WU: 1091525816 Task: 1216339486 At first I thought another Mini app had snuck in. Anyone else seen this "error" with 4.20? |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2146 Credit: 41,570,180 RAC: 8,210 |
7/8/2020 3:58:48 PM | Rosetta@home | Task 81efa213_fold_SAVE_ALL_OUT_951627_1211_0 exited with zero status but no 'finished' fileThis is the first time I've seen this happen on Rosetta 4.20. It happened all the time with Mini tasks. Are you sure that's the same task? Your link goes to COVID_jJHRs_perturb_SAVE_ALL_OUT_IGNORE_THE_REST_9za7zv9d_953350_4 and it looks fine, as do all your reported tasks |
James W Send message Joined: 25 Nov 12 Posts: 130 Credit: 1,766,254 RAC: 0 |
Are you sure that's the same task? Your link goes to COVID_jJHRs_perturb_SAVE_ALL_OUT_IGNORE_THE_REST_9za7zv9d_953350_4 and it looks fine, as do all your reported tasks.Sorry, picked the wrong WU and task somehow. Should be: WU: 1091720486 Task: 1216562014 It's currently still in process. I note now that error happened about 3 minutes into crunching and did not occur again. Hopefully just a fluke. |
Message boards :
Number crunching :
Rosetta 4.1+ and 4.2+
©2024 University of Washington
https://www.bakerlab.org