Rosetta 4.1+ and 4.2+

Message boards : Number crunching : Rosetta 4.1+ and 4.2+

To post messages, you must log in.

Previous · 1 . . . 14 · 15 · 16 · 17 · 18 · 19 · 20 . . . 34 · Next

AuthorMessage
Brian Nixon

Send message
Joined: 12 Apr 20
Posts: 293
Credit: 8,432,366
RAC: 0
Message 97268 - Posted: 7 Jun 2020, 11:37:26 UTC - in response to Message 97227.  

I just leave ’em be…
That said: if tasks that are expected to take 8 hours only take 2¾ (the 12V ones seem to be targeting 10 000 s), it could potentially mess up BOINC’s work calculation and cause hosts with infrequent Internet connections to run out of work prematurely…
ID: 97268 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
James W

Send message
Joined: 25 Nov 12
Posts: 130
Credit: 1,766,254
RAC: 0
Message 97270 - Posted: 7 Jun 2020, 12:10:34 UTC - in response to Message 97268.  

I just leave ’em be…
That said: if tasks that are expected to take 8 hours only take 2¾ (the 12V ones seem to be targeting 10 000 s), it could potentially mess up BOINC’s work calculation and cause hosts with infrequent Internet connections to run out of work prematurely…

Though 12V3AL*** tasks may have been running "short," other 12V*** tasks I've processed have run the full 8 hours or more on my two hosts. I therefore doubt that the observed short-running tasks would have a major impact on overall BOINC work calculations, etc.
ID: 97270 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1680
Credit: 17,850,850
RAC: 22,496
Message 97273 - Posted: 7 Jun 2020, 20:10:14 UTC - in response to Message 97270.  

I just leave ’em be…
That said: if tasks that are expected to take 8 hours only take 2¾ (the 12V ones seem to be targeting 10 000 s), it could potentially mess up BOINC’s work calculation and cause hosts with infrequent Internet connections to run out of work prematurely…
Though 12V3AL*** tasks may have been running "short," other 12V*** tasks I've processed have run the full 8 hours or more on my two hosts. I therefore doubt that the observed short-running tasks would have a major impact on overall BOINC work calculations, etc.
Yep.
Although i've been getting plenty of the shorter running Tasks, most Tasks run close to the Target CPU Runtime and my Estimated completion times are still at 7:59:59.
An Estimated completion time within 1 second of the Target CPU Runtime is pretty good in my opinion.
Grant
Darwin NT
ID: 97273 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Tomcat雄猫

Send message
Joined: 20 Dec 14
Posts: 180
Credit: 5,386,173
RAC: 0
Message 97279 - Posted: 8 Jun 2020, 1:58:29 UTC

Got this error
[url]Junior_HalfRoid_design6_cart_COVID-19_SAVE_ALL_OUT_IGNORE_THE_REST_4pd4vc0f_929576_40_0 url=https://boinc.bakerlab.org/rosetta/result.php?resultid=1199104948[/url]
<core_client_version>7.16.7</core_client_version>
<![CDATA[
<message>
Incorrect function.
 (0x1) - exit code 1 (0x1)</message>
<stderr_txt>
command: projects/boinc.bakerlab.org_rosetta/rosetta_4.20_windows_x86_64.exe -run:protocol jd2_scripting -parser:protocol jhr_boinc_v4_cart.xml @flags -in:file:silent Junior_HalfRoid_design6_cart_COVID-19_SAVE_ALL_OUT_IGNORE_THE_REST_4pd4vc0f.silent -in:file:silent_struct_type binary -silent_gz -mute all -silent_read_through_errors true -out:file:silent_struct_type binary -out:file:silent default.out -in:file:boinc_wu_zip Junior_HalfRoid_design6_cart_COVID-19_SAVE_ALL_OUT_IGNORE_THE_REST_4pd4vc0f.zip @Junior_HalfRoid_design6_cart_COVID-19_SAVE_ALL_OUT_IGNORE_THE_REST_4pd4vc0f.flags -nstruct 10000 -cpu_run_time 28800 -boinc:max_nstruct 20000 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -boinc::cpu_run_timeout 36000 -run::rng mt19937 -constant_seed -jran 1052475
Using database: database_357d5d93529_n_methylminirosetta_database

ERROR: [ERROR] Unable to open constraints file: 731194738833825cfcfe8f26a04b26f3_n1_c0_1_0001.MSAcst
ERROR:: Exit from: ......srccorescoringconstraintsConstraintIO.cc line: 457
BOINC:: Error reading and gzipping output datafile: default.out
18:29:4
4 (18508): called boinc_finish(1)

</stderr_txt>
]]>[/quote][/code]
ID: 97279 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
crystalsys
Avatar

Send message
Joined: 11 Aug 09
Posts: 8
Credit: 1,630,028
RAC: 416
Message 97284 - Posted: 8 Jun 2020, 12:07:41 UTC
Last modified: 8 Jun 2020, 12:17:30 UTC

Android - I have multiple tasks that have been saying uploading for days, and it isn't getting new tasks. And it says 'Nothing to do'. Also some that have been saying downloading, but nothing is happening.

Maybe in wrong thread? Can't delete?
ID: 97284 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
monk_duck

Send message
Joined: 17 Nov 09
Posts: 11
Credit: 284,039
RAC: 0
Message 97293 - Posted: 8 Jun 2020, 16:24:10 UTC - in response to Message 97284.  

Android - I have multiple tasks that have been saying uploading for days, and it isn't getting new tasks. And it says 'Nothing to do'. Also some that have been saying downloading, but nothing is happening.

Maybe in wrong thread? Can't delete?



You want this thread https://boinc.bakerlab.org/rosetta/forum_thread.php?id=14006 there is a certificate issue with the boinc software (a lot of companies were caught out by this), we're awaiting a new build to hit Google Play.
ID: 97293 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2124
Credit: 41,228,077
RAC: 11,048
Message 97299 - Posted: 9 Jun 2020, 5:53:14 UTC - in response to Message 97273.  

I just leave ’em be…
That said: if tasks that are expected to take 8 hours only take 2¾ (the 12V ones seem to be targeting 10 000 s), it could potentially mess up BOINC’s work calculation and cause hosts with infrequent Internet connections to run out of work prematurely…
Though 12V3AL*** tasks may have been running "short," other 12V*** tasks I've processed have run the full 8 hours or more on my two hosts. I therefore doubt that the observed short-running tasks would have a major impact on overall BOINC work calculations, etc.
Yep.
Although i've been getting plenty of the shorter running Tasks, most Tasks run close to the Target CPU Runtime and my Estimated completion times are still at 7:59:59.
An Estimated completion time within 1 second of the Target CPU Runtime is pretty good in my opinion.

I understand - and agree - with the premise of the question, but I've just reported a task that ran 3hr 15mins and my estimated times are still 8:00:00.
Fact is, none of my tasks run this amount of time, now or ever in the past. But my estimated times have been between 1 or 2secs of 8hrs for maybe a week.
I don't think there's anything to complain about (unless you've set a vastly different non-default CPU runtime, in which case there certainly is) - it's just odd. But neither do I think it's an average of past performance like it used to be.
Something's definitely changed, but I'll leave it to those who have problems to report it
ID: 97299 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1680
Credit: 17,850,850
RAC: 22,496
Message 97301 - Posted: 9 Jun 2020, 6:02:01 UTC - in response to Message 97299.  

I don't think there's anything to complain about (unless you've set a vastly different non-default CPU runtime, in which case there certainly is) - it's just odd. But neither do I think it's an average of past performance like it used to be.
Something's definitely changed, but I'll leave it to those who have problems to report it
It was changed by the project to stop systems from getting too much work whenever a new cruncher joined up, or a new application was released.
Since Rosetta is unlike other projects, and work is processed for a selected period of time, having the Estimated completion time being set to be the same as the Target CPU time makes sense.

If you change the Target CPU time, then the Estimated completion time should also end up matching that newly selected Target CPU time.
Grant
Darwin NT
ID: 97301 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2124
Credit: 41,228,077
RAC: 11,048
Message 97315 - Posted: 10 Jun 2020, 8:11:45 UTC - in response to Message 97301.  

I don't think there's anything to complain about (unless you've set a vastly different non-default CPU runtime, in which case there certainly is) - it's just odd. But neither do I think it's an average of past performance like it used to be.
Something's definitely changed, but I'll leave it to those who have problems to report it
It was changed by the project to stop systems from getting too much work whenever a new cruncher joined up, or a new application was released.
Since Rosetta is unlike other projects, and work is processed for a selected period of time, having the Estimated completion time being set to be the same as the Target CPU time makes sense.

If you change the Target CPU time, then the Estimated completion time should also end up matching that newly selected Target CPU time.

Oh. I was aware of that, but seeing as it didn't apply to me at the time, I ignored and obviously forgot it.
So that's what it does. Ok, ta
ID: 97315 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1680
Credit: 17,850,850
RAC: 22,496
Message 97349 - Posted: 12 Jun 2020, 22:10:56 UTC
Last modified: 12 Jun 2020, 22:16:09 UTC

Looks like another batch of faulty Work units, all crashed and burned in a matter of seconds.

061020SR_YAAAAAAXO_2-11_199_72102125_2mers_0001_0001_SAVE_ALL_OUT_947694_237_0
061020SR_YAAAAAAXO_2-11_199_72102125_2mers_0001_0001_SAVE_ALL_OUT_947694_323_1
061020SR_YAAAAAAXO_2-11_455_8524850_2mers_0001_0001_SAVE_ALL_OUT_947733_329_0

So far it's 50/50 with these Work Units- half have processed OK and Validated, then there's this half that just errored out.


<core_client_version>7.6.22</core_client_version>
<![CDATA[
<message>
(unknown error) - exit code -1073741819 (0xc0000005)
</message>
<stderr_txt>
command: projects/boinc.bakerlab.org_rosetta/rosetta_4.20_windows_x86_64.exe @061020SR_YAAAAAAXO_2-11_455_8524850_2mers_0001_0001.flags -nstruct 10000 -cpu_run_time 28800 -boinc:max_nstruct 20000 -checkpoint_interval 120 -mute all -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -boinc::cpu_run_timeout 36000 -run::rng mt19937 -constant_seed -jran 1089012
Using database: database_357d5d93529_n_methylminirosetta_database


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x000001C12A257AD8 

Engaging BOINC Windows Runtime Debugger...



********************


BOINC Windows Runtime Debugger Version 7.9.0


Dump Timestamp    : 06/13/20 00:34:32
Install Directory : C:Program FilesBOINC
Data Directory    : C:ProgramDataBOINC
Project Symstore  : https://boinc.bakerlab.org/rosetta/symstore
LoadLibraryA( C:ProgramDataBOINCdbghelp.dll ): GetLastError = 126
Loaded Library    : dbghelp.dll
LoadLibraryA( C:ProgramDataBOINCsymsrv.dll ): GetLastError = 126
LoadLibraryA( symsrv.dll ): GetLastError = 126
LoadLibraryA( C:ProgramDataBOINCsrcsrv.dll ): GetLastError = 126
LoadLibraryA( srcsrv.dll ): GetLastError = 126
LoadLibraryA( C:ProgramDataBOINCversion.dll ): GetLastError = 126
Loaded Library    : version.dll
Debugger Engine   : 4.0.5.0
Symbol Search Path: C:ProgramDataBOINCslots;C:ProgramDataBOINCprojectsboinc.bakerlab.org_rosetta;srv*C:ProgramDataBOINCprojectsboinc.bakerlab.org_rosettasymbols*http://msdl.microsoft.com/download/symbols;srv*C:ProgramDataBOINCprojectsboinc.bakerlab.org_rosettasymbols*https://boinc.bakerlab.org/rosetta/symstore


ModLoad: 00000000aea60000 00000000057ef000 C:ProgramDataBOINCprojectsboinc.bakerlab.org_rosettarosetta_4.20_windows_x86_64.exe (-exported- Symbols Loaded)
    Linked PDB Filename   : C:cygwin64homeboinc4.17RosettamainsourceideVisualStudiox64BoincReleaserosetta_4.20_windows_x86_64.pdb

ModLoad: 000000008bea0000 00000000001f0000 C:WINDOWSSYSTEM32ntdll.dll (6.2.18362.778) (-exported- Symbols Loaded)
    Linked PDB Filename   : ntdll.pdb
    File Version          : 10.0.18362.329 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.329

ModLoad: 000000008acc0000 00000000000b2000 C:WINDOWSSystem32KERNEL32.DLL (6.2.18362.778) (-exported- Symbols Loaded)
    Linked PDB Filename   : kernel32.pdb
    File Version          : 10.0.18362.329 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.329

ModLoad: 0000000089600000 00000000002a3000 C:WINDOWSSystem32KERNELBASE.dll (6.2.18362.778) (-exported- Symbols Loaded)
    Linked PDB Filename   : kernelbase.pdb
    File Version          : 10.0.18362.329 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.329

ModLoad: 000000008b9a0000 000000000006f000 C:WINDOWSSystem32WS2_32.dll (6.2.18362.387) (-exported- Symbols Loaded)
    Linked PDB Filename   : ws2_32.pdb
    File Version          : 10.0.18362.1 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.1

ModLoad: 000000008ade0000 0000000000120000 C:WINDOWSSystem32RPCRT4.dll (6.2.18362.628) (-exported- Symbols Loaded)
    Linked PDB Filename   : rpcrt4.pdb
    File Version          : 10.0.18362.1 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.1

ModLoad: 000000008aa00000 0000000000194000 C:WINDOWSSystem32USER32.dll (6.2.18362.778) (-exported- Symbols Loaded)
    Linked PDB Filename   : user32.pdb
    File Version          : 10.0.17763.802 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.17763.802

ModLoad: 0000000089bd0000 0000000000021000 C:WINDOWSSystem32win32u.dll (6.2.18362.778) (-exported- Symbols Loaded)
    Linked PDB Filename   : win32u.pdb
    File Version          : 10.0.18362.778 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.778

ModLoad: 000000008b590000 0000000000026000 C:WINDOWSSystem32GDI32.dll (6.2.18362.1) (-exported- Symbols Loaded)
    Linked PDB Filename   : gdi32.pdb
    File Version          : 10.0.18362.1 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.1

ModLoad: 0000000089c00000 0000000000194000 C:WINDOWSSystem32gdi32full.dll (6.2.18362.778) (-exported- Symbols Loaded)
    Linked PDB Filename   : gdi32full.pdb
    File Version          : 10.0.18362.778 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.778

ModLoad: 0000000089e50000 000000000009e000 C:WINDOWSSystem32msvcp_win.dll (6.2.18362.387) (-exported- Symbols Loaded)
    Linked PDB Filename   : msvcp_win.pdb
    File Version          : 10.0.18362.387 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.387

ModLoad: 0000000089960000 00000000000fa000 C:WINDOWSSystem32ucrtbase.dll (6.2.18362.387) (-exported- Symbols Loaded)
    Linked PDB Filename   : ucrtbase.pdb
    File Version          : 10.0.18362.387 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.387

ModLoad: 000000008af20000 00000000000a3000 C:WINDOWSSystem32ADVAPI32.dll (6.2.18362.752) (-exported- Symbols Loaded)
    Linked PDB Filename   : advapi32.pdb
    File Version          : 10.0.18362.1 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.1

ModLoad: 000000008aba0000 000000000009e000 C:WINDOWSSystem32msvcrt.dll (7.0.18362.1) (-exported- Symbols Loaded)
    Linked PDB Filename   : msvcrt.pdb
    File Version          : 7.0.18362.1 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 7.0.18362.1

ModLoad: 000000008afd0000 0000000000097000 C:WINDOWSSystem32sechost.dll (6.2.18362.693) (-exported- Symbols Loaded)
    Linked PDB Filename   : sechost.pdb
    File Version          : 10.0.18362.1 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.1

ModLoad: 000000008a0b0000 000000000002e000 C:WINDOWSSystem32IMM32.DLL (6.2.18362.387) (-exported- Symbols Loaded)
    Linked PDB Filename   : imm32.pdb
    File Version          : 10.0.18362.387 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.387

ModLoad: 0000000088d70000 0000000000011000 C:WINDOWSSystem32kernel.appcore.dll (6.2.18362.1) (-exported- Symbols Loaded)
    Linked PDB Filename   : Kernel.Appcore.pdb
    File Version          : 10.0.18362.1 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.1

ModLoad: 0000000087dc0000 0000000000031000 C:WINDOWSSYSTEM32ntmarta.dll (6.2.18362.1) (-exported- Symbols Loaded)
    Linked PDB Filename   : ntmarta.pdb
    File Version          : 10.0.18362.1 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.1

ModLoad: 00000000822e0000 00000000001f4000 C:WINDOWSSYSTEM32dbghelp.dll (6.2.18362.1) (-exported- Symbols Loaded)
    Linked PDB Filename   : dbghelp.pdb
    File Version          : 10.0.18362.1 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.1

ModLoad: 0000000089da0000 0000000000080000 C:WINDOWSSystem32bcryptPrimitives.dll (6.2.18362.295) (-exported- Symbols Loaded)
    Linked PDB Filename   : bcryptprimitives.pdb
    File Version          : 10.0.18362.295 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.295

ModLoad: 0000000083400000 000000000000a000 C:WINDOWSSYSTEM32version.dll (6.2.18362.1) (-exported- Symbols Loaded)
    Linked PDB Filename   : version.pdb
    File Version          : 10.0.18362.1 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.18362.1



*** Dump of the Process Statistics: ***

- I/O Operations Counters -
Read: 5002, Write: 646, Other 13717

- I/O Transfers Counters -
Read: 14512312, Write: 19118, Other 6542

- Paged Pool Usage -
QuotaPagedPoolUsage: 317448, QuotaPeakPagedPoolUsage: 317576
QuotaNonPagedPoolUsage: 6792, QuotaPeakNonPagedPoolUsage: 7352

- Virtual Memory Usage -
VirtualSize: 83120128, PeakVirtualSize: 895655936

- Pagefile Usage -
PagefileUsage: 83120128, PeakPagefileUsage: 83128320

- Working Set Size -
WorkingSetSize: 103743488, PeakWorkingSetSize: 103747584, PageFaultCount: 25733

*** Dump of thread ID 3488 (state: Initialized): ***

- Information -
Status: Base Priority: Normal, Priority: Normal, , Kernel Time: 0.000000, User Time: 0.000000, Wait Time: 0.000000

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x000001C12A257AD8 

- Registers -
rax=000000000000003a rbx=00000000296147b0 rcx=000000002a00cad0 rdx=000000002a0ecc08 rsi=000000000000000b rdi=000000002a00cad0
r8=000000000000003a r9=0000000000000421 r10=00000000b2606e80 r11=000000000a1450c0 r12=00000000aea60000 r13=000000000a15f7d0
r14=000000000a145800 r15=000000000048b215 rip=000000002a257ad8 rsp=000000000a145138 rbp=0000000000000000
cs=0033  ss=002b  ds=002b  es=002b  fs=0053  gs=002b             efl=00010202

- Callstack -
ChildEBP RetAddr  Args to Child
0a145130 aef3831c 00000000 b2606d60 b2606e80 b25ebe78 !+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '2a257ad8'
0a145160 aeef935d 296147b0 0a145200 0a145980 aeee355d rosetta_4.20_windows_x86_64!xmlParserInputRead+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'aef3831c'
0a145190 b2067f10 b2f50150 0a15f7d0 00000000 aeee3265 rosetta_4.20_windows_x86_64!xmlParserInputRead+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'aeef935d'
0a1451c0 aeee39e8 0a145e70 041c3000 0a1457c8 0a145850 rosetta_4.20_windows_x86_64!xmlValidateNotationDecl+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'b2067f10'
0a145230 8bf411cf 00000000 0a1457b0 0a145e70 0a145e70 rosetta_4.20_windows_x86_64!xmlParserInputRead+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'aeee39e8'
0a145260 8bf0a209 00000001 aea60000 00000000 b3ffa32c ntdll!__chkstk+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '8bf411cf'
0a145970 8bf3fe3e 29600000 8bedb997 b26da450 8bedc43f ntdll!RtlRaiseException+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '8bf0a209'
0a1460f0 af1a3e2b fffffffe 2da77558 ffffffff af1b18c5 ntdll!KiUserExceptionDispatcher+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '8bf3fe3e'
0a146140 af1b3690 b26da3a0 2da772b0 b26da3a0 0a146239 rosetta_4.20_windows_x86_64!cppdb::session::is_open+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'af1a3e2b'
0a146270 af2c9ee8 2d2c83d8 2da09e10 2da772b0 2da09e10 rosetta_4.20_windows_x86_64!cppdb::session::is_open+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'af1b3690'
0a146e20 af264b6c 2dd8e950 8bedb997 29540000 00000000 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'af2c9ee8'
0a147020 af26488e 0a147108 00000000 0a1472f0 00000000 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'af264b6c'
0a147180 af1c3da1 0a1472f8 00000000 29614000 0a1473c0 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'af26488e'
0a147540 af1c9f08 0a147890 0a147890 0a147890 00000000 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'af1c3da1'
0a147b90 af1c84db 29cf21d0 0a147bf0 29bc0080 29bc0080 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'af1c9f08'
0a147cf0 af131fb7 00000000 0a147e00 29bc0080 0a148000 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'af1c84db'
0a147e60 af1357a6 00000005 aeed5190 29fc8ba0 29fc8ba0 rosetta_4.20_windows_x86_64!cppdb::session::is_open+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'af131fb7'
0a147ed0 af1356cc 0a1481d8 0a148049 0a1481d8 29bc0080 rosetta_4.20_windows_x86_64!cppdb::session::is_open+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'af1357a6'
0a147f80 af1fb6f5 0a1481d8 0a148541 00000000 aeef75e8 rosetta_4.20_windows_x86_64!cppdb::session::is_open+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'af1356cc'
0a1480a0 af1fa592 00000005 0a1481d8 0a1483b0 00000000 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'af1fb6f5'
0a148170 af1fad06 00000000 00000000 0a148a90 29540000 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'af1fa592'
0a148310 af6571a3 0a1483b0 0a148a90 ffffff01 aeee3e73 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'af1fad06'
0a148600 af659d09 00000000 00000001 0a148710 0a148a90 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'af6571a3'
0a148990 af652f8a 0a1489d0 0a148a90 2ce1b570 29c22450 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'af659d09'
0a1489f0 af86cc70 0a148a90 0a1491b8 29fc8ba0 00000000 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'af652f8a'
0a149180 af86c6e4 2dc91a70 2dbaeb90 b3ed5cc0 aeed75a6 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'af86cc70'
0a1491e0 af87603e 0a1492d0 2dc917a0 0a1492f0 0a149a40 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'af86c6e4'
0a149960 af8756d4 b27e312e b27e301e b3e47f70 af896cb4 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'af87603e'
0a1499f0 af87578e 00000005 0a149f98 29c22450 00000001 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'af8756d4'
0a149b90 aeee081d 29eeb820 29eeb820 29c22450 29615d01 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'af87578e'
0a15f7c0 aeeeb215 00000000 00000000 b3e0ccf8 00000000 rosetta_4.20_windows_x86_64!xmlParserInputRead+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'aeee081d'
0a15f800 8acd7bd4 00000000 00000000 00000000 00000000 rosetta_4.20_windows_x86_64!xmlParserInputRead+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'aeeeb215'
0a15f830 8bf0ce51 00000000 00000000 00000000 00000000 KERNEL32!BaseThreadInitThunk+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '8acd7bd4'
0a15f8b0 00000000 00000000 00000000 00000000 00000000 ntdll!RtlUserThreadStart+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '8bf0ce51'

*** Dump of thread ID 32763 (state: Initialized): ***

- Information -
Status: Base Priority: Normal, Priority: Unknown, , Kernel Time: 6.000000, User Time: 0.000000, Wait Time: 3280771328.000000

- Registers -
rax=0000000000000000 rbx=0000000000000000 rcx=0000000000000000 rdx=0000000000000000 rsi=0000000000000000 rdi=0000000000000000
r8=0000000000000000 r9=0000000000000000 r10=0000000000000000 r11=0000000000000000 r12=0000000000000000 r13=0000000000000000
r14=0000000000000000 r15=0000000000000000 rip=0000000000000000 rsp=0000000000000000 rbp=0000000000000000
cs=0000  ss=0000  ds=0000  es=0000  fs=0000  gs=0000             efl=00000000

- Callstack -
ChildEBP RetAddr  Args to Child
(-nosymbols- PC == 0)
00000000 00000000 00000000 00000000 00000000 00000000 !+0x0 

*** Dump of thread ID 30818506 (state: Unknown): ***

- Information -
Status: Base Priority: Normal, Priority: Unknown, , Kernel Time: 17179869184.000000, User Time: 21474836480.000000, Wait Time: 0.000000

- Registers -
rax=0000000000000000 rbx=0000000000000000 rcx=0000000000000000 rdx=0000000000000000 rsi=0000000000000000 rdi=0000000000000000
r8=0000000000000000 r9=0000000000000000 r10=0000000000000000 r11=0000000000000000 r12=0000000000000000 r13=0000000000000000
r14=0000000000000000 r15=0000000000000000 rip=0000000000000000 rsp=0000000000000000 rbp=0000000000000000
cs=0000  ss=0000  ds=0000  es=0000  fs=0000  gs=0000             efl=00000000

- Callstack -
ChildEBP RetAddr  Args to Child
(-nosymbols- PC == 0)
00000000 00000000 00000000 00000000 00000000 00000000 !+0x0 


*** Debug Message Dump ****


*** Foreground Window Data ***
    Window Name      : 
    Window Class     : 
    Window Process ID: 0
    Window Thread ID : 0

Exiting...

</stderr_txt>
]]>

Grant
Darwin NT
ID: 97349 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Tomcat雄猫

Send message
Joined: 20 Dec 14
Posts: 180
Credit: 5,386,173
RAC: 0
Message 97371 - Posted: 14 Jun 2020, 4:11:18 UTC
Last modified: 14 Jun 2020, 4:12:11 UTC

I wonder what this is? Ran for about half an hour before erroring out.
rb_06_13_29145_28622_ab_t000__robetta_cstwt_5.0_FT_IGNORE_THE_REST_06_05_947971_63_1
<core_client_version>7.16.7</core_client_version>
<![CDATA[
<message>
Incorrect function.
 (0x1) - exit code 1 (0x1)</message>
<stderr_txt>
command: projects/boinc.bakerlab.org_rosetta/rosetta_4.20_windows_x86_64.exe @rb_06_13_29145_28622_ab_t000__robetta_FLAGS -in::file::fasta t000_.fasta -jumps:pairing_file t000_.fasta.bbcontacts.jumps -jumps:random_sheets 9 1 -constraints::cst_file t000_.fasta.CB.cst -constraints:cst_weight 5.0 -constraints::cst_fa_file t000_.fasta.MIN.cst -constraints:cst_fa_weight 5.0 -in:file:boinc_wu_zip rb_06_13_29145_28622_ab_t000__robetta.zip -frag3 rb_06_13_29145_28622_ab_t000__robetta.200.3mers.index.gz -fragA rb_06_13_29145_28622_ab_t000__robetta.200.5mers.index.gz -fragB rb_06_13_29145_28622_ab_t000__robetta.200.6mers.index.gz -nstruct 10000 -cpu_run_time 28800 -boinc:max_nstruct 20000 -checkpoint_interval 120 -mute all -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -boinc::cpu_run_timeout 36000 -run::rng mt19937 -constant_seed -jran 2496535
Using database: database_357d5d93529_n_methylminirosetta_database

[ ERROR ]: Caught exception:


File: C:cygwin64homeboinc4.17Rosettamainsourcesrccore/pack/dunbrack/SingleResidueDunbrackLibrary.hh:306
chi angle must be between -180 and 180: -nan(ind)
 ------------------------ Begin developer's backtrace ------------------------- 
BACKTRACE:
 ------------------------- End developer's backtrace -------------------------- 


AN INTERNAL ERROR HAS OCCURED. PLEASE SEE THE CONTENTS OF ROSETTA_CRASH.log FOR DETAILS.



</stderr_txt>
]]>
ID: 97371 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
James W

Send message
Joined: 25 Nov 12
Posts: 130
Credit: 1,766,254
RAC: 0
Message 97629 - Posted: 26 Jun 2020, 9:27:14 UTC

This morning (6/25) had 10 errors on my 2 hosts (one with device 3710630 and 9 with device 1759960
Name: Series "rb_06_25_30616_30000__t000__ab_robetta_IGNORE_THE_REST_****"
Application: Rosetta v4.20 windows_x86_64
Sample Task for Host 3710630: 1210226936.
Sample WU for same host: 1086224610.
Status: Error while downloading.
Exit status: -186 (0xFFFFFF46) ERR_RESULT_DOWNLOAD
Stderr output:
WU download error: couldn't get input files:
<file_xfer_error>
<file_name>flags_rb_06_25_30616_30000__t000__ab_robetta</file_name>
<error_code>-224 (permanent HTTP error)</error_code>
<error_message>permanent HTTP error</error_message>
<file_xfer_error>
<file_name>input_rb_06_25_30616_30000__t000__ab_robetta.zip</file_name>
<error_code>-224 (permanent HTTP error)</error_code>
<error_message>permanent HTTP error</error_message>
Sample Task for Host 1759960: 1210207254.
Sample WU for same host: 1086211597.
Status: Error while downloading.
Exit status: -186 (0xFFFFFF46) ERR_RESULT_DOWNLOAD
Stderr output:
WU download error: couldn't get input files:
<file_xfer_error>
<file_name>flags_rb_06_25_30584_29981__t000__0_C8_robetta</file_name>
<error_code>-224 (permanent HTTP error)</error_code>
<error_message>permanent HTTP error</error_message>
<file_xfer_error>
<file_name>input_rb_06_25_30584_29981__t000__0_C8_robetta.zip</file_name>
<error_code>-224 (permanent HTTP error)</error_code>
<error_message>permanent HTTP error</error_message>
Currently using BOINC Manager V7.16.5 on both hosts. Would the updated Manager V7.16.7 fix this particular problem?
ID: 97629 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1680
Credit: 17,850,850
RAC: 22,496
Message 97630 - Posted: 26 Jun 2020, 9:42:42 UTC - in response to Message 97629.  
Last modified: 26 Jun 2020, 9:43:38 UTC

Would the updated Manager V7.16.7 fix this particular problem?
Nope.
I'd suggest re-booting your system & modem.
If download issues are still occurring, i'd check your AV/security software to see if there have been any recent updates that are now clobbering Rosetta downloads (although you'll have to wat for some new work to be loaded before you'll be able to see if that does help things).
Grant
Darwin NT
ID: 97630 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
James W

Send message
Joined: 25 Nov 12
Posts: 130
Credit: 1,766,254
RAC: 0
Message 97633 - Posted: 26 Jun 2020, 10:13:42 UTC - in response to Message 97630.  

I did get a batch of tasks after these failed ones, including some in same series as problem ones, including WU 1086231903. I'm the wingman, with original task failing due to downloading issue I had previously. This one has been crunching away for over 5 hrs. and 40 mins. so far, with expected 2 hrs. & 50 mins. to go. I have 3 other tasks in this series (rb_06_25_30623_30025_ab_t000__h001_robetta_****) running as well on host 1759960.
ID: 97633 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1680
Credit: 17,850,850
RAC: 22,496
Message 97634 - Posted: 26 Jun 2020, 10:22:06 UTC - in response to Message 97633.  

I'm the wingman, with original task failing due to downloading issue I had previously.
With the other copy also giving a download error & the "missing input file" part of the error message, it could have been a server issue- the file missing from the download server (or at least not where it was expected to be).
Grant
Darwin NT
ID: 97634 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
James W

Send message
Joined: 25 Nov 12
Posts: 130
Credit: 1,766,254
RAC: 0
Message 97672 - Posted: 27 Jun 2020, 10:20:07 UTC - in response to Message 97629.  
Last modified: 27 Jun 2020, 10:24:00 UTC

I just noted that 9 of the 10 download error tasks I had were reissued to a host (3791293) which has not contacted server since 6/25. This host also timed out multiple tasks (252). Not a good choice to use as wingman! These 9 tasks will no doubt time out as well, as last valid tasks for this host were on 6/23/20!
ID: 97672 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
James W

Send message
Joined: 25 Nov 12
Posts: 130
Credit: 1,766,254
RAC: 0
Message 97673 - Posted: 27 Jun 2020, 10:33:07 UTC - in response to Message 97634.  

I'm the wingman, with original task failing due to downloading issue I had previously.
With the other copy also giving a download error & the "missing input file" part of the error message, it could have been a server issue- the file missing from the download server (or at least not where it was expected to be).
Appears your assumption is correct, as my host validly processed these previously erroneous tasks.
ID: 97673 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
James W

Send message
Joined: 25 Nov 12
Posts: 130
Credit: 1,766,254
RAC: 0
Message 97971 - Posted: 9 Jul 2020, 0:09:24 UTC

7/8/2020 3:58:48 PM | Rosetta@home | Task 81efa213_fold_SAVE_ALL_OUT_951627_1211_0 exited with zero status but no 'finished' file
7/8/2020 3:58:48 PM | Rosetta@home | If this happens repeatedly you may need to reset the project.
This is the first time I've seen this happen on Rosetta 4.20. It happened all the time with Mini tasks.
Host: 1759960
WU: 1091525816
Task: 1216339486
At first I thought another Mini app had snuck in. Anyone else seen this "error" with 4.20?
ID: 97971 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2124
Credit: 41,228,077
RAC: 11,048
Message 97973 - Posted: 9 Jul 2020, 1:58:54 UTC - in response to Message 97971.  

7/8/2020 3:58:48 PM | Rosetta@home | Task 81efa213_fold_SAVE_ALL_OUT_951627_1211_0 exited with zero status but no 'finished' file
7/8/2020 3:58:48 PM | Rosetta@home | If this happens repeatedly you may need to reset the project.
This is the first time I've seen this happen on Rosetta 4.20. It happened all the time with Mini tasks.
Host: 1759960
WU: 1091525816
Task: 1216339486
At first I thought another Mini app had snuck in. Anyone else seen this "error" with 4.20?

Are you sure that's the same task? Your link goes to COVID_jJHRs_perturb_SAVE_ALL_OUT_IGNORE_THE_REST_9za7zv9d_953350_4 and it looks fine, as do all your reported tasks
ID: 97973 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
James W

Send message
Joined: 25 Nov 12
Posts: 130
Credit: 1,766,254
RAC: 0
Message 97976 - Posted: 9 Jul 2020, 7:22:11 UTC - in response to Message 97973.  

Are you sure that's the same task? Your link goes to COVID_jJHRs_perturb_SAVE_ALL_OUT_IGNORE_THE_REST_9za7zv9d_953350_4 and it looks fine, as do all your reported tasks.
Sorry, picked the wrong WU and task somehow. Should be:
WU: 1091720486
Task: 1216562014
It's currently still in process. I note now that error happened about 3 minutes into crunching and did not occur again. Hopefully just a fluke.
ID: 97976 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 14 · 15 · 16 · 17 · 18 · 19 · 20 . . . 34 · Next

Message boards : Number crunching : Rosetta 4.1+ and 4.2+



©2024 University of Washington
https://www.bakerlab.org