Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 30
Filtrar
1.
Cancer Med ; 12(19): 19394-19405, 2023 10.
Artigo em Inglês | MEDLINE | ID: mdl-37712677

RESUMO

BACKGROUND: Roughly 5% of metastatic cancers present with uncertain origin, for which molecular classification could influence subsequent management; however, prior studies of molecular diagnostic classifiers have reported mixed results with regard to clinical impact. In this retrospective study, we evaluated the utility of a novel molecular diagnostic classifier by assessing theoretical changes in treatment and additional testing recommendations from oncologists before and after the review of classifier predictions. METHODS: We retrospectively analyzed de-identified records from 289 patients with a consensus diagnosis of cancer of uncertain/unknown primary (CUP). Two (or three, if adjudication was required) independent oncologists separately reviewed patient clinical information to determine the course of treatment before they reviewed results from the molecular diagnostic classifier and subsequently evaluated whether the predicted diagnosis would alter their treatment plan. RESULTS: Results from the molecular diagnostic classifier changed the consensus oncologist-reported treatment recommendations for 235 out of 289 patients (81.3%). At the level of individual oncologist reviews (n = 414), 64.7% (n = 268) of treatment recommendations were based on CUP guidelines prior to review of results from the molecular diagnostic classifier. After seeing classifier results, 98.1% (n = 207) of the reviews, where treatment was specified (n = 211), were guided by the tissue of origin-specific guidelines. Overall, 89.9% of the 414 total reviews either expressed strong agreement (n = 242) or agreement (n = 130) that the molecular diagnostic classifier result increased confidence in selecting the most appropriate treatment regimen. CONCLUSIONS: A retrospective review of CUP cases demonstrates that a novel molecular diagnostic classifier could affect treatment in the majority of patients, supporting its clinical utility. Further studies are needed to prospectively evaluate whether the use of molecular diagnostic classifiers improves clinical outcomes in CUP patients.


Assuntos
Segunda Neoplasia Primária , Neoplasias Primárias Desconhecidas , Humanos , Neoplasias Primárias Desconhecidas/diagnóstico , Neoplasias Primárias Desconhecidas/genética , Neoplasias Primárias Desconhecidas/patologia , Estudos Retrospectivos , Patologia Molecular
2.
Mol Diagn Ther ; 27(4): 499-511, 2023 07.
Artigo em Inglês | MEDLINE | ID: mdl-37099070

RESUMO

INTRODUCTION: Cancers assume a variety of distinct histologies, and may originate from a myriad of sites including solid organs, hematopoietic cells, and connective tissue. Clinical decision-making based on consensus guidelines such as the National Comprehensive Cancer Network (NCCN) is often predicated on a specific histologic and anatomic diagnosis, supported by clinical features and pathologist interpretation of morphology and immunohistochemical (IHC) staining patterns. However, in patients with nonspecific morphologic and IHC findings-in addition to ambiguous clinical presentations such as recurrence versus new primary-a definitive diagnosis may not be possible, resulting in the patient being categorized as having a cancer of unknown primary (CUP). Therapeutic options and clinical outcomes are poor for patients with CUP, with a median survival of 8-11 months. METHODS: Here, we describe and validate the Tempus Tumor Origin (Tempus TO) assay, an RNA-sequencing-based machine learning classifier capable of discriminating between 68 clinically relevant cancer subtypes. Model accuracy was assessed using primary and/or metastatic samples with known subtype. RESULTS: We show that the Tempus TO model is 91% accurate when assessed on both a retrospectively held out cohort and a set of samples sequenced after model freeze that collectively contained 9210 total samples with known diagnoses. When evaluated on a cohort of CUPs, the model recapitulated established associations between genomic alterations and cancer subtype. DISCUSSION: Combining diagnostic prediction tests (e.g., Tempus TO) with sequencing-based variant reporting (e.g., Tempus xT) may expand therapeutic options for patients with cancers of unknown primary or uncertain histology.


Assuntos
Neoplasias Primárias Desconhecidas , Transcriptoma , Humanos , Neoplasias Primárias Desconhecidas/diagnóstico , Neoplasias Primárias Desconhecidas/genética , Neoplasias Primárias Desconhecidas/patologia , Perfilação da Expressão Gênica/métodos , Estudos Retrospectivos , Genômica
4.
Genet Med ; 21(10): 2407-2408, 2019 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-31089271

RESUMO

The original version of this Article contained an error in Figure 3. Specifically, the result "3 (67%) TOP" should read "2 (67%) TOP." This has now been corrected in both the PDF and HTML versions of the Article.

5.
Genet Med ; 21(11): 2569-2576, 2019 11.
Artigo em Inglês | MEDLINE | ID: mdl-31036917

RESUMO

PURPOSE: Medical society guidelines recommend offering genotyping-based cystic fibrosis (CF) carrier screening to pregnant women or women considering pregnancy. We assessed the performance of sequencing-based CF screening relative to genotyping, in terms of analytical validity, clinical validity, clinical impact, and clinical utility. METHODS: Analytical validity was assessed using orthogonal confirmation and reference samples. Clinical validity was evaluated using the CFTR2 database. Clinical impact was assessed using ~100,000 screened patients. Three screening strategies were compared: genotyping 23 guideline-recommended variants ("CF23"), sequencing all coding bases in CFTR ("NGS"), and sequencing with large copy-number variant (CNV) identification ("NGS + CNV"). Clinical utility was determined via self-reported actions of at-risk couples (ARCs). RESULTS: Analytical accuracy of NGS + CNV was 100% for SNVs, indels, and CNVs; interpretive clinical specificity relative to CFTR2 was 99.5%. NGS + CNV detected 58 ARCs, 18 of whom would have gone undetected with CF23 alone. Most ARCs (89% screened preconceptionally, 56% prenatally) altered pregnancy management, and no significant differences were observed between ARCs with or without at least one non-CF23 variant. CONCLUSION: Modern NGS and variant interpretation enable accurate sequencing-based CF screening. Limiting screening to 23 variants does not improve analytical validity, clinical validity, or clinical utility, but does fail to detect approximately 30% (18/58) of ARCs.


Assuntos
Fibrose Cística/diagnóstico , Fibrose Cística/genética , Testes Genéticos/métodos , Adulto , Variações do Número de Cópias de DNA/genética , Feminino , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Mutação INDEL/genética , Mutação/genética , Gravidez , Sensibilidade e Especificidade
6.
Elife ; 82019 05 13.
Artigo em Inglês | MEDLINE | ID: mdl-31081496

RESUMO

Elucidating the conformational heterogeneity of proteins is essential for understanding protein function and developing exogenous ligands. With the rapid development of experimental and computational methods, it is of great interest to integrate these approaches to illuminate the conformational landscapes of target proteins. SETD8 is a protein lysine methyltransferase (PKMT), which functions in vivo via the methylation of histone and nonhistone targets. Utilizing covalent inhibitors and depleting native ligands to trap hidden conformational states, we obtained diverse X-ray structures of SETD8. These structures were used to seed distributed atomistic molecular dynamics simulations that generated a total of six milliseconds of trajectory data. Markov state models, built via an automated machine learning approach and corroborated experimentally, reveal how slow conformational motions and conformational states are relevant to catalysis. These findings provide molecular insight on enzymatic catalysis and allosteric mechanisms of a PKMT via its detailed conformational landscape.


Assuntos
Histona-Lisina N-Metiltransferase/química , Histona-Lisina N-Metiltransferase/metabolismo , Regulação Alostérica , Cristalografia por Raios X , Simulação de Dinâmica Molecular , Conformação Proteica
7.
Genet Med ; 21(9): 1948-1957, 2019 09.
Artigo em Inglês | MEDLINE | ID: mdl-30760891

RESUMO

PURPOSE: Carrier screening identifies couples at high risk for conceiving offspring affected with serious heritable conditions. Minimal guidelines recommend offering testing for cystic fibrosis and spinal muscular atrophy, but expanded carrier screening (ECS) assesses hundreds of conditions simultaneously. Although medical societies consider ECS an acceptable practice, the health economics of ECS remain incompletely characterized. METHODS: Preconception screening was modeled using a decision tree comparing minimal screening and a 176-condition ECS panel. Carrier rates from >60,000 patients, primarily with private insurance, informed disease incidence estimates, while cost and life-years-lost data were aggregated from the literature and a cost-of-care database. Model robustness was evaluated using one-way and probabilistic sensitivity analyses. RESULTS: For every 100,000 pregnancies, 290 are predicted to be affected by ECS-panel conditions, which, on average, increase mortality by 26 undiscounted life-years and individually incur $1,100,000 in lifetime costs. Relative to minimal screening, preconception ECS reduces the affected birth rate and is estimated to be cost-effective (i.e.,<$50,000 incremental cost per life-year), findings robust to perturbation. CONCLUSION: Based on screened patients predominantly with private coverage, preconception ECS is predicted to reduce the burden of Mendelian disease in a cost-effective manner compared with minimal screening. The data and framework herein may facilitate similar assessments in other cohorts.


Assuntos
Triagem de Portadores Genéticos/métodos , Doenças Genéticas Inatas/genética , Modelos Teóricos , Diagnóstico Pré-Natal , Tomada de Decisão Clínica/métodos , Análise Custo-Benefício/economia , Feminino , Triagem de Portadores Genéticos/economia , Aconselhamento Genético/economia , Doenças Genéticas Inatas/classificação , Doenças Genéticas Inatas/economia , Humanos , Gravidez
8.
Genet Med ; 21(5): 1041-1048, 2019 05.
Artigo em Inglês | MEDLINE | ID: mdl-30310157

RESUMO

PURPOSE: Expanded carrier screening (ECS) informs couples of their risk of having offspring affected by certain genetic conditions. Limited data exists assessing the actions and reproductive outcomes of at-risk couples (ARCs). We describe the impact of ECS on planned and actual pregnancy management in the largest sample of ARCs studied to date. METHODS: Couples who elected ECS and were found to be at high risk of having a pregnancy affected by at least one of 176 genetic conditions were invited to complete a survey about their actions and pregnancy management. RESULTS: Three hundred ninety-one ARCs completed the survey. Among those screened before becoming pregnant, 77% planned or pursued actions to avoid having affected offspring. Among those screened during pregnancy, 37% elected prenatal diagnostic testing (PNDx) for that pregnancy. In subsequent pregnancies that occurred in both the preconception and prenatal screening groups, PNDx was pursued in 29%. The decision to decline PNDx was most frequently based on the fear of procedure-related miscarriage, as well as the belief that termination would not be pursued in the event of a positive diagnosis. CONCLUSION: ECS results impacted couples' reproductive decision-making and led to altered pregnancy management that effectively eliminates the risk of having affected offspring.


Assuntos
Serviços de Planejamento Familiar , Triagem de Portadores Genéticos , Adulto , Estudos de Coortes , Feminino , Aconselhamento Genético , Humanos , Pessoa de Meia-Idade , Gravidez , Resultado da Gravidez , Diagnóstico Pré-Natal , Inquéritos e Questionários
9.
Entropy (Basel) ; 20(5)2018 May.
Artigo em Inglês | MEDLINE | ID: mdl-30393452

RESUMO

While Langevin integrators are popular in the study of equilibrium properties of complex systems, it is challenging to estimate the timestep-induced discretization error: the degree to which the sampled phase-space or configuration-space probability density departs from the desired target density due to the use of a finite integration timestep. Sivak et al., introduced a convenient approach to approximating a natural measure of error between the sampled density and the target equilibrium density, the Kullback-Leibler (KL) divergence, in phase space, but did not specifically address the issue of configuration-space properties, which are much more commonly of interest in molecular simulations. Here, we introduce a variant of this near-equilibrium estimator capable of measuring the error in the configuration-space marginal density, validating it against a complex but exact nested Monte Carlo estimator to show that it reproduces the KL divergence with high fidelity. To illustrate its utility, we employ this new near-equilibrium estimator to assess a claim that a recently proposed Langevin integrator introduces extremely small configuration-space density errors up to the stability limit at no extra computational expense. Finally, we show how this approach to quantifying sampling bias can be applied to a wide variety of stochastic integrators by following a straightforward procedure to compute the appropriate shadow work, and describe how it can be extended to quantify the error in arbitrary marginal or conditional distributions of interest.

10.
J Chem Theory Comput ; 14(11): 6076-6092, 2018 Nov 13.
Artigo em Inglês | MEDLINE | ID: mdl-30351006

RESUMO

Traditional approaches to specifying a molecular mechanics force field encode all the information needed to assign force field parameters to a given molecule into a discrete set of atom types. This is equivalent to a representation consisting of a molecular graph comprising a set of vertices, which represent atoms labeled by atom type, and unlabeled edges, which represent chemical bonds. Bond stretch, angle bend, and dihedral parameters are then assigned by looking up bonded pairs, triplets, and quartets of atom types in parameter tables to assign valence terms and using the atom types themselves to assign nonbonded parameters. This approach, which we call indirect chemical perception because it operates on the intermediate graph of atom-typed nodes, creates a number of technical problems. For example, atom types must be sufficiently complex to encode all necessary information about the molecular environment, making it difficult to extend force fields encoded this way. Atom typing also results in a proliferation of redundant parameters applied to chemically equivalent classes of valence terms, needlessly increasing force field complexity. Here, we describe a new approach to assigning force field parameters via direct chemical perception. Rather than working through the intermediary of the atom-typed graph, direct chemical perception operates directly on the unmodified chemical graph of the molecule to assign parameters. In particular, parameters are assigned to each type of force field term (e.g., bond stretch, angle bend, torsion, and Lennard-Jones) based on standard chemical substructure queries implemented via the industry-standard SMARTS chemical perception language, using SMIRKS extensions that permit labeling of specific atoms within a chemical pattern. We use this to implement a new force field format, called the SMIRKS Native Open Force Field (SMIRNOFF) format. We demonstrate the power and generality of this approach using examples of specific molecules that pose problems for indirect chemical perception and construct and validate a minimalist yet very general force field, SMIRNOFF99Frosst. We find that a parameter definition file only ∼300 lines long provides coverage of all but <0.02% of a 5 million molecule drug-like test set. Despite its simplicity, the accuracy of SMIRNOFF99Frosst for small molecule hydration free energies and selected properties of pure organic liquids is similar to that of the General Amber Force Field, whose specification requires thousands of parameters. This force field provides a starting point for further optimization and refitting work to follow.

11.
Clin Chem ; 64(7): 1063-1073, 2018 07.
Artigo em Inglês | MEDLINE | ID: mdl-29760218

RESUMO

BACKGROUND: By identifying pathogenic variants across hundreds of genes, expanded carrier screening (ECS) enables prospective parents to assess the risk of transmitting an autosomal recessive or X-linked condition. Detection of at-risk couples depends on the number of conditions tested, the prevalence of the respective diseases, and the screen's analytical sensitivity for identifying disease-causing variants. Disease-level analytical sensitivity is often <100% in ECS tests because copy number variants (CNVs) are typically not interrogated because of their technical complexity. METHODS: We present an analytical validation and preliminary clinical characterization of a 235-gene sequencing-based ECS with full coverage across coding regions, targeted assessment of pathogenic noncoding variants, panel-wide CNV calling, and specialized assays for technically challenging genes. Next-generation sequencing, customized bioinformatics, and expert manual call review were used to identify single-nucleotide variants, short insertions and deletions, and CNVs for all genes except FMR1 and those whose low disease incidence or high technical complexity precluded novel variant identification or interpretation. RESULTS: Screening of 36859 patients' blood or saliva samples revealed the substantial impact on fetal disease-risk detection attributable to novel CNVs (9.19% of risk) and technically challenging conditions (20.2% of risk), such as congenital adrenal hyperplasia. Of the 7498 couples screened, 335 were identified as at risk for an affected pregnancy, underscoring the clinical importance of the test. Validation of our ECS demonstrated >99% analytical sensitivity and >99% analytical specificity. CONCLUSIONS: Validated high-fidelity identification of different variant types-especially for diseases with complicated molecular genetics-maximizes at-risk couple detection.


Assuntos
Variações do Número de Cópias de DNA , Éxons , Triagem de Portadores Genéticos , Estudos de Coortes , Humanos , Mutação INDEL , Polimorfismo de Nucleotídeo Único
12.
Genet Med ; 20(1): 55-63, 2018 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-28640244

RESUMO

PurposeThe recent growth in pan-ethnic expanded carrier screening (ECS) has raised questions about how such panels might be designed and evaluated systematically. Design principles for ECS panels might improve clinical detection of at-risk couples and facilitate objective discussions of panel choice.MethodsGuided by medical-society statements, we propose a method for the design of ECS panels that aims to maximize the aggregate and per-disease sensitivity and specificity across a range of Mendelian disorders considered serious by a systematic classification scheme. We evaluated this method retrospectively using results from 474,644 de-identified carrier screens. We then constructed several idealized panels to highlight strengths and limitations of different ECS methodologies.ResultsBased on modeled fetal risks for "severe" and "profound" diseases, a commercially available ECS panel (Counsyl) is expected to detect 183 affected conceptuses per 100,000 US births. A screen's sensitivity is greatly impacted by two factors: (i) the methodology used (e.g., full-exon sequencing finds more affected conceptuses than targeted genotyping) and (ii) the detection rate of the screen for diseases with high prevalence and complex molecular genetics (e.g., fragile X syndrome).ConclusionThe described approaches enable principled, quantitative evaluation of which diseases and methodologies are appropriate for pan-ethnic expanded carrier screening.


Assuntos
Triagem de Portadores Genéticos/métodos , Triagem de Portadores Genéticos/normas , Doenças Genéticas Inatas/diagnóstico , Doenças Genéticas Inatas/genética , Feminino , Estudos de Associação Genética , Predisposição Genética para Doença , Testes Genéticos/métodos , Testes Genéticos/normas , Genômica/métodos , Genômica/normas , Fidelidade a Diretrizes , Humanos , Reprodutibilidade dos Testes
13.
PLoS Comput Biol ; 13(7): e1005659, 2017 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-28746339

RESUMO

OpenMM is a molecular dynamics simulation toolkit with a unique focus on extensibility. It allows users to easily add new features, including forces with novel functional forms, new integration algorithms, and new simulation protocols. Those features automatically work on all supported hardware types (including both CPUs and GPUs) and perform well on all of them. In many cases they require minimal coding, just a mathematical description of the desired function. They also require no modification to OpenMM itself and can be distributed independently of OpenMM. This makes it an ideal tool for researchers developing new simulation methods, and also allows those new methods to be immediately available to the larger community.


Assuntos
Algoritmos , Biologia Computacional/métodos , Simulação de Dinâmica Molecular , Software
14.
J Phys Chem B ; 121(16): 4023-4039, 2017 04 27.
Artigo em Inglês | MEDLINE | ID: mdl-28306259

RESUMO

The increasing availability of high-quality experimental data and first-principles calculations creates opportunities for developing more accurate empirical force fields for simulation of proteins. We developed the AMBER-FB15 protein force field by building a high-quality quantum chemical data set consisting of comprehensive potential energy scans and employing the ForceBalance software package for parameter optimization. The optimized potential surface allows for more significant thermodynamic fluctuations away from local minima. In validation studies where simulation results are compared to experimental measurements, AMBER-FB15 in combination with the updated TIP3P-FB water model predicts equilibrium properties with equivalent accuracy, and temperature dependent properties with significantly improved accuracy, in comparison with published models. We also discuss the effect of changing the protein force field and water model on the simulation results.


Assuntos
Proteínas/química , Bases de Dados de Proteínas , Simulação de Dinâmica Molecular , Desnaturação Proteica , Teoria Quântica , Software , Termodinâmica , Água/química
15.
Biophys J ; 112(1): 10-15, 2017 Jan 10.
Artigo em Inglês | MEDLINE | ID: mdl-28076801

RESUMO

MSMBuilder is a software package for building statistical models of high-dimensional time-series data. It is designed with a particular focus on the analysis of atomistic simulations of biomolecular dynamics such as protein folding and conformational change. MSMBuilder is named for its ability to construct Markov state models (MSMs), a class of models that has gained favor among computational biophysicists. In addition to both well-established and newer MSM methods, the package includes complementary algorithms for understanding time-series data such as hidden Markov models and time-structure based independent component analysis. MSMBuilder boasts an easy to use command-line interface, as well as clear and consistent abstractions through its Python application programming interface. MSMBuilder was developed with careful consideration for compatibility with the broader machine learning community by following the design of scikit-learn. The package is used primarily by practitioners of molecular dynamics, but is just as applicable to other computational or experimental time-series measurements.


Assuntos
Modelos Estatísticos , Simulação de Dinâmica Molecular , Software , Proteína Tirosina Quinase CSK , Cadeias de Markov , Conformação Proteica , Quinases da Família src/química , Quinases da Família src/metabolismo
16.
PLoS Comput Biol ; 12(6): e1004728, 2016 06.
Artigo em Inglês | MEDLINE | ID: mdl-27337644

RESUMO

The rapidly expanding body of available genomic and protein structural data provides a rich resource for understanding protein dynamics with biomolecular simulation. While computational infrastructure has grown rapidly, simulations on an omics scale are not yet widespread, primarily because software infrastructure to enable simulations at this scale has not kept pace. It should now be possible to study protein dynamics across entire (super)families, exploiting both available structural biology data and conformational similarities across homologous proteins. Here, we present a new tool for enabling high-throughput simulation in the genomics era. Ensembler takes any set of sequences-from a single sequence to an entire superfamily-and shepherds them through various stages of modeling and refinement to produce simulation-ready structures. This includes comparative modeling to all relevant PDB structures (which may span multiple conformational states of interest), reconstruction of missing loops, addition of missing atoms, culling of nearly identical structures, assignment of appropriate protonation states, solvation in explicit solvent, and refinement and filtering with molecular simulation to ensure stable simulation. The output of this pipeline is an ensemble of structures ready for subsequent molecular simulations using computer clusters, supercomputers, or distributed computing projects like Folding@home. Ensembler thus automates much of the time-consuming process of preparing protein models suitable for simulation, while allowing scalability up to entire superfamilies. A particular advantage of this approach can be found in the construction of kinetic models of conformational dynamics-such as Markov state models (MSMs)-which benefit from a diverse array of initial configurations that span the accessible conformational states to aid sampling. We demonstrate the power of this approach by constructing models for all catalytic domains in the human tyrosine kinase family, using all available kinase catalytic domain structures from any organism as structural templates. Ensembler is free and open source software licensed under the GNU General Public License (GPL) v2. It is compatible with Linux and OS X. The latest release can be installed via the conda package manager, and the latest source can be downloaded from https://github.com/choderalab/ensembler.


Assuntos
Algoritmos , Modelos Químicos , Simulação de Acoplamento Molecular/métodos , Proteínas Tirosina Quinases/química , Proteínas Tirosina Quinases/ultraestrutura , Análise de Sequência de Proteína/métodos , Sítios de Ligação , Simulação por Computador , Ativação Enzimática , Ensaios de Triagem em Larga Escala/métodos , Ligação Proteica , Software
17.
Biophys J ; 109(8): 1528-32, 2015 Oct 20.
Artigo em Inglês | MEDLINE | ID: mdl-26488642

RESUMO

As molecular dynamics (MD) simulations continue to evolve into powerful computational tools for studying complex biomolecular systems, the necessity of flexible and easy-to-use software tools for the analysis of these simulations is growing. We have developed MDTraj, a modern, lightweight, and fast software package for analyzing MD simulations. MDTraj reads and writes trajectory data in a wide variety of commonly used formats. It provides a large number of trajectory analysis capabilities including minimal root-mean-square-deviation calculations, secondary structure assignment, and the extraction of common order parameters. The package has a strong focus on interoperability with the wider scientific Python ecosystem, bridging the gap between MD data and the rapidly growing collection of industry-standard statistical analysis and visualization tools in Python. MDTraj is a powerful and user-friendly software package that simplifies the analysis of MD data and connects these datasets with the modern interactive data science software ecosystem in Python.


Assuntos
Simulação de Dinâmica Molecular , Software , Internet
18.
J Phys Chem B ; 119(40): 12912-20, 2015 Oct 08.
Artigo em Inglês | MEDLINE | ID: mdl-26339862

RESUMO

Atomistic molecular simulations are a powerful way to make quantitative predictions, but the accuracy of these predictions depends entirely on the quality of the force field employed. Although experimental measurements of fundamental physical properties offer a straightforward approach for evaluating force field quality, the bulk of this information has been tied up in formats that are not machine-readable. Compiling benchmark data sets of physical properties from non-machine-readable sources requires substantial human effort and is prone to the accumulation of human errors, hindering the development of reproducible benchmarks of force-field accuracy. Here, we examine the feasibility of benchmarking atomistic force fields against the NIST ThermoML data archive of physicochemical measurements, which aggregates thousands of experimental measurements in a portable, machine-readable, self-annotating IUPAC-standard format. As a proof of concept, we present a detailed benchmark of the generalized Amber small-molecule force field (GAFF) using the AM1-BCC charge model against experimental measurements (specifically, bulk liquid densities and static dielectric constants at ambient pressure) automatically extracted from the archive and discuss the extent of data available for use in larger scale (or continuously performed) benchmarks. The results of even this limited initial benchmark highlight a general problem with fixed-charge force fields in the representation low-dielectric environments, such as those seen in binding cavities or biological membranes.


Assuntos
Automação , Eletricidade , Armazenamento e Recuperação da Informação
19.
Proc Natl Acad Sci U S A ; 111(15): E1473-80, 2014 Apr 15.
Artigo em Inglês | MEDLINE | ID: mdl-24706812

RESUMO

Direct experimental measurements of conformational ensembles are critical for understanding macromolecular function, but traditional biophysical methods do not directly report the solution ensemble of a macromolecule. Small-angle X-ray scattering interferometry has the potential to overcome this limitation by providing the instantaneous distance distribution between pairs of gold-nanocrystal probes conjugated to a macromolecule in solution. Our X-ray interferometry experiments reveal an increasing bend angle of DNA duplexes with bulges of one, three, and five adenosine residues, consistent with previous FRET measurements, and further reveal an increasingly broad conformational ensemble with increasing bulge length. The distance distributions for the AAA bulge duplex (3A-DNA) with six different Au-Au pairs provide strong evidence against a simple elastic model in which fluctuations occur about a single conformational state. Instead, the measured distance distributions suggest a 3A-DNA ensemble with multiple conformational states predominantly across a region of conformational space with bend angles between 24 and 85 degrees and characteristic bend directions and helical twists and displacements. Additional X-ray interferometry experiments revealed perturbations to the ensemble from changes in ionic conditions and the bulge sequence, effects that can be understood in terms of electrostatic and stacking contributions to the ensemble and that demonstrate the sensitivity of X-ray interferometry. Combining X-ray interferometry ensemble data with molecular dynamics simulations gave atomic-level models of representative conformational states and of the molecular interactions that may shape the ensemble, and fluorescence measurements with 2-aminopurine-substituted 3A-DNA provided initial tests of these atomistic models. More generally, X-ray interferometry will provide powerful benchmarks for testing and developing computational methods.


Assuntos
DNA/química , Modelos Moleculares , Nanoestruturas/química , Conformação de Ácido Nucleico , Biofísica/métodos , Fluorescência , Ouro , Interferometria/métodos , Simulação de Dinâmica Molecular , Espalhamento a Baixo Ângulo
20.
Biophys J ; 106(6): 1359-70, 2014 Mar 18.
Artigo em Inglês | MEDLINE | ID: mdl-24655511

RESUMO

The folding mechanism of the N-terminal domain of ribosomal protein L9 (NTL91-39) is studied using temperature-jump (T-jump) amide I' two-dimensional infrared (2D IR) spectroscopy in combination with spectral simulations based on a Markov state model (MSM) built from millisecond-long molecular dynamics trajectories. The results provide evidence for a compact well-structured folded state and a heterogeneous fast-exchanging denatured state ensemble exhibiting residual secondary structure. The folding rate of 26.4 µs(-1) (at 80°C), extracted from the T-jump response of NTL91-39, compares favorably with the 18 µs(-1) obtained from the MSM. Structural decomposition of the MSM and analysis along the folding coordinate indicates that helix-formation nucleates the global folding. Simulated difference spectra, corresponding to the global folding transition of the MSM, are in qualitative agreement with measured T-jump 2D IR spectra. The experiments demonstrate the use of T-jump 2D IR spectroscopy as a valuable tool for studying protein folding, with direct connections to simulations. The results suggest that in addition to predicting the correct native structure and folding time constant, molecular dynamics simulations carried out with modern force fields provide an accurate description of folding mechanisms in small proteins.


Assuntos
Dobramento de Proteína , Proteínas Ribossômicas/química , Sequência de Aminoácidos , Interpretação Estatística de Dados , Cadeias de Markov , Simulação de Dinâmica Molecular , Dados de Sequência Molecular , Estrutura Terciária de Proteína , Espectroscopia de Infravermelho com Transformada de Fourier
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...