Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 13 de 13
Filtrar
1.
Mol Syst Biol ; 20(7): 825-844, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-38849565

RESUMO

Nonsense and missense mutations in the transcription factor PAX6 cause a wide range of eye development defects, including aniridia, microphthalmia and coloboma. To understand how changes of PAX6:DNA binding cause these phenotypes, we combined saturation mutagenesis of the paired domain of PAX6 with a yeast one-hybrid (Y1H) assay in which expression of a PAX6-GAL4 fusion gene drives antibiotic resistance. We quantified binding of more than 2700 single amino-acid variants to two DNA sequence elements. Mutations in DNA-facing residues of the N-terminal subdomain and linker region were most detrimental, as were mutations to prolines and to negatively charged residues. Many variants caused sequence-specific molecular gain-of-function effects, including variants in position 71 that increased binding to the LE9 enhancer but decreased binding to a SELEX-derived binding site. In the absence of antibiotic selection, variants that retained DNA binding slowed yeast growth, likely because such variants perturbed the yeast transcriptome. Benchmarking against known patient variants and applying ACMG/AMP guidelines to variant classification, we obtained supporting-to-moderate evidence that 977 variants are likely pathogenic and 1306 are likely benign. Our analysis shows that most pathogenic mutations in the paired domain of PAX6 can be explained simply by the effects of these mutations on PAX6:DNA association, and establishes Y1H as a generalisable assay for the interpretation of variant effects in transcription factors.


Assuntos
DNA , Fator de Transcrição PAX6 , Fator de Transcrição PAX6/genética , Fator de Transcrição PAX6/metabolismo , Humanos , DNA/genética , DNA/metabolismo , Sítios de Ligação , Ligação Proteica , Mutação , Técnicas do Sistema de Duplo-Híbrido , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Mutação de Sentido Incorreto , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , Análise Mutacional de DNA
2.
Mol Syst Biol ; 19(8): e11474, 2023 08 08.
Artigo em Inglês | MEDLINE | ID: mdl-37310135

RESUMO

The assessment of variant effect predictor (VEP) performance is fraught with biases introduced by benchmarking against clinical observations. In this study, building on our previous work, we use independently generated measurements of protein function from deep mutational scanning (DMS) experiments for 26 human proteins to benchmark 55 different VEPs, while introducing minimal data circularity. Many top-performing VEPs are unsupervised methods including EVE, DeepSequence and ESM-1v, a protein language model that ranked first overall. However, the strong performance of recent supervised VEPs, in particular VARITY, shows that developers are taking data circularity and bias issues seriously. We also assess the performance of DMS and unsupervised VEPs for discriminating between known pathogenic and putatively benign missense variants. Our findings are mixed, demonstrating that some DMS datasets perform exceptionally at variant classification, while others are poor. Notably, we observe a striking correlation between VEP agreement with DMS data and performance in identifying clinically relevant variants, strongly supporting the validity of our rankings and the utility of DMS for independent benchmarking.


Assuntos
Benchmarking , Mutação de Sentido Incorreto , Humanos , Mutação , Proteínas/genética
3.
PLoS Comput Biol ; 18(2): e1009858, 2022 02.
Artigo em Inglês | MEDLINE | ID: mdl-35120134

RESUMO

The assembly of proteins into complexes and their interactions with other biomolecules are often vital for their biological function. While it is known that mutations at protein interfaces have a high potential to be damaging and cause human genetic disease, there has been relatively little consideration for how this varies between different types of interfaces. Here we investigate the properties of human pathogenic and putatively benign missense variants at homomeric (isologous and heterologous), heteromeric, DNA, RNA and other ligand interfaces, and at different regions in proteins with respect to those interfaces. We find that different types of interfaces vary greatly in their propensity to be associated with pathogenic mutations, with homomeric heterologous and DNA interfaces being particularly enriched in disease. We also find that residues that do not directly participate in an interface, but are close in three-dimensional space, show a significant disease enrichment. Finally, we observe that mutations at different types of interfaces tend to have distinct property changes when undergoing amino acid substitutions associated with disease, and that this is linked to substantial variability in their identification by computational variant effect predictors.


Assuntos
Mutação , Proteínas/genética , Humanos , Mapeamento de Interação de Proteínas
4.
BMC Ophthalmol ; 22(1): 140, 2022 Mar 26.
Artigo em Inglês | MEDLINE | ID: mdl-35346118

RESUMO

BACKGROUND: Usher Syndrome is the commonest cause of inherited blindness and deafness. The condition is clinically and genetically heterogeneous, with no current treatment. We report a case carrying novel biallelic variants in USH2A causing progressive early adolescent onset visual and hearing impairment consistent with Usher Syndrome Type IIA. CASE PRESENTATION: Our patient presented at age 13 with progressive visual field loss and hearing loss, associated with early onset of cataract in her 40s requiring lens extraction. Now 52 years old, latest best corrected visual acuity (BCVA) stands at Logmar Right Eye (RE) 0.8 and Left Eye (LE) 0.2, with significantly constricted visual fields bilaterally. She was registered partially sighted age 46. Clinical and molecular genetic assessment of the proband was consistent with a diagnosis of Usher Syndrome Type IIA. Genetic testing identified two novel USH2A variants, resulting in the premature termination codon p.Leu30Ter and a missense mutation p.Cys3251Tyr. Segregation analysis confirmed that these variants were biallelic in the affected case. Comprehensive in silico analysis confirmed that these mutations are the probable cause of Usher Syndrome Type IIA in this individual. CONCLUSIONS: The identification of novel mutations in USH2A increases the spectrum of genetic variations that lead to Usher Syndrome, aiding genetic diagnosis, assessment of patient prognosis, and emphasising the importance of genetic testing to identify new mutations in patients with undiagnosed progressive visual loss.


Assuntos
Síndromes de Usher , Adolescente , Códon sem Sentido , Proteínas da Matriz Extracelular/genética , Feminino , Humanos , Pessoa de Meia-Idade , Mutação , Mutação de Sentido Incorreto , Síndromes de Usher/diagnóstico , Síndromes de Usher/genética
5.
Mol Syst Biol ; 16(7): e9380, 2020 07.
Artigo em Inglês | MEDLINE | ID: mdl-32627955

RESUMO

To deal with the huge number of novel protein-coding variants identified by genome and exome sequencing studies, many computational variant effect predictors (VEPs) have been developed. Such predictors are often trained and evaluated using different variant data sets, making a direct comparison between VEPs difficult. In this study, we use 31 previously published deep mutational scanning (DMS) experiments, which provide quantitative, independent phenotypic measurements for large numbers of single amino acid substitutions, in order to benchmark and compare 46 different VEPs. We also evaluate the ability of DMS measurements and VEPs to discriminate between pathogenic and benign missense variants. We find that DMS experiments tend to be superior to the top-ranking predictors, demonstrating the tremendous potential of DMS for identifying novel human disease mutations. Among the VEPs, DeepSequence clearly stood out, showing both the strongest correlations with DMS data and having the best ability to predict pathogenic mutations, which is especially remarkable given that it is an unsupervised method. We further recommend SNAP2, DEOGEN2, SNPs&GO, SuSPect and REVEL based upon their performance in these analyses.


Assuntos
Substituição de Aminoácidos/genética , Biologia Computacional/métodos , Predisposição Genética para Doença , Proteínas/metabolismo , Benchmarking , Correlação de Dados , Cristalografia por Raios X , Bases de Dados Genéticas , Bases de Dados de Proteínas , Escherichia coli/genética , Escherichia coli/metabolismo , Genótipo , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Mutação de Sentido Incorreto , Fenótipo , Polimorfismo de Nucleotídeo Único , Domínios Proteicos/genética , Proteínas/química , Proteínas/genética , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Software
6.
Genet Med ; 22(3): 598-609, 2020 03.
Artigo em Inglês | MEDLINE | ID: mdl-31700164

RESUMO

PURPOSE: Most classical aniridia is caused by PAX6 haploinsufficiency. PAX6 missense variants can be hypomorphic or mimic haploinsufficiency. We hypothesized that missense variants also cause previously undescribed disease by altering the affinity and/or specificity of PAX6 genomic interactions. METHODS: We screened PAX6 in 372 individuals with bilateral microphthalmia, anophthalmia, or coloboma (MAC) from the Medical Research Council Human Genetics Unit eye malformation cohort (HGUeye) and reviewed data from the Deciphering Developmental Disorders study. We performed cluster analysis on PAX6-associated ocular phenotypes by variant type and molecular modeling of the structural impact of 86 different PAX6 causative missense variants. RESULTS: Eight different PAX6 missense variants were identified in 17 individuals (15 families) with MAC, accounting for 4% (15/372) of our cohort. Seven altered the paired domain (p.[Arg26Gln]x1, p.[Gly36Val]x1, p.[Arg38Trp]x2, p.[Arg38Gln]x1, p.[Gly51Arg]x2, p.[Ser54Arg]x2, p.[Asn124Lys]x5) and one the homeodomain (p.[Asn260Tyr]x1). p.Ser54Arg and p.Asn124Lys were exclusively associated with severe bilateral microphthalmia. MAC-associated variants were predicted to alter but not ablate DNA interaction, consistent with the electrophoretic mobility shifts observed using mutant paired domains with well-characterized PAX6-binding sites. We found no strong evidence for novel PAX6-associated extraocular disease. CONCLUSION: Altering the affinity and specificity of PAX6-binding genome-wide provides a plausible mechanism for the worse-than-null effects of MAC-associated missense variants.


Assuntos
Anormalidades do Olho/genética , Predisposição Genética para Doença , Microftalmia/genética , Fator de Transcrição PAX6/genética , Adolescente , Adulto , Sítios de Ligação/genética , Criança , Pré-Escolar , Estudos de Coortes , Proteínas de Ligação a DNA/genética , Anormalidades do Olho/patologia , Feminino , Heterozigoto , Humanos , Lactente , Masculino , Microftalmia/patologia , Mutação de Sentido Incorreto/genética , Linhagem , Adulto Jovem
7.
J Chem Phys ; 150(22): 225102, 2019 Jun 14.
Artigo em Inglês | MEDLINE | ID: mdl-31202237

RESUMO

A majority of cellular proteins function as part of multimeric complexes of two or more subunits. Multimer formation requires interactions between protein surfaces that lead to closed structures, such as dimers and tetramers. If proteins interact in an open-ended way, uncontrolled growth of fibrils can occur, which is likely to be detrimental in most cases. We present a statistical physics model that allows aggregation of proteins as either closed dimers or open fibrils of all lengths. We use pairwise amino-acid contact energies to calculate the energies of interacting protein surfaces. The probabilities of all possible aggregate configurations can be calculated for any given sequence of surface amino acids. We link the statistical physics model to a population genetics model that describes the evolution of the surface residues. When proteins evolve neutrally, without selection for or against multimer formation, we find that a majority of proteins remain as monomers at moderate concentrations, but strong dimer-forming or fibril-forming sequences are also possible. If selection is applied in favor of dimers or in favor of fibrils, then it is easy to select either dimer-forming or fibril-forming sequences. It is also possible to select for oriented fibrils with protein subunits all aligned in the same direction. We measure the propensities of amino acids to occur at interfaces relative to noninteracting surfaces and show that the propensities in our model are strongly correlated with those that have been measured in real protein structures. We also show that there are significant differences between amino acid frequencies at isologous and heterologous interfaces in our model, and we observe that similar effects occur in real protein structures.


Assuntos
Evolução Molecular , Modelos Biológicos , Agregados Proteicos , Multimerização Proteica , Proteínas/química , Aminoácidos/química , Cadeias de Markov , Método de Monte Carlo , Termodinâmica
8.
ArXiv ; 2024 Apr 16.
Artigo em Inglês | MEDLINE | ID: mdl-38699161

RESUMO

Computational methods for assessing the likely impacts of mutations, known as variant effect predictors (VEPs), are widely used in the assessment and interpretation of human genetic variation, as well as in other applications like protein engineering. Many different VEPs have been released to date, and there is tremendous variability in their underlying algorithms and outputs, and in the ways in which the methodologies and predictions are shared. This leads to considerable challenges for end users in knowing which VEPs to use and how to use them. Here, to address these issues, we provide guidelines and recommendations for the release of novel VEPs. Emphasising open-source availability, transparent methodologies, clear variant effect score interpretations, standardised scales, accessible predictions, and rigorous training data disclosure, we aim to improve the usability and interpretability of VEPs, and promote their integration into analysis and evaluation pipelines. We also provide a large, categorised list of currently available VEPs, aiming to facilitate the discovery and encourage the usage of novel methods within the scientific community.

9.
Protein Sci ; 32(7): e4688, 2023 07.
Artigo em Inglês | MEDLINE | ID: mdl-37243972

RESUMO

Many methodologically diverse computational methods have been applied to the growing challenge of predicting and interpreting the effects of protein variants. As many pathogenic mutations have a perturbing effect on protein stability or intermolecular interactions, one highly interpretable approach is to use protein structural information to model the physical impacts of variants and predict their likely effects on protein stability and interactions. Previous efforts have assessed the accuracy of stability predictors in reproducing thermodynamically accurate values and evaluated their ability to distinguish between known pathogenic and benign mutations. Here, we take an alternate approach, and explore how well stability predictor scores correlate with functional impacts derived from deep mutational scanning (DMS) experiments. In this work, we compare the predictions of 9 protein stability-based tools against mutant protein fitness values from 49 independent DMS datasets, covering 170,940 unique single amino acid variants. We find that FoldX and Rosetta show the strongest correlations with DMS-based functional scores, similar to their previous top performance in distinguishing between pathogenic and benign variants. For both methods, performance is considerably improved when considering intermolecular interactions from protein complex structures, when available. Furthermore, using these two predictors, we derive a "Foldetta" consensus score, which improves upon the performance of both, and manages to match dedicated variant effect predictors in reflecting variant functional impacts. Finally, we also highlight that predicted stability effects show consistently higher correlations with certain DMS experimental phenotypes, particularly those based upon protein abundance, and, in certain cases, can significantly outcompete sequence-based variant effect prediction methodologies for predicting functional scores from DMS experiments.


Assuntos
Aminoácidos , Proteínas , Mutação , Proteínas/genética , Proteínas/química , Aminoácidos/genética , Estabilidade Proteica
10.
Dis Model Mech ; 15(6)2022 06 01.
Artigo em Inglês | MEDLINE | ID: mdl-35736673

RESUMO

Computational predictors of genetic variant effect have advanced rapidly in recent years. These programs provide clinical and research laboratories with a rapid and scalable method to assess the likely impacts of novel variants. However, it can be difficult to know to what extent we can trust their results. To benchmark their performance, predictors are often tested against large datasets of known pathogenic and benign variants. These benchmarking data may overlap with the data used to train some supervised predictors, which leads to data re-use or circularity, resulting in inflated performance estimates for those predictors. Furthermore, new predictors are usually found by their authors to be superior to all previous predictors, which suggests some degree of computational bias in their benchmarking. Large-scale functional assays known as deep mutational scans provide one possible solution to this problem, providing independent datasets of variant effect measurements. In this Review, we discuss some of the key advances in predictor methodology, current benchmarking strategies and how data derived from deep mutational scans can be used to overcome the issue of data circularity. We also discuss the ability of such functional assays to directly predict clinical impacts of mutations and how this might affect the future need for variant effect predictors.


Assuntos
Biologia Computacional , Biologia Computacional/métodos , Mutação/genética
11.
Nat Commun ; 13(1): 3895, 2022 07 06.
Artigo em Inglês | MEDLINE | ID: mdl-35794153

RESUMO

Most known pathogenic mutations occur in protein-coding regions of DNA and change the way proteins are made. Taking protein structure into account has therefore provided great insight into the molecular mechanisms underlying human genetic disease. While there has been much focus on how mutations can disrupt protein structure and thus cause a loss of function (LOF), alternative mechanisms, specifically dominant-negative (DN) and gain-of-function (GOF) effects, are less understood. Here, we investigate the protein-level effects of pathogenic missense mutations associated with different molecular mechanisms. We observe striking differences between recessive vs dominant, and LOF vs non-LOF mutations, with dominant, non-LOF disease mutations having much milder effects on protein structure, and DN mutations being highly enriched at protein interfaces. We also find that nearly all computational variant effect predictors, even those based solely on sequence conservation, underperform on non-LOF mutations. However, we do show that non-LOF mutations could potentially be identified by their tendency to cluster in three-dimensional space. Overall, our work suggests that many pathogenic mutations that act via DN and GOF mechanisms are likely being missed by current variant prioritisation strategies, but that there is considerable scope to improve computational predictions through consideration of molecular disease mechanisms.


Assuntos
Mutação com Ganho de Função , Mutação de Sentido Incorreto , Humanos , Mutação , Fases de Leitura Aberta
12.
Nat Med ; 28(7): 1439-1446, 2022 07.
Artigo em Inglês | MEDLINE | ID: mdl-35788175

RESUMO

Clonal hematopoiesis of indeterminate potential (CHIP) increases rapidly in prevalence beyond age 60 and has been associated with increased risk for malignancy, heart disease and ischemic stroke. CHIP is driven by somatic mutations in hematopoietic stem and progenitor cells (HSPCs). Because mutations in HSPCs often drive leukemia, we hypothesized that HSPC fitness substantially contributes to transformation from CHIP to leukemia. HSPC fitness is defined as the proliferative advantage over cells carrying no or only neutral mutations. If mutations in different genes lead to distinct fitness advantages, this could enable patient stratification. We quantified the fitness effects of mutations over 12 years in older age using longitudinal sequencing and developed a filtering method that considers individual mutational context alongside mutation co-occurrence to quantify the growth potential of variants within individuals. We found that gene-specific fitness differences can outweigh inter-individual variation and, therefore, could form the basis for personalized clinical management.


Assuntos
Hematopoese , Leucemia , Hematopoiese Clonal , Hematopoese/genética , Células-Tronco Hematopoéticas/patologia , Humanos , Leucemia/patologia , Pessoa de Meia-Idade , Mutação/genética
13.
Nat Genet ; 55(9): 1426-1427, 2023 09.
Artigo em Inglês | MEDLINE | ID: mdl-37563330

Assuntos
Encéfalo , Idioma
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa