Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 16 de 16
Filtrar
1.
Bioinformatics ; 37(22): 4180-4186, 2021 11 18.
Artigo em Inglês | MEDLINE | ID: mdl-34117883

RESUMO

MOTIVATION: Experimental findings on genetic disease mechanisms are scattered throughout the literature and represented in many ways, including unstructured text, cartoons, pathway diagrams and network graphs. Integration and structuring of such mechanistic information greatly enhances its utility. RESULTS: MecCog is a graphical framework for building integrated representations (mechanism schemas) of mechanisms by which a genetic variant causes a disease phenotype. A MecCog mechanism schema displays the propagation of system perturbations across stages of biological organization, using graphical notations to symbolize perturbed entities and activities, hyperlinked evidence tagging, a mechanism ontology and depiction of knowledge gaps, ambiguities and uncertainties. The web platform enables a user to construct, store, publish, browse, query and comment on schemas. MecCog facilitates the identification of potential biomarkers, therapeutic intervention sites and critical future experiments. AVAILABILITY AND IMPLEMENTATION: The MecCog framework is freely available at http://www.meccog.org. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Doenças Genéticas Inatas , Fenótipo , Biologia Computacional
2.
Hum Mutat ; 41(2): 347-362, 2020 02.
Artigo em Inglês | MEDLINE | ID: mdl-31680375

RESUMO

Precise identification of causative variants from whole-genome sequencing data, including both coding and noncoding variants, is challenging. The Critical Assessment of Genome Interpretation 5 SickKids clinical genome challenge provided an opportunity to assess our ability to extract such information. Participants in the challenge were required to match each of the 24 whole-genome sequences to the correct phenotypic profile and to identify the disease class of each genome. These are all rare disease cases that have resisted genetic diagnosis in a state-of-the-art pipeline. The patients have a range of eye, neurological, and connective-tissue disorders. We used a gene-centric approach to address this problem, assigning each gene a multiphenotype-matching score. Mutations in the top-scoring genes for each phenotype profile were ranked on a 6-point scale of pathogenicity probability, resulting in an approximately equal number of top-ranked coding and noncoding candidate variants overall. We were able to assign the correct disease class for 12 cases and the correct genome to a clinical profile for five cases. The challenge assessor found genes in three of these five cases as likely appropriate. In the postsubmission phase, after careful screening of the genes in the correct genome, we identified additional potential diagnostic variants, a high proportion of which are noncoding.


Assuntos
Estudos de Associação Genética/métodos , Doenças Genéticas Inatas/diagnóstico , Doenças Genéticas Inatas/genética , Predisposição Genética para Doença , Genoma Humano , Genômica/métodos , Doenças Raras , Algoritmos , Alelos , Variação Genética , Estudo de Associação Genômica Ampla/métodos , Genótipo , Humanos , Modelos Teóricos , Fenótipo , Sequenciamento Completo do Genoma , Fluxo de Trabalho
3.
Hum Mutat ; 40(9): 1495-1506, 2019 09.
Artigo em Inglês | MEDLINE | ID: mdl-31184403

RESUMO

Thermodynamic stability is a fundamental property shared by all proteins. Changes in stability due to mutation are a widespread molecular mechanism in genetic diseases. Methods for the prediction of mutation-induced stability change have typically been developed and evaluated on incomplete and/or biased data sets. As part of the Critical Assessment of Genome Interpretation, we explored the utility of high-throughput variant stability profiling (VSP) assay data as an alternative for the assessment of computational methods and evaluated state-of-the-art predictors against over 7,000 nonsynonymous variants from two proteins. We found that predictions were modestly correlated with actual experimental values. Predictors fared better when evaluated as classifiers of extreme stability effects. While different methods emerging as top performers depending on the metric, it is nontrivial to draw conclusions on their adoption or improvement. Our analyses revealed that only 16% of all variants in VSP assays could be confidently defined as stability-affecting. Furthermore, it is unclear as to what extent VSP abundance scores were reasonable proxies for the stability-related quantities that participating methods were designed to predict. Overall, our observations underscore the need for clearly defined objectives when developing and using both computational and experimental methods in the context of measuring variant impact.


Assuntos
Biologia Computacional/métodos , Metiltransferases/química , Mutação , PTEN Fosfo-Hidrolase/química , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Metiltransferases/genética , PTEN Fosfo-Hidrolase/genética , Estabilidade Proteica
4.
Hum Mutat ; 40(9): 1330-1345, 2019 09.
Artigo em Inglês | MEDLINE | ID: mdl-31144778

RESUMO

The Critical Assessment of Genome Interpretation-5 intellectual disability challenge asked to use computational methods to predict patient clinical phenotypes and the causal variant(s) based on an analysis of their gene panel sequence data. Sequence data for 74 genes associated with intellectual disability (ID) and/or autism spectrum disorders (ASD) from a cohort of 150 patients with a range of neurodevelopmental manifestations (i.e. ID, autism, epilepsy, microcephaly, macrocephaly, hypotonia, ataxia) have been made available for this challenge. For each patient, predictors had to report the causative variants and which of the seven phenotypes were present. Since neurodevelopmental disorders are characterized by strong comorbidity, tested individuals often present more than one pathological condition. Considering the overall clinical manifestation of each patient, the correct phenotype has been predicted by at least one group for 93 individuals (62%). ID and ASD were the best predicted among the seven phenotypic traits. Also, causative or potentially pathogenic variants were predicted correctly by at least one group. However, the prediction of the correct causative variant seems to be insufficient to predict the correct phenotype. In some cases, the correct prediction has been supported by rare or common variants in genes different from the causative one.


Assuntos
Transtorno do Espectro Autista/genética , Biologia Computacional/métodos , Deficiência Intelectual/genética , Análise de Sequência de DNA/métodos , Feminino , Predisposição Genética para Doença , Humanos , Masculino , Fenótipo , Locos de Características Quantitativas
5.
Hum Mutat ; 40(9): 1373-1391, 2019 09.
Artigo em Inglês | MEDLINE | ID: mdl-31322791

RESUMO

Whole-genome sequencing (WGS) holds great potential as a diagnostic test. However, the majority of patients currently undergoing WGS lack a molecular diagnosis, largely due to the vast number of undiscovered disease genes and our inability to assess the pathogenicity of most genomic variants. The CAGI SickKids challenges attempted to address this knowledge gap by assessing state-of-the-art methods for clinical phenotype prediction from genomes. CAGI4 and CAGI5 participants were provided with WGS data and clinical descriptions of 25 and 24 undiagnosed patients from the SickKids Genome Clinic Project, respectively. Predictors were asked to identify primary and secondary causal variants. In addition, for CAGI5, groups had to match each genome to one of three disorder categories (neurologic, ophthalmologic, and connective), and separately to each patient. The performance of matching genomes to categories was no better than random but two groups performed significantly better than chance in matching genomes to patients. Two of the ten variants proposed by two groups in CAGI4 were deemed to be diagnostic, and several proposed pathogenic variants in CAGI5 are good candidates for phenotype expansion. We discuss implications for improving in silico assessment of genomic variants and identifying new disease genes.


Assuntos
Biologia Computacional/métodos , Variação Genética , Doenças não Diagnosticadas/diagnóstico , Adolescente , Criança , Pré-Escolar , Simulação por Computador , Bases de Dados Genéticas , Feminino , Predisposição Genética para Doença , Humanos , Masculino , Fenótipo , Doenças não Diagnosticadas/genética , Sequenciamento Completo do Genoma
6.
Hum Mutat ; 40(9): 1519-1529, 2019 09.
Artigo em Inglês | MEDLINE | ID: mdl-31342580

RESUMO

The NAGLU challenge of the fourth edition of the Critical Assessment of Genome Interpretation experiment (CAGI4) in 2016, invited participants to predict the impact of variants of unknown significance (VUS) on the enzymatic activity of the lysosomal hydrolase α-N-acetylglucosaminidase (NAGLU). Deficiencies in NAGLU activity lead to a rare, monogenic, recessive lysosomal storage disorder, Sanfilippo syndrome type B (MPS type IIIB). This challenge attracted 17 submissions from 10 groups. We observed that top models were able to predict the impact of missense mutations on enzymatic activity with Pearson's correlation coefficients of up to .61. We also observed that top methods were significantly more correlated with each other than they were with observed enzymatic activity values, which we believe speaks to the importance of sequence conservation across the different methods. Improved functional predictions on the VUS will help population-scale analysis of disease epidemiology and rare variant association analysis.


Assuntos
Acetilglucosaminidase/metabolismo , Biologia Computacional/métodos , Mutação de Sentido Incorreto , Acetilglucosaminidase/genética , Humanos , Modelos Genéticos , Análise de Regressão
7.
PLoS Comput Biol ; 14(12): e1006540, 2018 12.
Artigo em Inglês | MEDLINE | ID: mdl-30586388

RESUMO

Mechanism is a widely used concept in biology. In 2017, more than 10% of PubMed abstracts used the term. Therefore, searching for and reasoning about mechanisms is fundamental to much of biomedical research, but until now there has been almost no computational infrastructure for this purpose. Recent work in the philosophy of science has explored the central role that the search for mechanistic accounts of biological phenomena plays in biomedical research, providing a conceptual basis for representing and analyzing biological mechanism. The foundational categories for components of mechanisms-entities and activities-guide the development of general, abstract types of biological mechanism parts. Building on that analysis, we have developed a formal framework for describing and representing biological mechanism, MecCog, and applied it to describing mechanisms underlying human genetic disease. Mechanisms are depicted using a graphical notation. Key features are assignment of mechanism components to stages of biological organization and classes; visual representation of uncertainty, ignorance, and ambiguity; and tight integration with literature sources. The MecCog framework facilitates analysis of many aspects of disease mechanism, including the prioritization of future experiments, probing of gene-drug and gene-environment interactions, identification of possible new drug targets, personalized drug choice, analysis of nonlinear interactions between relevant genetic loci, and classification of diseases based on mechanism.


Assuntos
Classificação/métodos , Biologia Computacional/métodos , Doença/classificação , Fenômenos Biológicos , Pesquisa Biomédica , Biologia Computacional/normas , Bases de Dados Factuais , Humanos , Fenômenos Fisiológicos/fisiologia
8.
Hum Mutat ; 38(9): 1169-1181, 2017 09.
Artigo em Inglês | MEDLINE | ID: mdl-28512736

RESUMO

Compared with earlier more restricted sequencing technologies, identification of rare disease variants using whole-genome sequence has the possibility of finding all causative variants, but issues of data quality and an overwhelming level of background variants complicate the analysis. The CAGI4 SickKids clinical genome challenge provided an opportunity to assess the landscape of variants found in a difficult set of 25 unsolved rare disease cases. To address the challenge, we developed a three-stage pipeline, first carefully analyzing data quality, then classifying high-quality gene-specific variants into seven categories, and finally examining each candidate variant for compatibility with the often complex phenotypes of these patients for final prioritization. Variants consistent with the phenotypes were found in 24 out of the 25 cases, and in a number of these, there are prioritized variants in multiple genes. Data quality analysis suggests that some of the selected variants are likely incorrect calls, complicating interpretation. The data providers followed up on three suggested variants with Sanger sequencing, and in one case, a prioritized variant was confirmed as likely causative by the referring physician, providing a diagnosis in a previously intractable case.


Assuntos
Variação Genética , Genômica/métodos , Doenças Raras/genética , Criança , Predisposição Genética para Doença , Humanos , Análise de Sequência de DNA , Software
9.
Hum Mutat ; 38(9): 1225-1234, 2017 09.
Artigo em Inglês | MEDLINE | ID: mdl-28512778

RESUMO

Understanding the basis of complex trait disease is a fundamental problem in human genetics. The CAGI Crohn's Exome challenges are providing insight into the adequacy of current disease models by requiring participants to identify which of a set of individuals has been diagnosed with the disease, given exome data. For the CAGI4 round, we developed a method that used the genotypes from exome sequencing data only to impute the status of genome wide association studies marker SNPs. We then used the imputed genotypes as input to several machine learning methods that had been trained to predict disease status from marker SNP information. We achieved the best performance using Naïve Bayes and with a consensus machine learning method, obtaining an area under the curve of 0.72, larger than other methods used in CAGI4. We also developed a model that incorporated the contribution from rare missense variants in the exome data, but this performed less well. Future progress is expected to come from the use of whole genome data rather than exomes.


Assuntos
Doença de Crohn/genética , Sequenciamento do Exoma/métodos , Polimorfismo de Nucleotídeo Único , Algoritmos , Área Sob a Curva , Marcadores Genéticos , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Aprendizado de Máquina , Fenótipo
10.
Hum Mutat ; 38(9): 1109-1122, 2017 09.
Artigo em Inglês | MEDLINE | ID: mdl-28544272

RESUMO

CAGI (Critical Assessment of Genome Interpretation) conducts community experiments to determine the state of the art in relating genotype to phenotype. Here, we report results obtained using newly developed ensemble methods to address two CAGI4 challenges: enzyme activity for population missense variants found in NAGLU (Human N-acetyl-glucosaminidase) and random missense mutations in Human UBE2I (Human SUMO E2 ligase), assayed in a high-throughput competitive yeast complementation procedure. The ensemble methods are effective, ranked second for SUMO-ligase and third for NAGLU, according to the CAGI independent assessors. However, in common with other methods used in CAGI, there are large discrepancies between predicted and experimental activities for a subset of variants. Analysis of the structural context provides some insight into these. Post-challenge analysis shows that the ensemble methods are also effective at assigning pathogenicity for the NAGLU variants. In the clinic, providing an estimate of the reliability of pathogenic assignments is the key. We have also used the NAGLU dataset to show that ensemble methods have considerable potential for this task, and are already reliable enough for use with a subset of mutations.


Assuntos
Acetilglucosaminidase/genética , Biologia Computacional/métodos , Mutação de Sentido Incorreto , Enzimas de Conjugação de Ubiquitina/genética , Bases de Dados Genéticas , Humanos , Aprendizado de Máquina , Fenótipo , Curva ROC , Reprodutibilidade dos Testes
11.
Hum Mutat ; 38(9): 1201-1216, 2017 09.
Artigo em Inglês | MEDLINE | ID: mdl-28497567

RESUMO

The use of gene panel sequence for diagnostic and prognostic testing is now widespread, but there are so far few objective tests of methods to interpret these data. We describe the design and implementation of a gene panel sequencing data analysis pipeline (VarP) and its assessment in a CAGI4 community experiment. The method was applied to clinical gene panel sequencing data of 106 patients, with the goal of determining which of 14 disease classes each patient has and the corresponding causative variant(s). The disease class was correctly identified for 36 cases, including 10 where the original clinical pipeline did not find causative variants. For a further seven cases, we found strong evidence of an alternative disease to that tested. Many of the potentially causative variants are missense, with no previous association with disease, and these proved the hardest to correctly assign pathogenicity or otherwise. Post analysis showed that three-dimensional structure data could have helped for up to half of these cases. Over-reliance on HGMD annotation led to a number of incorrect disease assignments. We used a largely ad hoc method to assign probabilities of pathogenicity for each variant, and there is much work still to be done in this area.


Assuntos
Doença/classificação , Sequenciamento do Exoma/métodos , Variação Genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Biologia Computacional , Bases de Dados Genéticas , Doença/genética , Predisposição Genética para Doença , Humanos , Modelos Moleculares , Mutação de Sentido Incorreto , Fenótipo , Proteínas/química , Proteínas/genética
12.
Hum Mutat ; 38(9): 1155-1168, 2017 09.
Artigo em Inglês | MEDLINE | ID: mdl-28397312

RESUMO

The CAGI-4 Hopkins clinical panel challenge was an attempt to assess state-of-the-art methods for clinical phenotype prediction from DNA sequence. Participants were provided with exonic sequences of 83 genes for 106 patients from the Johns Hopkins DNA Diagnostic Laboratory. Five groups participated in the challenge, predicting both the probability that each patient had each of the 14 possible classes of disease, as well as one or more causal variants. In cases where the Hopkins laboratory reported a variant, at least one predictor correctly identified the disease class in 36 of the 43 patients (84%). Even in cases where the Hopkins laboratory did not find a variant, at least one predictor correctly identified the class in 39 of the 63 patients (62%). Each prediction group correctly diagnosed at least one patient that was not successfully diagnosed by any other group. We discuss the causal variant predictions by different groups and their implications for further development of methods to assess variants of unknown significance. Our results suggest that clinically relevant variants may be missed when physicians order small panels targeted on a specific phenotype. We also quantify the false-positive rate of DNA-guided analysis in the absence of prior phenotypic indication.


Assuntos
Biologia Computacional/métodos , Análise de Sequência de DNA/métodos , Bases de Dados Genéticas , Predisposição Genética para Doença , Testes Genéticos , Humanos , Fenótipo
13.
Hum Mutat ; 38(9): 1182-1192, 2017 09.
Artigo em Inglês | MEDLINE | ID: mdl-28634997

RESUMO

Precision medicine aims to predict a patient's disease risk and best therapeutic options by using that individual's genetic sequencing data. The Critical Assessment of Genome Interpretation (CAGI) is a community experiment consisting of genotype-phenotype prediction challenges; participants build models, undergo assessment, and share key findings. For CAGI 4, three challenges involved using exome-sequencing data: Crohn's disease, bipolar disorder, and warfarin dosing. Previous CAGI challenges included prior versions of the Crohn's disease challenge. Here, we discuss the range of techniques used for phenotype prediction as well as the methods used for assessing predictive models. Additionally, we outline some of the difficulties associated with making predictions and evaluating them. The lessons learned from the exome challenges can be applied to both research and clinical efforts to improve phenotype prediction from genotype. In addition, these challenges serve as a vehicle for sharing clinical and research exome data in a secure manner with scientists who have a broad range of expertise, contributing to a collaborative effort to advance our understanding of genotype-phenotype relationships.


Assuntos
Transtorno Bipolar/genética , Doença de Crohn/genética , Sequenciamento do Exoma/métodos , Medicina de Precisão/métodos , Varfarina/uso terapêutico , Biologia Computacional/métodos , Bases de Dados Genéticas , Predisposição Genética para Doença , Humanos , Disseminação de Informação , Variantes Farmacogenômicos , Fenótipo , Varfarina/farmacologia
14.
J Clin Immunol ; 35(2): 227-33, 2015 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-25677497

RESUMO

PURPOSE: Severe combined immunodeficiency (SCID) encompasses a group of disorders characterized by reduced or absent T-cell number and function and identified by newborn screening utilizing T-cell receptor excision circles (TRECs). This screening has also identified infants with T lymphopenia who lack mutations in typical SCID genes. We report an infant with low TRECs and non-SCID T lymphopenia, who proved upon whole exome sequencing to have Nijmegen breakage syndrome (NBS). METHODS: Exome sequencing of DNA from the infant and his parents was performed. Genomic analysis revealed deleterious variants in the NBN gene. Confirmatory testing included Sanger sequencing and immunoblotting and radiosensitivity testing of patient lymphocytes. RESULTS: Two novel nonsense mutations in NBN were identified in genomic DNA from the family. Immunoblotting showed absence of nibrin protein. A colony survival assay demonstrated radiosensitivity comparable to patients with ataxia telangiectasia. CONCLUSIONS: Although TREC screening was developed to identify newborns with SCID, it has also identified T lymphopenic disorders that may not otherwise be diagnosed until later in life. Timely identification of an infant with T lymphopenia allowed for prompt pursuit of underlying etiology, making possible a diagnosis of NBS, genetic counseling, and early intervention to minimize complications.


Assuntos
Triagem Neonatal , Síndrome de Quebra de Nijmegen/diagnóstico , Síndrome de Quebra de Nijmegen/genética , Receptores de Antígenos de Linfócitos T/genética , Linfócitos T/metabolismo , Proteínas de Ciclo Celular/genética , Proteínas de Ciclo Celular/metabolismo , DNA Circular , Exoma , Rearranjo Gênico do Linfócito T , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Lactente , Recém-Nascido , Masculino , Síndrome de Quebra de Nijmegen/imunologia , Proteínas Nucleares/genética , Proteínas Nucleares/metabolismo , Linfócitos T/imunologia
15.
Nat Med ; 29(6): 1389-1399, 2023 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-37322116

RESUMO

Despite no apparent defects in T cell priming and recruitment to tumors, a large subset of T cell rich tumors fail to respond to immune checkpoint blockade (ICB). We leveraged a neoadjuvant anti-PD-1 trial in patients with hepatocellular carcinoma (HCC), as well as additional samples collected from patients treated off-label, to explore correlates of response to ICB within T cell-rich tumors. We show that ICB response correlated with the clonal expansion of intratumoral CXCL13+CH25H+IL-21+PD-1+CD4+ T helper cells ("CXCL13+ TH") and Granzyme K+ PD-1+ effector-like CD8+ T cells, whereas terminally exhausted CD39hiTOXhiPD-1hiCD8+ T cells dominated in nonresponders. CD4+ and CD8+ T cell clones that expanded post-treatment were found in pretreatment biopsies. Notably, PD-1+TCF-1+ (Progenitor-exhausted) CD8+ T cells shared clones mainly with effector-like cells in responders or terminally exhausted cells in nonresponders, suggesting that local CD8+ T cell differentiation occurs upon ICB. We found that these Progenitor CD8+ T cells interact with CXCL13+ TH within cellular triads around dendritic cells enriched in maturation and regulatory molecules, or "mregDC". These results suggest that discrete intratumoral niches that include mregDC and CXCL13+ TH control the differentiation of tumor-specific Progenitor exhasuted CD8+ T cells following ICB.


Assuntos
Carcinoma Hepatocelular , Neoplasias Hepáticas , Humanos , Carcinoma Hepatocelular/tratamento farmacológico , Linfócitos T CD8-Positivos , Neoplasias Hepáticas/patologia , Receptor de Morte Celular Programada 1 , Linfócitos T Auxiliares-Indutores , Diferenciação Celular , Células Dendríticas/patologia
16.
Nat Med ; 26(9): 1392-1397, 2020 09.
Artigo em Inglês | MEDLINE | ID: mdl-32778825

RESUMO

Public health newborn screening (NBS) programs provide population-scale ascertainment of rare, treatable conditions that require urgent intervention. Tandem mass spectrometry (MS/MS) is currently used to screen newborns for a panel of rare inborn errors of metabolism (IEMs)1-4. The NBSeq project evaluated whole-exome sequencing (WES) as an innovative methodology for NBS. We obtained archived residual dried blood spots and data for nearly all IEM cases from the 4.5 million infants born in California between mid-2005 and 2013 and from some infants who screened positive by MS/MS, but were unaffected upon follow-up testing. WES had an overall sensitivity of 88% and specificity of 98.4%, compared to 99.0% and 99.8%, respectively for MS/MS, although effectiveness varied among individual IEMs. Thus, WES alone was insufficiently sensitive or specific to be a primary screen for most NBS IEMs. However, as a secondary test for infants with abnormal MS/MS screens, WES could reduce false-positive results, facilitate timely case resolution and in some instances even suggest more appropriate or specific diagnosis than that initially obtained. This study represents the largest, to date, sequencing effort of an entire population of IEM-affected cases, allowing unbiased assessment of current capabilities of WES as a tool for population screening.


Assuntos
Sequenciamento do Exoma/métodos , Exoma/genética , Erros Inatos do Metabolismo/diagnóstico , Erros Inatos do Metabolismo/genética , Triagem Neonatal/métodos , Testes Genéticos , Humanos , Recém-Nascido , Erros Inatos do Metabolismo/epidemiologia , Espectrometria de Massas em Tandem
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA