Search | VHL Regional Portal

1.

Intratumoral dendritic cell-CD4⁺ T helper cell niches enable CD8⁺ T cell differentiation following PD-1 blockade in hepatocellular carcinoma.

Magen, Assaf; Hamon, Pauline; Fiaschi, Nathalie; Soong, Brian Y; Park, Matthew D; Mattiuz, Raphaël; Humblin, Etienne; Troncoso, Leanna; D'souza, Darwin; Dawson, Travis; Kim, Joel; Hamel, Steven; Buckup, Mark; Chang, Christie; Tabachnikova, Alexandra; Schwartz, Hara; Malissen, Nausicaa; Lavin, Yonit; Soares-Schanoski, Alessandra; Giotti, Bruno; Hegde, Samarth; Ioannou, Giorgio; Gonzalez-Kozlova, Edgar; Hennequin, Clotilde; Le Berichel, Jessica; Zhao, Zhen; Ward, Stephen C; Fiel, Isabel; Kou, Baijun; Dobosz, Michael; Li, Lianjie; Adler, Christina; Ni, Min; Wei, Yi; Wang, Wei; Atwal, Gurinder S; Kundu, Kunal; Cygan, Kamil J; Tsankov, Alexander M; Rahman, Adeeb; Price, Colles; Fernandez, Nicolas; He, Jiang; Gupta, Namita T; Kim-Schulze, Seunghee; Gnjatic, Sacha; Kenigsberg, Ephraim; Deering, Raquel P; Schwartz, Myron; Marron, Thomas U.

Nat Med ; 29(6): 1389-1399, 2023 Jun.

Article in English | MEDLINE | ID: mdl-37322116

ABSTRACT

Despite no apparent defects in T cell priming and recruitment to tumors, a large subset of T cell rich tumors fail to respond to immune checkpoint blockade (ICB). We leveraged a neoadjuvant anti-PD-1 trial in patients with hepatocellular carcinoma (HCC), as well as additional samples collected from patients treated off-label, to explore correlates of response to ICB within T cell-rich tumors. We show that ICB response correlated with the clonal expansion of intratumoral CXCL13+CH25H+IL-21+PD-1+CD4+ T helper cells ("CXCL13+ TH") and Granzyme K+ PD-1+ effector-like CD8+ T cells, whereas terminally exhausted CD39hiTOXhiPD-1hiCD8+ T cells dominated in nonresponders. CD4+ and CD8+ T cell clones that expanded post-treatment were found in pretreatment biopsies. Notably, PD-1+TCF-1+ (Progenitor-exhausted) CD8+ T cells shared clones mainly with effector-like cells in responders or terminally exhausted cells in nonresponders, suggesting that local CD8+ T cell differentiation occurs upon ICB. We found that these Progenitor CD8+ T cells interact with CXCL13+ TH within cellular triads around dendritic cells enriched in maturation and regulatory molecules, or "mregDC". These results suggest that discrete intratumoral niches that include mregDC and CXCL13+ TH control the differentiation of tumor-specific Progenitor exhasuted CD8+ T cells following ICB.

Subject(s)

Carcinoma, Hepatocellular , Liver Neoplasms , Humans , Carcinoma, Hepatocellular/drug therapy , CD8-Positive T-Lymphocytes , Liver Neoplasms/pathology , Programmed Cell Death 1 Receptor , T-Lymphocytes, Helper-Inducer , Cell Differentiation , Dendritic Cells/pathology

2.

MecCog: a knowledge representation framework for genetic disease mechanism.

Kundu, Kunal; Darden, Lindley; Moult, John.

Bioinformatics ; 37(22): 4180-4186, 2021 11 18.

Article in English | MEDLINE | ID: mdl-34117883

ABSTRACT

MOTIVATION: Experimental findings on genetic disease mechanisms are scattered throughout the literature and represented in many ways, including unstructured text, cartoons, pathway diagrams and network graphs. Integration and structuring of such mechanistic information greatly enhances its utility. RESULTS: MecCog is a graphical framework for building integrated representations (mechanism schemas) of mechanisms by which a genetic variant causes a disease phenotype. A MecCog mechanism schema displays the propagation of system perturbations across stages of biological organization, using graphical notations to symbolize perturbed entities and activities, hyperlinked evidence tagging, a mechanism ontology and depiction of knowledge gaps, ambiguities and uncertainties. The web platform enables a user to construct, store, publish, browse, query and comment on schemas. MecCog facilitates the identification of potential biomarkers, therapeutic intervention sites and critical future experiments. AVAILABILITY AND IMPLEMENTATION: The MecCog framework is freely available at http://www.meccog.org. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Subject(s)

Genetic Diseases, Inborn , Phenotype , Computational Biology

3.

The role of exome sequencing in newborn screening for inborn errors of metabolism.

Adhikari, Aashish N; Gallagher, Renata C; Wang, Yaqiong; Currier, Robert J; Amatuni, George; Bassaganyas, Laia; Chen, Flavia; Kundu, Kunal; Kvale, Mark; Mooney, Sean D; Nussbaum, Robert L; Randi, Savanna S; Sanford, Jeremy; Shieh, Joseph T; Srinivasan, Rajgopal; Sunderam, Uma; Tang, Hao; Vaka, Dedeepya; Zou, Yangyun; Koenig, Barbara A; Kwok, Pui-Yan; Risch, Neil; Puck, Jennifer M; Brenner, Steven E.

Nat Med ; 26(9): 1392-1397, 2020 09.

Article in English | MEDLINE | ID: mdl-32778825

ABSTRACT

Public health newborn screening (NBS) programs provide population-scale ascertainment of rare, treatable conditions that require urgent intervention. Tandem mass spectrometry (MS/MS) is currently used to screen newborns for a panel of rare inborn errors of metabolism (IEMs)1-4. The NBSeq project evaluated whole-exome sequencing (WES) as an innovative methodology for NBS. We obtained archived residual dried blood spots and data for nearly all IEM cases from the 4.5 million infants born in California between mid-2005 and 2013 and from some infants who screened positive by MS/MS, but were unaffected upon follow-up testing. WES had an overall sensitivity of 88% and specificity of 98.4%, compared to 99.0% and 99.8%, respectively for MS/MS, although effectiveness varied among individual IEMs. Thus, WES alone was insufficiently sensitive or specific to be a primary screen for most NBS IEMs. However, as a secondary test for infants with abnormal MS/MS screens, WES could reduce false-positive results, facilitate timely case resolution and in some instances even suggest more appropriate or specific diagnosis than that initially obtained. This study represents the largest, to date, sequencing effort of an entire population of IEM-affected cases, allowing unbiased assessment of current capabilities of WES as a tool for population screening.

Subject(s)

Exome Sequencing/methods , Exome/genetics , Metabolism, Inborn Errors/diagnosis , Metabolism, Inborn Errors/genetics , Neonatal Screening/methods , Genetic Testing , Humans , Infant, Newborn , Metabolism, Inborn Errors/epidemiology , Tandem Mass Spectrometry

4.

Matching whole genomes to rare genetic disorders: Identification of potential causative variants using phenotype-weighted knowledge in the CAGI SickKids5 clinical genomes challenge.

Pal, Lipika R; Kundu, Kunal; Yin, Yizhou; Moult, John.

Hum Mutat ; 41(2): 347-362, 2020 02.

Article in English | MEDLINE | ID: mdl-31680375

ABSTRACT

Precise identification of causative variants from whole-genome sequencing data, including both coding and noncoding variants, is challenging. The Critical Assessment of Genome Interpretation 5 SickKids clinical genome challenge provided an opportunity to assess our ability to extract such information. Participants in the challenge were required to match each of the 24 whole-genome sequences to the correct phenotypic profile and to identify the disease class of each genome. These are all rare disease cases that have resisted genetic diagnosis in a state-of-the-art pipeline. The patients have a range of eye, neurological, and connective-tissue disorders. We used a gene-centric approach to address this problem, assigning each gene a multiphenotype-matching score. Mutations in the top-scoring genes for each phenotype profile were ranked on a 6-point scale of pathogenicity probability, resulting in an approximately equal number of top-ranked coding and noncoding candidate variants overall. We were able to assign the correct disease class for 12 cases and the correct genome to a clinical profile for five cases. The challenge assessor found genes in three of these five cases as likely appropriate. In the postsubmission phase, after careful screening of the genes in the correct genome, we identified additional potential diagnostic variants, a high proportion of which are noncoding.

Subject(s)

Genetic Association Studies/methods , Genetic Diseases, Inborn/diagnosis , Genetic Diseases, Inborn/genetics , Genetic Predisposition to Disease , Genome, Human , Genomics/methods , Rare Diseases , Algorithms , Alleles , Genetic Variation , Genome-Wide Association Study/methods , Genotype , Humans , Models, Theoretical , Phenotype , Whole Genome Sequencing , Workflow

5.

Assessment of predicted enzymatic activity of α-N-acetylglucosaminidase variants of unknown significance for CAGI 2016.

Clark, Wyatt T; Kasak, Laura; Bakolitsa, Constantina; Hu, Zhiqiang; Andreoletti, Gaia; Babbi, Giulia; Bromberg, Yana; Casadio, Rita; Dunbrack, Roland; Folkman, Lukas; Ford, Colby T; Jones, David; Katsonis, Panagiotis; Kundu, Kunal; Lichtarge, Olivier; Martelli, Pier L; Mooney, Sean D; Nodzak, Conor; Pal, Lipika R; Radivojac, Predrag; Savojardo, Castrense; Shi, Xinghua; Zhou, Yaoqi; Uppal, Aneeta; Xu, Qifang; Yin, Yizhou; Pejaver, Vikas; Wang, Meng; Wei, Liping; Moult, John; Yu, Guoying Karen; Brenner, Steven E; LeBowitz, Jonathan H.

Hum Mutat ; 40(9): 1519-1529, 2019 09.

Article in English | MEDLINE | ID: mdl-31342580

ABSTRACT

The NAGLU challenge of the fourth edition of the Critical Assessment of Genome Interpretation experiment (CAGI4) in 2016, invited participants to predict the impact of variants of unknown significance (VUS) on the enzymatic activity of the lysosomal hydrolase α-N-acetylglucosaminidase (NAGLU). Deficiencies in NAGLU activity lead to a rare, monogenic, recessive lysosomal storage disorder, Sanfilippo syndrome type B (MPS type IIIB). This challenge attracted 17 submissions from 10 groups. We observed that top models were able to predict the impact of missense mutations on enzymatic activity with Pearson's correlation coefficients of up to .61. We also observed that top methods were significantly more correlated with each other than they were with observed enzymatic activity values, which we believe speaks to the importance of sequence conservation across the different methods. Improved functional predictions on the VUS will help population-scale analysis of disease epidemiology and rare variant association analysis.

Subject(s)

Acetylglucosaminidase/metabolism , Computational Biology/methods , Mutation, Missense , Acetylglucosaminidase/genetics , Humans , Models, Genetic , Regression Analysis

6.

CAGI SickKids challenges: Assessment of phenotype and variant predictions derived from clinical and genomic data of children with undiagnosed diseases.

Kasak, Laura; Hunter, Jesse M; Udani, Rupa; Bakolitsa, Constantina; Hu, Zhiqiang; Adhikari, Aashish N; Babbi, Giulia; Casadio, Rita; Gough, Julian; Guerrero, Rafael F; Jiang, Yuxiang; Joseph, Thomas; Katsonis, Panagiotis; Kotte, Sujatha; Kundu, Kunal; Lichtarge, Olivier; Martelli, Pier Luigi; Mooney, Sean D; Moult, John; Pal, Lipika R; Poitras, Jennifer; Radivojac, Predrag; Rao, Aditya; Sivadasan, Naveen; Sunderam, Uma; Saipradeep, V G; Yin, Yizhou; Zaucha, Jan; Brenner, Steven E; Meyn, M Stephen.

Hum Mutat ; 40(9): 1373-1391, 2019 09.

Article in English | MEDLINE | ID: mdl-31322791

ABSTRACT

Whole-genome sequencing (WGS) holds great potential as a diagnostic test. However, the majority of patients currently undergoing WGS lack a molecular diagnosis, largely due to the vast number of undiscovered disease genes and our inability to assess the pathogenicity of most genomic variants. The CAGI SickKids challenges attempted to address this knowledge gap by assessing state-of-the-art methods for clinical phenotype prediction from genomes. CAGI4 and CAGI5 participants were provided with WGS data and clinical descriptions of 25 and 24 undiagnosed patients from the SickKids Genome Clinic Project, respectively. Predictors were asked to identify primary and secondary causal variants. In addition, for CAGI5, groups had to match each genome to one of three disorder categories (neurologic, ophthalmologic, and connective), and separately to each patient. The performance of matching genomes to categories was no better than random but two groups performed significantly better than chance in matching genomes to patients. Two of the ten variants proposed by two groups in CAGI4 were deemed to be diagnostic, and several proposed pathogenic variants in CAGI5 are good candidates for phenotype expansion. We discuss implications for improving in silico assessment of genomic variants and identifying new disease genes.

Subject(s)

Computational Biology/methods , Genetic Variation , Undiagnosed Diseases/diagnosis , Adolescent , Child , Child, Preschool , Computer Simulation , Databases, Genetic , Female , Genetic Predisposition to Disease , Humans , Male , Phenotype , Undiagnosed Diseases/genetics , Whole Genome Sequencing

7.

Assessment of methods for predicting the effects of PTEN and TPMT protein variants.

Pejaver, Vikas; Babbi, Giulia; Casadio, Rita; Folkman, Lukas; Katsonis, Panagiotis; Kundu, Kunal; Lichtarge, Olivier; Martelli, Pier Luigi; Miller, Maximilian; Moult, John; Pal, Lipika R; Savojardo, Castrense; Yin, Yizhou; Zhou, Yaoqi; Radivojac, Predrag; Bromberg, Yana.

Hum Mutat ; 40(9): 1495-1506, 2019 09.

Article in English | MEDLINE | ID: mdl-31184403

ABSTRACT

Thermodynamic stability is a fundamental property shared by all proteins. Changes in stability due to mutation are a widespread molecular mechanism in genetic diseases. Methods for the prediction of mutation-induced stability change have typically been developed and evaluated on incomplete and/or biased data sets. As part of the Critical Assessment of Genome Interpretation, we explored the utility of high-throughput variant stability profiling (VSP) assay data as an alternative for the assessment of computational methods and evaluated state-of-the-art predictors against over 7,000 nonsynonymous variants from two proteins. We found that predictions were modestly correlated with actual experimental values. Predictors fared better when evaluated as classifiers of extreme stability effects. While different methods emerging as top performers depending on the metric, it is nontrivial to draw conclusions on their adoption or improvement. Our analyses revealed that only 16% of all variants in VSP assays could be confidently defined as stability-affecting. Furthermore, it is unclear as to what extent VSP abundance scores were reasonable proxies for the stability-related quantities that participating methods were designed to predict. Overall, our observations underscore the need for clearly defined objectives when developing and using both computational and experimental methods in the context of measuring variant impact.

Subject(s)

Computational Biology/methods , Methyltransferases/chemistry , Mutation , PTEN Phosphohydrolase/chemistry , High-Throughput Nucleotide Sequencing , Humans , Methyltransferases/genetics , PTEN Phosphohydrolase/genetics , Protein Stability

8.

Assessment of patient clinical descriptions and pathogenic variants from gene panel sequences in the CAGI-5 intellectual disability challenge.

Carraro, Marco; Monzon, Alexander Miguel; Chiricosta, Luigi; Reggiani, Francesco; Aspromonte, Maria Cristina; Bellini, Mariagrazia; Pagel, Kymberleigh; Jiang, Yuxiang; Radivojac, Predrag; Kundu, Kunal; Pal, Lipika R; Yin, Yizhou; Limongelli, Ivan; Andreoletti, Gaia; Moult, John; Wilson, Stephen J; Katsonis, Panagiotis; Lichtarge, Olivier; Chen, Jingqi; Wang, Yaqiong; Hu, Zhiqiang; Brenner, Steven E; Ferrari, Carlo; Murgia, Alessandra; Tosatto, Silvio C E; Leonardi, Emanuela.

Hum Mutat ; 40(9): 1330-1345, 2019 09.

Article in English | MEDLINE | ID: mdl-31144778

ABSTRACT

The Critical Assessment of Genome Interpretation-5 intellectual disability challenge asked to use computational methods to predict patient clinical phenotypes and the causal variant(s) based on an analysis of their gene panel sequence data. Sequence data for 74 genes associated with intellectual disability (ID) and/or autism spectrum disorders (ASD) from a cohort of 150 patients with a range of neurodevelopmental manifestations (i.e. ID, autism, epilepsy, microcephaly, macrocephaly, hypotonia, ataxia) have been made available for this challenge. For each patient, predictors had to report the causative variants and which of the seven phenotypes were present. Since neurodevelopmental disorders are characterized by strong comorbidity, tested individuals often present more than one pathological condition. Considering the overall clinical manifestation of each patient, the correct phenotype has been predicted by at least one group for 93 individuals (62%). ID and ASD were the best predicted among the seven phenotypic traits. Also, causative or potentially pathogenic variants were predicted correctly by at least one group. However, the prediction of the correct causative variant seems to be insufficient to predict the correct phenotype. In some cases, the correct prediction has been supported by rare or common variants in genes different from the causative one.

Subject(s)

Autism Spectrum Disorder/genetics , Computational Biology/methods , Intellectual Disability/genetics , Sequence Analysis, DNA/methods , Female , Genetic Predisposition to Disease , Humans , Male , Phenotype , Quantitative Trait Loci

9.

Harnessing formal concepts of biological mechanism to analyze human disease.

Darden, Lindley; Kundu, Kunal; Pal, Lipika R; Moult, John.

PLoS Comput Biol ; 14(12): e1006540, 2018 12.

Article in English | MEDLINE | ID: mdl-30586388

ABSTRACT

Mechanism is a widely used concept in biology. In 2017, more than 10% of PubMed abstracts used the term. Therefore, searching for and reasoning about mechanisms is fundamental to much of biomedical research, but until now there has been almost no computational infrastructure for this purpose. Recent work in the philosophy of science has explored the central role that the search for mechanistic accounts of biological phenomena plays in biomedical research, providing a conceptual basis for representing and analyzing biological mechanism. The foundational categories for components of mechanisms-entities and activities-guide the development of general, abstract types of biological mechanism parts. Building on that analysis, we have developed a formal framework for describing and representing biological mechanism, MecCog, and applied it to describing mechanisms underlying human genetic disease. Mechanisms are depicted using a graphical notation. Key features are assignment of mechanism components to stages of biological organization and classes; visual representation of uncertainty, ignorance, and ambiguity; and tight integration with literature sources. The MecCog framework facilitates analysis of many aspects of disease mechanism, including the prioritization of future experiments, probing of gene-drug and gene-environment interactions, identification of possible new drug targets, personalized drug choice, analysis of nonlinear interactions between relevant genetic loci, and classification of diseases based on mechanism.

Subject(s)

Classification/methods , Computational Biology/methods , Disease/classification , Biological Phenomena , Biomedical Research , Computational Biology/standards , Databases, Factual , Humans , Physiological Phenomena/physiology

10.

Working toward precision medicine: Predicting phenotypes from exomes in the Critical Assessment of Genome Interpretation (CAGI) challenges.

Daneshjou, Roxana; Wang, Yanran; Bromberg, Yana; Bovo, Samuele; Martelli, Pier L; Babbi, Giulia; Lena, Pietro Di; Casadio, Rita; Edwards, Matthew; Gifford, David; Jones, David T; Sundaram, Laksshman; Bhat, Rajendra Rana; Li, Xiaolin; Pal, Lipika R; Kundu, Kunal; Yin, Yizhou; Moult, John; Jiang, Yuxiang; Pejaver, Vikas; Pagel, Kymberleigh A; Li, Biao; Mooney, Sean D; Radivojac, Predrag; Shah, Sohela; Carraro, Marco; Gasparini, Alessandra; Leonardi, Emanuela; Giollo, Manuel; Ferrari, Carlo; Tosatto, Silvio C E; Bachar, Eran; Azaria, Johnathan R; Ofran, Yanay; Unger, Ron; Niroula, Abhishek; Vihinen, Mauno; Chang, Billy; Wang, Maggie H; Franke, Andre; Petersen, Britt-Sabina; Pirooznia, Mehdi; Zandi, Peter; McCombie, Richard; Potash, James B; Altman, Russ B; Klein, Teri E; Hoskins, Roger A; Repo, Susanna; Brenner, Steven E.

Hum Mutat ; 38(9): 1182-1192, 2017 09.

Article in English | MEDLINE | ID: mdl-28634997

ABSTRACT

Precision medicine aims to predict a patient's disease risk and best therapeutic options by using that individual's genetic sequencing data. The Critical Assessment of Genome Interpretation (CAGI) is a community experiment consisting of genotype-phenotype prediction challenges; participants build models, undergo assessment, and share key findings. For CAGI 4, three challenges involved using exome-sequencing data: Crohn's disease, bipolar disorder, and warfarin dosing. Previous CAGI challenges included prior versions of the Crohn's disease challenge. Here, we discuss the range of techniques used for phenotype prediction as well as the methods used for assessing predictive models. Additionally, we outline some of the difficulties associated with making predictions and evaluating them. The lessons learned from the exome challenges can be applied to both research and clinical efforts to improve phenotype prediction from genotype. In addition, these challenges serve as a vehicle for sharing clinical and research exome data in a secure manner with scientists who have a broad range of expertise, contributing to a collaborative effort to advance our understanding of genotype-phenotype relationships.

Subject(s)

Bipolar Disorder/genetics , Crohn Disease/genetics , Exome Sequencing/methods , Precision Medicine/methods , Warfarin/therapeutic use , Computational Biology/methods , Databases, Genetic , Genetic Predisposition to Disease , Humans , Information Dissemination , Pharmacogenomic Variants , Phenotype , Warfarin/pharmacology

11.

Ensemble variant interpretation methods to predict enzyme activity and assign pathogenicity in the CAGI4 NAGLU (Human N-acetyl-glucosaminidase) and UBE2I (Human SUMO-ligase) challenges.

Yin, Yizhou; Kundu, Kunal; Pal, Lipika R; Moult, John.

Hum Mutat ; 38(9): 1109-1122, 2017 09.

Article in English | MEDLINE | ID: mdl-28544272

ABSTRACT

CAGI (Critical Assessment of Genome Interpretation) conducts community experiments to determine the state of the art in relating genotype to phenotype. Here, we report results obtained using newly developed ensemble methods to address two CAGI4 challenges: enzyme activity for population missense variants found in NAGLU (Human N-acetyl-glucosaminidase) and random missense mutations in Human UBE2I (Human SUMO E2 ligase), assayed in a high-throughput competitive yeast complementation procedure. The ensemble methods are effective, ranked second for SUMO-ligase and third for NAGLU, according to the CAGI independent assessors. However, in common with other methods used in CAGI, there are large discrepancies between predicted and experimental activities for a subset of variants. Analysis of the structural context provides some insight into these. Post-challenge analysis shows that the ensemble methods are also effective at assigning pathogenicity for the NAGLU variants. In the clinic, providing an estimate of the reliability of pathogenic assignments is the key. We have also used the NAGLU dataset to show that ensemble methods have considerable potential for this task, and are already reliable enough for use with a subset of mutations.

Subject(s)

Acetylglucosaminidase/genetics , Computational Biology/methods , Mutation, Missense , Ubiquitin-Conjugating Enzymes/genetics , Databases, Genetic , Humans , Machine Learning , Phenotype , ROC Curve , Reproducibility of Results

12.

CAGI4 SickKids clinical genomes challenge: A pipeline for identifying pathogenic variants.

Pal, Lipika R; Kundu, Kunal; Yin, Yizhou; Moult, John.

Hum Mutat ; 38(9): 1169-1181, 2017 09.

Article in English | MEDLINE | ID: mdl-28512736

ABSTRACT

Compared with earlier more restricted sequencing technologies, identification of rare disease variants using whole-genome sequence has the possibility of finding all causative variants, but issues of data quality and an overwhelming level of background variants complicate the analysis. The CAGI4 SickKids clinical genome challenge provided an opportunity to assess the landscape of variants found in a difficult set of 25 unsolved rare disease cases. To address the challenge, we developed a three-stage pipeline, first carefully analyzing data quality, then classifying high-quality gene-specific variants into seven categories, and finally examining each candidate variant for compatibility with the often complex phenotypes of these patients for final prioritization. Variants consistent with the phenotypes were found in 24 out of the 25 cases, and in a number of these, there are prioritized variants in multiple genes. Data quality analysis suggests that some of the selected variants are likely incorrect calls, complicating interpretation. The data providers followed up on three suggested variants with Sanger sequencing, and in one case, a prioritized variant was confirmed as likely causative by the referring physician, providing a diagnosis in a previously intractable case.

Subject(s)

Genetic Variation , Genomics/methods , Rare Diseases/genetics , Child , Genetic Predisposition to Disease , Humans , Sequence Analysis, DNA , Software

13.

CAGI4 Crohn's exome challenge: Marker SNP versus exome variant models for assigning risk of Crohn disease.

Pal, Lipika R; Kundu, Kunal; Yin, Yizhou; Moult, John.

Hum Mutat ; 38(9): 1225-1234, 2017 09.

Article in English | MEDLINE | ID: mdl-28512778

ABSTRACT

Understanding the basis of complex trait disease is a fundamental problem in human genetics. The CAGI Crohn's Exome challenges are providing insight into the adequacy of current disease models by requiring participants to identify which of a set of individuals has been diagnosed with the disease, given exome data. For the CAGI4 round, we developed a method that used the genotypes from exome sequencing data only to impute the status of genome wide association studies marker SNPs. We then used the imputed genotypes as input to several machine learning methods that had been trained to predict disease status from marker SNP information. We achieved the best performance using Naïve Bayes and with a consensus machine learning method, obtaining an area under the curve of 0.72, larger than other methods used in CAGI4. We also developed a model that incorporated the contribution from rare missense variants in the exome data, but this performed less well. Future progress is expected to come from the use of whole genome data rather than exomes.

Subject(s)

Crohn Disease/genetics , Exome Sequencing/methods , Polymorphism, Single Nucleotide , Algorithms , Area Under Curve , Genetic Markers , Genetic Predisposition to Disease , Genome-Wide Association Study , Humans , Machine Learning , Phenotype

14.

Determination of disease phenotypes and pathogenic variants from exome sequence data in the CAGI 4 gene panel challenge.

Kundu, Kunal; Pal, Lipika R; Yin, Yizhou; Moult, John.

Hum Mutat ; 38(9): 1201-1216, 2017 09.

Article in English | MEDLINE | ID: mdl-28497567

ABSTRACT

The use of gene panel sequence for diagnostic and prognostic testing is now widespread, but there are so far few objective tests of methods to interpret these data. We describe the design and implementation of a gene panel sequencing data analysis pipeline (VarP) and its assessment in a CAGI4 community experiment. The method was applied to clinical gene panel sequencing data of 106 patients, with the goal of determining which of 14 disease classes each patient has and the corresponding causative variant(s). The disease class was correctly identified for 36 cases, including 10 where the original clinical pipeline did not find causative variants. For a further seven cases, we found strong evidence of an alternative disease to that tested. Many of the potentially causative variants are missense, with no previous association with disease, and these proved the hardest to correctly assign pathogenicity or otherwise. Post analysis showed that three-dimensional structure data could have helped for up to half of these cases. Over-reliance on HGMD annotation led to a number of incorrect disease assignments. We used a largely ad hoc method to assign probabilities of pathogenicity for each variant, and there is much work still to be done in this area.

Subject(s)

Disease/classification , Exome Sequencing/methods , Genetic Variation , High-Throughput Nucleotide Sequencing/methods , Computational Biology , Databases, Genetic , Disease/genetics , Genetic Predisposition to Disease , Humans , Models, Molecular , Mutation, Missense , Phenotype , Proteins/chemistry , Proteins/genetics

15.

Lessons from the CAGI-4 Hopkins clinical panel challenge.

Chandonia, John-Marc; Adhikari, Aashish; Carraro, Marco; Chhibber, Aparna; Cutting, Garry R; Fu, Yao; Gasparini, Alessandra; Jones, David T; Kramer, Andreas; Kundu, Kunal; Lam, Hugo Y K; Leonardi, Emanuela; Moult, John; Pal, Lipika R; Searls, David B; Shah, Sohela; Sunyaev, Shamil; Tosatto, Silvio C E; Yin, Yizhou; Buckley, Bethany A.

Hum Mutat ; 38(9): 1155-1168, 2017 09.

Article in English | MEDLINE | ID: mdl-28397312

ABSTRACT

The CAGI-4 Hopkins clinical panel challenge was an attempt to assess state-of-the-art methods for clinical phenotype prediction from DNA sequence. Participants were provided with exonic sequences of 83 genes for 106 patients from the Johns Hopkins DNA Diagnostic Laboratory. Five groups participated in the challenge, predicting both the probability that each patient had each of the 14 possible classes of disease, as well as one or more causal variants. In cases where the Hopkins laboratory reported a variant, at least one predictor correctly identified the disease class in 36 of the 43 patients (84%). Even in cases where the Hopkins laboratory did not find a variant, at least one predictor correctly identified the class in 39 of the 63 patients (62%). Each prediction group correctly diagnosed at least one patient that was not successfully diagnosed by any other group. We discuss the causal variant predictions by different groups and their implications for further development of methods to assess variants of unknown significance. Our results suggest that clinically relevant variants may be missed when physicians order small panels targeted on a specific phenotype. We also quantify the false-positive rate of DNA-guided analysis in the absence of prior phenotypic indication.

Subject(s)

Computational Biology/methods , Sequence Analysis, DNA/methods , Databases, Genetic , Genetic Predisposition to Disease , Genetic Testing , Humans , Phenotype

16.

Nijmegen breakage syndrome detected by newborn screening for T cell receptor excision circles (TRECs).

Patel, Jay P; Puck, Jennifer M; Srinivasan, Rajgopal; Brown, Christina; Sunderam, Uma; Kundu, Kunal; Brenner, Steven E; Gatti, Richard A; Church, Joseph A.

J Clin Immunol ; 35(2): 227-33, 2015 Feb.

Article in English | MEDLINE | ID: mdl-25677497

ABSTRACT

PURPOSE: Severe combined immunodeficiency (SCID) encompasses a group of disorders characterized by reduced or absent T-cell number and function and identified by newborn screening utilizing T-cell receptor excision circles (TRECs). This screening has also identified infants with T lymphopenia who lack mutations in typical SCID genes. We report an infant with low TRECs and non-SCID T lymphopenia, who proved upon whole exome sequencing to have Nijmegen breakage syndrome (NBS). METHODS: Exome sequencing of DNA from the infant and his parents was performed. Genomic analysis revealed deleterious variants in the NBN gene. Confirmatory testing included Sanger sequencing and immunoblotting and radiosensitivity testing of patient lymphocytes. RESULTS: Two novel nonsense mutations in NBN were identified in genomic DNA from the family. Immunoblotting showed absence of nibrin protein. A colony survival assay demonstrated radiosensitivity comparable to patients with ataxia telangiectasia. CONCLUSIONS: Although TREC screening was developed to identify newborns with SCID, it has also identified T lymphopenic disorders that may not otherwise be diagnosed until later in life. Timely identification of an infant with T lymphopenia allowed for prompt pursuit of underlying etiology, making possible a diagnosis of NBS, genetic counseling, and early intervention to minimize complications.

Subject(s)

Neonatal Screening , Nijmegen Breakage Syndrome/diagnosis , Nijmegen Breakage Syndrome/genetics , Receptors, Antigen, T-Cell/genetics , T-Lymphocytes/metabolism , Cell Cycle Proteins/genetics , Cell Cycle Proteins/metabolism , DNA, Circular , Exome , Gene Rearrangement, T-Lymphocyte , High-Throughput Nucleotide Sequencing , Humans , Infant , Infant, Newborn , Male , Nijmegen Breakage Syndrome/immunology , Nuclear Proteins/genetics , Nuclear Proteins/metabolism , T-Lymphocytes/immunology

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL