Búsqueda | Portal Regional de la BVS

Phenotypic overlap between rare disease patients and variant carriers in a large population cohort informs biological mechanisms.

Fitzsimmons, Lane; Beaulieu-Jones, Brett; Kobren, Shilpa Nadimpalli.

medRxiv ; 2024 Apr 19.

Artículo en Inglés | MEDLINE | ID: mdl-38699301

RESUMEN

The biological mechanisms giving rise to the extreme symptoms exhibited by rare disease patients are complex, heterogenous, and difficult to discern. Understanding these mechanisms is critical for developing treatments that address the underlying causes of diseases rather than merely the presenting symptoms. Moreover, the same dysfunctional biological mechanisms implicated in rare recessive diseases may also lead to milder and potentially preventable symptoms in carriers in the general population. Seizures are a common, extreme phenotype that can result from diverse and often elusive biological pathways in patients with ultrarare or undiagnosed disorders. In this pilot study, we present an approach to understand the biological pathways leading to seizures in patients from the Undiagnosed Diseases Network (UDN) by analyzing aggregated genotype and phenotype data from the UK Biobank (UKB). Specifically, we look for enriched phenotypes across UKB participants who harbor rare variants in the same gene known or suspected to be causally implicated in a UDN patient's recessively manifesting disorder. Analyzing these milder but related associated phenotypes in UKB participants can provide insight into the disease-causing molecular mechanisms at play in the rare disease UDN patient. We present six vignettes of undiagnosed patients experiencing seizures as part of their recessive genetic condition, and we discuss the potential mechanisms underlying the spectrum of symptoms associated with UKB participants to the severe presentations exhibited by UDN patients. We find that in our set of rare disease patients, seizures may result from diverse, multi-step pathways that involve multiple body systems. Analyses of large-scale population cohorts such as the UKB can be a critical tool to further our understanding of rare diseases in general.

VarPPUD: Variant post prioritization developed for undiagnosed genetic disorders.

Yin, Rui; Gutierrez, Alba; Kobren, Shilpa Nadimpalli; Avillach, Paul.

medRxiv ; 2024 Apr 20.

Artículo en Inglés | MEDLINE | ID: mdl-38699371

RESUMEN

Rare and ultra-rare genetic conditions are estimated to impact nearly 1 in 17 people worldwide, yet accurately pinpointing the diagnostic variants underlying each of these conditions remains a formidable challenge. Because comprehensive, in vivo functional assessment of all possible genetic variants is infeasible, clinicians instead consider in silico variant pathogenicity predictions to distinguish plausibly disease-causing from benign variants across the genome. However, in the most difficult undiagnosed cases, such as those accepted to the Undiagnosed Diseases Network (UDN), existing pathogenicity predictions cannot reliably discern true etiological variant(s) from other deleterious candidate variants that were prioritized through N-of-1 efforts. Pinpointing the disease-causing variant from a pool of plausible candidates remains a largely manual effort requiring extensive clinical workups, functional and experimental assays, and eventual identification of genotype- and phenotype-matched individuals. Here, we introduce VarPPUD, a tool trained on prioritized variants from UDN cases, that leverages gene-, amino acid-, and nucleotide-level features to discern pathogenic variants from other deleterious variants that are unlikely to be confirmed as disease relevant. VarPPUD achieves a cross-validated accuracy of 79.3% and precision of 77.5% on a held-out subset of uniquely challenging UDN cases, respectively representing an average 18.6% and 23.4% improvement over nine traditional pathogenicity prediction approaches on this task. We validate VarPPUD's ability to discriminate likely from unlikely pathogenic variants on synthetic, GAN-generated candidate variants as well. Finally, we show how VarPPUD can be probed to evaluate each input feature's importance and contribution toward prediction-an essential step toward understanding the distinct characteristics of newly-uncovered disease-causing variants.

RExPRT: a machine learning tool to predict pathogenicity of tandem repeat loci.

Fazal, Sarah; Danzi, Matt C; Xu, Isaac; Kobren, Shilpa Nadimpalli; Sunyaev, Shamil; Reuter, Chloe; Marwaha, Shruti; Wheeler, Matthew; Dolzhenko, Egor; Lucas, Francesca; Wuchty, Stefan; Tekin, Mustafa; Züchner, Stephan; Aguiar-Pulido, Vanessa.

Genome Biol ; 25(1): 39, 2024 Jan 31.

Artículo en Inglés | MEDLINE | ID: mdl-38297326

RESUMEN

Expansions of tandem repeats (TRs) cause approximately 60 monogenic diseases. We expect that the discovery of additional pathogenic repeat expansions will narrow the diagnostic gap in many diseases. A growing number of TR expansions are being identified, and interpreting them is a challenge. We present RExPRT (Repeat EXpansion Pathogenicity pRediction Tool), a machine learning tool for distinguishing pathogenic from benign TR expansions. Our results demonstrate that an ensemble approach classifies TRs with an average precision of 93% and recall of 83%. RExPRT's high precision will be valuable in large-scale discovery studies, which require prioritization of candidate loci for follow-up studies.

Asunto(s)

Aprendizaje Automático , Secuencias Repetidas en Tándem , Virulencia

The contribution of mosaicism to genetic diseases and de novo pathogenic variants.

Tinker, Rory J; Bastarache, Lisa; Ezell, Kimberly; Kobren, Shilpa Nadimpalli; Esteves, Cecilia; Rosenfeld, Jill A; Macnamara, Ellen F; Hamid, Rizwan; Cogan, Joy D; Rinker, David; Mukharjee, Souhrid; Glass, Ian; Dipple, Katrina; Phillips, John A.

Am J Med Genet A ; 191(10): 2482-2492, 2023 10.

Artículo en Inglés | MEDLINE | ID: mdl-37246601

RESUMEN

The contribution of mosaicism to diagnosed genetic disease and presumed de novo variants (DNV) is under investigated. We determined the contribution of mosaic genetic disease (MGD) and diagnosed parental mosaicism (PM) in parents of offspring with reported DNV (in the same variant) in the (1) Undiagnosed Diseases Network (UDN) (N = 1946) and (2) in 12,472 individuals electronic health records (EHR) who underwent genetic testing at an academic medical center. In the UDN, we found 4.51% of diagnosed probands had MGD, and 2.86% of parents of those with DNV exhibited PM. In the EHR, we found 6.03% and 2.99% and (of diagnosed probands) had MGD detected on chromosomal microarray and exome/genome sequencing, respectively. We found 2.34% (of those with a presumed pathogenic DNV) had a parent with PM for the variant. We detected mosaicism (regardless of pathogenicity) in 4.49% of genetic tests performed. We found a broad phenotypic spectrum of MGD with previously unknown phenotypic phenomena. MGD is highly heterogeneous and provides a significant contribution to genetic diseases. Further work is required to improve the diagnosis of MGD and investigate how PM contributes to DNV risk.

Asunto(s)

Variación Genética , Mosaicismo , Humanos , Pruebas Genéticas , Exoma , Padres

Commonalities across computational workflows for uncovering explanatory variants in undiagnosed cases.

Kobren, Shilpa Nadimpalli; Baldridge, Dustin; Velinder, Matt; Krier, Joel B; LeBlanc, Kimberly; Esteves, Cecilia; Pusey, Barbara N; Züchner, Stephan; Blue, Elizabeth; Lee, Hane; Huang, Alden; Bastarache, Lisa; Bican, Anna; Cogan, Joy; Marwaha, Shruti; Alkelai, Anna; Murdock, David R; Liu, Pengfei; Wegner, Daniel J; Paul, Alexander J; Sunyaev, Shamil R; Kohane, Isaac S.

Genet Med ; 23(6): 1075-1085, 2021 06.

Artículo en Inglés | MEDLINE | ID: mdl-33580225

RESUMEN

PURPOSE: Genomic sequencing has become an increasingly powerful and relevant tool to be leveraged for the discovery of genetic aberrations underlying rare, Mendelian conditions. Although the computational tools incorporated into diagnostic workflows for this task are continually evolving and improving, we nevertheless sought to investigate commonalities across sequencing processing workflows to reveal consensus and standard practice tools and highlight exploratory analyses where technical and theoretical method improvements would be most impactful. METHODS: We collected details regarding the computational approaches used by a genetic testing laboratory and 11 clinical research sites in the United States participating in the Undiagnosed Diseases Network via meetings with bioinformaticians, online survey forms, and analyses of internal protocols. RESULTS: We found that tools for processing genomic sequencing data can be grouped into four distinct categories. Whereas well-established practices exist for initial variant calling and quality control steps, there is substantial divergence across sites in later stages for variant prioritization and multimodal data integration, demonstrating a diversity of approaches for solving the most mysterious undiagnosed cases. CONCLUSION: The largest differences across diagnostic workflows suggest that advances in structural variant detection, noncoding variant interpretation, and integration of additional biomedical data may be especially promising for solving chronically undiagnosed cases.

Asunto(s)

Genómica , Enfermedades no Diagnosticadas , Biología Computacional , Pruebas Genéticas , Genoma , Humanos , Programas Informáticos , Flujo de Trabajo

PertInInt: An Integrative, Analytical Approach to Rapidly Uncover Cancer Driver Genes with Perturbed Interactions and Functionalities.

Kobren, Shilpa Nadimpalli; Chazelle, Bernard; Singh, Mona.

Cell Syst ; 11(1): 63-74.e7, 2020 07 22.

Artículo en Inglés | MEDLINE | ID: mdl-32711844

RESUMEN

A major challenge in cancer genomics is to identify genes with functional roles in cancer and uncover their mechanisms of action. We introduce an integrative framework that identifies cancer-relevant genes by pinpointing those whose interaction or other functional sites are enriched in somatic mutations across tumors. We derive analytical calculations that enable us to avoid time-prohibitive permutation-based significance tests, making it computationally feasible to simultaneously consider multiple measures of protein site functionality. Our accompanying software, PertInInt, combines knowledge about sites participating in interactions with DNA, RNA, peptides, ions, or small molecules with domain, evolutionary conservation, and gene-level mutation data. When applied to 10,037 tumor samples, PertInInt uncovers both known and newly predicted cancer genes, while additionally revealing what types of interactions or other functionalities are disrupted. PertInInt's analysis demonstrates that somatic mutations are frequently enriched in interaction sites and domains and implicates interaction perturbation as a pervasive cancer-driving event.

Asunto(s)

Genómica/métodos , Neoplasias/genética , Oncogenes/genética , Humanos

Systematic domain-based aggregation of protein structures highlights DNA-, RNA- and other ligand-binding positions.

Kobren, Shilpa Nadimpalli; Singh, Mona.

Nucleic Acids Res ; 47(2): 582-593, 2019 01 25.

Artículo en Inglés | MEDLINE | ID: mdl-30535108

RESUMEN

Domains are fundamental subunits of proteins, and while they play major roles in facilitating protein-DNA, protein-RNA and other protein-ligand interactions, a systematic assessment of their various interaction modes is still lacking. A comprehensive resource identifying positions within domains that tend to interact with nucleic acids, small molecules and other ligands would expand our knowledge of domain functionality as well as aid in detecting ligand-binding sites within structurally uncharacterized proteins. Here, we introduce an approach to identify per-domain-position interaction 'frequencies' by aggregating protein co-complex structures by domain and ascertaining how often residues mapping to each domain position interact with ligands. We perform this domain-based analysis on â¼91000 co-complex structures, and infer positions involved in binding DNA, RNA, peptides, ions or small molecules across 4128 domains, which we refer to collectively as the InteracDome. Cross-validation testing reveals that ligand-binding positions for 2152 domains are highly consistent and can be used to identify residues facilitating interactions in â¼63-69% of human genes. Our resource of domain-inferred ligand-binding sites should be a great aid in understanding disease etiology: whereas these sites are enriched in Mendelian-associated and cancer somatic mutations, they are depleted in polymorphisms observed across healthy populations. The InteracDome is available at http://interacdome.princeton.edu.

Asunto(s)

Proteínas de Unión al ADN/química , ADN/metabolismo , Dominios Proteicos , Proteínas de Unión al ARN/química , ARN/metabolismo , Sitios de Unión , ADN/química , Proteínas de Unión al ADN/metabolismo , Enfermedad/genética , Genes , Humanos , Ligandos , Modelos Moleculares , Mutación , Unión Proteica , ARN/química , Proteínas de Unión al ARN/metabolismo

RESUMEN

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA