Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
1.
medRxiv ; 2024 Apr 20.
Artículo en Inglés | MEDLINE | ID: mdl-38699371

RESUMEN

Rare and ultra-rare genetic conditions are estimated to impact nearly 1 in 17 people worldwide, yet accurately pinpointing the diagnostic variants underlying each of these conditions remains a formidable challenge. Because comprehensive, in vivo functional assessment of all possible genetic variants is infeasible, clinicians instead consider in silico variant pathogenicity predictions to distinguish plausibly disease-causing from benign variants across the genome. However, in the most difficult undiagnosed cases, such as those accepted to the Undiagnosed Diseases Network (UDN), existing pathogenicity predictions cannot reliably discern true etiological variant(s) from other deleterious candidate variants that were prioritized through N-of-1 efforts. Pinpointing the disease-causing variant from a pool of plausible candidates remains a largely manual effort requiring extensive clinical workups, functional and experimental assays, and eventual identification of genotype- and phenotype-matched individuals. Here, we introduce VarPPUD, a tool trained on prioritized variants from UDN cases, that leverages gene-, amino acid-, and nucleotide-level features to discern pathogenic variants from other deleterious variants that are unlikely to be confirmed as disease relevant. VarPPUD achieves a cross-validated accuracy of 79.3% and precision of 77.5% on a held-out subset of uniquely challenging UDN cases, respectively representing an average 18.6% and 23.4% improvement over nine traditional pathogenicity prediction approaches on this task. We validate VarPPUD's ability to discriminate likely from unlikely pathogenic variants on synthetic, GAN-generated candidate variants as well. Finally, we show how VarPPUD can be probed to evaluate each input feature's importance and contribution toward prediction-an essential step toward understanding the distinct characteristics of newly-uncovered disease-causing variants.

2.
medRxiv ; 2024 Apr 19.
Artículo en Inglés | MEDLINE | ID: mdl-38699301

RESUMEN

The biological mechanisms giving rise to the extreme symptoms exhibited by rare disease patients are complex, heterogenous, and difficult to discern. Understanding these mechanisms is critical for developing treatments that address the underlying causes of diseases rather than merely the presenting symptoms. Moreover, the same dysfunctional biological mechanisms implicated in rare recessive diseases may also lead to milder and potentially preventable symptoms in carriers in the general population. Seizures are a common, extreme phenotype that can result from diverse and often elusive biological pathways in patients with ultrarare or undiagnosed disorders. In this pilot study, we present an approach to understand the biological pathways leading to seizures in patients from the Undiagnosed Diseases Network (UDN) by analyzing aggregated genotype and phenotype data from the UK Biobank (UKB). Specifically, we look for enriched phenotypes across UKB participants who harbor rare variants in the same gene known or suspected to be causally implicated in a UDN patient's recessively manifesting disorder. Analyzing these milder but related associated phenotypes in UKB participants can provide insight into the disease-causing molecular mechanisms at play in the rare disease UDN patient. We present six vignettes of undiagnosed patients experiencing seizures as part of their recessive genetic condition, and we discuss the potential mechanisms underlying the spectrum of symptoms associated with UKB participants to the severe presentations exhibited by UDN patients. We find that in our set of rare disease patients, seizures may result from diverse, multi-step pathways that involve multiple body systems. Analyses of large-scale population cohorts such as the UKB can be a critical tool to further our understanding of rare diseases in general.

3.
NPJ Precis Oncol ; 8(1): 120, 2024 May 25.
Artículo en Inglés | MEDLINE | ID: mdl-38796637

RESUMEN

A small number of cancer patients respond exceptionally well to therapies and survive significantly longer than patients with similar diagnoses. Profiling the germline genetic backgrounds of exceptional responder (ER) patients, with extreme survival times, can yield insights into the germline polymorphisms that influence response to therapy. As ERs showed a high incidence in autoimmune diseases, we hypothesized the differences in autoimmune disease risk could reflect the immune background of ERs and contribute to better cancer treatment responses. We analyzed the germline variants of 51 ERs using polygenic risk score (PRS) analysis. Compared to typical cancer patients, the ERs had significantly elevated PRSs for several autoimmune-related diseases: type 1 diabetes, hypothyroidism, and psoriasis. This indicates that an increased genetic predisposition towards these autoimmune diseases is more prevalent among the ERs. In contrast, ERs had significantly lower PRSs for developing inflammatory bowel disease. The left-skew of type 1 diabetes score was significant for exceptional responders. Variants on genes involved in the T1D PRS model associated with cancer drug response are more likely to co-occur with other variants among ERs. In conclusion, ERs exhibited different risks for autoimmune diseases compared to typical cancer patients, which suggests that changes in a patient's immune set point or immune surveillance specificity could be a potential mechanistic link to their exceptional response. These findings expand upon previous research on immune checkpoint inhibitor-treated patients to include those who received chemotherapy or radiotherapy.

4.
bioRxiv ; 2024 Feb 16.
Artículo en Inglés | MEDLINE | ID: mdl-38405764

RESUMEN

Genomics for rare disease diagnosis has advanced at a rapid pace due to our ability to perform "N-of-1" analyses on individual patients. The increasing sizes of ultra-rare, "N-of-1" disease cohorts internationally newly enables cohort-wide analyses for new discoveries, but well-calibrated statistical genetics approaches for jointly analyzing these patients are still under development.1,2 The Undiagnosed Diseases Network (UDN) brings multiple clinical, research and experimental centers under the same umbrella across the United States to facilitate and scale N-of-1 analyses. Here, we present the first joint analysis of whole genome sequencing data of UDN patients across the network. We apply existing and introduce new, well-calibrated statistical methods for prioritizing disease genes with de novo recurrence and compound heterozygosity. We also detect pathways enriched with candidate and known diagnostic genes. Our computational analysis, coupled with a systematic clinical review, recapitulated known diagnoses and revealed new disease associations. We make our gene-level findings and variant-level information across the cohort available in a public-facing browser (https://dbmi-bgm.github.io/udn-browser/). These results show that N-of-1 efforts should be supplemented by a joint genomic analysis across cohorts.

5.
Genome Biol ; 25(1): 39, 2024 01 31.
Artículo en Inglés | MEDLINE | ID: mdl-38297326

RESUMEN

Expansions of tandem repeats (TRs) cause approximately 60 monogenic diseases. We expect that the discovery of additional pathogenic repeat expansions will narrow the diagnostic gap in many diseases. A growing number of TR expansions are being identified, and interpreting them is a challenge. We present RExPRT (Repeat EXpansion Pathogenicity pRediction Tool), a machine learning tool for distinguishing pathogenic from benign TR expansions. Our results demonstrate that an ensemble approach classifies TRs with an average precision of 93% and recall of 83%. RExPRT's high precision will be valuable in large-scale discovery studies, which require prioritization of candidate loci for follow-up studies.


Asunto(s)
Aprendizaje Automático , Secuencias Repetidas en Tándem , Virulencia
6.
Nat Commun ; 14(1): 6403, 2023 10 12.
Artículo en Inglés | MEDLINE | ID: mdl-37828001

RESUMEN

Rare Mendelian disorders pose a major diagnostic challenge and collectively affect 300-400 million patients worldwide. Many automated tools aim to uncover causal genes in patients with suspected genetic disorders, but evaluation of these tools is limited due to the lack of comprehensive benchmark datasets that include previously unpublished conditions. Here, we present a computational pipeline that simulates realistic clinical datasets to address this deficit. Our framework jointly simulates complex phenotypes and challenging candidate genes and produces patients with novel genetic conditions. We demonstrate the similarity of our simulated patients to real patients from the Undiagnosed Diseases Network and evaluate common gene prioritization methods on the simulated cohort. These prioritization methods recover known gene-disease associations but perform poorly on diagnosing patients with novel genetic disorders. Our publicly-available dataset and codebase can be utilized by medical genetics researchers to evaluate, compare, and improve tools that aid in the diagnostic process.


Asunto(s)
Pacientes , Enfermedades Raras , Humanos , Simulación por Computador , Fenotipo , Enfermedades Raras/diagnóstico , Enfermedades Raras/genética
7.
Am J Med Genet A ; 191(10): 2482-2492, 2023 10.
Artículo en Inglés | MEDLINE | ID: mdl-37246601

RESUMEN

The contribution of mosaicism to diagnosed genetic disease and presumed de novo variants (DNV) is under investigated. We determined the contribution of mosaic genetic disease (MGD) and diagnosed parental mosaicism (PM) in parents of offspring with reported DNV (in the same variant) in the (1) Undiagnosed Diseases Network (UDN) (N = 1946) and (2) in 12,472 individuals electronic health records (EHR) who underwent genetic testing at an academic medical center. In the UDN, we found 4.51% of diagnosed probands had MGD, and 2.86% of parents of those with DNV exhibited PM. In the EHR, we found 6.03% and 2.99% and (of diagnosed probands) had MGD detected on chromosomal microarray and exome/genome sequencing, respectively. We found 2.34% (of those with a presumed pathogenic DNV) had a parent with PM for the variant. We detected mosaicism (regardless of pathogenicity) in 4.49% of genetic tests performed. We found a broad phenotypic spectrum of MGD with previously unknown phenotypic phenomena. MGD is highly heterogeneous and provides a significant contribution to genetic diseases. Further work is required to improve the diagnosis of MGD and investigate how PM contributes to DNV risk.


Asunto(s)
Variación Genética , Mosaicismo , Humanos , Pruebas Genéticas , Exoma , Padres
8.
Genet Med ; 23(6): 1075-1085, 2021 06.
Artículo en Inglés | MEDLINE | ID: mdl-33580225

RESUMEN

PURPOSE: Genomic sequencing has become an increasingly powerful and relevant tool to be leveraged for the discovery of genetic aberrations underlying rare, Mendelian conditions. Although the computational tools incorporated into diagnostic workflows for this task are continually evolving and improving, we nevertheless sought to investigate commonalities across sequencing processing workflows to reveal consensus and standard practice tools and highlight exploratory analyses where technical and theoretical method improvements would be most impactful. METHODS: We collected details regarding the computational approaches used by a genetic testing laboratory and 11 clinical research sites in the United States participating in the Undiagnosed Diseases Network via meetings with bioinformaticians, online survey forms, and analyses of internal protocols. RESULTS: We found that tools for processing genomic sequencing data can be grouped into four distinct categories. Whereas well-established practices exist for initial variant calling and quality control steps, there is substantial divergence across sites in later stages for variant prioritization and multimodal data integration, demonstrating a diversity of approaches for solving the most mysterious undiagnosed cases. CONCLUSION: The largest differences across diagnostic workflows suggest that advances in structural variant detection, noncoding variant interpretation, and integration of additional biomedical data may be especially promising for solving chronically undiagnosed cases.


Asunto(s)
Genómica , Enfermedades no Diagnosticadas , Biología Computacional , Pruebas Genéticas , Genoma , Humanos , Programas Informáticos , Flujo de Trabajo
9.
Cell Syst ; 11(1): 63-74.e7, 2020 07 22.
Artículo en Inglés | MEDLINE | ID: mdl-32711844

RESUMEN

A major challenge in cancer genomics is to identify genes with functional roles in cancer and uncover their mechanisms of action. We introduce an integrative framework that identifies cancer-relevant genes by pinpointing those whose interaction or other functional sites are enriched in somatic mutations across tumors. We derive analytical calculations that enable us to avoid time-prohibitive permutation-based significance tests, making it computationally feasible to simultaneously consider multiple measures of protein site functionality. Our accompanying software, PertInInt, combines knowledge about sites participating in interactions with DNA, RNA, peptides, ions, or small molecules with domain, evolutionary conservation, and gene-level mutation data. When applied to 10,037 tumor samples, PertInInt uncovers both known and newly predicted cancer genes, while additionally revealing what types of interactions or other functionalities are disrupted. PertInInt's analysis demonstrates that somatic mutations are frequently enriched in interaction sites and domains and implicates interaction perturbation as a pervasive cancer-driving event.


Asunto(s)
Genómica/métodos , Neoplasias/genética , Oncogenes/genética , Humanos
10.
Nucleic Acids Res ; 47(2): 582-593, 2019 01 25.
Artículo en Inglés | MEDLINE | ID: mdl-30535108

RESUMEN

Domains are fundamental subunits of proteins, and while they play major roles in facilitating protein-DNA, protein-RNA and other protein-ligand interactions, a systematic assessment of their various interaction modes is still lacking. A comprehensive resource identifying positions within domains that tend to interact with nucleic acids, small molecules and other ligands would expand our knowledge of domain functionality as well as aid in detecting ligand-binding sites within structurally uncharacterized proteins. Here, we introduce an approach to identify per-domain-position interaction 'frequencies' by aggregating protein co-complex structures by domain and ascertaining how often residues mapping to each domain position interact with ligands. We perform this domain-based analysis on ∼91000 co-complex structures, and infer positions involved in binding DNA, RNA, peptides, ions or small molecules across 4128 domains, which we refer to collectively as the InteracDome. Cross-validation testing reveals that ligand-binding positions for 2152 domains are highly consistent and can be used to identify residues facilitating interactions in ∼63-69% of human genes. Our resource of domain-inferred ligand-binding sites should be a great aid in understanding disease etiology: whereas these sites are enriched in Mendelian-associated and cancer somatic mutations, they are depleted in polymorphisms observed across healthy populations. The InteracDome is available at http://interacdome.princeton.edu.


Asunto(s)
Proteínas de Unión al ADN/química , ADN/metabolismo , Dominios Proteicos , Proteínas de Unión al ARN/química , ARN/metabolismo , Sitios de Unión , ADN/química , Proteínas de Unión al ADN/metabolismo , Enfermedad/genética , Genes , Humanos , Ligandos , Modelos Moleculares , Mutación , Unión Proteica , ARN/química , Proteínas de Unión al ARN/metabolismo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA