Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 9 de 9
Filtrar
Más filtros

Bases de datos
Tipo del documento
Asunto de la revista
Intervalo de año de publicación
1.
HGG Adv ; 5(3): 100310, 2024 Jul 18.
Artículo en Inglés | MEDLINE | ID: mdl-38773771

RESUMEN

Non-protein-coding genetic variants are a major driver of the genetic risk for human disease; however, identifying which non-coding variants contribute to diseases and their mechanisms remains challenging. In silico variant prioritization methods quantify a variant's severity, but for most methods, the specific phenotype and disease context of the prediction remain poorly defined. For example, many commonly used methods provide a single, organism-wide score for each variant, while other methods summarize a variant's impact in certain tissues and/or cell types. Here, we propose a complementary disease-specific variant prioritization scheme, which is motivated by the observation that variants contributing to disease often operate through specific biological mechanisms. We combine tissue/cell-type-specific variant scores (e.g., GenoSkyline, FitCons2, DNA accessibility) into disease-specific scores with a logistic regression approach and apply it to ∼25,000 non-coding variants spanning 111 diseases. We show that this disease-specific aggregation significantly improves the association of common non-coding genetic variants with disease (average precision: 0.151, baseline = 0.09), compared with organism-wide scores (GenoCanyon, LINSIGHT, GWAVA, Eigen, CADD; average precision: 0.129, baseline = 0.09). Further on, disease similarities based on data-driven aggregation weights highlight meaningful disease groups, and it provides information about tissues and cell types that drive these similarities. We also show that so-learned similarities are complementary to genetic similarities as quantified by genetic correlation. Overall, our approach demonstrates the strengths of disease-specific variant prioritization, leads to improvement in non-coding variant prioritization, and enables interpretable models that link variants to disease via specific tissues and/or cell types.


Asunto(s)
Cromatina , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Cromatina/genética , Cromatina/metabolismo , Variación Genética/genética , Polimorfismo de Nucleótido Simple , Biología Computacional/métodos , Algoritmos
2.
Cell Genom ; 4(4): 100536, 2024 Apr 10.
Artículo en Inglés | MEDLINE | ID: mdl-38604126

RESUMEN

Gene regulatory divergence between species can result from cis-acting local changes to regulatory element DNA sequences or global trans-acting changes to the regulatory environment. Understanding how these mechanisms drive regulatory evolution has been limited by challenges in identifying trans-acting changes. We present a comprehensive approach to directly identify cis- and trans-divergent regulatory elements between human and rhesus macaque lymphoblastoid cells using assay for transposase-accessible chromatin coupled to self-transcribing active regulatory region (ATAC-STARR) sequencing. In addition to thousands of cis changes, we discover an unexpected number (∼10,000) of trans changes and show that cis and trans elements exhibit distinct patterns of sequence divergence and function. We further identify differentially expressed transcription factors that underlie ∼37% of trans differences and trace how cis changes can produce cascades of trans changes. Overall, we find that most divergent elements (67%) experienced changes in both cis and trans, revealing a substantial role for trans divergence-alone and together with cis changes-in regulatory differences between species.


Asunto(s)
Regulación de la Expresión Génica , Secuencias Reguladoras de Ácidos Nucleicos , Animales , Humanos , Macaca mulatta/genética , Secuencias Reguladoras de Ácidos Nucleicos/genética , Regulación de la Expresión Génica/genética , Factores de Transcripción/genética , Cromatina/genética
3.
bioRxiv ; 2024 Aug 07.
Artículo en Inglés | MEDLINE | ID: mdl-39149406

RESUMEN

Effective diagnosis and treatment of rare genetic disorders requires the interpretation of a patient's genetic variants of unknown significance (VUSs). Today, clinical decision-making is primarily guided by gene-phenotype association databases and DNA-based scoring methods. Our web-accessible variant analysis pipeline, VUStruct, supplements these established approaches by deeply analyzing the downstream molecular impact of variation in context of 3D protein structure. VUStruct's growing impact is fueled by the co-proliferation of protein 3D structural models, gene sequencing, compute power, and artificial intelligence. Contextualizing VUSs in protein 3D structural models also illuminates longitudinal genomics studies and biochemical bench research focused on VUS, and we created VUStruct for clinicians and researchers alike. We now introduce VUStruct to the broad scientific community as a mature, web-facing, extensible, High Performance Computing (HPC) software pipeline. VUStruct maps missense variants onto automatically selected protein structures and launches a broad range of analyses. These include energy-based assessments of protein folding and stability, pathogenicity prediction through spatial clustering analysis, and machine learning (ML) predictors of binding surface disruptions and nearby post-translational modification sites. The pipeline also considers the entire input set of VUS and identifies genes potentially involved in digenic disease. VUStruct's utility in clinical rare disease genome interpretation has been demonstrated through its analysis of over 175 Undiagnosed Disease Network (UDN) Patient cases. VUStruct-leveraged hypotheses have often informed clinicians in their consideration of additional patient testing, and we report here details from two cases where VUStruct was key to their solution. We also note successes with academic research collaborators, for whom VUStruct has informed research directions in both computational genomics and wet lab studies.

4.
Genome Med ; 16(1): 73, 2024 05 30.
Artículo en Inglés | MEDLINE | ID: mdl-38816749

RESUMEN

BACKGROUND: KCNE1 encodes a 129-residue cardiac potassium channel (IKs) subunit. KCNE1 variants are associated with long QT syndrome and atrial fibrillation. However, most variants have insufficient evidence of clinical consequences and thus limited clinical utility. METHODS: In this study, we leveraged the power of variant effect mapping, which couples saturation mutagenesis with high-throughput sequencing, to ascertain the function of thousands of protein-coding KCNE1 variants. RESULTS: We comprehensively assayed KCNE1 variant cell surface expression (2554/2709 possible single-amino-acid variants) and function (2534 variants). Our study identified 470 loss- or partial loss-of-surface expression and 574 loss- or partial loss-of-function variants. Of the 574 loss- or partial loss-of-function variants, 152 (26.5%) had reduced cell surface expression, indicating that most functionally deleterious variants affect channel gating. Nonsense variants at residues 56-104 generally had WT-like trafficking scores but decreased functional scores, indicating that the latter half of the protein is dispensable for protein trafficking but essential for channel function. 22 of the 30 KCNE1 residues (73%) highly intolerant of variation (with > 70% loss-of-function variants) were in predicted close contact with binding partners KCNQ1 or calmodulin. Our functional assay data were consistent with gold standard electrophysiological data (ρ = - 0.64), population and patient cohorts (32/38 presumed benign or pathogenic variants with consistent scores), and computational predictors (ρ = - 0.62). Our data provide moderate-strength evidence for the American College of Medical Genetics/Association of Molecular Pathology functional criteria for benign and pathogenic variants. CONCLUSIONS: Comprehensive variant effect maps of KCNE1 can both provide insight into I Ks channel biology and help reclassify variants of uncertain significance.


Asunto(s)
Calmodulina , Canales de Potasio con Entrada de Voltaje , Canales de Potasio con Entrada de Voltaje/genética , Canales de Potasio con Entrada de Voltaje/metabolismo , Humanos , Calmodulina/genética , Calmodulina/metabolismo , Arritmias Cardíacas/genética , Secuenciación de Nucleótidos de Alto Rendimiento , Variación Genética , Transporte de Proteínas , Células HEK293
5.
medRxiv ; 2024 May 04.
Artículo en Inglés | MEDLINE | ID: mdl-38746318

RESUMEN

Molecular studies of Alzheimer's disease (AD) implicate potential links between autoimmunity and AD, but the underlying clinical relationships between these conditions remain poorly understood. Electronic health records (EHRs) provide an opportunity to determine the clinical risk relationship between autoimmune disorders and AD and understand whether specific disorders and disorder subtypes affect AD risk at the phenotypic level in human populations. We evaluated relationships between 26 autoimmune disorders and AD across retrospective observational case-control and cohort study designs in the EHR systems at UCSF and Stanford. We quantified overall and sex-specific AD risk effects that these autoimmune disorders confer. We identified significantly increased AD risk in autoimmune disorder patients in both study designs at UCSF and at Stanford. This pattern was driven by specific autoimmunity subtypes including endocrine, gastrointestinal, dermatologic, and musculoskeletal disorders. We also observed increased AD risk from autoimmunity in both women and men, but women with autoimmune disorders continued to have a higher AD prevalence than men, indicating persistent sex-specificity. This study identifies autoimmune disorders as strong risk factors for AD that validate across several study designs and EHR databases. It sets the foundation for exploring how underlying autoimmune mechanisms increase AD risk and contribute to AD pathogenesis.

6.
ACR Open Rheumatol ; 2024 Aug 14.
Artículo en Inglés | MEDLINE | ID: mdl-39143499

RESUMEN

OBJECTIVE: There is an established yet unexplained link between interferon (IFN) and systemic lupus erythematosus (SLE). The expression of sequences derived from transposable elements (TEs) may contribute to SLE phenotypes, specifically production of type I IFNs and generation of autoantibodies. METHODS: We profiled cell-sorted RNA-sequencing data (CD4+ T cells, CD14+ monocytes, CD19+ B cells, and natural killer cells) from peripheral blood mononuclear cells of 120 patients with SLE and quantified TE expression identifying 27,135 TEs. We tested for differential TE expression across 10 SLE phenotypes, including autoantibody production and disease activity. RESULTS: We found 731 differentially expressed (DE) TEs across all SLE phenotypes that were mostly cell specific and phenotype specific. DE TEs were enriched for specific families and open reading frames of viral genes encoded in TE sequences. Increased expression of DE TEs was associated with genes involved in antiviral activity, such as LY6E, ISG15, and TRIM22, and pathways such as IFN signaling. CONCLUSION: These findings suggest that expression of TEs contributes to activation of SLE-related mechanisms in a cell-specific manner, which can impact disease diagnostics and therapeutics.

7.
bioRxiv ; 2024 Jul 18.
Artículo en Inglés | MEDLINE | ID: mdl-39071407

RESUMEN

Mutations in the kinase and juxtamembrane domains of the MET Receptor Tyrosine Kinase are responsible for oncogenesis in various cancers and can drive resistance to MET-directed treatments. Determining the most effective inhibitor for each mutational profile is a major challenge for MET-driven cancer treatment in precision medicine. Here, we used a deep mutational scan (DMS) of ~5,764 MET kinase domain variants to profile the growth of each mutation against a panel of 11 inhibitors that are reported to target the MET kinase domain. We identified common resistance sites across type I, type II, and type I ½ inhibitors, unveiled unique resistance and sensitizing mutations for each inhibitor, and validated non-cross-resistant sensitivities for type I and type II inhibitor pairs. We augment a protein language model with biophysical and chemical features to improve the predictive performance for inhibitor-treated datasets. Together, our study demonstrates a pooled experimental pipeline for identifying resistance mutations, provides a reference dictionary for mutations that are sensitized to specific therapies, and offers insights for future drug development.

8.
Nat Commun ; 15(1): 4380, 2024 May 23.
Artículo en Inglés | MEDLINE | ID: mdl-38782905

RESUMEN

SLC22A10 is an orphan transporter with unknown substrates and function. The goal of this study is to elucidate its substrate specificity and functional characteristics. In contrast to orthologs from great apes, human SLC22A10, tagged with green fluorescent protein, is not expressed on the plasma membrane. Cells expressing great ape SLC22A10 orthologs exhibit significant accumulation of estradiol-17ß-glucuronide, unlike those expressing human SLC22A10. Sequence alignments reveal a proline at position 220 in humans, which is a leucine in great apes. Replacing proline with leucine in SLC22A10-P220L restores plasma membrane localization and uptake function. Neanderthal and Denisovan genomes show proline at position 220, akin to modern humans, indicating functional loss during hominin evolution. Human SLC22A10 is a unitary pseudogene due to a fixed missense mutation, P220, while in great apes, its orthologs transport sex steroid conjugates. Characterizing SLC22A10 across species sheds light on its biological role, influencing organism development and steroid homeostasis.


Asunto(s)
Primates , Animales , Humanos , Secuencia de Aminoácidos , Estradiol/metabolismo , Células HEK293 , Hominidae/genética , Hominidae/metabolismo , Mutación Missense , Proteínas de Transporte de Catión Orgánico/metabolismo , Proteínas de Transporte de Catión Orgánico/genética , Primates/genética , Seudogenes , Especificidad por Sustrato
9.
bioRxiv ; 2023 Dec 23.
Artículo en Inglés | MEDLINE | ID: mdl-38187606

RESUMEN

Understanding variation in chromatin contact patterns across human populations is critical for interpreting non-coding variants and their ultimate effects on gene expression and phenotypes. However, experimental determination of chromatin contacts at a population-scale is prohibitively expensive. To overcome this challenge, we develop and validate a machine learning method to quantify the diversity 3D chromatin contacts at 2 kilobase resolution from genome sequence alone. We then apply this approach to thousands of diverse modern humans and the inferred human-archaic hominin ancestral genome. While patterns of 3D contact divergence genome-wide are qualitatively similar to patterns of sequence divergence, we find that 3D divergence in local 1-megabase genomic windows does not follow sequence divergence. In particular, we identify 392 windows with significantly greater 3D divergence than expected from sequence. Moreover, 26% of genomic windows have rare 3D contact variation observed in a small number of individuals. Using in silico mutagenesis we find that most sequence changes to do not result in changes to 3D chromatin contacts. However in windows with substantial 3D divergence, just one or a few variants can lead to divergent 3D chromatin contacts without the individuals carrying those variants having high sequence divergence. In summary, inferring 3D chromatin contact maps across human populations reveals diverse contact patterns. We anticipate that these genetically diverse maps of 3D chromatin contact will provide a reference for future work on the function and evolution of 3D chromatin contact variation across human populations.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA