Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 116
Filtrar
Mais filtros












Base de dados
Intervalo de ano de publicação
1.
HGG Adv ; 5(3): 100310, 2024 May 21.
Artigo em Inglês | MEDLINE | ID: mdl-38773771

RESUMO

Non-protein-coding genetic variants are a major driver of the genetic risk for human disease; however, identifying which non-coding variants contribute to diseases and their mechanisms remains challenging. In silico variant prioritization methods quantify a variant's severity, but for most methods, the specific phenotype and disease context of the prediction remain poorly defined. For example, many commonly used methods provide a single, organism-wide score for each variant, while other methods summarize a variant's impact in certain tissues and/or cell types. Here, we propose a complementary disease-specific variant prioritization scheme, which is motivated by the observation that variants contributing to disease often operate through specific biological mechanisms. We combine tissue/cell-type-specific variant scores (e.g., GenoSkyline, FitCons2, DNA accessibility) into disease-specific scores with a logistic regression approach and apply it to ∼25,000 non-coding variants spanning 111 diseases. We show that this disease-specific aggregation significantly improves the association of common non-coding genetic variants with disease (average precision: 0.151, baseline = 0.09), compared with organism-wide scores (GenoCanyon, LINSIGHT, GWAVA, Eigen, CADD; average precision: 0.129, baseline = 0.09). Further on, disease similarities based on data-driven aggregation weights highlight meaningful disease groups, and it provides information about tissues and cell types that drive these similarities. We also show that so-learned similarities are complementary to genetic similarities as quantified by genetic correlation. Overall, our approach demonstrates the strengths of disease-specific variant prioritization, leads to improvement in non-coding variant prioritization, and enables interpretable models that link variants to disease via specific tissues and/or cell types.

2.
Genome Med ; 16(1): 73, 2024 05 30.
Artigo em Inglês | MEDLINE | ID: mdl-38816749

RESUMO

BACKGROUND: KCNE1 encodes a 129-residue cardiac potassium channel (IKs) subunit. KCNE1 variants are associated with long QT syndrome and atrial fibrillation. However, most variants have insufficient evidence of clinical consequences and thus limited clinical utility. METHODS: In this study, we leveraged the power of variant effect mapping, which couples saturation mutagenesis with high-throughput sequencing, to ascertain the function of thousands of protein-coding KCNE1 variants. RESULTS: We comprehensively assayed KCNE1 variant cell surface expression (2554/2709 possible single-amino-acid variants) and function (2534 variants). Our study identified 470 loss- or partial loss-of-surface expression and 574 loss- or partial loss-of-function variants. Of the 574 loss- or partial loss-of-function variants, 152 (26.5%) had reduced cell surface expression, indicating that most functionally deleterious variants affect channel gating. Nonsense variants at residues 56-104 generally had WT-like trafficking scores but decreased functional scores, indicating that the latter half of the protein is dispensable for protein trafficking but essential for channel function. 22 of the 30 KCNE1 residues (73%) highly intolerant of variation (with > 70% loss-of-function variants) were in predicted close contact with binding partners KCNQ1 or calmodulin. Our functional assay data were consistent with gold standard electrophysiological data (ρ = - 0.64), population and patient cohorts (32/38 presumed benign or pathogenic variants with consistent scores), and computational predictors (ρ = - 0.62). Our data provide moderate-strength evidence for the American College of Medical Genetics/Association of Molecular Pathology functional criteria for benign and pathogenic variants. CONCLUSIONS: Comprehensive variant effect maps of KCNE1 can both provide insight into I Ks channel biology and help reclassify variants of uncertain significance.


Assuntos
Calmodulina , Canais de Potássio de Abertura Dependente da Tensão da Membrana , Canais de Potássio de Abertura Dependente da Tensão da Membrana/genética , Canais de Potássio de Abertura Dependente da Tensão da Membrana/metabolismo , Humanos , Calmodulina/genética , Calmodulina/metabolismo , Arritmias Cardíacas/genética , Sequenciamento de Nucleotídeos em Larga Escala , Variação Genética , Transporte Proteico , Células HEK293
3.
medRxiv ; 2024 May 04.
Artigo em Inglês | MEDLINE | ID: mdl-38746318

RESUMO

Molecular studies of Alzheimer's disease (AD) implicate potential links between autoimmunity and AD, but the underlying clinical relationships between these conditions remain poorly understood. Electronic health records (EHRs) provide an opportunity to determine the clinical risk relationship between autoimmune disorders and AD and understand whether specific disorders and disorder subtypes affect AD risk at the phenotypic level in human populations. We evaluated relationships between 26 autoimmune disorders and AD across retrospective observational case-control and cohort study designs in the EHR systems at UCSF and Stanford. We quantified overall and sex-specific AD risk effects that these autoimmune disorders confer. We identified significantly increased AD risk in autoimmune disorder patients in both study designs at UCSF and at Stanford. This pattern was driven by specific autoimmunity subtypes including endocrine, gastrointestinal, dermatologic, and musculoskeletal disorders. We also observed increased AD risk from autoimmunity in both women and men, but women with autoimmune disorders continued to have a higher AD prevalence than men, indicating persistent sex-specificity. This study identifies autoimmune disorders as strong risk factors for AD that validate across several study designs and EHR databases. It sets the foundation for exploring how underlying autoimmune mechanisms increase AD risk and contribute to AD pathogenesis.

4.
Nat Commun ; 15(1): 4380, 2024 May 23.
Artigo em Inglês | MEDLINE | ID: mdl-38782905

RESUMO

SLC22A10 is an orphan transporter with unknown substrates and function. The goal of this study is to elucidate its substrate specificity and functional characteristics. In contrast to orthologs from great apes, human SLC22A10, tagged with green fluorescent protein, is not expressed on the plasma membrane. Cells expressing great ape SLC22A10 orthologs exhibit significant accumulation of estradiol-17ß-glucuronide, unlike those expressing human SLC22A10. Sequence alignments reveal a proline at position 220 in humans, which is a leucine in great apes. Replacing proline with leucine in SLC22A10-P220L restores plasma membrane localization and uptake function. Neanderthal and Denisovan genomes show proline at position 220, akin to modern humans, indicating functional loss during hominin evolution. Human SLC22A10 is a unitary pseudogene due to a fixed missense mutation, P220, while in great apes, its orthologs transport sex steroid conjugates. Characterizing SLC22A10 across species sheds light on its biological role, influencing organism development and steroid homeostasis.


Assuntos
Primatas , Animais , Humanos , Sequência de Aminoácidos , Estradiol/metabolismo , Células HEK293 , Hominidae/genética , Hominidae/metabolismo , Mutação de Sentido Incorreto , Proteínas de Transporte de Cátions Orgânicos/metabolismo , Proteínas de Transporte de Cátions Orgânicos/genética , Primatas/genética , Pseudogenes , Especificidade por Substrato
5.
Cell Genom ; 4(4): 100536, 2024 Apr 10.
Artigo em Inglês | MEDLINE | ID: mdl-38604126

RESUMO

Gene regulatory divergence between species can result from cis-acting local changes to regulatory element DNA sequences or global trans-acting changes to the regulatory environment. Understanding how these mechanisms drive regulatory evolution has been limited by challenges in identifying trans-acting changes. We present a comprehensive approach to directly identify cis- and trans-divergent regulatory elements between human and rhesus macaque lymphoblastoid cells using assay for transposase-accessible chromatin coupled to self-transcribing active regulatory region (ATAC-STARR) sequencing. In addition to thousands of cis changes, we discover an unexpected number (∼10,000) of trans changes and show that cis and trans elements exhibit distinct patterns of sequence divergence and function. We further identify differentially expressed transcription factors that underlie ∼37% of trans differences and trace how cis changes can produce cascades of trans changes. Overall, we find that most divergent elements (67%) experienced changes in both cis and trans, revealing a substantial role for trans divergence-alone and together with cis changes-in regulatory differences between species.


Assuntos
Regulação da Expressão Gênica , Sequências Reguladoras de Ácido Nucleico , Animais , Humanos , Macaca mulatta/genética , Sequências Reguladoras de Ácido Nucleico/genética , Regulação da Expressão Gênica/genética , Fatores de Transcrição/genética , Cromatina/genética
6.
bioRxiv ; 2023 Nov 28.
Artigo em Inglês | MEDLINE | ID: mdl-38076936

RESUMO

There is an established yet unexplained link between interferon (IFN) and systemic lupus erythematosus (SLE). The expression of sequences derived from transposable elements (TEs) may contribute to production of type I IFNs and generation of autoantibodies. We profiled cell-sorted RNA-seq data (CD4+ T cells, CD14+ monocytes, CD19+ B cells, and NK cells) from PBMCs of 120 SLE patients and quantified TE expression identifying 27,135 TEs. We tested for differential TE expression across 10 SLE phenotypes including autoantibody production and disease activity and discovered 731 differentially expressed (DE) TEs whose effects were mostly cell-specific and phenotype-specific. DE TEs were enriched for specific families and viral genes encoded in TE sequences. Increased expression of DE TEs was associated with genes involved in antiviral activity such as LY6E, ISG15, TRIM22 and pathways such as interferon signaling. These findings suggest that expression of TEs contributes to activation of SLE-related mechanisms in a cell-specific manner, which can impact disease diagnostics and therapeutics.

7.
medRxiv ; 2023 Nov 30.
Artigo em Inglês | MEDLINE | ID: mdl-38077057

RESUMO

Background: Preterm birth (PTB) is the leading cause of infant mortality and follows multiple biological pathways, many of which are poorly understood. Some PTBs result from medically indicated labor following complications from hypertension and/or diabetes, while many others are spontaneous with unknown causes. Previously, investigation of potential risk factors has been limited by lack of data on maternal medical history and the difficulty of classifying PTBs as indicated or spontaneous. Here, we leverage electronic health record (EHR) data (patient health information including demographics, diagnoses, and medications) and a supplemental curated pregnancy database to overcome these limitations. Novel associations may provide new insight into the pathophysiology of PTB as well as help identify individuals who would be at risk of PTB. Methods: We quantified associations between maternal diagnoses and preterm birth using logistic regression controlling for maternal age and socioeconomic factors within a University of California, San Francisco (UCSF), EHR cohort with 10,643 births ( nterm = 9692, nspontaneous_preterm = 449, nindicated_preterm = 418) and maternal pre-conception diagnosis phenotypes derived from International Classification of Diseases (ICD) 9 and 10 codes. Results: Eighteen conditions significantly and robustly (False Discovery Rate (FDR)<0.05) associated with PTBs compared to term. We discovered known (hypertension, diabetes, and chronic kidney disease) and less established (blood, cardiac, gynecological, and liver conditions) associations. Type 1 diabetes was the most significant overall association (adjusted p = 1.6×10 -14 , adjusted OR = 7 (95% CI 5, 12)), and the odds ratios for the significant phenotypes ranged from 3 to 13. We further carried out analysis stratified by spontaneous vs. indicated PTB. No phenotypes significantly associated with spontaneous PTB; however, the results for indicated PTB largely recapitulated the phenotype associations with all PTBs. Conclusions: Our study underscores the limitations of approaches that combine indicated and spontaneous births together. When combined, significant associations were almost entirely driven by indicated PTBs, although our spontaneous and indicated groups were of a similar size. Investigating the spontaneous population has the potential to reveal new pathways and understanding of the heterogeneity of PTB.

8.
Genome Biol Evol ; 15(12)2023 Dec 01.
Artigo em Inglês | MEDLINE | ID: mdl-38095367

RESUMO

When the ancestors of modern Eurasians migrated out of Africa and interbred with Eurasian archaic hominins, namely, Neanderthals and Denisovans, DNA of archaic ancestry integrated into the genomes of anatomically modern humans. This process potentially accelerated adaptation to Eurasian environmental factors, including reduced ultraviolet radiation and increased variation in seasonal dynamics. However, whether these groups differed substantially in circadian biology and whether archaic introgression adaptively contributed to human chronotypes remain unknown. Here, we traced the evolution of chronotype based on genomes from archaic hominins and present-day humans. First, we inferred differences in circadian gene sequences, splicing, and regulation between archaic hominins and modern humans. We identified 28 circadian genes containing variants with potential to alter splicing in archaics (e.g., CLOCK, PER2, RORB, and RORC) and 16 circadian genes likely divergently regulated between present-day humans and archaic hominins, including RORA. These differences suggest the potential for introgression to modify circadian gene expression. Testing this hypothesis, we found that introgressed variants are enriched among expression quantitative trait loci for circadian genes. Supporting the functional relevance of these regulatory effects, we found that many introgressed alleles have associations with chronotype. Strikingly, the strongest introgressed effects on chronotype increase morningness, consistent with adaptations to high latitude in other species. Finally, we identified several circadian loci with evidence of adaptive introgression or latitudinal clines in allele frequency. These findings identify differences in circadian gene regulation between modern humans and archaic hominins and support the contribution of introgression via coordinated effects on variation in human chronotype.


Assuntos
Hominidae , Homem de Neandertal , Animais , Humanos , Raios Ultravioleta , Genoma Humano , Hominidae/genética , Homem de Neandertal/genética , Frequência do Gene
9.
bioRxiv ; 2023 Oct 26.
Artigo em Inglês | MEDLINE | ID: mdl-37961120

RESUMO

Phenotypic divergence between closely related species, including bonobos and chimpanzees (genus Pan), is largely driven by variation in gene regulation. The 3D structure of the genome mediates gene expression; however, genome folding differences in Pan are not well understood. Here, we apply machine learning to predict genome-wide 3D genome contact maps from DNA sequence for 56 bonobos and chimpanzees, encompassing all five extant lineages. We use a pairwise approach to estimate 3D divergence between individuals from the resulting contact maps in 4,420 1 Mb genomic windows. While most pairs were similar, ∼17% were predicted to be substantially divergent in genome folding. The most dissimilar maps were largely driven by single individuals with rare variants that produce unique 3D genome folding in a region. We also identified 89 genomic windows where bonobo and chimpanzee contact maps substantially diverged, including several windows harboring genes associated with traits implicated in Pan phenotypic divergence. We used in silico mutagenesis to identify 51 3D-modifying variants in these bonobo-chimpanzee divergent windows, finding that 34 or 66.67% induce genome folding changes via CTCF binding motif disruption. Our results reveal 3D genome variation at the population-level and identify genomic regions where changes in 3D folding may contribute to phenotypic differences in our closest living relatives.

10.
Res Sq ; 2023 Sep 14.
Artigo em Inglês | MEDLINE | ID: mdl-37790518

RESUMO

SLC22A10 is classified as an orphan transporter with unknown substrates and function. Here we describe the discovery of the substrate specificity and functional characteristics of SLC22A10. The human SLC22A10 tagged with green fluorescent protein was found to be absent from the plasma membrane, in contrast to the SLC22A10 orthologs found in great apes. Estradiol-17ß-glucuronide accumulated in cells expressing great ape SLC22A10 orthologs (over 4-fold, p<0.001). In contrast, human SLC22A10 displayed no uptake function. Sequence alignments revealed two amino acid differences including a proline at position 220 of the human SLC22A10 and a leucine at the same position of great ape orthologs. Site-directed mutagenesis yielding the human SLC22A10-P220L produced a protein with excellent plasma membrane localization and associated uptake function. Neanderthal and Denisovan genomes show human-like sequences at proline 220 position, corroborating that SLC22A10 were rendered nonfunctional during hominin evolution after the divergence from the pan lineage (chimpanzees and bonobos). These findings demonstrate that human SLC22A10 is a unitary pseudogene and was inactivated by a missense mutation that is fixed in humans, whereas orthologs in great apes transport sex steroid conjugates.

11.
bioRxiv ; 2023 Aug 12.
Artigo em Inglês | MEDLINE | ID: mdl-37609337

RESUMO

SLC22A10 is classified as an orphan transporter with unknown substrates and function. Here we describe the discovery of the substrate specificity and functional characteristics of SLC22A10. The human SLC22A10 tagged with green fluorescent protein was found to be absent from the plasma membrane, in contrast to the SLC22A10 orthologs found in great apes. Estradiol-17ß-glucuronide accumulated in cells expressing great ape SLC22A10 orthologs (over 4-fold, p<0.001). In contrast, human SLC22A10 displayed no uptake function. Sequence alignments revealed two amino acid differences including a proline at position 220 of the human SLC22A10 and a leucine at the same position of great ape orthologs. Site-directed mutagenesis yielding the human SLC22A10-P220L produced a protein with excellent plasma membrane localization and associated uptake function. Neanderthal and Denisovan genomes show human-like sequences at proline 220 position, corroborating that SLC22A10 were rendered nonfunctional during hominin evolution after the divergence from the pan lineage (chimpanzees and bonobos). These findings demonstrate that human SLC22A10 is a unitary pseudogene and was inactivated by a missense mutation that is fixed in humans, whereas orthologs in great apes transport sex steroid conjugates.

12.
Adv Exp Med Biol ; 1415: 157-163, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37440029

RESUMO

Protein function can be impacted by changes in protein structure stability, but determining which change has impact is complex. Stability can be affected by a large change in the tertiary (3D) structure of the protein or due to free-energy changes caused by single amino acid substitutions. Changes in the DNA sequence can have minor or major impact on protein stability, which can lead to disease. Inherited retinal degenerations are generally caused by single mutations which are mostly located in protein-coding regions, while age-related macular degeneration (AMD) is a complex disorder that can be influenced by some genetic variants impacting proteins involved in the disease, although not all AMD risk variants lead to amino acid changes. Here, we review ways that proteins may be affected, the identification and understanding of these changes, and how to identify causal changes that can be targeted to develop treatments to alleviate retinal degenerative disease.


Assuntos
Degeneração Macular , Degeneração Retiniana , Humanos , Degeneração Retiniana/genética , Retina , Degeneração Macular/genética , Mutação , Proteínas/química , Estabilidade Proteica
13.
Genome Biol Evol ; 15(7)2023 07 03.
Artigo em Inglês | MEDLINE | ID: mdl-37410590

RESUMO

Multiple distal cis-regulatory elements (CREs) often cooperate to regulate gene expression, and the presence of multiple CREs for a gene has been proposed to provide redundancy and robustness to variation. However, we do not understand how attributes of a gene's distal CRE landscape-the CREs that contribute to its regulation-relate to its expression and function. Here, we integrate three-dimensional chromatin conformation and functional genomics data to quantify the CRE landscape composition genome-wide across ten human tissues and relate their attributes to the function, constraint, and expression patterns of genes. Within each tissue, we find that expressed genes have larger CRE landscapes than nonexpressed genes and that genes with tissue-specific CREs are more likely to have tissue-specific expression. Controlling for the association between expression level and CRE landscape size, we also find that CRE landscapes around genes under strong constraint (e.g., loss-of-function intolerant and housekeeping genes) are not significantly smaller than other expressed genes as previously proposed; however, they do have more evolutionarily conserved sequences than CREs of expressed genes overall. We also show that CRE landscape size does not associate with expression variability across individuals; nonetheless, genes with larger CRE landscapes have a relative depletion for variants that influence expression levels (expression quantitative trait loci). Overall, this work illustrates how differences in gene function, expression, and evolutionary constraint are reflected in features of CRE landscapes. Thus, considering the CRE landscape of a gene is vital for understanding gene expression dynamics across biological contexts and interpreting the effects of noncoding genetic variants.


Assuntos
Genômica , Sequências Reguladoras de Ácido Nucleico , Humanos , Especificidade de Órgãos , Genoma , Fenótipo
14.
Res Sq ; 2023 May 23.
Artigo em Inglês | MEDLINE | ID: mdl-37292728

RESUMO

Comparing chromatin contact maps is an essential step in quantifying how three-dimensional (3D) genome organization shapes development, evolution, and disease. However, no gold standard exists for comparing contact maps, and even simple methods often disagree. In this study, we propose novel comparison methods and evaluate them alongside existing approaches using genome-wide Hi-C data and 22,500 in silico predicted contact maps. We also quantify the robustness of methods to common sources of biological and technical variation, such as boundary size and noise. We find that simple difference-based methods such as mean squared error are suitable for initial screening, but biologically informed methods are necessary to identify why maps diverge and propose specific functional hypotheses. We provide a reference guide, codebase, and benchmark for rapidly comparing chromatin contact maps at scale to enable biological insights into the 3D organization of the genome.

15.
bioRxiv ; 2023 Apr 29.
Artigo em Inglês | MEDLINE | ID: mdl-37162834

RESUMO

Background: KCNE1 encodes a 129-residue cardiac potassium channel (IKs) subunit. KCNE1 variants are associated with long QT syndrome and atrial fibrillation. However, most variants have insufficient evidence of clinical consequences and thus limited clinical utility. Results: Here, we demonstrate the power of variant effect mapping, which couples saturation mutagenesis with high-throughput sequencing, to ascertain the function of thousands of protein coding KCNE1 variants. We comprehensively assayed KCNE1 variant cell surface expression (2,554/2,709 possible single amino acid variants) and function (2,539 variants). We identified 470 loss-of-surface expression and 588 loss-of-function variants. Out of the 588 loss-of-function variants, only 155 had low cell surface expression. The latter half of the protein is dispensable for protein trafficking but essential for channel function. 22 of the 30 KCNE1 residues (73%) highly intolerant of variation were in predicted close contact with binding partners KCNQ1 or calmodulin. Our data were highly concordant with gold standard electrophysiological data (ρ = -0.65), population and patient cohorts (32/38 concordant variants), and computational metrics (ρ = -0.55). Our data provide moderate-strength evidence for the ACMG/AMP functional criteria for benign and pathogenic variants. Conclusions: Comprehensive variant effect maps of KCNE1 can both provide insight into IKs channel biology and help reclassify variants of uncertain significance.

16.
Nat Ecol Evol ; 7(6): 939-953, 2023 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-37142741

RESUMO

Alternative splicing contributes to adaptation and divergence in many species. However, it has not been possible to directly compare splicing between modern and archaic hominins. Here, we unmask the recent evolution of this previously unobservable regulatory mechanism by applying SpliceAI, a machine-learning algorithm that identifies splice-altering variants (SAVs), to high-coverage genomes from three Neanderthals and a Denisovan. We discover 5,950 putative archaic SAVs, of which 2,186 are archaic-specific and 3,607 also occur in modern humans via introgression (244) or shared ancestry (3,520). Archaic-specific SAVs are enriched in genes that contribute to traits potentially relevant to hominin phenotypic divergence, such as the epidermis, respiration and spinal rigidity. Compared to shared SAVs, archaic-specific SAVs occur in sites under weaker selection and are more common in genes with tissue-specific expression. Further underscoring the importance of negative selection on SAVs, Neanderthal lineages with low effective population sizes are enriched for SAVs compared to Denisovan and shared SAVs. Finally, we find that nearly all introgressed SAVs in humans were shared across the three Neanderthals, suggesting that older SAVs were more tolerated in human genomes. Our results reveal the splicing landscape of archaic hominins and identify potential contributions of splicing to phenotypic differences among hominins.


Assuntos
Hominidae , Homem de Neandertal , Animais , Humanos , Hominidae/genética , Homem de Neandertal/genética , Processamento Alternativo , Genoma Humano , Densidade Demográfica
17.
bioRxiv ; 2023 Apr 04.
Artigo em Inglês | MEDLINE | ID: mdl-37066196

RESUMO

Comparing chromatin contact maps is an essential step in quantifying how three-dimensional (3D) genome organization shapes development, evolution, and disease. However, no gold standard exists for comparing contact maps, and even simple methods often disagree. In this study, we propose novel comparison methods and evaluate them alongside existing approaches using genome-wide Hi-C data and 22,500 in silico predicted contact maps. We also quantify the robustness of methods to common sources of biological and technical variation, such as boundary size and noise. We find that simple difference-based methods such as mean squared error are suitable for initial screening, but biologically informed methods are necessary to identify why maps diverge and propose specific functional hypotheses. We provide a reference guide, codebase, and benchmark for rapidly comparing chromatin contact maps at scale to enable biological insights into the 3D organization of the genome.

18.
JAMIA Open ; 6(1): ooad007, 2023 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-36875690

RESUMO

Enabling discovery across the spectrum of rare and common diseases requires the integration of biological knowledge with clinical data; however, differences in terminologies present a major barrier. For example, the Human Phenotype Ontology (HPO) is the primary vocabulary for describing features of rare diseases, while most clinical encounters use International Classification of Diseases (ICD) billing codes. ICD codes are further organized into clinically meaningful phenotypes via phecodes. Despite their prevalence, no robust phenome-wide disease mapping between HPO and phecodes/ICD exists. Here, we synthesize evidence using diverse sources and methods-including text matching, the National Library of Medicine's Unified Medical Language System (UMLS), Wikipedia, SORTA, and PheMap-to define a mapping between phecodes and HPO terms via 38 950 links. We evaluate the precision and recall for each domain of evidence, both individually and jointly. This flexibility permits users to tailor the HPO-phecode links for diverse applications along the spectrum of monogenic to polygenic diseases.

19.
bioRxiv ; 2023 Feb 15.
Artigo em Inglês | MEDLINE | ID: mdl-36824965

RESUMO

Gene regulatory divergence between species can result from cis-acting local changes to regulatory element DNA sequences or global trans-acting changes to the regulatory environment. Understanding how these mechanisms drive regulatory evolution has been limited by challenges in identifying trans-acting changes. We present a comprehensive approach to directly identify cis- and trans-divergent regulatory elements between human and rhesus macaque lymphoblastoid cells using ATAC-STARR-seq. In addition to thousands of cis changes, we discover an unexpected number (~10,000) of trans changes and show that cis and trans elements exhibit distinct patterns of sequence divergence and function. We further identify differentially expressed transcription factors that underlie >50% of trans differences and trace how cis changes can produce cascades of trans changes. Overall, we find that most divergent elements (67%) experienced changes in both cis and trans, revealing a substantial role for trans divergence-alone and together with cis changes-to regulatory differences between species.

20.
bioRxiv ; 2023 Sep 17.
Artigo em Inglês | MEDLINE | ID: mdl-36778254

RESUMO

Introduction: When the ancestors of modern Eurasians migrated out of Africa and interbred with Eurasian archaic hominins, namely Neanderthals and Denisovans, DNA of archaic ancestry integrated into the genomes of anatomically modern humans. This process potentially accelerated adaptation to Eurasian environmental factors, including reduced ultra-violet radiation and increased variation in seasonal dynamics. However, whether these groups differed substantially in circadian biology, and whether archaic introgression adaptively contributed to human chronotypes remains unknown. Results: Here we traced the evolution of chronotype based on genomes from archaic hominins and present-day humans. First, we inferred differences in circadian gene sequences, splicing, and regulation between archaic hominins and modern humans. We identified 28 circadian genes containing variants with potential to alter splicing in archaics (e.g., CLOCK, PER2, RORB, RORC), and 16 circadian genes likely divergently regulated between present-day humans and archaic hominins, including RORA. These differences suggest the potential for introgression to modify circadian gene expression. Testing this hypothesis, we found that introgressed variants are enriched among eQTLs for circadian genes. Supporting the functional relevance of these regulatory effects, we found that many introgressed alleles have associations with chronotype. Strikingly, the strongest introgressed effects on chronotype increase morningness, consistent with adaptations to high latitude in other species. Finally, we identified several circadian loci with evidence of adaptive introgression or latitudinal clines in allele frequency. Conclusions: These findings identify differences in circadian gene regulation between modern humans and archaic hominins and support the contribution of introgression via coordinated effects on variation in human chronotype.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...