Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 116
Filtrar
1.
Genome Med ; 16(1): 73, 2024 May 30.
Artículo en Inglés | MEDLINE | ID: mdl-38816749

RESUMEN

BACKGROUND: KCNE1 encodes a 129-residue cardiac potassium channel (IKs) subunit. KCNE1 variants are associated with long QT syndrome and atrial fibrillation. However, most variants have insufficient evidence of clinical consequences and thus limited clinical utility. METHODS: In this study, we leveraged the power of variant effect mapping, which couples saturation mutagenesis with high-throughput sequencing, to ascertain the function of thousands of protein-coding KCNE1 variants. RESULTS: We comprehensively assayed KCNE1 variant cell surface expression (2554/2709 possible single-amino-acid variants) and function (2534 variants). Our study identified 470 loss- or partial loss-of-surface expression and 574 loss- or partial loss-of-function variants. Of the 574 loss- or partial loss-of-function variants, 152 (26.5%) had reduced cell surface expression, indicating that most functionally deleterious variants affect channel gating. Nonsense variants at residues 56-104 generally had WT-like trafficking scores but decreased functional scores, indicating that the latter half of the protein is dispensable for protein trafficking but essential for channel function. 22 of the 30 KCNE1 residues (73%) highly intolerant of variation (with > 70% loss-of-function variants) were in predicted close contact with binding partners KCNQ1 or calmodulin. Our functional assay data were consistent with gold standard electrophysiological data (ρ = - 0.64), population and patient cohorts (32/38 presumed benign or pathogenic variants with consistent scores), and computational predictors (ρ = - 0.62). Our data provide moderate-strength evidence for the American College of Medical Genetics/Association of Molecular Pathology functional criteria for benign and pathogenic variants. CONCLUSIONS: Comprehensive variant effect maps of KCNE1 can both provide insight into I Ks channel biology and help reclassify variants of uncertain significance.


Asunto(s)
Calmodulina , Canales de Potasio con Entrada de Voltaje , Canales de Potasio con Entrada de Voltaje/genética , Canales de Potasio con Entrada de Voltaje/metabolismo , Humanos , Calmodulina/genética , Calmodulina/metabolismo , Arritmias Cardíacas/genética , Secuenciación de Nucleótidos de Alto Rendimiento , Variación Genética , Transporte de Proteínas , Células HEK293
2.
HGG Adv ; : 100310, 2024 May 20.
Artículo en Inglés | MEDLINE | ID: mdl-38773771

RESUMEN

Non-protein-coding genetic variants are a major driver of the genetic risk for human disease; however, identifying which non-coding variants contribute to diseases and their mechanisms remains challenging. In-silico variant prioritization methods quantify a variant's severity, but for most methods the specific phenotype and disease-context of the prediction remain poorly defined. For example, many commonly used methods provide a single, organism-wide score for each variant, while other methods summarize a variant's impact in certain tissues and/or cell-types. Here we propose a complementary disease-specific variant prioritization scheme, which is motivated by the observation that variants contributing to disease often operate through specific biological mechanisms.

3.
medRxiv ; 2024 May 04.
Artículo en Inglés | MEDLINE | ID: mdl-38746318

RESUMEN

Molecular studies of Alzheimer's disease (AD) implicate potential links between autoimmunity and AD, but the underlying clinical relationships between these conditions remain poorly understood. Electronic health records (EHRs) provide an opportunity to determine the clinical risk relationship between autoimmune disorders and AD and understand whether specific disorders and disorder subtypes affect AD risk at the phenotypic level in human populations. We evaluated relationships between 26 autoimmune disorders and AD across retrospective observational case-control and cohort study designs in the EHR systems at UCSF and Stanford. We quantified overall and sex-specific AD risk effects that these autoimmune disorders confer. We identified significantly increased AD risk in autoimmune disorder patients in both study designs at UCSF and at Stanford. This pattern was driven by specific autoimmunity subtypes including endocrine, gastrointestinal, dermatologic, and musculoskeletal disorders. We also observed increased AD risk from autoimmunity in both women and men, but women with autoimmune disorders continued to have a higher AD prevalence than men, indicating persistent sex-specificity. This study identifies autoimmune disorders as strong risk factors for AD that validate across several study designs and EHR databases. It sets the foundation for exploring how underlying autoimmune mechanisms increase AD risk and contribute to AD pathogenesis.

4.
Nat Commun ; 15(1): 4380, 2024 May 23.
Artículo en Inglés | MEDLINE | ID: mdl-38782905

RESUMEN

SLC22A10 is an orphan transporter with unknown substrates and function. The goal of this study is to elucidate its substrate specificity and functional characteristics. In contrast to orthologs from great apes, human SLC22A10, tagged with green fluorescent protein, is not expressed on the plasma membrane. Cells expressing great ape SLC22A10 orthologs exhibit significant accumulation of estradiol-17ß-glucuronide, unlike those expressing human SLC22A10. Sequence alignments reveal a proline at position 220 in humans, which is a leucine in great apes. Replacing proline with leucine in SLC22A10-P220L restores plasma membrane localization and uptake function. Neanderthal and Denisovan genomes show proline at position 220, akin to modern humans, indicating functional loss during hominin evolution. Human SLC22A10 is a unitary pseudogene due to a fixed missense mutation, P220, while in great apes, its orthologs transport sex steroid conjugates. Characterizing SLC22A10 across species sheds light on its biological role, influencing organism development and steroid homeostasis.


Asunto(s)
Primates , Animales , Humanos , Primates/genética , Secuencia de Aminoácidos , Células HEK293 , Proteínas de Transporte de Catión Orgánico/metabolismo , Proteínas de Transporte de Catión Orgánico/genética , Hominidae/genética , Hominidae/metabolismo , Estradiol/metabolismo , Seudogenes , Especificidad por Sustrato , Mutación Missense
5.
Cell Genom ; 4(4): 100536, 2024 Apr 10.
Artículo en Inglés | MEDLINE | ID: mdl-38604126

RESUMEN

Gene regulatory divergence between species can result from cis-acting local changes to regulatory element DNA sequences or global trans-acting changes to the regulatory environment. Understanding how these mechanisms drive regulatory evolution has been limited by challenges in identifying trans-acting changes. We present a comprehensive approach to directly identify cis- and trans-divergent regulatory elements between human and rhesus macaque lymphoblastoid cells using assay for transposase-accessible chromatin coupled to self-transcribing active regulatory region (ATAC-STARR) sequencing. In addition to thousands of cis changes, we discover an unexpected number (∼10,000) of trans changes and show that cis and trans elements exhibit distinct patterns of sequence divergence and function. We further identify differentially expressed transcription factors that underlie ∼37% of trans differences and trace how cis changes can produce cascades of trans changes. Overall, we find that most divergent elements (67%) experienced changes in both cis and trans, revealing a substantial role for trans divergence-alone and together with cis changes-in regulatory differences between species.


Asunto(s)
Regulación de la Expresión Génica , Secuencias Reguladoras de Ácidos Nucleicos , Animales , Humanos , Macaca mulatta/genética , Secuencias Reguladoras de Ácidos Nucleicos/genética , Regulación de la Expresión Génica/genética , Factores de Transcripción/genética , Cromatina/genética
6.
Genome Biol Evol ; 15(12)2023 Dec 01.
Artículo en Inglés | MEDLINE | ID: mdl-38095367

RESUMEN

When the ancestors of modern Eurasians migrated out of Africa and interbred with Eurasian archaic hominins, namely, Neanderthals and Denisovans, DNA of archaic ancestry integrated into the genomes of anatomically modern humans. This process potentially accelerated adaptation to Eurasian environmental factors, including reduced ultraviolet radiation and increased variation in seasonal dynamics. However, whether these groups differed substantially in circadian biology and whether archaic introgression adaptively contributed to human chronotypes remain unknown. Here, we traced the evolution of chronotype based on genomes from archaic hominins and present-day humans. First, we inferred differences in circadian gene sequences, splicing, and regulation between archaic hominins and modern humans. We identified 28 circadian genes containing variants with potential to alter splicing in archaics (e.g., CLOCK, PER2, RORB, and RORC) and 16 circadian genes likely divergently regulated between present-day humans and archaic hominins, including RORA. These differences suggest the potential for introgression to modify circadian gene expression. Testing this hypothesis, we found that introgressed variants are enriched among expression quantitative trait loci for circadian genes. Supporting the functional relevance of these regulatory effects, we found that many introgressed alleles have associations with chronotype. Strikingly, the strongest introgressed effects on chronotype increase morningness, consistent with adaptations to high latitude in other species. Finally, we identified several circadian loci with evidence of adaptive introgression or latitudinal clines in allele frequency. These findings identify differences in circadian gene regulation between modern humans and archaic hominins and support the contribution of introgression via coordinated effects on variation in human chronotype.


Asunto(s)
Hominidae , Hombre de Neandertal , Animales , Humanos , Rayos Ultravioleta , Genoma Humano , Hominidae/genética , Hombre de Neandertal/genética , Frecuencia de los Genes
7.
bioRxiv ; 2023 Nov 28.
Artículo en Inglés | MEDLINE | ID: mdl-38076936

RESUMEN

There is an established yet unexplained link between interferon (IFN) and systemic lupus erythematosus (SLE). The expression of sequences derived from transposable elements (TEs) may contribute to production of type I IFNs and generation of autoantibodies. We profiled cell-sorted RNA-seq data (CD4+ T cells, CD14+ monocytes, CD19+ B cells, and NK cells) from PBMCs of 120 SLE patients and quantified TE expression identifying 27,135 TEs. We tested for differential TE expression across 10 SLE phenotypes including autoantibody production and disease activity and discovered 731 differentially expressed (DE) TEs whose effects were mostly cell-specific and phenotype-specific. DE TEs were enriched for specific families and viral genes encoded in TE sequences. Increased expression of DE TEs was associated with genes involved in antiviral activity such as LY6E, ISG15, TRIM22 and pathways such as interferon signaling. These findings suggest that expression of TEs contributes to activation of SLE-related mechanisms in a cell-specific manner, which can impact disease diagnostics and therapeutics.

8.
medRxiv ; 2023 Nov 30.
Artículo en Inglés | MEDLINE | ID: mdl-38077057

RESUMEN

Background: Preterm birth (PTB) is the leading cause of infant mortality and follows multiple biological pathways, many of which are poorly understood. Some PTBs result from medically indicated labor following complications from hypertension and/or diabetes, while many others are spontaneous with unknown causes. Previously, investigation of potential risk factors has been limited by lack of data on maternal medical history and the difficulty of classifying PTBs as indicated or spontaneous. Here, we leverage electronic health record (EHR) data (patient health information including demographics, diagnoses, and medications) and a supplemental curated pregnancy database to overcome these limitations. Novel associations may provide new insight into the pathophysiology of PTB as well as help identify individuals who would be at risk of PTB. Methods: We quantified associations between maternal diagnoses and preterm birth using logistic regression controlling for maternal age and socioeconomic factors within a University of California, San Francisco (UCSF), EHR cohort with 10,643 births ( nterm = 9692, nspontaneous_preterm = 449, nindicated_preterm = 418) and maternal pre-conception diagnosis phenotypes derived from International Classification of Diseases (ICD) 9 and 10 codes. Results: Eighteen conditions significantly and robustly (False Discovery Rate (FDR)<0.05) associated with PTBs compared to term. We discovered known (hypertension, diabetes, and chronic kidney disease) and less established (blood, cardiac, gynecological, and liver conditions) associations. Type 1 diabetes was the most significant overall association (adjusted p = 1.6×10 -14 , adjusted OR = 7 (95% CI 5, 12)), and the odds ratios for the significant phenotypes ranged from 3 to 13. We further carried out analysis stratified by spontaneous vs. indicated PTB. No phenotypes significantly associated with spontaneous PTB; however, the results for indicated PTB largely recapitulated the phenotype associations with all PTBs. Conclusions: Our study underscores the limitations of approaches that combine indicated and spontaneous births together. When combined, significant associations were almost entirely driven by indicated PTBs, although our spontaneous and indicated groups were of a similar size. Investigating the spontaneous population has the potential to reveal new pathways and understanding of the heterogeneity of PTB.

9.
bioRxiv ; 2023 Oct 26.
Artículo en Inglés | MEDLINE | ID: mdl-37961120

RESUMEN

Phenotypic divergence between closely related species, including bonobos and chimpanzees (genus Pan), is largely driven by variation in gene regulation. The 3D structure of the genome mediates gene expression; however, genome folding differences in Pan are not well understood. Here, we apply machine learning to predict genome-wide 3D genome contact maps from DNA sequence for 56 bonobos and chimpanzees, encompassing all five extant lineages. We use a pairwise approach to estimate 3D divergence between individuals from the resulting contact maps in 4,420 1 Mb genomic windows. While most pairs were similar, ∼17% were predicted to be substantially divergent in genome folding. The most dissimilar maps were largely driven by single individuals with rare variants that produce unique 3D genome folding in a region. We also identified 89 genomic windows where bonobo and chimpanzee contact maps substantially diverged, including several windows harboring genes associated with traits implicated in Pan phenotypic divergence. We used in silico mutagenesis to identify 51 3D-modifying variants in these bonobo-chimpanzee divergent windows, finding that 34 or 66.67% induce genome folding changes via CTCF binding motif disruption. Our results reveal 3D genome variation at the population-level and identify genomic regions where changes in 3D folding may contribute to phenotypic differences in our closest living relatives.

10.
Res Sq ; 2023 Sep 14.
Artículo en Inglés | MEDLINE | ID: mdl-37790518

RESUMEN

SLC22A10 is classified as an orphan transporter with unknown substrates and function. Here we describe the discovery of the substrate specificity and functional characteristics of SLC22A10. The human SLC22A10 tagged with green fluorescent protein was found to be absent from the plasma membrane, in contrast to the SLC22A10 orthologs found in great apes. Estradiol-17ß-glucuronide accumulated in cells expressing great ape SLC22A10 orthologs (over 4-fold, p<0.001). In contrast, human SLC22A10 displayed no uptake function. Sequence alignments revealed two amino acid differences including a proline at position 220 of the human SLC22A10 and a leucine at the same position of great ape orthologs. Site-directed mutagenesis yielding the human SLC22A10-P220L produced a protein with excellent plasma membrane localization and associated uptake function. Neanderthal and Denisovan genomes show human-like sequences at proline 220 position, corroborating that SLC22A10 were rendered nonfunctional during hominin evolution after the divergence from the pan lineage (chimpanzees and bonobos). These findings demonstrate that human SLC22A10 is a unitary pseudogene and was inactivated by a missense mutation that is fixed in humans, whereas orthologs in great apes transport sex steroid conjugates.

11.
bioRxiv ; 2023 Aug 12.
Artículo en Inglés | MEDLINE | ID: mdl-37609337

RESUMEN

SLC22A10 is classified as an orphan transporter with unknown substrates and function. Here we describe the discovery of the substrate specificity and functional characteristics of SLC22A10. The human SLC22A10 tagged with green fluorescent protein was found to be absent from the plasma membrane, in contrast to the SLC22A10 orthologs found in great apes. Estradiol-17ß-glucuronide accumulated in cells expressing great ape SLC22A10 orthologs (over 4-fold, p<0.001). In contrast, human SLC22A10 displayed no uptake function. Sequence alignments revealed two amino acid differences including a proline at position 220 of the human SLC22A10 and a leucine at the same position of great ape orthologs. Site-directed mutagenesis yielding the human SLC22A10-P220L produced a protein with excellent plasma membrane localization and associated uptake function. Neanderthal and Denisovan genomes show human-like sequences at proline 220 position, corroborating that SLC22A10 were rendered nonfunctional during hominin evolution after the divergence from the pan lineage (chimpanzees and bonobos). These findings demonstrate that human SLC22A10 is a unitary pseudogene and was inactivated by a missense mutation that is fixed in humans, whereas orthologs in great apes transport sex steroid conjugates.

12.
Adv Exp Med Biol ; 1415: 157-163, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-37440029

RESUMEN

Protein function can be impacted by changes in protein structure stability, but determining which change has impact is complex. Stability can be affected by a large change in the tertiary (3D) structure of the protein or due to free-energy changes caused by single amino acid substitutions. Changes in the DNA sequence can have minor or major impact on protein stability, which can lead to disease. Inherited retinal degenerations are generally caused by single mutations which are mostly located in protein-coding regions, while age-related macular degeneration (AMD) is a complex disorder that can be influenced by some genetic variants impacting proteins involved in the disease, although not all AMD risk variants lead to amino acid changes. Here, we review ways that proteins may be affected, the identification and understanding of these changes, and how to identify causal changes that can be targeted to develop treatments to alleviate retinal degenerative disease.


Asunto(s)
Degeneración Macular , Degeneración Retiniana , Humanos , Degeneración Retiniana/genética , Retina , Degeneración Macular/genética , Mutación , Proteínas/química , Estabilidad Proteica
13.
Genome Biol Evol ; 15(7)2023 07 03.
Artículo en Inglés | MEDLINE | ID: mdl-37410590

RESUMEN

Multiple distal cis-regulatory elements (CREs) often cooperate to regulate gene expression, and the presence of multiple CREs for a gene has been proposed to provide redundancy and robustness to variation. However, we do not understand how attributes of a gene's distal CRE landscape-the CREs that contribute to its regulation-relate to its expression and function. Here, we integrate three-dimensional chromatin conformation and functional genomics data to quantify the CRE landscape composition genome-wide across ten human tissues and relate their attributes to the function, constraint, and expression patterns of genes. Within each tissue, we find that expressed genes have larger CRE landscapes than nonexpressed genes and that genes with tissue-specific CREs are more likely to have tissue-specific expression. Controlling for the association between expression level and CRE landscape size, we also find that CRE landscapes around genes under strong constraint (e.g., loss-of-function intolerant and housekeeping genes) are not significantly smaller than other expressed genes as previously proposed; however, they do have more evolutionarily conserved sequences than CREs of expressed genes overall. We also show that CRE landscape size does not associate with expression variability across individuals; nonetheless, genes with larger CRE landscapes have a relative depletion for variants that influence expression levels (expression quantitative trait loci). Overall, this work illustrates how differences in gene function, expression, and evolutionary constraint are reflected in features of CRE landscapes. Thus, considering the CRE landscape of a gene is vital for understanding gene expression dynamics across biological contexts and interpreting the effects of noncoding genetic variants.


Asunto(s)
Genómica , Secuencias Reguladoras de Ácidos Nucleicos , Humanos , Especificidad de Órganos , Genoma , Fenotipo
14.
Res Sq ; 2023 May 23.
Artículo en Inglés | MEDLINE | ID: mdl-37292728

RESUMEN

Comparing chromatin contact maps is an essential step in quantifying how three-dimensional (3D) genome organization shapes development, evolution, and disease. However, no gold standard exists for comparing contact maps, and even simple methods often disagree. In this study, we propose novel comparison methods and evaluate them alongside existing approaches using genome-wide Hi-C data and 22,500 in silico predicted contact maps. We also quantify the robustness of methods to common sources of biological and technical variation, such as boundary size and noise. We find that simple difference-based methods such as mean squared error are suitable for initial screening, but biologically informed methods are necessary to identify why maps diverge and propose specific functional hypotheses. We provide a reference guide, codebase, and benchmark for rapidly comparing chromatin contact maps at scale to enable biological insights into the 3D organization of the genome.

15.
bioRxiv ; 2023 Apr 29.
Artículo en Inglés | MEDLINE | ID: mdl-37162834

RESUMEN

Background: KCNE1 encodes a 129-residue cardiac potassium channel (IKs) subunit. KCNE1 variants are associated with long QT syndrome and atrial fibrillation. However, most variants have insufficient evidence of clinical consequences and thus limited clinical utility. Results: Here, we demonstrate the power of variant effect mapping, which couples saturation mutagenesis with high-throughput sequencing, to ascertain the function of thousands of protein coding KCNE1 variants. We comprehensively assayed KCNE1 variant cell surface expression (2,554/2,709 possible single amino acid variants) and function (2,539 variants). We identified 470 loss-of-surface expression and 588 loss-of-function variants. Out of the 588 loss-of-function variants, only 155 had low cell surface expression. The latter half of the protein is dispensable for protein trafficking but essential for channel function. 22 of the 30 KCNE1 residues (73%) highly intolerant of variation were in predicted close contact with binding partners KCNQ1 or calmodulin. Our data were highly concordant with gold standard electrophysiological data (ρ = -0.65), population and patient cohorts (32/38 concordant variants), and computational metrics (ρ = -0.55). Our data provide moderate-strength evidence for the ACMG/AMP functional criteria for benign and pathogenic variants. Conclusions: Comprehensive variant effect maps of KCNE1 can both provide insight into IKs channel biology and help reclassify variants of uncertain significance.

16.
Nat Ecol Evol ; 7(6): 939-953, 2023 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-37142741

RESUMEN

Alternative splicing contributes to adaptation and divergence in many species. However, it has not been possible to directly compare splicing between modern and archaic hominins. Here, we unmask the recent evolution of this previously unobservable regulatory mechanism by applying SpliceAI, a machine-learning algorithm that identifies splice-altering variants (SAVs), to high-coverage genomes from three Neanderthals and a Denisovan. We discover 5,950 putative archaic SAVs, of which 2,186 are archaic-specific and 3,607 also occur in modern humans via introgression (244) or shared ancestry (3,520). Archaic-specific SAVs are enriched in genes that contribute to traits potentially relevant to hominin phenotypic divergence, such as the epidermis, respiration and spinal rigidity. Compared to shared SAVs, archaic-specific SAVs occur in sites under weaker selection and are more common in genes with tissue-specific expression. Further underscoring the importance of negative selection on SAVs, Neanderthal lineages with low effective population sizes are enriched for SAVs compared to Denisovan and shared SAVs. Finally, we find that nearly all introgressed SAVs in humans were shared across the three Neanderthals, suggesting that older SAVs were more tolerated in human genomes. Our results reveal the splicing landscape of archaic hominins and identify potential contributions of splicing to phenotypic differences among hominins.


Asunto(s)
Hominidae , Hombre de Neandertal , Animales , Humanos , Hominidae/genética , Hombre de Neandertal/genética , Empalme Alternativo , Genoma Humano , Densidad de Población
17.
bioRxiv ; 2023 Apr 04.
Artículo en Inglés | MEDLINE | ID: mdl-37066196

RESUMEN

Comparing chromatin contact maps is an essential step in quantifying how three-dimensional (3D) genome organization shapes development, evolution, and disease. However, no gold standard exists for comparing contact maps, and even simple methods often disagree. In this study, we propose novel comparison methods and evaluate them alongside existing approaches using genome-wide Hi-C data and 22,500 in silico predicted contact maps. We also quantify the robustness of methods to common sources of biological and technical variation, such as boundary size and noise. We find that simple difference-based methods such as mean squared error are suitable for initial screening, but biologically informed methods are necessary to identify why maps diverge and propose specific functional hypotheses. We provide a reference guide, codebase, and benchmark for rapidly comparing chromatin contact maps at scale to enable biological insights into the 3D organization of the genome.

18.
JAMIA Open ; 6(1): ooad007, 2023 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-36875690

RESUMEN

Enabling discovery across the spectrum of rare and common diseases requires the integration of biological knowledge with clinical data; however, differences in terminologies present a major barrier. For example, the Human Phenotype Ontology (HPO) is the primary vocabulary for describing features of rare diseases, while most clinical encounters use International Classification of Diseases (ICD) billing codes. ICD codes are further organized into clinically meaningful phenotypes via phecodes. Despite their prevalence, no robust phenome-wide disease mapping between HPO and phecodes/ICD exists. Here, we synthesize evidence using diverse sources and methods-including text matching, the National Library of Medicine's Unified Medical Language System (UMLS), Wikipedia, SORTA, and PheMap-to define a mapping between phecodes and HPO terms via 38 950 links. We evaluate the precision and recall for each domain of evidence, both individually and jointly. This flexibility permits users to tailor the HPO-phecode links for diverse applications along the spectrum of monogenic to polygenic diseases.

19.
bioRxiv ; 2023 Sep 17.
Artículo en Inglés | MEDLINE | ID: mdl-36778254

RESUMEN

Introduction: When the ancestors of modern Eurasians migrated out of Africa and interbred with Eurasian archaic hominins, namely Neanderthals and Denisovans, DNA of archaic ancestry integrated into the genomes of anatomically modern humans. This process potentially accelerated adaptation to Eurasian environmental factors, including reduced ultra-violet radiation and increased variation in seasonal dynamics. However, whether these groups differed substantially in circadian biology, and whether archaic introgression adaptively contributed to human chronotypes remains unknown. Results: Here we traced the evolution of chronotype based on genomes from archaic hominins and present-day humans. First, we inferred differences in circadian gene sequences, splicing, and regulation between archaic hominins and modern humans. We identified 28 circadian genes containing variants with potential to alter splicing in archaics (e.g., CLOCK, PER2, RORB, RORC), and 16 circadian genes likely divergently regulated between present-day humans and archaic hominins, including RORA. These differences suggest the potential for introgression to modify circadian gene expression. Testing this hypothesis, we found that introgressed variants are enriched among eQTLs for circadian genes. Supporting the functional relevance of these regulatory effects, we found that many introgressed alleles have associations with chronotype. Strikingly, the strongest introgressed effects on chronotype increase morningness, consistent with adaptations to high latitude in other species. Finally, we identified several circadian loci with evidence of adaptive introgression or latitudinal clines in allele frequency. Conclusions: These findings identify differences in circadian gene regulation between modern humans and archaic hominins and support the contribution of introgression via coordinated effects on variation in human chronotype.

20.
bioRxiv ; 2023 Feb 15.
Artículo en Inglés | MEDLINE | ID: mdl-36824965

RESUMEN

Gene regulatory divergence between species can result from cis-acting local changes to regulatory element DNA sequences or global trans-acting changes to the regulatory environment. Understanding how these mechanisms drive regulatory evolution has been limited by challenges in identifying trans-acting changes. We present a comprehensive approach to directly identify cis- and trans-divergent regulatory elements between human and rhesus macaque lymphoblastoid cells using ATAC-STARR-seq. In addition to thousands of cis changes, we discover an unexpected number (~10,000) of trans changes and show that cis and trans elements exhibit distinct patterns of sequence divergence and function. We further identify differentially expressed transcription factors that underlie >50% of trans differences and trace how cis changes can produce cascades of trans changes. Overall, we find that most divergent elements (67%) experienced changes in both cis and trans, revealing a substantial role for trans divergence-alone and together with cis changes-to regulatory differences between species.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...