Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 15.963
Filtrar
1.
Genome Biol ; 25(1): 123, 2024 May 17.
Artigo em Inglês | MEDLINE | ID: mdl-38760655

RESUMO

BACKGROUND: Vision depends on the interplay between photoreceptor cells of the neural retina and the underlying retinal pigment epithelium (RPE). Most genes involved in inherited retinal diseases display specific spatiotemporal expression within these interconnected retinal components through the local recruitment of cis-regulatory elements (CREs) in 3D nuclear space. RESULTS: To understand the role of differential chromatin architecture in establishing tissue-specific expression at inherited retinal disease loci, we mapped genome-wide chromatin interactions using in situ Hi-C and H3K4me3 HiChIP on neural retina and RPE/choroid from human adult donor eyes. We observed chromatin looping between active promoters and 32,425 and 8060 candidate CREs in the neural retina and RPE/choroid, respectively. A comparative 3D genome analysis between these two retinal tissues revealed that 56% of 290 known inherited retinal disease genes were marked by differential chromatin interactions. One of these was ABCA4, which is implicated in the most common autosomal recessive inherited retinal disease. We zoomed in on retina- and RPE-specific cis-regulatory interactions at the ABCA4 locus using high-resolution UMI-4C. Integration with bulk and single-cell epigenomic datasets and in vivo enhancer assays in zebrafish revealed tissue-specific CREs interacting with ABCA4. CONCLUSIONS: Through comparative 3D genome mapping, based on genome-wide, promoter-centric, and locus-specific assays of human neural retina and RPE, we have shown that gene regulation at key inherited retinal disease loci is likely mediated by tissue-specific chromatin interactions. These findings do not only provide insight into tissue-specific regulatory landscapes at retinal disease loci, but also delineate the search space for non-coding genomic variation underlying unsolved inherited retinal diseases.


Assuntos
Cromatina , Retina , Doenças Retinianas , Epitélio Pigmentado da Retina , Humanos , Epitélio Pigmentado da Retina/metabolismo , Cromatina/metabolismo , Doenças Retinianas/genética , Doenças Retinianas/metabolismo , Retina/metabolismo , Transportadores de Cassetes de Ligação de ATP/genética , Transportadores de Cassetes de Ligação de ATP/metabolismo , Animais , Regiões Promotoras Genéticas , Loci Gênicos , Peixe-Zebra/genética , Sequências Reguladoras de Ácido Nucleico , Genoma Humano
2.
Nat Commun ; 15(1): 3839, 2024 May 07.
Artigo em Inglês | MEDLINE | ID: mdl-38714659

RESUMO

Pre-mRNA splicing, a key process in gene expression, can be therapeutically modulated using various drug modalities, including antisense oligonucleotides (ASOs). However, determining promising targets is hampered by the challenge of systematically mapping splicing-regulatory elements (SREs) in their native sequence context. Here, we use the catalytically inactive CRISPR-RfxCas13d RNA-targeting system (dCas13d/gRNA) as a programmable platform to bind SREs and modulate splicing by competing against endogenous splicing factors. SpliceRUSH, a high-throughput screening method, was developed to map SREs in any gene of interest using a lentivirus gRNA library that tiles the genetic region, including distal intronic sequences. When applied to SMN2, a therapeutic target for spinal muscular atrophy, SpliceRUSH robustly identifies not only known SREs but also a previously unknown distal intronic SRE, which can be targeted to alter exon 7 splicing using either dCas13d/gRNA or ASOs. This technology enables a deeper understanding of splicing regulation with applications for RNA-based drug discovery.


Assuntos
Sistemas CRISPR-Cas , Éxons , Íntrons , Splicing de RNA , RNA Guia de Sistemas CRISPR-Cas , Proteína 2 de Sobrevivência do Neurônio Motor , Humanos , Splicing de RNA/genética , Proteína 2 de Sobrevivência do Neurônio Motor/genética , RNA Guia de Sistemas CRISPR-Cas/genética , Íntrons/genética , Éxons/genética , Células HEK293 , Oligonucleotídeos Antissenso/genética , Atrofia Muscular Espinal/genética , Sequências Reguladoras de Ácido Nucleico/genética , Precursores de RNA/genética , Precursores de RNA/metabolismo
3.
BMC Bioinformatics ; 25(1): 179, 2024 May 07.
Artigo em Inglês | MEDLINE | ID: mdl-38714913

RESUMO

BACKGROUND: As genomic studies continue to implicate non-coding sequences in disease, testing the roles of these variants requires insights into the cell type(s) in which they are likely to be mediating their effects. Prior methods for associating non-coding variants with cell types have involved approaches using linkage disequilibrium or ontological associations, incurring significant processing requirements. GaiaAssociation is a freely available, open-source software that enables thousands of genomic loci implicated in a phenotype to be tested for enrichment at regulatory loci of multiple cell types in minutes, permitting insights into the cell type(s) mediating the studied phenotype. RESULTS: In this work, we present Regulatory Landscape Enrichment Analysis (RLEA) by GaiaAssociation and demonstrate its capability to test the enrichment of 12,133 variants across the cis-regulatory regions of 44 cell types. This analysis was completed in 134.0 ± 2.3 s, highlighting the efficient processing provided by GaiaAssociation. The intuitive interface requires only four inputs, offers a collection of customizable functions, and visualizes variant enrichment in cell-type regulatory regions through a heatmap matrix. GaiaAssociation is available on PyPi for download as a command line tool or Python package and the source code can also be installed from GitHub at https://github.com/GreallyLab/gaiaAssociation . CONCLUSIONS: GaiaAssociation is a novel package that provides an intuitive and efficient resource to understand the enrichment of non-coding variants across the cis-regulatory regions of different cells, empowering studies seeking to identify disease-mediating cell types.


Assuntos
Software , Variação Genética , Humanos , Genômica/métodos , Biologia Computacional/métodos , Fenótipo , Sequências Reguladoras de Ácido Nucleico/genética , Desequilíbrio de Ligação
4.
Nat Commun ; 15(1): 3699, 2024 May 02.
Artigo em Inglês | MEDLINE | ID: mdl-38698035

RESUMO

In silico identification of viral anti-CRISPR proteins (Acrs) has relied largely on the guilt-by-association method using known Acrs or anti-CRISPR associated proteins (Acas) as the bait. However, the low number and limited spread of the characterized archaeal Acrs and Aca hinders our ability to identify Acrs using guilt-by-association. Here, based on the observation that the few characterized archaeal Acrs and Aca are transcribed immediately post viral infection, we hypothesize that these genes, and many other unidentified anti-defense genes (ADG), are under the control of conserved regulatory sequences including a strong promoter, which can be used to predict anti-defense genes in archaeal viruses. Using this consensus sequence based method, we identify 354 potential ADGs in 57 archaeal viruses and 6 metagenome-assembled genomes. Experimental validation identified a CRISPR subtype I-A inhibitor and the first virally encoded inhibitor of an archaeal toxin-antitoxin based immune system. We also identify regulatory proteins potentially akin to Acas that can facilitate further identification of ADGs combined with the guilt-by-association approach. These results demonstrate the potential of regulatory sequence analysis for extensive identification of ADGs in viruses of archaea and bacteria.


Assuntos
Archaea , Vírus de Archaea , Vírus de Archaea/genética , Archaea/genética , Archaea/virologia , Archaea/imunologia , Regiões Promotoras Genéticas/genética , Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas/genética , Sequências Reguladoras de Ácido Nucleico/genética , Proteínas Virais/genética , Proteínas Arqueais/genética , Proteínas Arqueais/metabolismo , Metagenoma/genética , Proteínas Associadas a CRISPR/genética , Proteínas Associadas a CRISPR/metabolismo , Sistemas CRISPR-Cas/genética
5.
Mol Biol Rep ; 51(1): 612, 2024 May 05.
Artigo em Inglês | MEDLINE | ID: mdl-38704770

RESUMO

BACKGROUND: The α-Major Regulatory Element (α-MRE), also known as HS-40, is located upstream of the α-globin gene cluster and has a crucial role in the long-range regulation of the α-globin gene expression. This enhancer is polymorphic and several haplotypes were identified in different populations, with haplotype D almost exclusively found in African populations. The purpose of this research was to identify the HS-40 haplotype associated with the 3.7 kb α-thalassemia deletion (-α3.7del) in the Portuguese population, and determine its ancestry and influence on patients' hematological phenotype. METHODS AND RESULTS: We selected 111 Portuguese individuals previously analyzed by Gap-PCR to detect the presence of the -α3.7del: 50 without the -α3.7del, 34 heterozygous and 27 homozygous for the -α3.7del. The HS-40 region was amplified by PCR followed by Sanger sequencing. Four HS-40 haplotypes were found (A to D). The distribution of HS-40 haplotypes and genotypes are significantly different between individuals with and without the -α3.7del, being haplotype D and genotype AD the most prevalent in patients with this deletion in homozygosity. Furthermore, multiple correspondence analysis revealed that individuals without the -α3.7del are grouped with other European populations, while samples with the -α3.7del are separated from these and found more closely related to the African population. CONCLUSION: This study revealed for the first time an association of the HS-40 haplotype D with the -α3.7del in the Portuguese population, and its likely African ancestry. These results may have clinical importance as in vitro analysis of haplotype D showed a decrease in its enhancer activity on α-globin gene.


Assuntos
Haplótipos , Deleção de Sequência , alfa-Globinas , Talassemia alfa , Feminino , Humanos , Masculino , alfa-Globinas/genética , Talassemia alfa/genética , População Negra/genética , Frequência do Gene/genética , Genótipo , Haplótipos/genética , Portugal , Sequências Reguladoras de Ácido Nucleico/genética , Deleção de Sequência/genética
6.
Sci Rep ; 14(1): 10078, 2024 05 02.
Artigo em Inglês | MEDLINE | ID: mdl-38698030

RESUMO

Comparative analyses between traditional model organisms, such as the fruit fly Drosophila melanogaster, and more recent model organisms, such as the red flour beetle Tribolium castaneum, have provided a wealth of insight into conserved and diverged aspects of gene regulation. While the study of trans-regulatory components is relatively straightforward, the study of cis-regulatory elements (CREs, or enhancers) remains challenging outside of Drosophila. A central component of this challenge has been finding a core promoter suitable for enhancer-reporter assays in diverse insect species. Previously, we demonstrated that a Drosophila Synthetic Core Promoter (DSCP) functions in a cross-species manner in Drosophila and Tribolium. Given the over 300 million years of divergence between the Diptera and Coleoptera, we reasoned that DSCP-based reporter constructs will be useful when studying cis-regulation in a variety of insect models across the holometabola and possibly beyond. To this end, we sought to create a suite of new DSCP-based reporter vectors, leveraging dual compatibility with piggyBac and PhiC31-integration, the 3xP3 universal eye marker, GATEWAY cloning, different colors of reporters and markers, as well as Gal4-UAS binary expression. While all constructs functioned properly with a Tc-nub enhancer in Drosophila, complications arose with tissue-specific Gal4-UAS binary expression in Tribolium. Nevertheless, the functionality of these constructs across multiple holometabolous orders suggests a high potential compatibility with a variety of other insects. In addition, we present the piggyLANDR (piggyBac-LoxP AttP Neutralizable Destination Reporter) platform for the establishment of proper PhiC31 landing sites free from position effects. As a proof-of-principle, we demonstrated the workflow for piggyLANDR in Drosophila. The potential utility of these tools ranges from molecular biology research to pest and disease-vector management, and will help advance the study of gene regulation beyond traditional insect models.


Assuntos
Drosophila melanogaster , Genes Reporter , Vetores Genéticos , Regiões Promotoras Genéticas , Tribolium , Animais , Vetores Genéticos/genética , Tribolium/genética , Drosophila melanogaster/genética , Elementos Facilitadores Genéticos , Sequências Reguladoras de Ácido Nucleico/genética , Insetos/genética , Animais Geneticamente Modificados
7.
Sci Data ; 11(1): 467, 2024 May 08.
Artigo em Inglês | MEDLINE | ID: mdl-38719891

RESUMO

Angiogenesis is extensively involved in embryonic development and requires complex regulation networks, whose defects can cause a variety of vascular abnormalities. Cis-regulatory elements control gene expression at all developmental stages, but they have not been studied or profiled in angiogenesis yet. In this study, we exploited public DNase-seq and RNA-seq datasets from a VEGFA-stimulated in vitro angiogenic model, and carried out an integrated analysis of the transcriptome and chromatin accessibility across the entire process. Totally, we generated a bank of 47,125 angiogenic cis-regulatory elements with promoter (marker by H3K4me3) and/or enhancer (marker by H3K27ac) activities. Motif enrichment analysis revealed that these angiogenic cis-regulatory elements interacted preferentially with ETS family TFs. With this tool, we performed an association study using our WES data of TAPVC and identified rs199530718 as a cis-regulatory SNP associated with disease risk. Altogether, this study generated a genome-wide bank of angiogenic cis-regulatory elements and illustrated its utility in identifying novel cis-regulatory SNPs for TAPVC, expanding new horizons of angiogenesis as well as vascular abnormality genetics.


Assuntos
Polimorfismo de Nucleotídeo Único , Humanos , Sequências Reguladoras de Ácido Nucleico , Fator A de Crescimento do Endotélio Vascular/genética , Estudo de Associação Genômica Ampla , Neovascularização Patológica/genética
8.
Front Endocrinol (Lausanne) ; 15: 1368494, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38745948

RESUMO

Decidualisation, the process whereby endometrial stromal cells undergo morphological and functional transformation in preparation for trophoblast invasion, is often disrupted in women with polycystic ovary syndrome (PCOS) resulting in complications with pregnancy and/or infertility. The transcription factor Wilms tumour suppressor 1 (WT1) is a key regulator of the decidualization process, which is reduced in patients with PCOS, a complex condition characterized by increased expression of androgen receptor in endometrial cells and high presence of circulating androgens. Using genome-wide chromatin immunoprecipitation approaches on primary human endometrial stromal cells, we identify key genes regulated by WT1 during decidualization, including homeobox transcription factors which are important for regulating cell differentiation. Furthermore, we found that AR in PCOS patients binds to the same DNA regions as WT1 in samples from healthy endometrium, suggesting dysregulation of genes important to decidualisation pathways in PCOS endometrium due to competitive binding between WT1 and AR. Integrating RNA-seq and H3K4me3 and H3K27ac ChIP-seq metadata with our WT1/AR data, we identified a number of key genes involved in immune response and angiogenesis pathways that are dysregulated in PCOS patients. This is likely due to epigenetic alterations at distal enhancer regions allowing AR to recruit cofactors such as MAGEA11, and demonstrates the consequences of AR disruption of WT1 in PCOS endometrium.


Assuntos
Endométrio , Síndrome do Ovário Policístico , Receptores Androgênicos , Proteínas WT1 , Humanos , Feminino , Síndrome do Ovário Policístico/metabolismo , Síndrome do Ovário Policístico/genética , Síndrome do Ovário Policístico/patologia , Endométrio/metabolismo , Endométrio/patologia , Proteínas WT1/metabolismo , Proteínas WT1/genética , Receptores Androgênicos/metabolismo , Receptores Androgênicos/genética , Células Estromais/metabolismo , Células Estromais/patologia , Adulto , Sequências Reguladoras de Ácido Nucleico
9.
Nat Commun ; 15(1): 2821, 2024 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-38561401

RESUMO

Activation of the p53 tumor suppressor triggers a transcriptional program to control cellular response to stress. However, the molecular mechanisms by which p53 controls gene transcription are not completely understood. Here, we uncover the critical role of spatio-temporal genome architecture in this process. We demonstrate that p53 drives direct and indirect changes in genome compartments, topologically associating domains, and DNA loops prior to one hour of its activation, which escort the p53 transcriptional program. Focusing on p53-bound enhancers, we report 340 genes directly regulated by p53 over a median distance of 116 kb, with 74% of these genes not previously identified. Finally, we showcase that p53 controls transcription of distal genes through newly formed and pre-existing enhancer-promoter loops in a cohesin dependent manner. Collectively, our findings demonstrate a previously unappreciated architectural role of p53 as regulator at distinct topological layers and provide a reliable set of new p53 direct target genes that may help designs of cancer therapies.


Assuntos
Coesinas , Proteína Supressora de Tumor p53 , Proteína Supressora de Tumor p53/genética , Proteína Supressora de Tumor p53/metabolismo , Sequências Reguladoras de Ácido Nucleico , DNA , Cromatina/genética
10.
Cell Genom ; 4(4): 100540, 2024 Apr 10.
Artigo em Inglês | MEDLINE | ID: mdl-38604125

RESUMO

Mechanisms underlying phenotypic divergence across species remain unresolved. In this issue of Cell Genomics, Hansen, Fong, et al.1 systematically dissect human and rhesus macaque gene expression divergence by screening tens of thousands of orthologous elements for enhancer activity in lymphoblastoid cell lines, revealing a much greater role for trans divergence at levels equal to those of cis effects, counter to the prevailing consensus in the field.


Assuntos
Evolução Molecular , Regulação da Expressão Gênica , Animais , Humanos , Macaca mulatta/genética , Sequências Reguladoras de Ácido Nucleico , Genômica
11.
Cell Genom ; 4(4): 100536, 2024 Apr 10.
Artigo em Inglês | MEDLINE | ID: mdl-38604126

RESUMO

Gene regulatory divergence between species can result from cis-acting local changes to regulatory element DNA sequences or global trans-acting changes to the regulatory environment. Understanding how these mechanisms drive regulatory evolution has been limited by challenges in identifying trans-acting changes. We present a comprehensive approach to directly identify cis- and trans-divergent regulatory elements between human and rhesus macaque lymphoblastoid cells using assay for transposase-accessible chromatin coupled to self-transcribing active regulatory region (ATAC-STARR) sequencing. In addition to thousands of cis changes, we discover an unexpected number (∼10,000) of trans changes and show that cis and trans elements exhibit distinct patterns of sequence divergence and function. We further identify differentially expressed transcription factors that underlie ∼37% of trans differences and trace how cis changes can produce cascades of trans changes. Overall, we find that most divergent elements (67%) experienced changes in both cis and trans, revealing a substantial role for trans divergence-alone and together with cis changes-in regulatory differences between species.


Assuntos
Regulação da Expressão Gênica , Sequências Reguladoras de Ácido Nucleico , Animais , Humanos , Macaca mulatta/genética , Sequências Reguladoras de Ácido Nucleico/genética , Regulação da Expressão Gênica/genética , Fatores de Transcrição/genética , Cromatina/genética
12.
Sci Rep ; 14(1): 8642, 2024 04 15.
Artigo em Inglês | MEDLINE | ID: mdl-38622172

RESUMO

Cation exchanger (CAX) genes play an important role in plant growth/development and response to biotic and abiotic stresses. Here, we tried to obtain important information on the functionalities and phenotypic effects of CAX gene family by systematic analyses of their expression patterns, genetic diversity (gene CDS haplotypes, structural variations, gene presence/absence variations) in 3010 rice genomes and nine parents of 496 Huanghuazhan introgression lines, the frequency shifts of the predominant gcHaps at these loci to artificial selection during modern breeding, and their association with tolerances to several abiotic stresses. Significant amounts of variation also exist in the cis-regulatory elements (CREs) of the OsCAX gene promoters in 50 high-quality rice genomes. The functional differentiation of OsCAX gene family were reflected primarily by their tissue and development specific expression patterns and in varied responses to different treatments, by unique sets of CREs in their promoters and their associations with specific agronomic traits/abiotic stress tolerances. Our results indicated that OsCAX1a and OsCAX2 as general signal transporters were in many processes of rice growth/development and responses to diverse environments, but they might be of less value in rice improvement. OsCAX1b, OsCAX1c, OsCAX3 and OsCAX4 was expected to be of potential value in rice improvement because of their associations with specific traits, responsiveness to specific abiotic stresses or phytohormones, and relatively high gcHap and CRE diversity. Our strategy was demonstrated to be highly efficient to obtain important genetic information on genes/alleles of specific gene family and can be used to systematically characterize the other rice gene families.


Assuntos
Oryza , Melhoramento Vegetal , Sequências Reguladoras de Ácido Nucleico , Estresse Fisiológico/genética , Cátions/metabolismo , Variação Genética
13.
Sci Rep ; 14(1): 8743, 2024 04 16.
Artigo em Inglês | MEDLINE | ID: mdl-38627506

RESUMO

The IVa subfamily of glycine-rich proteins (GRPs) comprises a group of glycine-rich RNA binding proteins referred to as GR-RBPa here. Previous studies have demonstrated functions of GR-RBPa proteins in regulating stress response in plants. However, the mechanisms responsible for the differential regulatory functions of GR-RBPa proteins in different plant species have not been fully elucidated. In this study, we identified and comprehensively studied a total of 34 GR-RBPa proteins from five plant species. Our analysis revealed that GR-RBPa proteins were further classified into two branches, with proteins in branch I being relatively more conserved than those in branch II. When subjected to identical stresses, these genes exhibited intensive and differential expression regulation in different plant species, corresponding to the enrichment of cis-acting regulatory elements involving in environmental and internal signaling in these genes. Unexpectedly, all GR-RBPa genes in branch I underwent intensive alternative splicing (AS) regulation, while almost all genes in branch II were only constitutively spliced, despite having more introns. This study highlights the complex and divergent regulations of a group of conserved RNA binding proteins in different plants when exposed to identical stress conditions. These species-specific regulations may have implications for stress responses and adaptations in different plant species.


Assuntos
Plantas , Sequências Reguladoras de Ácido Nucleico , Plantas/genética , Plantas/metabolismo , Estresse Fisiológico/genética , Proteínas de Ligação a RNA/genética , Proteínas de Ligação a RNA/metabolismo , Glicina/metabolismo , Regulação da Expressão Gênica de Plantas , Proteínas de Plantas/genética , Proteínas de Plantas/metabolismo , Filogenia
14.
Nat Genet ; 56(4): 615-626, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38594305

RESUMO

Translating genome-wide association study (GWAS) loci into causal variants and genes requires accurate cell-type-specific enhancer-gene maps from disease-relevant tissues. Building enhancer-gene maps is essential but challenging with current experimental methods in primary human tissues. Here we developed a nonparametric statistical method, SCENT (single-cell enhancer target gene mapping), that models association between enhancer chromatin accessibility and gene expression in single-cell or nucleus multimodal RNA sequencing and ATAC sequencing data. We applied SCENT to 9 multimodal datasets including >120,000 single cells or nuclei and created 23 cell-type-specific enhancer-gene maps. These maps were highly enriched for causal variants in expression quantitative loci and GWAS for 1,143 diseases and traits. We identified likely causal genes for both common and rare diseases and linked somatic mutation hotspots to target genes. We demonstrate that application of SCENT to multimodal data from disease-relevant human tissue enables the scalable construction of accurate cell-type-specific enhancer-gene maps, essential for defining noncoding variant function.


Assuntos
Estudo de Associação Genômica Ampla , Sequências Reguladoras de Ácido Nucleico , Humanos , Alelos , Estudo de Associação Genômica Ampla/métodos , Mapeamento Cromossômico , Fenótipo , Cromatina/genética , Polimorfismo de Nucleotídeo Único , Predisposição Genética para Doença/genética
15.
Sci Adv ; 10(15): eadk2082, 2024 Apr 12.
Artigo em Inglês | MEDLINE | ID: mdl-38598634

RESUMO

We report an approach for cancer phenotyping based on targeted sequencing of cell-free DNA (cfDNA) for small cell lung cancer (SCLC). In SCLC, differential activation of transcription factors (TFs), such as ASCL1, NEUROD1, POU2F3, and REST defines molecular subtypes. We designed a targeted capture panel that identifies chromatin organization signatures at 1535 TF binding sites and 13,240 gene transcription start sites and detects exonic mutations in 842 genes. Sequencing of cfDNA from SCLC patient-derived xenograft models captured TF activity and gene expression and revealed individual highly informative loci. Prediction models of ASCL1 and NEUROD1 activity using informative loci achieved areas under the receiver operating characteristic curve (AUCs) from 0.84 to 0.88 in patients with SCLC. As non-SCLC (NSCLC) often transforms to SCLC following targeted therapy, we applied our framework to distinguish NSCLC from SCLC and achieved an AUC of 0.99. Our approach shows promising utility for SCLC subtyping and transformation monitoring, with potential applicability to diverse tumor types.


Assuntos
Carcinoma Pulmonar de Células não Pequenas , Ácidos Nucleicos Livres , Neoplasias Pulmonares , Carcinoma de Pequenas Células do Pulmão , Humanos , Carcinoma de Pequenas Células do Pulmão/metabolismo , Neoplasias Pulmonares/metabolismo , Carcinoma Pulmonar de Células não Pequenas/patologia , Sequências Reguladoras de Ácido Nucleico , Regulação Neoplásica da Expressão Gênica
16.
Nature ; 629(8010): 127-135, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-38658750

RESUMO

Phenotypic variation among species is a product of evolutionary changes to developmental programs1,2. However, how these changes generate novel morphological traits remains largely unclear. Here we studied the genomic and developmental basis of the mammalian gliding membrane, or patagium-an adaptative trait that has repeatedly evolved in different lineages, including in closely related marsupial species. Through comparative genomic analysis of 15 marsupial genomes, both from gliding and non-gliding species, we find that the Emx2 locus experienced lineage-specific patterns of accelerated cis-regulatory evolution in gliding species. By combining epigenomics, transcriptomics and in-pouch marsupial transgenics, we show that Emx2 is a critical upstream regulator of patagium development. Moreover, we identify different cis-regulatory elements that may be responsible for driving increased Emx2 expression levels in gliding species. Lastly, using mouse functional experiments, we find evidence that Emx2 expression patterns in gliders may have been modified from a pre-existing program found in all mammals. Together, our results suggest that patagia repeatedly originated through a process of convergent genomic evolution, whereby regulation of Emx2 was altered by distinct cis-regulatory elements in independently evolved species. Thus, different regulatory elements targeting the same key developmental gene may constitute an effective strategy by which natural selection has harnessed regulatory evolution in marsupial genomes to generate phenotypic novelty.


Assuntos
Evolução Molecular , Proteínas de Homeodomínio , Locomoção , Marsupiais , Fatores de Transcrição , Animais , Feminino , Masculino , Camundongos , Epigenômica , Perfilação da Expressão Gênica , Regulação da Expressão Gênica no Desenvolvimento , Genoma/genética , Genômica , Proteínas de Homeodomínio/genética , Proteínas de Homeodomínio/metabolismo , Locomoção/genética , Marsupiais/anatomia & histologia , Marsupiais/classificação , Marsupiais/genética , Marsupiais/crescimento & desenvolvimento , Filogenia , Sequências Reguladoras de Ácido Nucleico/genética , Fatores de Transcrição/metabolismo , Fatores de Transcrição/genética , Fenótipo , Humanos
17.
Nat Commun ; 15(1): 3488, 2024 Apr 25.
Artigo em Inglês | MEDLINE | ID: mdl-38664394

RESUMO

Elucidating the relationship between non-coding regulatory element sequences and gene expression is crucial for understanding gene regulation and genetic variation. We explored this link with the training of interpretable deep learning models predicting gene expression profiles from gene flanking regions of the plant species Arabidopsis thaliana, Solanum lycopersicum, Sorghum bicolor, and Zea mays. With over 80% accuracy, our models enabled predictive feature selection, highlighting e.g. the significant role of UTR regions in determining gene expression levels. The models demonstrated remarkable cross-species performance, effectively identifying both conserved and species-specific regulatory sequence features and their predictive power for gene expression. We illustrated the application of our approach by revealing causal links between genetic variation and gene expression changes across fourteen tomato genomes. Lastly, our models efficiently predicted genotype-specific expression of key functional gene groups, exemplified by underscoring known phenotypic and metabolic differences between Solanum lycopersicum and its wild, drought-resistant relative, Solanum pennellii.


Assuntos
Arabidopsis , Aprendizado Profundo , Regulação da Expressão Gênica de Plantas , Solanum lycopersicum , Sorghum , Zea mays , Solanum lycopersicum/genética , Solanum lycopersicum/metabolismo , Sorghum/genética , Sorghum/metabolismo , Arabidopsis/genética , Arabidopsis/metabolismo , Zea mays/genética , Sequências Reguladoras de Ácido Nucleico/genética , Genoma de Planta , Variação Genética , Especificidade da Espécie
18.
Cell Genom ; 4(4): 100537, 2024 Apr 10.
Artigo em Inglês | MEDLINE | ID: mdl-38604128

RESUMO

Transcriptional dysregulation is a hallmark of diffuse large B cell lymphoma (DLBCL), as transcriptional regulators are frequently mutated. However, our mechanistic understanding of how normal transcriptional programs are co-opted in DLBCL has been hindered by a lack of methodologies that provide the temporal resolution required to separate direct and indirect effects on transcriptional control. We applied a chemical-genetic approach to engineer the inducible degradation of the transcription factor FOXO1, which is recurrently mutated (mFOXO1) in DLBCL. The combination of rapid degradation of mFOXO1, nascent transcript detection, and assessment of chromatin accessibility allowed us to identify the direct targets of mFOXO1. mFOXO1 was required to maintain accessibility at specific enhancers associated with multiple oncogenes, and mFOXO1 degradation impaired RNA polymerase pause-release at some targets. Wild-type FOXO1 appeared to weakly regulate many of the same targets as mFOXO1 and was able to complement the degradation of mFOXO1 in the context of AKT inhibition.


Assuntos
Proteína Forkhead Box O1 , Sequências Reguladoras de Ácido Nucleico , Humanos , Proteína Forkhead Box O1/genética , Linfoma Difuso de Grandes Células B/genética , Fatores de Transcrição/genética
19.
Genome Res ; 34(4): 620-632, 2024 May 15.
Artigo em Inglês | MEDLINE | ID: mdl-38631728

RESUMO

Differential gene expression in response to perturbations is mediated at least in part by changes in binding of transcription factors (TFs) and other proteins at specific genomic regions. Association of these cis-regulatory elements (CREs) with their target genes is a challenging task that is essential to address many biological and mechanistic questions. Many current approaches rely on chromatin conformation capture techniques or single-cell correlational methods to establish CRE-to-gene associations. These methods can be effective but have limitations, including resolution, gaps in detectable association distances, and cost. As an alternative, we have developed DegCre, a nonparametric method that evaluates correlations between measurements of perturbation-induced differential gene expression and differential regulatory signal at CREs to score possible CRE-to-gene associations. It has several unique features, including the ability to use any type of CRE activity measurement, yield probabilistic scores for CRE-to-gene pairs, and assess CRE-to-gene pairings across a wide range of sequence distances. We apply DegCre to six data sets, each using different perturbations and containing a variety of regulatory signal measurements, including chromatin openness, histone modifications, and TF occupancy. To test their efficacy, we compare DegCre associations to Hi-C loop calls and CRISPR-validated CRE-to-gene associations, establishing good performance by DegCre that is comparable or superior to competing methods. DegCre is a novel approach to the association of CREs to genes from a perturbation-differential perspective, with strengths that are complementary to existing approaches and allow for new insights into gene regulation.


Assuntos
Cromatina , Fatores de Transcrição , Humanos , Fatores de Transcrição/metabolismo , Fatores de Transcrição/genética , Cromatina/metabolismo , Cromatina/genética , Regulação da Expressão Gênica , Sequências Reguladoras de Ácido Nucleico , Elementos Reguladores de Transcrição
20.
Genome Biol ; 25(1): 83, 2024 Apr 02.
Artigo em Inglês | MEDLINE | ID: mdl-38566111

RESUMO

BACKGROUND: The rise of large-scale multi-species genome sequencing projects promises to shed new light on how genomes encode gene regulatory instructions. To this end, new algorithms are needed that can leverage conservation to capture regulatory elements while accounting for their evolution. RESULTS: Here, we introduce species-aware DNA language models, which we trained on more than 800 species spanning over 500 million years of evolution. Investigating their ability to predict masked nucleotides from context, we show that DNA language models distinguish transcription factor and RNA-binding protein motifs from background non-coding sequence. Owing to their flexibility, DNA language models capture conserved regulatory elements over much further evolutionary distances than sequence alignment would allow. Remarkably, DNA language models reconstruct motif instances bound in vivo better than unbound ones and account for the evolution of motif sequences and their positional constraints, showing that these models capture functional high-order sequence and evolutionary context. We further show that species-aware training yields improved sequence representations for endogenous and MPRA-based gene expression prediction, as well as motif discovery. CONCLUSIONS: Collectively, these results demonstrate that species-aware DNA language models are a powerful, flexible, and scalable tool to integrate information from large compendia of highly diverged genomes.


Assuntos
DNA , Sequências Reguladoras de Ácido Nucleico , Sítios de Ligação , Alinhamento de Sequência , Algoritmos , Sequência Conservada/genética , Evolução Molecular
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA