Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
1.
Cell ; 184(7): 1724-1739.e16, 2021 04 01.
Artículo en Inglés | MEDLINE | ID: mdl-33667348

RESUMEN

Divergence of gene function is a hallmark of evolution, but assessing functional divergence over deep time is not trivial. The few alleles available for cross-species studies often fail to expose the entire functional spectrum of genes, potentially obscuring deeply conserved pleiotropic roles. Here, we explore the functional divergence of WUSCHEL HOMEOBOX9 (WOX9), suggested to have species-specific roles in embryo and inflorescence development. Using a cis-regulatory editing drive system, we generate a comprehensive allelic series in tomato, which revealed hidden pleiotropic roles for WOX9. Analysis of accessible chromatin and conserved cis-regulatory sequences identifies the regions responsible for this pleiotropic activity, the functions of which are conserved in groundcherry, a tomato relative. Mimicking these alleles in Arabidopsis, distantly related to tomato and groundcherry, reveals new inflorescence phenotypes, exposing a deeply conserved pleiotropy. We suggest that targeted cis-regulatory mutations can uncover conserved gene functions and reduce undesirable effects in crop improvement.


Asunto(s)
Genes de Plantas , Pleiotropía Genética/genética , Proteínas de Homeodominio/genética , Proteínas de Plantas/genética , Secuencias Reguladoras de Ácidos Nucleicos/genética , Alelos , Arabidopsis/genética , Sistemas CRISPR-Cas/genética , Cromatina/metabolismo , Regulación de la Expresión Génica de las Plantas , Inflorescencia/genética , Solanum lycopersicum/genética , Mutagénesis , Desarrollo de la Planta/genética , Plantas Modificadas Genéticamente/genética , Plantas Modificadas Genéticamente/crecimiento & desarrollo , Plantas Modificadas Genéticamente/metabolismo , Regiones Promotoras Genéticas , Solanaceae/genética , Solanaceae/crecimiento & desarrollo
2.
Trends Genet ; 36(4): 243-258, 2020 04.
Artículo en Inglés | MEDLINE | ID: mdl-31954511

RESUMEN

Methods to detect signals of natural selection from genomic data have traditionally emphasized the use of simple summary statistics. Here, we review a new generation of methods that consider combinations of conventional summary statistics and/or richer features derived from inferred gene trees and ancestral recombination graphs (ARGs). We also review recent advances in methods for population genetic simulation and ARG reconstruction. Finally, we describe opportunities for future work on a variety of related topics, including the genetics of speciation, estimation of selection coefficients, and inference of selection on polygenic traits. Together, these emerging methods offer promising new directions in the study of natural selection.


Asunto(s)
Evolución Molecular , Genética de Población/estadística & datos numéricos , Recombinación Genética/genética , Selección Genética/genética , Algoritmos , Simulación por Computador , Modelos Genéticos , Herencia Multifactorial/genética , Filogenia
3.
Bioinformatics ; 37(24): 4727-4736, 2021 12 11.
Artículo en Inglés | MEDLINE | ID: mdl-34382072

RESUMEN

MOTIVATION: Quantification of isoform abundance has been extensively studied at the mature RNA level using RNA-seq but not at the level of precursor RNAs using nascent RNA sequencing. RESULTS: We address this problem with a new computational method called Deconvolution of Expression for Nascent RNA-sequencing data (DENR), which models nascent RNA-sequencing read-counts as a mixture of user-provided isoforms. The baseline algorithm is enhanced by machine-learning predictions of active transcription start sites and an adjustment for the typical 'shape profile' of read-counts along a transcription unit. We show that DENR outperforms simple read-count-based methods for estimating gene and isoform abundances, and that transcription of multiple pre-RNA isoforms per gene is widespread, with frequent differences between cell types. In addition, we provide evidence that a majority of human isoform diversity derives from primary transcription rather than from post-transcriptional processes. AVAILABILITY AND IMPLEMENTATION: DENR and nascentRNASim are freely available at https://github.com/CshlSiepelLab/DENR (version v1.0.0) and https://github.com/CshlSiepelLab/nascentRNASim (version v0.3.0). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Isoformas de ARN , ARN , Humanos , Isoformas de ARN/genética , Programas Informáticos , Isoformas de Proteínas/genética , Análisis de Secuencia de ARN/métodos , Factores Eucarióticos de Iniciación/genética
4.
BMC Biol ; 19(1): 30, 2021 02 15.
Artículo en Inglés | MEDLINE | ID: mdl-33588838

RESUMEN

BACKGROUND: The concentrations of distinct types of RNA in cells result from a dynamic equilibrium between RNA synthesis and decay. Despite the critical importance of RNA decay rates, current approaches for measuring them are generally labor-intensive, limited in sensitivity, and/or disruptive to normal cellular processes. Here, we introduce a simple method for estimating relative RNA half-lives that is based on two standard and widely available high-throughput assays: Precision Run-On sequencing (PRO-seq) and RNA sequencing (RNA-seq). RESULTS: Our method treats PRO-seq as a measure of transcription rate and RNA-seq as a measure of RNA concentration, and estimates the rate of RNA decay required for a steady-state equilibrium. We show that this approach can be used to assay relative RNA half-lives genome-wide, with good accuracy and sensitivity for both coding and noncoding transcription units. Using a structural equation model (SEM), we test several features of transcription units, nearby DNA sequences, and nearby epigenomic marks for associations with RNA stability after controlling for their effects on transcription. We find that RNA splicing-related features are positively correlated with RNA stability, whereas features related to miRNA binding and DNA methylation are negatively correlated with RNA stability. Furthermore, we find that a measure based on U1 binding and polyadenylation sites distinguishes between unstable noncoding and stable coding transcripts but is not predictive of relative stability within the mRNA or lincRNA classes. We also identify several histone modifications that are associated with RNA stability. CONCLUSION: We introduce an approach for estimating the relative half-lives of individual RNAs. Together, our estimation method and systematic analysis shed light on the pervasive impacts of RNA stability on cellular RNA concentrations.


Asunto(s)
Inestabilidad Genómica , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Estabilidad del ARN , Secuenciación de Nucleótidos de Alto Rendimiento/instrumentación , Humanos , RNA-Seq/métodos
5.
Mol Biol Evol ; 37(7): 2137-2152, 2020 07 01.
Artículo en Inglés | MEDLINE | ID: mdl-32176292

RESUMEN

Evolutionary changes in gene expression are often driven by gains and losses of cis-regulatory elements (CREs). The dynamics of CRE evolution can be examined using multispecies epigenomic data, but so far such analyses have generally been descriptive and model-free. Here, we introduce a probabilistic modeling framework for the evolution of CREs that operates directly on raw chromatin immunoprecipitation and sequencing (ChIP-seq) data and fully considers the phylogenetic relationships among species. Our framework includes a phylogenetic hidden Markov model, called epiPhyloHMM, for identifying the locations of multiply aligned CREs, and a combined phylogenetic and generalized linear model, called phyloGLM, for accounting for the influence of a rich set of genomic features in describing their evolutionary dynamics. We apply these methods to previously published ChIP-seq data for the H3K4me3 and H3K27ac histone modifications in liver tissue from nine mammals. We find that enhancers are gained and lost during mammalian evolution at about twice the rate of promoters, and that turnover rates are negatively correlated with DNA sequence conservation, expression level, and tissue breadth, and positively correlated with distance from the transcription start site, consistent with previous findings. In addition, we find that the predicted dosage sensitivity of target genes positively correlates with DNA sequence constraint in CREs but not with turnover rates, perhaps owing to differences in the effect sizes of the relevant mutations. Altogether, our probabilistic modeling framework enables a variety of powerful new analyses.


Asunto(s)
Epigenómica/métodos , Evolución Molecular , Modelos Genéticos , Filogenia , Elementos Reguladores de la Transcripción , Animales , Secuenciación de Inmunoprecipitación de Cromatina , Código de Histonas/genética , Mamíferos/genética
6.
Genome Res ; 27(11): 1816-1829, 2017 11.
Artículo en Inglés | MEDLINE | ID: mdl-29025894

RESUMEN

Most studies of responses to transcriptional stimuli measure changes in cellular mRNA concentrations. By sequencing nascent RNA instead, it is possible to detect changes in transcription in minutes rather than hours and thereby distinguish primary from secondary responses to regulatory signals. Here, we describe the use of PRO-seq to characterize the immediate transcriptional response in human cells to celastrol, a compound derived from traditional Chinese medicine that has potent anti-inflammatory, tumor-inhibitory, and obesity-controlling effects. Celastrol is known to elicit a cellular stress response resembling the response to heat shock, but the transcriptional basis of this response remains unclear. Our analysis of PRO-seq data for K562 cells reveals dramatic transcriptional effects soon after celastrol treatment at a broad collection of both coding and noncoding transcription units. This transcriptional response occurred in two major waves, one within 10 min, and a second 40-60 min after treatment. Transcriptional activity was generally repressed by celastrol, but one distinct group of genes, enriched for roles in the heat shock response, displayed strong activation. Using a regression approach, we identified key transcription factors that appear to drive these transcriptional responses, including members of the E2F and RFX families. We also found sequence-based evidence that particular transcription factors drive the activation of enhancers. We observed increased polymerase pausing at both genes and enhancers, suggesting that pause release may be widely inhibited during the celastrol response. Our study demonstrates that a careful analysis of PRO-seq time-course data can disentangle key aspects of a complex transcriptional response, and it provides new insights into the activity of a powerful pharmacological agent.


Asunto(s)
Perfilación de la Expresión Génica/métodos , Respuesta al Choque Térmico/efectos de los fármacos , Análisis de Secuencia de ARN/métodos , Triterpenos/farmacología , Factores de Transcripción E2F/genética , Elementos de Facilitación Genéticos/efectos de los fármacos , Regulación de la Expresión Génica/efectos de los fármacos , Humanos , Células K562 , Triterpenos Pentacíclicos , Factores de Transcripción del Factor Regulador X/genética , Factores de Tiempo
7.
Nat Commun ; 13(1): 4312, 2022 07 25.
Artículo en Inglés | MEDLINE | ID: mdl-35879308

RESUMEN

Large-scale genome sequencing has enabled the measurement of strong purifying selection in protein-coding genes. Here we describe a new method, called ExtRaINSIGHT, for measuring such selection in noncoding as well as coding regions of the human genome. ExtRaINSIGHT estimates the prevalence of "ultraselection" by the fractional depletion of rare single-nucleotide variants, after controlling for variation in mutation rates. Applying ExtRaINSIGHT to 71,702 whole genome sequences from gnomAD v3, we find abundant ultraselection in evolutionarily ancient miRNAs and neuronal protein-coding genes, as well as at splice sites. By contrast, we find much less ultraselection in other noncoding RNAs and transcription factor binding sites, and only modest levels in ultraconserved elements. We estimate that ~0.4-0.7% of the human genome is ultraselected, implying ~ 0.26-0.51 strongly deleterious mutations per generation. Overall, our study sheds new light on the genome-wide distribution of fitness effects by combining deep sequencing data and classical theory from population genetics.


Asunto(s)
Genoma Humano , Mutación Puntual , Evolución Molecular , Genética de Población , Genoma Humano/genética , Humanos , Mutación , Selección Genética
8.
Elife ; 92020 06 23.
Artículo en Inglés | MEDLINE | ID: mdl-32573438

RESUMEN

The explosion in population genomic data demands ever more complex modes of analysis, and increasingly, these analyses depend on sophisticated simulations. Recent advances in population genetic simulation have made it possible to simulate large and complex models, but specifying such models for a particular simulation engine remains a difficult and error-prone task. Computational genetics researchers currently re-implement simulation models independently, leading to inconsistency and duplication of effort. This situation presents a major barrier to empirical researchers seeking to use simulations for power analyses of upcoming studies or sanity checks on existing genomic data. Population genetics, as a field, also lacks standard benchmarks by which new tools for inference might be measured. Here, we describe a new resource, stdpopsim, that attempts to rectify this situation. Stdpopsim is a community-driven open source project, which provides easy access to a growing catalog of published simulation models from a range of organisms and supports multiple simulation engine backends. This resource is available as a well-documented python library with a simple command-line interface. We share some examples demonstrating how stdpopsim can be used to systematically compare demographic inference methods, and we encourage a broader community of developers to contribute to this growing resource.


Asunto(s)
Genética de Población , Biblioteca Genómica , Modelos Genéticos , Animales , Arabidopsis/genética , Perros/genética , Drosophila melanogaster/genética , Escherichia coli/genética , Genética de Población/métodos , Genética de Población/organización & administración , Genoma/genética , Genoma Humano/genética , Humanos , Pongo abelii/genética
9.
Nat Ecol Evol ; 2(3): 537-548, 2018 03.
Artículo en Inglés | MEDLINE | ID: mdl-29379187

RESUMEN

How evolutionary changes at enhancers affect the transcription of target genes remains an important open question. Previous comparative studies of gene expression have largely measured the abundance of messenger RNA, which is affected by post-transcriptional regulatory processes, hence limiting inferences about the mechanisms underlying expression differences. Here, we directly measured nascent transcription in primate species, allowing us to separate transcription from post-transcriptional regulation. We used precision run-on and sequencing to map RNA polymerases in resting and activated CD4+ T cells in multiple human, chimpanzee and rhesus macaque individuals, with rodents as outgroups. We observed general conservation in coding and non-coding transcription, punctuated by numerous differences between species, particularly at distal enhancers and non-coding RNAs. Genes regulated by larger numbers of enhancers are more frequently transcribed at evolutionarily stable levels, despite reduced conservation at individual enhancers. Adaptive nucleotide substitutions are associated with lineage-specific transcription and at one locus, SGPP2, we predict and experimentally validate that multiple substitutions contribute to human-specific transcription. Collectively, our findings suggest a pervasive role for evolutionary compensation across ensembles of enhancers that jointly regulate target genes.


Asunto(s)
Macaca mulatta/genética , Pan troglodytes/genética , Elementos Reguladores de la Transcripción , Linfocitos T/metabolismo , Transcripción Genética , Animales , Expresión Génica , Humanos , Macaca mulatta/metabolismo , Masculino , Pan troglodytes/metabolismo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA