RESUMO
RNA-seq is an effective method for studying the transcriptome, but it can be difficult to apply to scarce or degraded RNA from fixed clinical samples, rare cell populations or cadavers. Recent studies have proposed several methods for RNA-seq of low-quality and/or low-quantity samples, but the relative merits of these methods have not been systematically analyzed. Here we compare five such methods using metrics relevant to transcriptome annotation, transcript discovery and gene expression. Using a single human RNA sample, we constructed and sequenced ten libraries with these methods and compared them against two control libraries. We found that the RNase H method performed best for chemically fragmented, low-quality RNA, and we confirmed this through analysis of actual degraded samples. RNase H can even effectively replace oligo(dT)-based methods for standard RNA-seq. SMART and NuGEN had distinct strengths for measuring low-quantity RNA. Our analysis allows biologists to select the most suitable methods and provides a benchmark for future method development.
Assuntos
Algoritmos , Artefatos , Perfilação da Expressão Gênica/métodos , RNA/genética , Tamanho da Amostra , Análise de Sequência de RNA/métodos , Software , Transcriptoma/genéticaRESUMO
Strand-specific, massively parallel cDNA sequencing (RNA-seq) is a powerful tool for transcript discovery, genome annotation and expression profiling. There are multiple published methods for strand-specific RNA-seq, but no consensus exists as to how to choose between them. Here we developed a comprehensive computational pipeline to compare library quality metrics from any RNA-seq method. Using the well-annotated Saccharomyces cerevisiae transcriptome as a benchmark, we compared seven library-construction protocols, including both published and our own methods. We found marked differences in strand specificity, library complexity, evenness and continuity of coverage, agreement with known annotations and accuracy for expression profiling. Weighing each method's performance and ease, we identified the dUTP second-strand marking and the Illumina RNA ligation methods as the leading protocols, with the former benefitting from the current availability of paired-end sequencing. Our analysis provides a comprehensive benchmark, and our computational pipeline is applicable for assessment of future protocols in other organisms.
Assuntos
DNA Complementar/genética , Análise de Sequência de RNA/métodos , Biologia Computacional , Perfilação da Expressão Gênica/métodos , Biblioteca Gênica , Especificidade por SubstratoRESUMO
Chromatin organization plays a major role in gene regulation and can affect the function and evolution of new transcriptional programs. However, it can be difficult to decipher the basis of changes in chromatin organization and their functional effect on gene expression. Here, we present a large-scale comparative genomic analysis of the relationship between chromatin organization and gene expression, by measuring mRNA abundance and nucleosome positions genome-wide in 12 Hemiascomycota yeast species. We found substantial conservation of global and functional chromatin organization in all species, including prominent nucleosome-free regions (NFRs) at gene promoters, and distinct chromatin architecture in growth and stress genes. Chromatin organization has also substantially diverged in both global quantitative features, such as spacing between adjacent nucleosomes, and in functional groups of genes. Expression levels, intrinsic anti-nucleosomal sequences, and trans-acting chromatin modifiers all play important, complementary, and evolvable roles in determining NFRs. We identify five mechanisms that couple chromatin organization to evolution of gene regulation and have contributed to the evolution of respiro-fermentation and other key systems, including (1) compensatory evolution of alternative modifiers associated with conserved chromatin organization, (2) a gradual transition from constitutive to trans-regulated NFRs, (3) a loss of intrinsic anti-nucleosomal sequences accompanying changes in chromatin organization and gene expression, (4) re-positioning of motifs from NFRs to nucleosome-occluded regions, and (5) the expanded use of NFRs by paralogous activator-repressor pairs. Our study sheds light on the molecular basis of chromatin organization, and on the role of chromatin organization in the evolution of gene regulation.
Assuntos
Ascomicetos/genética , Posicionamento Cromossômico/genética , Evolução Molecular , Regulação Fúngica da Expressão Gênica , Nucleossomos/genética , Processamento Alternativo/genética , Ascomicetos/enzimologia , Sequência Conservada , Citoesqueleto/genética , Empacotamento do DNA/genética , RNA Polimerases Dirigidas por DNA/metabolismo , Proteínas Fúngicas/genética , Proteínas Fúngicas/metabolismo , Duplicação Gênica , Genes Fúngicos/genética , Meiose/genética , Mitocôndrias/genética , Poro Nuclear/genética , Fases de Leitura Aberta/genética , Peroxissomos/genética , Filogenia , Complexo de Endopeptidases do Proteassoma/genética , Especificidade da Espécie , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismoRESUMO
Coexpression of genes within a functional module can be conserved at great evolutionary distances, whereas the associated regulatory mechanisms can substantially diverge. For example, ribosomal protein (RP) genes are tightly coexpressed in Saccharomyces cerevisiae, but the cis and trans factors associated with them are surprisingly diverged across Ascomycota fungi. Little is known, however, about the functional impact of such changes on actual expression levels or about the selective pressures that affect them. Here, we address this question in the context of the evolution of the regulation of RP gene expression by using a comparative genomics approach together with cross-species functional assays. We show that an activator (Ifh1) and a repressor (Crf1) that control RP gene regulation in normal and stress conditions in S. cerevisiae are derived from the duplication and subsequent specialization of a single ancestral protein. We provide evidence that this regulatory innovation coincides with the duplication of RP genes in a whole-genome duplication (WGD) event and may have been important for tighter control of higher levels of RP transcripts. We find that subsequent loss of the derived repressor led to the loss of a stress-dependent repression of RPs in the fungal pathogen Candida glabrata. Our comparative computational and experimental approach shows how gene duplication can constrain and drive regulatory evolution and provides a general strategy for reconstructing the evolutionary trajectory of gene regulation across species.
Assuntos
Evolução Molecular , Duplicação Gênica , Proteínas Ribossômicas/genética , Proteínas de Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/genética , Candida glabrata/genética , Perfilação da Expressão Gênica , Regulação Fúngica da Expressão Gênica , Genes Fúngicos , Genoma Fúngico , Modelos Genéticos , Proteínas Repressoras/genética , Saccharomyces/genética , Especificidade da Espécie , Transativadores/genéticaRESUMO
Defining the transcriptome, the repertoire of transcribed regions encoded in the genome, is a challenging experimental task. Current approaches, relying on sequencing of ESTs or cDNA libraries, are expensive and labor-intensive. Here, we present a general approach for ab initio discovery of the complete transcriptome of the budding yeast, based only on the unannotated genome sequence and millions of short reads from a single massively parallel sequencing run. Using novel algorithms, we automatically construct a highly accurate transcript catalog. Our approach automatically and fully defines 86% of the genes expressed under the given conditions, and discovers 160 previously undescribed transcription units of 250 bp or longer. It correctly demarcates the 5' and 3' UTR boundaries of 86 and 77% of expressed genes, respectively. The method further identifies 83% of known splice junctions in expressed genes, and discovers 25 previously uncharacterized introns, including 2 cases of condition-dependent intron retention. Our framework is applicable to poorly understood organisms, and can lead to greater understanding of the transcribed elements in an explored genome.
Assuntos
Saccharomyces cerevisiae/genética , Sequência de Bases , Simulação por Computador , Perfilação da Expressão Gênica , Splicing de RNA/genética , RNA Mensageiro/genéticaRESUMO
BACKGROUND: Recent studies in budding yeast have shown that antisense transcription occurs at many loci. However, the functional role of antisense transcripts has been demonstrated only in a few cases and it has been suggested that most antisense transcripts may result from promiscuous bi-directional transcription in a dense genome. RESULTS: Here, we use strand-specific RNA sequencing to study anti-sense transcription in Saccharomyces cerevisiae. We detect 1,103 putative antisense transcripts expressed in mid-log phase growth, ranging from 39 short transcripts covering only the 3' UTR of sense genes to 145 long transcripts covering the entire sense open reading frame. Many of these antisense transcripts overlap sense genes that are repressed in mid-log phase and are important in stationary phase, stress response, or meiosis. We validate the differential regulation of 67 antisense transcripts and their sense targets in relevant conditions, including nutrient limitation and environmental stresses. Moreover, we show that several antisense transcripts and, in some cases, their differential expression have been conserved across five species of yeast spanning 150 million years of evolution. Divergence in the regulation of antisense transcripts to two respiratory genes coincides with the evolution of respiro-fermentation. CONCLUSIONS: Our work provides support for a global and conserved role for antisense transcription in yeast gene regulation.
Assuntos
Sequência Conservada , RNA Antissenso/análise , Análise de Sequência de RNA/métodos , Leveduras/genética , Evolução Biológica , Fermentação/genética , Regulação Fúngica da Expressão Gênica , Saccharomyces cerevisiae/genéticaRESUMO
Divergent adaptation can be associated with reproductive isolation in speciation [1]. We recently demonstrated the link between divergent adaptation and the onset of reproductive isolation in experimental populations of the yeast Saccharomyces cerevisiae evolved from a single progenitor in either a high-salt or a low-glucose environment [2]. Here, whole-genome resequencing and comparative genome hybridization of representatives of three populations revealed 17 mutations, six of which explained the adaptive increases in mitotic fitness. In two populations evolved in high salt, two different mutations occurred in the proton efflux pump gene PMA1 and the global transcriptional repressor gene CYC8; the ENA genes encoding sodium efflux pumps were overexpressed once through expansion of this gene cluster and once because of mutation in the regulator CYC8. In the population from low glucose, one mutation occurred in MDS3, which modulates growth at high pH, and one in MKT1, a global regulator of mRNAs encoding mitochondrial proteins, the latter recapitulating a naturally occurring variant. A Dobzhansky-Muller (DM) incompatibility between the evolved alleles of PMA1 and MKT1 strongly depressed fitness in the low-glucose environment. This DM interaction is the first reported between experimentally evolved alleles of known genes and shows how reproductive isolation can arise rapidly when divergent selection is strong.
Assuntos
Adaptação Biológica , Especiação Genética , Saccharomyces cerevisiae/genética , Alelos , Meio Ambiente , Glucose , Mutação , ATPases Translocadoras de Prótons/genética , Proteínas Repressoras/genética , Proteínas de Saccharomyces cerevisiae/genética , Cloreto de SódioRESUMO
Regulatory divergence is likely a major driving force in evolution. Comparative genomics is being increasingly used to infer the evolution of gene regulation. Ascomycota fungi are uniquely suited among eukaryotes for regulatory evolution studies, due to broad phylogenetic scope, many sequenced genomes, and tractability of genomic analysis. Here we review recent advances in the identification of the contribution of cis- and trans-factors to expression divergence. Whereas current strategies have led to the discovery of surprising signatures and mechanisms, we still understand very little about the adaptive role of regulatory evolution. Empirical studies including experimental evolution, comparative functional genomics and hybrid and engineered strains are showing early promise toward deciphering the contribution of regulatory divergence to adaptation.
Assuntos
Ascomicetos/genética , Evolução Molecular , Regulação Fúngica da Expressão Gênica , Redes Reguladoras de Genes , Genoma Fúngico , Ascomicetos/classificação , Variação Genética , Genômica , FilogeniaRESUMO
Regulatory divergence is likely a major driving force in evolution. Comparative transcriptomics provides a new glimpse into the evolution of gene regulation. Ascomycota fungi are uniquely suited among eukaryotes for studies of regulatory evolution, because of broad phylogenetic scope, many sequenced genomes, and facility of genomic analysis. Here we review the substantial divergence in gene expression in Ascomycota and how this is reconciled with the modular organization of transcriptional networks. We show that flexibility and redundancy in both cis-regulation and trans-regulation can lead to changes from altered expression of single genes to wholesale rewiring of regulatory modules. Redundancy thus emerges as a major driving force facilitating expression divergence while preserving the coherent functional organization of a transcriptional response.