RESUMO
Circulating extracellular RNAs (exRNAs) have the potential to serve as biomarkers for a wide range of medical conditions. However, limitations in existing exRNA isolation methods and a lack of knowledge on parameters affecting exRNA variability in human samples may hinder their successful discovery and clinical implementation. Using combinations of denaturants, reducing agents, proteolysis, and revised organic extraction, we developed an automated, high-throughput approach for recovery of exRNAs and exDNA from the same biofluid sample. We applied this method to characterize exRNAs from 312 plasma and serum samples collected from 13 healthy volunteers at 12 time points over a 2-month period. Small RNA cDNA library sequencing identified nearly twofold increased epithelial-, muscle-, and neuroendocrine-cell-specific miRNAs in females, while fasting and hormonal cycle showed little effect. External standardization helped to detect quantitative differences in erythrocyte and platelet-specific miRNA contributions and in miRNA concentrations between biofluids. It also helped to identify a study participant with a unique exRNA phenotype featuring a miRNA signature of up to 20-fold elevated endocrine-cell-specific miRNAs and twofold elevated total miRNA concentrations stable for over 1 year. Collectively, these results demonstrate an efficient and quantitative method to discern exRNA phenotypes and suggest that plasma and serum RNA profiles are stable over months and can be routinely monitored in long-term clinical studies.
Assuntos
Ácidos Nucleicos Livres/sangue , Adulto , Biomarcadores/sangue , Ácidos Nucleicos Livres/genética , Ácidos Nucleicos Livres/isolamento & purificação , Feminino , Biblioteca Gênica , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Masculino , MicroRNAs/sangue , MicroRNAs/genéticaRESUMO
Freshwater planaria possess extreme regeneration capabilities mediated by abundant, pluripotent stem cells (neoblasts) in adult animals. Although planaria emerged as an attractive in vivo model system for stem cell biology, gene expression in neoblasts has not been profiled comprehensively and it is unknown how molecular mechanisms for pluripotency in neoblasts relate to those in mammalian embryonic stem cells (ESCs). We purified neoblasts and quantified mRNA and protein expression by sequencing and shotgun proteomics. We identified â¼4000 genes specifically expressed in neoblasts, including all â¼30 known neoblast markers. Genes important for pluripotency in ESCs, including regulators as well as targets of OCT4, were well conserved and upregulated in neoblasts. We found conserved expression of epigenetic regulators and demonstrated their requirement for planarian regeneration by knockdown experiments. Post-transcriptional regulatory genes characteristic for germ cells were also enriched in neoblasts, suggesting the existence of a common ancestral state of germ cells and ESCs. We conclude that molecular determinants of pluripotency are conserved throughout evolution and that planaria are an informative model system for human stem cell biology.
Assuntos
Diferenciação Celular , Proliferação de Células , Regulação da Expressão Gênica , Planárias/citologia , Células-Tronco Pluripotentes/fisiologia , Animais , Perfilação da Expressão Gênica , Proteoma/análiseRESUMO
Freshwater planaria are a very attractive model system for stem cell biology, tissue homeostasis, and regeneration. The genome of the planarian Schmidtea mediterranea has recently been sequenced and is estimated to contain >20,000 protein-encoding genes. However, the characterization of its transcriptome is far from complete. Furthermore, not a single proteome of the entire phylum has been assayed on a genome-wide level. We devised an efficient sequencing strategy that allowed us to de novo assemble a major fraction of the S. mediterranea transcriptome. We then used independent assays and massive shotgun proteomics to validate the authenticity of transcripts. In total, our de novo assembly yielded 18,619 candidate transcripts with a mean length of 1118 nt after filtering. A total of 17,564 candidate transcripts could be mapped to 15,284 distinct loci on the current genome reference sequence. RACE confirmed complete or almost complete 5' and 3' ends for 22/24 transcripts. The frequencies of frame shifts, fusion, and fission events in the assembled transcripts were computationally estimated to be 4.2%-13%, 0%-3.7%, and 2.6%, respectively. Our shotgun proteomics produced 16,135 distinct peptides that validated 4200 transcripts (FDR ≤1%). The catalog of transcripts assembled in this study, together with the identified peptides, dramatically expands and refines planarian gene annotation, demonstrated by validation of several previously unknown transcripts with stem cell-dependent expression patterns. In addition, our robust transcriptome characterization pipeline could be applied to other organisms without genome assembly. All of our data, including homology annotation, are freely available at SmedGD, the S. mediterranea genome database.
Assuntos
Perfilação da Expressão Gênica/métodos , Planárias/genética , Proteômica , Análise de Sequência de DNA/métodos , Animais , Mapeamento Cromossômico , Simulação por Computador , Genoma , Hibridização In Situ , Anotação de Sequência Molecular , Planárias/metabolismo , Proteoma/genética , Proteoma/metabolismo , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Software , Células-Tronco/metabolismoRESUMO
Freshwater planarian flatworms possess uncanny regenerative capacities mediated by abundant and collectively totipotent adult stem cells. Key functions of these cells during regeneration and tissue homeostasis have been shown to depend on PIWI, a molecule required for Piwi-interacting RNA (piRNA) expression in planarians. Nevertheless, the full complement of piRNAs and microRNAs (miRNAs) in this organism has yet to be defined. Here we report on the large-scale cloning and sequencing of small RNAs from the planarian Schmidtea mediterranea, yielding altogether millions of sequenced, unique small RNAs. We show that piRNAs are in part organized in genomic clusters and that they share characteristic features with mammalian and fly piRNAs. We further identify 61 novel miRNA genes and thus double the number of known planarian miRNAs. Sequencing, as well as quantitative PCR of small RNAs, uncovered 10 miRNAs enriched in planarian stem cells. These miRNAs are down-regulated in animals in which stem cells have been abrogated by irradiation, and thus constitute miRNAs likely associated with specific stem-cell functions. Altogether, we present the first comprehensive small RNA analysis in animals belonging to the third animal superphylum, the Lophotrochozoa, and single out a number of miRNAs that may function in regeneration. Several of these miRNAs are deeply conserved in animals.
Assuntos
MicroRNAs/genética , Planárias/genética , RNA Interferente Pequeno/genética , Regeneração/genética , Animais , Sequência de Bases , Clonagem Molecular , MicroRNAs/metabolismo , Dados de Sequência Molecular , Planárias/metabolismo , RNA Interferente Pequeno/metabolismo , Análise de Sequência de DNARESUMO
The capacity of highly parallel sequencing technologies to detect small RNAs at unprecedented depth suggests their value in systematically identifying microRNAs (miRNAs). However, the identification of miRNAs from the large pool of sequenced transcripts from a single deep sequencing run remains a major challenge. Here, we present an algorithm, miRDeep, which uses a probabilistic model of miRNA biogenesis to score compatibility of the position and frequency of sequenced RNA with the secondary structure of the miRNA precursor. We demonstrate its accuracy and robustness using published Caenorhabditis elegans data and data we generated by deep sequencing human and dog RNAs. miRDeep reports altogether approximately 230 previously unannotated miRNAs, of which four novel C. elegans miRNAs are validated by northern blot analysis.
Assuntos
Algoritmos , Caenorhabditis elegans/genética , Bases de Dados Genéticas , MicroRNAs/genética , Alinhamento de Sequência/métodos , Análise de Sequência de RNA/métodos , Animais , Sequência de Bases , Sistemas de Gerenciamento de Base de Dados , Cães , Humanos , Dados de Sequência Molecular , SoftwareRESUMO
Group II intron RNAs fold into catalytically active structures that catalyze their own self-splicing and subsequent transposition into DNA. Because of their remarkable enzymatic properties, it has been of interest to find new group II introns with novel properties. Here we report the cloning, sequencing, and mechanistic characterization of a new group II intron from the bacterium Azotobacter vinelandii (the AV intron). Although it bears the characteristics of the group IIB1 class, the AV intron is unusually G-C rich, and it has unusual insertion sequences and a minimal dependence on the EBS2-IBS2 tertiary interaction. The AV intron is the first bacterial intron that has been found to reside in a housekeeping gene which, in this case, encodes a heat-shock protein (hsp60). Consistent with a potential role in heat-shock regulation, kinetic analysis reveals that AV intron self-splicing is activated only at elevated temperatures. This suggests a novel pathway for the regulation of heat shock in prokaryotes and provides a first example of a thermally tolerant group II intron RNA.
Assuntos
Azotobacter vinelandii/genética , Resposta ao Choque Térmico/genética , Azotobacter vinelandii/metabolismo , Proteínas de Bactérias/genética , Sequência de Bases , Catálise , Chaperonina 60/genética , Íntrons , Modelos Moleculares , Dados de Sequência Molecular , Mutagênese , Conformação de Ácido Nucleico , Óperon , Splicing de RNA , Estabilidade de RNA , RNA Bacteriano/química , RNA Bacteriano/genética , RNA Bacteriano/metabolismo , RNA Catalítico/genética , RNA Catalítico/metabolismo , Deleção de SequênciaRESUMO
Ribonuclease P (RNase P), the ubiquitous endonuclease that catalyzes maturation of the 5'-end of tRNA in bacteria, is a ribonucleoprotein particle composed of one large RNA and one small protein. Two major structural types of bacterial RNase P RNA have been identified by phylogenetic comparative analysis: the A (ancestral) and B (Bacillus) types. The RNase P protein from Thermotoga maritima, a hyperthermophilic bacterium with an A-type RNase P RNA, has been expressed in Escherichia coli. A purification strategy was developed to obtain a protein preparation suitable for crystallization. Protein crystals suitable for diffraction studies were obtained and characterized.