Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 1.076
Filtrar
1.
Nat Protoc ; 16(9): 4355-4381, 2021 09.
Artículo en Inglés | MEDLINE | ID: mdl-34331052

RESUMEN

Elongation, splicing and polyadenylation are fundamental steps of transcription, and studying their coordination requires simultaneous monitoring of these dynamic processes on one transcript. We recently developed a full-length nascent RNA sequencing method in the model plant Arabidopsis that simultaneously detects RNA polymerase II position, splicing status, polyadenylation site and poly(A) tail length at genome-wide scale. This method allows calculation of the kinetics of cotranscriptional splicing and detects polyadenylated transcripts with unspliced introns retained at specific positions posttranscriptionally. Here we describe a detailed protocol for this method called FLEP-seq (full-length elongating and polyadenylated RNA sequencing) that is applicable to plants. Library production requires as little as one nanogram of nascent RNA (after rRNA/tRNA removal), and either Nanopore or PacBio platforms can be used for sequencing. We also provide a complete bioinformatic pipeline from raw data processing to downstream analysis. The minimum time required for FLEP-seq, including RNA extraction and library preparation, is 36 h. The subsequent long-read sequencing and initial data analysis ranges between 31 and 40 h, depending on the sequencing platform.


Asunto(s)
Poli A/análisis , ARN Polimerasa II/análisis , Empalme del ARN , Análisis de Secuencia de ARN/métodos , Programas Informáticos , Arabidopsis , Genómica/métodos , Poliadenilación
2.
Mol Hum Reprod ; 27(7)2021 07 01.
Artículo en Inglés | MEDLINE | ID: mdl-34152407

RESUMEN

Mammalian oocytes and embryos rely exclusively on maternal mRNAs to accomplish early developmental processes. Since oocytes and early embryos are transcriptionally silent after meiotic resumption, most of the synthesised maternal mRNA does not undergo immediate translation but is instead stored in the oocyte. Quantitative RT-PCR is commonly used to quantify mRNA levels, and correct quantification relies on reverse transcription and the choice of reference genes. Different methods for reverse transcription may affect gene expression determination in oocytes. In this study, we examined the suitability of either random or oligo(dT) primers for reverse transcription to be used for quantitative RT-PCR. We further looked for changes in poly(A) length of the maternal mRNAs during oocyte maturation. Our data indicate that depending on the method of reverse transcription, the optimal combination of reference genes for normalisation differed. Surprisingly, we observed a shortening of the poly(A) tail lengths of maternal mRNA as oocytes progressed from germinal vesicle to metaphase II. Overall, our findings suggest dynamic maternal regulation of mRNA structure and gene expression during oocyte maturation and early embryo development.


Asunto(s)
Blastómeros/metabolismo , Cartilla de ADN , Regulación del Desarrollo de la Expresión Génica , Mórula/metabolismo , Oocitos/metabolismo , ARN Mensajero/metabolismo , Reacción en Cadena de la Polimerasa de Transcriptasa Inversa/métodos , Transcripción Reversa , Cigoto/metabolismo , Animales , Bovinos , Cartilla de ADN/síntesis química , ADN Complementario/genética , Técnicas de Cultivo de Embriones , Genes , Poli A/análisis , ARN Mensajero/genética , ARN Mensajero/aislamiento & purificación , Estándares de Referencia , Creación de Embriones para Investigación , Alineación de Secuencia , Homología de Secuencia de Ácido Nucleico
3.
Methods Mol Biol ; 2284: 543-567, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-33835463

RESUMEN

The poly(A) tail is a homopolymeric stretch of adenosine at the 3'-end of mature RNA transcripts and its length plays an important role in nuclear export, stability, and translational regulation of mRNA. Existing techniques for genome-wide estimation of poly(A) tail length are based on short-read sequencing. These methods are limited because they sequence a synthetic DNA copy of mRNA instead of the native transcripts. Furthermore, they can identify only a short segment of the transcript proximal to the poly(A) tail which makes it difficult to assign the measured poly(A) length uniquely to a single transcript isoform. With the introduction of native RNA sequencing by Oxford Nanopore Technologies, it is now possible to sequence full-length native RNA. A single long read contains both the transcript and the associated poly(A) tail, thereby making transcriptome-wide isoform-specific poly(A) tail length assessment feasible. We developed tailfindr-an R-based package for estimating poly(A) tail length from Oxford Nanopore sequencing data. In this chapter, we describe in detail the pipeline for transcript isoform-specific poly(A) tail profiling based on native RNA Nanopore sequencing-from library preparation to downstream data analysis with tailfindr.


Asunto(s)
Secuenciación de Nanoporos/métodos , Poli A/análisis , ARN/análisis , Análisis de Secuencia de ARN/métodos , Animales , Estudios de Factibilidad , Perfilación de la Expresión Génica/métodos , Biblioteca de Genes , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos , Poli A/genética , Isoformas de Proteínas/análisis , Isoformas de Proteínas/genética , ARN/química , ARN/genética , Procesamiento Postranscripcional del ARN , ARN Mensajero/análisis , ARN Mensajero/genética , Transcriptoma , Pez Cebra/genética
4.
Biochemistry ; 59(22): 2041-2046, 2020 06 09.
Artículo en Inglés | MEDLINE | ID: mdl-32412236

RESUMEN

Homopolymeric adenosine RNA plays numerous roles in both cells and noncellular genetic material. We report herein an unusual poly(A) signature in chemical mapping data generated by the Eterna Massive Open Laboratory. Poly(A) sequences of length seven or more show unexpected results in the selective 2'-hydroxyl acylation read out by primer extension (SHAPE) and dimethyl sulfate (DMS) chemical probing. This unusual signature first appears in poly(A) sequences of length seven and grows to its maximum strength at length ∼10. In a long poly(A) sequence, the substitution of a single A by any other nucleotide disrupts the signature, but only for the 6 or so nucleotides on the 5' side of the substitution.


Asunto(s)
Ensayos Analíticos de Alto Rendimiento , Poli A/análisis , ARN/análisis
5.
Nucleic Acids Res ; 48(11): 5926-5938, 2020 06 19.
Artículo en Inglés | MEDLINE | ID: mdl-32421815

RESUMEN

Alternative polyadenylation (APA) produces isoforms with distinct 3'-ends, yet their functional differences remain largely unknown. Here, we introduce the APA-seq method to detect the expression levels of APA isoforms from 3'-end RNA-Seq data by exploiting both paired-end reads for gene isoform identification and quantification. We detected the expression levels of APA isoforms in individual Caenorhabditis elegans embryos at different stages throughout embryogenesis. Examining the correlation between the temporal profiles of isoforms led us to distinguish two classes of genes: those with highly correlated isoforms (HCI) and those with lowly correlated isoforms (LCI) across time. We hypothesized that variants with similar expression profiles may be the product of biological noise, while the LCI variants may be under tighter selection and consequently their distinct 3' UTR isoforms are more likely to have functional consequences. Supporting this notion, we found that LCI genes have significantly more miRNA binding sites, more correlated expression profiles with those of their targeting miRNAs and a relative lack of correspondence between their transcription and protein abundances. Collectively, our results suggest that a lack of coherence among the regulation of 3' UTR isoforms is a proxy for selective pressures acting upon APA usage and consequently for their functional relevance.


Asunto(s)
Regiones no Traducidas 3'/genética , Caenorhabditis elegans/embriología , Caenorhabditis elegans/genética , Regulación del Desarrollo de la Expresión Génica , Poli A/análisis , Poliadenilación , Animales , Drosophila melanogaster , Desarrollo Embrionario/genética , MicroARNs/genética , MicroARNs/metabolismo , Conformación de Ácido Nucleico , Xenopus laevis
6.
Nucleic Acids Res ; 48(D1): D174-D179, 2020 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-31617559

RESUMEN

Generated by 3' end cleavage and polyadenylation at alternative polyadenylation (poly(A)) sites, alternative terminal exons account for much of the variation between human transcript isoforms. More than a dozen protocols have been developed so far for capturing and sequencing RNA 3' ends from a variety of cell types and species. In previous studies, we have used these data to uncover novel regulatory signals and cell type-specific isoforms. Here we present an update of the PolyASite (https://polyasite.unibas.ch) resource of poly(A) sites, constructed from publicly available human, mouse and worm 3' end sequencing datasets by enforcing uniform quality measures, including the flagging of putative internal priming sites. Through integrated processing of all data, we identified and clustered sites that are closely spaced and share polyadenylation signals, as these are likely the result of stochastic variations in processing. For each cluster, we identified the representative - most frequently processed - site and estimated the relative use in the transcriptome across all samples. We have established a modern web portal for efficient finding, exploration and export of data. Database generation is fully automated, greatly facilitating incorporation of new datasets and the updating of underlying genome resources.


Asunto(s)
Bases de Datos de Ácidos Nucleicos , Poliadenilación , Animales , Caenorhabditis elegans/genética , Humanos , Ratones , Poli A/análisis , Análisis de Secuencia de ARN
7.
Nat Commun ; 10(1): 5292, 2019 11 22.
Artículo en Inglés | MEDLINE | ID: mdl-31757970

RESUMEN

Message RNA poly(A) tails are vital for their function and regulation. However, the full-length sequence of mRNA isoforms with their poly(A) tails remains undetermined. Here, we develop a method at single-cell level sensitivity that enables quantification of poly(A) tails along with the full-length cDNA while reading non-adenosine residues within poly(A) tails precisely, which we name poly(A) inclusive RNA isoform sequencing (PAIso-seq). Using this method, we can quantify isoform specific poly(A) tail length. More interestingly, we find that 17% of the mRNAs harbor non-A residues within the body of poly(A) tails in mouse GV oocytes. We show that PAIso-seq is sensitive enough to analyze single GV oocytes. These findings will not only provide an accurate and sensitive tool in studying poly(A) tails, but also open a door for the function and regulation of non-adenosine modifications within the body of poly(A) tails.


Asunto(s)
Nucleótidos de Citosina/química , Nucleótidos de Guanina/química , Oocitos/metabolismo , Poli A/química , ARN Mensajero/química , Análisis de Secuencia de ARN/métodos , Nucleótidos de Uracilo/química , Animales , Proteínas de Ciclo Celular/genética , Ciclina B1/genética , Nucleótidos de Citosina/análisis , ADN (Citosina-5-)-Metiltransferasa 1/genética , Nucleótidos de Guanina/análisis , Ratones , Poli A/análisis , Biosíntesis de Proteínas , Isoformas de ARN , ARN Mensajero/análisis , Análisis de la Célula Individual , Activador de Tejido Plasminógeno/genética , Nucleótidos de Uracilo/análisis
8.
Nucleic Acids Res ; 47(5): 2263-2275, 2019 03 18.
Artículo en Inglés | MEDLINE | ID: mdl-30576549

RESUMEN

Recent studies have shown that tissue-specific transcriptomes contain multiple types of RNAs that are transcribed from intronic and intergenic sequences. The current study presents a tool for the discovery of transcribed, unannotated sequence elements from RNA-seq libraries. This RNA Element (RE) discovery algorithm (REDa) was applied to a spectrum of tissues and cells representing germline, embryonic, and somatic tissues and examined as a function of differentiation through the first set of cell divisions of human development. This highlighted extensive transcription throughout the genome, yielding previously unidentified human spermatogenic RNAs. Both exonic and novel X-chromosome REs were subject to robust meiotic sex chromosome inactivation, although an extensive de-repression occurred in the post-meiotic stages of spermatogenesis. Surprisingly, 2.4% of the 10,395 X chromosome exonic REs were present in mature sperm. Transcribed genomic repetitive sequences, including simple centromeric repeats, HERVE and HSAT1, were also shown to be associated with RE expression during spermatogenesis. These results suggest that pervasive intergenic repetitive sequence expression during human spermatogenesis may play a role in regulating chromatin dynamics. Repetitive REs switching repeat classes during differentiation upon fertilization and embryonic genome activation was evident.


Asunto(s)
Algoritmos , Blastocisto/metabolismo , Oocitos/metabolismo , ARN Mensajero/análisis , ARN Mensajero/genética , Secuencias Reguladoras de Ácido Ribonucleico/genética , Análisis de Secuencia de ARN , Espermatozoides/metabolismo , Blastocisto/citología , Diferenciación Celular , Línea Celular , Cromatina/genética , Cromatina/metabolismo , Cromosomas Humanos X/genética , Desarrollo Embrionario/genética , Exones/genética , Femenino , Fertilización , Regulación del Desarrollo de la Expresión Génica , Genómica , Humanos , Hígado/citología , Hígado/metabolismo , Masculino , Meiosis/genética , Oocitos/citología , Poli A/análisis , Poli A/genética , Poli A/aislamiento & purificación , ARN Mensajero/aislamiento & purificación , Secuencias Repetitivas de Ácidos Nucleicos , Espermatogénesis/genética , Espermatozoides/citología , Transcripción Genética , Inactivación del Cromosoma X
9.
Langmuir ; 34(49): 14940-14945, 2018 12 11.
Artículo en Inglés | MEDLINE | ID: mdl-30462509

RESUMEN

RNA sensing is of vital significance to advance our comprehension of gene expression and to further benefit medical diagnostics. Taking advantage of the excellent sensing capability of the aerolysin nanopore as a single-biomolecule interface, we for the first time achieved the direct characterization of single native RNA of Poly(A)4 and Poly(U)4. Poly(A)4 induces ∼10% larger blockade current amplitude than Poly(U)4. The statistical duration of Poly(A)4 is 18.83 ± 1.08 ms, which is 100 times longer than that of Poly(U)4. Our results demonstrated that the capture of RNA homopolymers is restricted by the biased diffusion. The translocation of RNA needs to overcome a lower free-energy barrier than that of DNA. Moreover, the strong RNA-aerolysin interaction is attributed to the hydroxyl in pentose, which prolongs the translocation time. This study opens an avenue for aerolysin nanopores to directly achieve RNA sensing, including discrimination of RNA epigenetic modification and selective detection of miRNA.


Asunto(s)
Toxinas Bacterianas/química , Nanoporos , Proteínas Citotóxicas Formadoras de Poros/química , ARN/análisis , Aeromonas hydrophila/química , Técnicas Electroquímicas/métodos , Membrana Dobles de Lípidos/química , Fosfatidilcolinas/química , Poli A/análisis , Poli U/análisis
10.
Anal Bioanal Chem ; 410(6): 1667-1677, 2018 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-29313076

RESUMEN

The 3'-polyadenosine (poly A) tail of in vitro transcribed (IVT) mRNA was studied using liquid chromatography coupled to mass spectrometry (LC-MS). Poly A tails were cleaved from the mRNA using ribonuclease T1 followed by isolation with dT magnetic beads. Extracted tails were then analyzed by LC-MS which provided tail length information at single-nucleotide resolution. A 2100-nt mRNA with plasmid-encoded poly A tail lengths of either 27, 64, 100, or 117 nucleotides was used for these studies as enzymatically added poly A tails showed significant length heterogeneity. The number of As observed in the tails closely matched Sanger sequencing results of the DNA template, and even minor plasmid populations with sequence variations were detected. When the plasmid sequence contained a discreet number of poly As in the tail, analysis revealed a distribution that included tails longer than the encoded tail lengths. These observations were consistent with transcriptional slippage of T7 RNAP taking place within a poly A sequence. The type of RNAP did not alter the observed tail distribution, and comparison of T3, T7, and SP6 showed all three RNAPs produced equivalent tail length distributions. The addition of a sequence at the 3' end of the poly A tail did, however, produce narrower tail length distributions which supports a previously described model of slippage where the 3' end can be locked in place by having a G or C after the poly nucleotide region. Graphical abstract Determination of mRNA poly A tail length using magnetic beads and LC-MS.


Asunto(s)
Cromatografía Líquida de Alta Presión/métodos , Espectrometría de Masas/métodos , Poli A/análisis , ARN Mensajero/química , Animales , Línea Celular , Ratones , Plásmidos/química , Plásmidos/genética , Poli A/genética , Poliadenilación , ARN Mensajero/genética , Transcripción Genética
11.
Molecules ; 22(12)2017 Dec 13.
Artículo en Inglés | MEDLINE | ID: mdl-29236076

RESUMEN

We demonstrate here for the first time that a guanidiniocarbonyl-pyrrole (GCP) unit can be applied for the fine recognition of single stranded RNA sequences-an intuitively unexpected result since so far binding of the GCP unit to ds-DNA or ds-RNA relied strongly on minor or major groove interactions, as shown in previous work. Two novel nucleobase-GCP isosteric conjugates differing in the flexibility of GCP unit revealed a fluorimetric recognition of various single stranded RNA, which could be additionally regulated by pH. The more rigid conjugate showed a specific fluorescence increase for poly A only at pH 7, whereby this response could be reversibly switched-off at pH 5. The more flexible derivative revealed selective fluorescence quenching by poly G at pH 7 but no change for poly A, whereas its recognition of poly AH⁺ can be switched-on at pH 5. The computational analysis confirmed the important role of the GCP fragment and its protonation states in the sensing of polynucleotides and revealed that it is affected by the intrinsic dynamical features of conjugates themselves. Both conjugates showed a negligible response to uracil and cytosine ss-RNA as well as ds-RNA at pH 7, and only weak interactions with ds-DNA. Thus, nucleobase-GCP conjugates can be considered as novel lead compounds for the design of ss-RNA or ss-DNA selective fluorimetric probes.


Asunto(s)
Fluorometría/métodos , Guanidinas/química , Sondas Moleculares/química , Pirroles/química , ARN/análisis , ADN/química , Concentración de Iones de Hidrógeno , Poli A/análisis , Poli G/análisis , Soluciones
12.
Nucleic Acids Res ; 45(12): e112, 2017 Jul 07.
Artículo en Inglés | MEDLINE | ID: mdl-28449108

RESUMEN

The recent emergence of alternative polyadenylation (APA) as an engine driving transcriptomic diversity has stimulated the development of sequencing methodologies designed to assess genome-wide polyadenylation events. The goal of these approaches is to enrich, partition, capture and ultimately sequence poly(A) site junctions. However, these methods often require poly(A) enrichment, 3΄ linker ligation steps, and RNA fragmentation, which can necessitate higher levels of starting RNA, increase experimental error and potentially introduce bias. We recently reported a click-chemistry based method for generating RNAseq libraries called 'ClickSeq'. Here, we adapt this method to direct the cDNA synthesis specifically toward the 3΄UTR/poly(A) tail junction of cellular RNA. With this novel approach, we demonstrate sensitive and specific enrichment for poly(A) site junctions without the need for complex sample preparation, fragmentation or purification. Poly(A)-ClickSeq (PAC-seq) is therefore a simple procedure that generates high-quality RNA-seq poly(A) libraries. As a proof-of-principle, we utilized PAC-seq to explore the poly(A) landscape of both human and Drosophila cells in culture and observed outstanding overlap with existing poly(A) databases and also identified previously unannotated poly(A) sites. Moreover, we utilize PAC-seq to quantify and analyze APA events regulated by CFIm25 illustrating how this technology can be harnessed to identify alternatively polyadenylated RNA.


Asunto(s)
Regiones no Traducidas 3' , Química Clic/métodos , Secuenciación de Nucleótidos de Alto Rendimiento , Poli A/análisis , ARN Mensajero/análisis , Transcriptoma , Animales , Secuencia de Bases , ADN Complementario/genética , ADN Complementario/metabolismo , Bases de Datos Genéticas , Drosophila melanogaster/genética , Drosophila melanogaster/metabolismo , Perfilación de la Expresión Génica , Biblioteca de Genes , Células HeLa , Humanos , Anotación de Secuencia Molecular , Poli A/genética , Poli A/metabolismo , Poliadenilación , ARN Mensajero/genética , ARN Mensajero/metabolismo , Análisis de Secuencia de ARN , Factores de Escisión y Poliadenilación de ARNm/genética , Factores de Escisión y Poliadenilación de ARNm/metabolismo
13.
Methods Mol Biol ; 1463: 93-102, 2017.
Artículo en Inglés | MEDLINE | ID: mdl-27734350

RESUMEN

mRNA regulation by poly(A) tail length variations plays an important role in many developmental processes. Recent advances have shown that, in particular, deadenylation (the shortening of mRNA poly(A) tails) is essential for germ-line stem cell biology in the Drosophila ovary. Therefore, a rapid and accurate method to analyze poly(A) tail lengths of specific mRNAs in this tissue is valuable. Several methods of poly(A) test (PAT) assays have been reported to measure mRNA poly(A) tail lengths in vivo. Here, we describe two of these methods (PAT and ePAT) that we have adapted for Drosophila ovarian germ cells and germ-line stem cells.


Asunto(s)
Drosophila/genética , Ovario/química , Poli A/análisis , ARN Mensajero/química , Animales , Femenino , Regulación de la Expresión Génica , Ovario/citología , Poliadenilación , Nicho de Células Madre , Células Madre/química , Células Madre/citología
14.
BMC Genomics ; 16: 511, 2015 Jul 09.
Artículo en Inglés | MEDLINE | ID: mdl-26155789

RESUMEN

BACKGROUND: Messenger RNA polyadenylation is an essential step for the maturation of most eukaryotic mRNAs. Accurate determination of poly(A) sites helps define the 3'-ends of genes, which is important for genome annotation and gene function research. Genomic studies have revealed the presence of poly(A) sites in intergenic regions, which may be attributed to 3'-UTR extensions and novel transcript units. However, there is no systematically evaluation of intergenic poly(A) sites in plants. RESULTS: Approximately 16,000 intergenic poly(A) site clusters (IPAC) in Arabidopsis thaliana were discovered and evaluated at the whole genome level. Based on the distributions of distance from IPACs to nearby sense and antisense genes, these IPACs were classified into three categories. About 70 % of them were from previously unannotated 3'-UTR extensions to known genes, which would extend 6985 transcripts of TAIR10 genome annotation beyond their 3'-ends, with a mean extension of 134 nucleotides. 1317 IPACs were originated from novel intergenic transcripts, 37 of which were likely to be associated with protein coding transcripts. 2957 IPACs corresponded to antisense transcripts for genes on the reverse strand, which might affect 2265 protein coding genes and 39 non-protein-coding genes, including long non-coding RNA genes. The rest of IPACs could be originated from transcriptional read-through or gene mis-annotations. CONCLUSIONS: The identified IPACs corresponding to novel transcripts, 3'-UTR extensions, and antisense transcription should be incorporated into current Arabidopsis genome annotation. Comprehensive characterization of IPACs from this study provides insights of alternative polyadenylation and antisense transcription in plants.


Asunto(s)
Proteínas de Arabidopsis/genética , Arabidopsis/genética , Poli A/análisis , Regiones no Traducidas 3' , Regulación de la Expresión Génica de las Plantas , Genoma de Planta , ARN sin Sentido/genética , ARN de Planta/genética
15.
Comput Biol Chem ; 54: 49-56, 2015 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-25594576

RESUMEN

Polyadenylation is the process of addition of poly(A) tail to mRNA 3' ends. Identification of motifs controlling polyadenylation plays an essential role in improving genome annotation accuracy and better understanding of the mechanisms governing gene regulation. The bioinformatics methods used for poly(A) motifs recognition have demonstrated that information extracted from sequences surrounding the candidate motifs can differentiate true motifs from the false ones greatly. However, these methods depend on either domain features or string kernels. To date, methods combining information from different sources have not been found yet. Here, we proposed an improved poly(A) motifs recognition method by combing different sources based on decision level fusion. First of all, two novel prediction methods was proposed based on support vector machine (SVM): one method is achieved by using the domain-specific features and principle component analysis (PCA) method to eliminate the redundancy (PCA-SVM); the other method is based on Oligo string kernel (Oligo-SVM). Then we proposed a novel machine-learning method for poly(A) motif prediction by marrying four poly(A) motifs recognition methods, including two state-of-the-art methods (Random Forest (RF) and HMM-SVM), and two novel proposed methods (PCA-SVM and Oligo-SVM). A decision level information fusion method was employed to combine the decision values of different classifiers by applying the DS evidence theory. We evaluated our method on a comprehensive poly(A) dataset that consists of 14,740 samples on 12 variants of poly(A) motifs and 2750 samples containing none of these motifs. Our method has achieved accuracy up to 86.13%. Compared with the four classifiers, our evidence theory based method reduces the average error rate by about 30%, 27%, 26% and 16%, respectively. The experimental results suggest that the proposed method is more effective for poly(A) motif recognition.


Asunto(s)
Poli A/química , Poliadenilación , ARN Mensajero/química , Máquina de Vectores de Soporte , Biología Computacional/métodos , Biología Computacional/estadística & datos numéricos , Datos de Secuencia Molecular , Motivos de Nucleótidos , Poli A/análisis , Análisis de Componente Principal , ARN Mensajero/análisis
16.
Bioinformatics ; 31(10): 1671-3, 2015 May 15.
Artículo en Inglés | MEDLINE | ID: mdl-25583118

RESUMEN

MOTIVATION: Polyadenylation is an essential process during eukaryotic gene expression. Prediction of poly(A) sites helps to define the 3' end of genes, which is important for gene annotation and elucidating gene regulation mechanisms. However, due to limited knowledge of poly(A) signals, it is still challenging to predict poly(A) sites in plants and algae. PASPA is a web server for P: oly( A: ) S: ite prediction in P: lants and A: lgae, which integrates many in-house tools as add-ons to facilitate poly(A) site prediction, visualization and mining. This server can predict poly(A) sites for ten species, including seven previously poly(A) signal non-characterized species, with sensitivity and specificity in a range between 0.80 and 0.95.


Asunto(s)
Regiones no Traducidas 3' , Poli A/análisis , Poliadenilación , ARN de Planta/química , Programas Informáticos , Algoritmos , Chlorophyta/genética , Diatomeas/genética , Internet , Rhodophyta/genética
17.
Nucleic Acids Res ; 43(Database issue): D59-67, 2015 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-25378337

RESUMEN

Increasing amounts of genes have been shown to utilize alternative polyadenylation (APA) 3'-processing sites depending on the cell and tissue type and/or physiological and pathological conditions at the time of processing, and the construction of genome-wide database regarding APA is urgently needed for better understanding poly(A) site selection and APA-directed gene expression regulation for a given biology. Here we present a web-accessible database, named APASdb (http://mosas.sysu.edu.cn/utr), which can visualize the precise map and usage quantification of different APA isoforms for all genes. The datasets are deeply profiled by the sequencing alternative polyadenylation sites (SAPAS) method capable of high-throughput sequencing 3'-ends of polyadenylated transcripts. Thus, APASdb details all the heterogeneous cleavage sites downstream of poly(A) signals, and maintains near complete coverage for APA sites, much better than the previous databases using conventional methods. Furthermore, APASdb provides the quantification of a given APA variant among transcripts with different APA sites by computing their corresponding normalized-reads, making our database more useful. In addition, APASdb supports URL-based retrieval, browsing and display of exon-intron structure, poly(A) signals, poly(A) sites location and usage reads, and 3'-untranslated regions (3'-UTRs). Currently, APASdb involves APA in various biological processes and diseases in human, mouse and zebrafish.


Asunto(s)
Bases de Datos de Ácidos Nucleicos , Poliadenilación , Animales , Expresión Génica , Humanos , Internet , Ratones , Poli A/análisis , División del ARN , Pez Cebra/genética
18.
PLoS One ; 9(11): e111780, 2014.
Artículo en Inglés | MEDLINE | ID: mdl-25369024

RESUMEN

The localisation of poly(A) RNA in plant cells containing either reticular (Allium cepa) or chromocentric (Lupinus luteus, Arabidopsis thaliana) nuclei was studied through in situ hybridisation. In both types of nuclei, the amount of poly(A) RNA was much greater in the nucleus than in the cytoplasm. In the nuclei, poly(A) RNA was present in structures resembling nuclear bodies. The molecular composition as well as the characteristic ultrastructure of the bodies containing poly(A) RNA demonstrated that they were Cajal bodies. We showed that some poly(A) RNAs in Cajal bodies code for proteins. However, examination of the localisation of active RNA polymerase II and in situ run-on transcription assays both demonstrated that CBs are not sites of transcription and that BrU-containing RNA accumulates in these structures long after synthesis. In addition, it was demonstrated that accumulation of poly(A) RNA occurs in the nuclei and CBs of hypoxia-treated cells. Our findings indicated that CBs may be involved in the later stages of poly(A) RNA metabolism, playing a role storage or retention.


Asunto(s)
Arabidopsis/citología , Cuerpos Enrollados/ultraestructura , Lupinus/citología , Cebollas/citología , Poli A/análisis , ARN de Planta/análisis , Cuerpos Enrollados/química , ARN Mensajero/análisis , Ribonucleoproteínas Nucleares Pequeñas/análisis
19.
Nature ; 508(7494): 66-71, 2014 Apr 03.
Artículo en Inglés | MEDLINE | ID: mdl-24476825

RESUMEN

Poly(A) tails enhance the stability and translation of most eukaryotic messenger RNAs, but difficulties in globally measuring poly(A)-tail lengths have impeded greater understanding of poly(A)-tail function. Here we describe poly(A)-tail length profiling by sequencing (PAL-seq) and apply it to measure tail lengths of millions of individual RNAs isolated from yeasts, cell lines, Arabidopsis thaliana leaves, mouse liver, and zebrafish and frog embryos. Poly(A)-tail lengths were conserved between orthologous mRNAs, with mRNAs encoding ribosomal proteins and other 'housekeeping' proteins tending to have shorter tails. As expected, tail lengths were coupled to translational efficiencies in early zebrafish and frog embryos. However, this strong coupling diminished at gastrulation and was absent in non-embryonic samples, indicating a rapid developmental switch in the nature of translational control. This switch complements an earlier switch to zygotic transcriptional control and explains why the predominant effect of microRNA-mediated deadenylation concurrently shifts from translational repression to mRNA destabilization.


Asunto(s)
Regulación del Desarrollo de la Expresión Génica/genética , Poli A/análisis , Biosíntesis de Proteínas/genética , ARN Mensajero/genética , Animales , Arabidopsis/genética , Secuencia de Bases , Línea Celular , Drosophila melanogaster/embriología , Drosophila melanogaster/genética , Gastrulación/genética , Humanos , Hígado/metabolismo , Ratones , MicroARNs/genética , MicroARNs/metabolismo , Modelos Genéticos , Hojas de la Planta/genética , Poli A/genética , Estabilidad del ARN/genética , ARN Mensajero/metabolismo , Ribosomas/metabolismo , Análisis de Secuencia de ARN , Especificidad de la Especie , Transcripción Genética , Xenopus/embriología , Xenopus/genética , Levaduras/genética , Pez Cebra/embriología , Pez Cebra/genética , Cigoto/metabolismo
20.
Nucleic Acids Res ; 42(5): 2820-32, 2014 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-24357408

RESUMEN

Recent sequencing technologies that allow massive parallel production of short reads are the method of choice for transcriptome analysis. Particularly, digital gene expression (DGE) technologies produce a large dynamic range of expression data by generating short tag signatures for each cell transcript. These tags can be mapped back to a reference genome to identify new transcribed regions that can be further covered by RNA-sequencing (RNA-Seq) reads. Here, we applied an integrated bioinformatics approach that combines DGE tags, RNA-Seq, tiling array expression data and species-comparison to explore new transcriptional regions and their specific biological features, particularly tissue expression or conservation. We analysed tags from a large DGE data set (designated as 'TranscriRef'). We then annotated 750,000 tags that were uniquely mapped to the human genome according to Ensembl. We retained transcripts originating from both DNA strands and categorized tags corresponding to protein-coding genes, antisense, intronic- or intergenic-transcribed regions and computed their overlap with annotated non-coding transcripts. Using this bioinformatics approach, we identified ∼34,000 novel transcribed regions located outside the boundaries of known protein-coding genes. As demonstrated using sequencing data from human pluripotent stem cells for biological validation, the method could be easily applied for the selection of tissue-specific candidate transcripts. DigitagCT is available at http://cractools.gforge.inria.fr/softwares/digitagct.


Asunto(s)
Perfilación de la Expresión Génica/métodos , Genoma Humano , ARN no Traducido/análisis , Análisis de Secuencia de ARN/métodos , Línea Celular , Humanos , Anotación de Secuencia Molecular , Poli A/análisis , Programas Informáticos , Transcripción Genética
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...