Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 29
Filtrar
Más filtros












Base de datos
Intervalo de año de publicación
1.
Funct Integr Genomics ; 22(4): 451-466, 2022 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-35305194

RESUMEN

The aim of this study was to identify mRNA isoforms and small genetic variants that may be affecting marbling and beef color in Nellore cattle. Longissimus thoracis muscle samples from 20 bulls with different phenotypes (out of 80 bulls set) for marbling (moderate (n = 10) and low (n = 10) groups) and beef color (desirable (n = 10) and undesirable (n = 9) group) traits were used to perform transcriptomic analysis using RNA sequencing. Fourteen and 15 mRNA isoforms were detected as differentially expressed (DE) (P-value ≤ 0.001) between divergent groups for marbling and meat color traits, respectively. Some of those DE mRNA isoforms have shown sites of splicing modified by small structural variants as single nucleotide variant (SNV), insertion, and/or deletion. Enrichment analysis identified metabolic pathways, such as O2/CO2 exchange in erythrocytes, tyrosine biosynthesis, and phenylalanine degradation. The results obtained suggest potential key regulatory genes associated with these economically important traits for the beef industry and for the consumer.


Asunto(s)
Carne , Isoformas de ARN , Animales , Bovinos/genética , Variación Genética , Masculino , Carne/análisis , Músculo Esquelético/metabolismo , Fenotipo , Isoformas de ARN/análisis , Isoformas de ARN/metabolismo , Análisis de Secuencia de ARN
2.
Methods Mol Biol ; 2284: 231-251, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-33835446

RESUMEN

High-throughput sequencing for micro-RNAs (miRNAs) to obtain expression estimates is a central method of molecular biology. Surprisingly, there are a number of different approaches to converting sequencing output into micro-RNA counts. Each has their own strengths and biases that impact on the final data that can be obtained from a sequencing run. This chapter serves to make the reader aware of the trade-offs one must consider in analyzing small RNA sequencing data. It then compares two methods, miRge2.0 and the sRNAbench and the steps utilized to output data from their tools.


Asunto(s)
Biología Computacional/métodos , MicroARNs/genética , Análisis de Secuencia de ARN/métodos , Animales , Simulación por Computador , Conjuntos de Datos como Asunto , Perfilación de la Expresión Génica , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos , MicroARNs/análisis , Polimorfismo de Nucleótido Simple , Isoformas de ARN/análisis , Isoformas de ARN/genética , Programas Informáticos
3.
BMC Biol ; 18(1): 139, 2020 10 13.
Artículo en Inglés | MEDLINE | ID: mdl-33050904

RESUMEN

BACKGROUND: Some dinoflagellates cause harmful algal blooms, releasing toxic secondary metabolites, to the detriment of marine ecosystems and human health. Our understanding of dinoflagellate toxin biosynthesis has been hampered by their unusually large genomes. To overcome this challenge, for the first time, we sequenced the genome, microRNAs, and mRNA isoforms of a basal dinoflagellate, Amphidinium gibbosum, and employed an integrated omics approach to understand its secondary metabolite biosynthesis. RESULTS: We assembled the ~ 6.4-Gb A. gibbosum genome, and by probing decoded dinoflagellate genomes and transcriptomes, we identified the non-ribosomal peptide synthetase adenylation domain as essential for generation of specialized metabolites. Upon starving the cells of phosphate and nitrogen, we observed pronounced shifts in metabolite biosynthesis, suggestive of post-transcriptional regulation by microRNAs. Using Iso-Seq and RNA-seq data, we found that alternative splicing and polycistronic expression generate different transcripts for secondary metabolism. CONCLUSIONS: Our genomic findings suggest intricate integration of various metabolic enzymes that function iteratively to synthesize metabolites, providing mechanistic insights into how dinoflagellates synthesize secondary metabolites, depending upon nutrient availability. This study provides insights into toxin production associated with dinoflagellate blooms. The genome of this basal dinoflagellate provides important clues about dinoflagellate evolution and overcomes the large genome size, which has been a challenge previously.


Asunto(s)
Dinoflagelados/metabolismo , Genoma de Protozoos , MicroARNs/análisis , Isoformas de ARN/análisis , ARN Protozoario/análisis , Metabolismo Secundario , Dinoflagelados/genética , ARN de Algas/análisis
4.
Nat Biotechnol ; 38(6): 708-714, 2020 06.
Artículo en Inglés | MEDLINE | ID: mdl-32518404

RESUMEN

Large-scale sequencing of RNA from individual cells can reveal patterns of gene, isoform and allelic expression across cell types and states1. However, current short-read single-cell RNA-sequencing methods have limited ability to count RNAs at allele and isoform resolution, and long-read sequencing techniques lack the depth required for large-scale applications across cells2,3. Here we introduce Smart-seq3, which combines full-length transcriptome coverage with a 5' unique molecular identifier RNA counting strategy that enables in silico reconstruction of thousands of RNA molecules per cell. Of the counted and reconstructed molecules, 60% could be directly assigned to allelic origin and 30-50% to specific isoforms, and we identified substantial differences in isoform usage in different mouse strains and human cell types. Smart-seq3 greatly increased sensitivity compared to Smart-seq2, typically detecting thousands more transcripts per cell. We expect that Smart-seq3 will enable large-scale characterization of cell types and states across tissues and organisms.


Asunto(s)
Perfilación de la Expresión Génica/métodos , ARN/análisis , Análisis de Secuencia de ARN/métodos , Análisis de la Célula Individual/métodos , Alelos , Animales , Humanos , Ratones , ARN/genética , Isoformas de ARN/análisis , Isoformas de ARN/genética , Sensibilidad y Especificidad , Transcriptoma/genética
5.
Genome Biol ; 21(1): 74, 2020 03 23.
Artículo en Inglés | MEDLINE | ID: mdl-32293520

RESUMEN

BACKGROUND: Early single-cell RNA-seq (scRNA-seq) studies suggested that it was unusual to see more than one isoform being produced from a gene in a single cell, even when multiple isoforms were detected in matched bulk RNA-seq samples. However, these studies generally did not consider the impact of dropouts or isoform quantification errors, potentially confounding the results of these analyses. RESULTS: In this study, we take a simulation based approach in which we explicitly account for dropouts and isoform quantification errors. We use our simulations to ask to what extent it is possible to study alternative splicing using scRNA-seq. Additionally, we ask what limitations must be overcome to make splicing analysis feasible. We find that the high rate of dropouts associated with scRNA-seq is a major obstacle to studying alternative splicing. In mice and other well-established model organisms, the relatively low rate of isoform quantification errors poses a lesser obstacle to splicing analysis. We find that different models of isoform choice meaningfully change our simulation results. CONCLUSIONS: To accurately study alternative splicing with single-cell RNA-seq, a better understanding of isoform choice and the errors associated with scRNA-seq is required. An increase in the capture efficiency of scRNA-seq would also be beneficial. Until some or all of the above are achieved, we do not recommend attempting to resolve isoforms in individual cells using scRNA-seq.


Asunto(s)
Empalme Alternativo , RNA-Seq , Análisis de la Célula Individual , Simulación por Computador , Modelos Genéticos , Isoformas de ARN/análisis , Isoformas de ARN/metabolismo
6.
Bioinformatics ; 34(3): 524-526, 2018 02 01.
Artículo en Inglés | MEDLINE | ID: mdl-28968682

RESUMEN

Motivation: As sequencing technologies improve their capacity to detect distinct transcripts of the same gene and to address complex experimental designs such as longitudinal studies, there is a need to develop statistical methods for the analysis of isoform expression changes in time series data. Results: Iso-maSigPro is a new functionality of the R package maSigPro for transcriptomics time series data analysis. Iso-maSigPro identifies genes with a differential isoform usage across time. The package also includes new clustering and visualization functions that allow grouping of genes with similar expression patterns at the isoform level, as well as those genes with a shift in major expressed isoform. Availability and implementation: The package is freely available under the LGPL license from the Bioconductor web site. Contact: mj.nueda@ua.es or aconesa@ufl.edu. Supplementary information: Supplementary data are available at Bioinformatics online.


Asunto(s)
Perfilación de la Expresión Génica/métodos , Isoformas de ARN/análisis , Análisis de Secuencia de ARN/métodos , Programas Informáticos , Animales , Linfocitos B/metabolismo , Linfocitos B/fisiología , Diferenciación Celular , Regulación de la Expresión Génica , Ratones , Isoformas de ARN/genética
7.
Int J Mol Sci ; 18(7)2017 Jun 27.
Artículo en Inglés | MEDLINE | ID: mdl-28653984

RESUMEN

The long non-coding RNA ANRIL, antisense to the CDKN2B locus, is transcribed from a gene that encompasses multiple disease-associated polymorphisms. Despite the identification of multiple isoforms of ANRIL, expression of certain transcripts has been found to be tissue-specific and the characterisation of ANRIL transcripts remains incomplete. Several functions have been associated with ANRIL. In our judgement, studies on ANRIL functionality are premature pending a more complete appreciation of the profusion of isoforms. We found differential expression of ANRIL exons, which indicates that multiple isoforms exist in melanoma cells. In addition to linear isoforms, we identified circular forms of ANRIL (circANRIL). Further characterisation of circANRIL in two patient-derived metastatic melanoma cell lines (NZM7 and NZM37) revealed the existence of a rich assortment of circular isoforms. Moreover, in the two melanoma cell lines investigated, the complements of circANRIL isoforms were almost completely different. Novel exons were also discovered. We also found the family of linear ANRIL was enriched in the nucleus, whilst the circular isoforms were enriched in the cytoplasm and they differed markedly in stability. With respect to the variable processing of circANRIL species, bioinformatic analysis indicated that intronic Arthrobacter luteus (Alu) restriction endonuclease inverted repeats and exon skipping were not involved in selection of back-spliced exon junctions. Based on our findings, we hypothesise that "ANRIL" has wholly distinct dual sets of functions in melanoma. This reveals the dynamic nature of the locus and constitutes a basis for investigating the functions of ANRIL in melanoma.


Asunto(s)
Melanoma/genética , Isoformas de ARN/genética , ARN Largo no Codificante/genética , Neoplasias Cutáneas/genética , Línea Celular Tumoral , Exones , Regulación Neoplásica de la Expresión Génica , Humanos , Conformación de Ácido Nucleico , Isoformas de ARN/análisis , Empalme del ARN , ARN Largo no Codificante/análisis
8.
Virology ; 488: 81-7, 2016 Jan 15.
Artículo en Inglés | MEDLINE | ID: mdl-26609938

RESUMEN

In lytically infected B cells Kaposi sarcoma-associated herpesvirus (KSHV) ORF57 gene encodes two RNA isoforms by alternative splicing of its pre-mRNA, which contains a small, constitutive intron in its 5' half and a large, suboptimal intron in its 3's half. The RNA1 isoform encodes full-length ORF57 and is a major isoform derived from splicing of the constitutive small intron, but retaining the suboptimal large intron as the coding region. A small fraction (<5%) of ORF57 RNA undergoes double splicing to produce a smaller non-coding RNA2 due to lack of a translational termination codon. Both RNAs are cleaved and polyadenylated at the same cleavage site CS83636. The insertion of ORF57 RNA1 into a restriction cutting site in certain mammalian expression vectors activates splicing of the subopitmal intron and produces a truncated ORF57 protein.


Asunto(s)
Empalme Alternativo , Regulación Viral de la Expresión Génica , Herpesvirus Humano 8/fisiología , Isoformas de ARN/análisis , Proteínas Reguladoras y Accesorias Virales/genética , Línea Celular , Herpesvirus Humano 8/genética , Humanos
9.
Bioinformatics ; 31(24): 3938-45, 2015 Dec 15.
Artículo en Inglés | MEDLINE | ID: mdl-26338770

RESUMEN

MOTIVATION: Because of the advantages of RNA sequencing (RNA-Seq) over microarrays, it is gaining widespread popularity for highly parallel gene expression analysis. For example, RNA-Seq is expected to be able to provide accurate identification and quantification of full-length splice forms. A number of informatics packages have been developed for this purpose, but short reads make it a difficult problem in principle. Sequencing error and polymorphisms add further complications. It has become necessary to perform studies to determine which algorithms perform best and which if any algorithms perform adequately. However, there is a dearth of independent and unbiased benchmarking studies. Here we take an approach using both simulated and experimental benchmark data to evaluate their accuracy. RESULTS: We conclude that most methods are inaccurate even using idealized data, and that no method is highly accurate once multiple splice forms, polymorphisms, intron signal, sequencing errors, alignment errors, annotation errors and other complicating factors are present. These results point to the pressing need for further algorithm development. AVAILABILITY AND IMPLEMENTATION: Simulated datasets and other supporting information can be found at http://bioinf.itmat.upenn.edu/BEERS/bp2. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Algoritmos , Empalme Alternativo , Perfilación de la Expresión Génica/métodos , Isoformas de ARN/análisis , Análisis de Secuencia de ARN/métodos , Animales , Benchmarking , Humanos , Ratones , ARN Mensajero/análisis
10.
BMC Bioinformatics ; 16: 262, 2015 Aug 19.
Artículo en Inglés | MEDLINE | ID: mdl-26286719

RESUMEN

BACKGROUND: Detecting and quantifying isoforms from RNA-seq data is an important but challenging task. The problem is often ill-posed, particularly at low coverage. One promising direction is to exploit several samples simultaneously. RESULTS: We propose a new method for solving the isoform deconvolution problem jointly across several samples. We formulate a convex optimization problem that allows to share information between samples and that we solve efficiently. We demonstrate the benefits of combining several samples on simulated and real data, and show that our approach outperforms pooling strategies and methods based on integer programming. CONCLUSION: Our convex formulation to jointly detect and quantify isoforms from RNA-seq data of multiple related samples is a computationally efficient approach to leverage the hypotheses that some isoforms are likely to be present in several samples. The software and source code are available at http://cbio.ensmp.fr/flipflop.


Asunto(s)
Isoformas de ARN/análisis , ARN/metabolismo , Algoritmos , Empalme Alternativo , Humanos , Internet , Isoformas de ARN/metabolismo , Análisis de Secuencia de ARN , Transcriptoma , Interfaz Usuario-Computador
11.
Genome Biol ; 16: 150, 2015 Jul 23.
Artículo en Inglés | MEDLINE | ID: mdl-26201343

RESUMEN

BACKGROUND: Understanding the regulation of gene expression, including transcription start site usage, alternative splicing, and polyadenylation, requires accurate quantification of expression levels down to the level of individual transcript isoforms. To comparatively evaluate the accuracy of the many methods that have been proposed for estimating transcript isoform abundance from RNA sequencing data, we have used both synthetic data as well as an independent experimental method for quantifying the abundance of transcript ends at the genome-wide level. RESULTS: We found that many tools have good accuracy and yield better estimates of gene-level expression compared to commonly used count-based approaches, but they vary widely in memory and runtime requirements. Nucleotide composition and intron/exon structure have comparatively little influence on the accuracy of expression estimates, which correlates most strongly with transcript/gene expression levels. To facilitate the reproduction and further extension of our study, we provide datasets, source code, and an online analysis tool on a companion website, where developers can upload expression estimates obtained with their own tool to compare them to those inferred by the methods assessed here. CONCLUSIONS: As many methods for quantifying isoform abundance with comparable accuracy are available, a user's choice will likely be determined by factors such as the memory and runtime requirements, as well as the availability of methods for downstream analyses. Sequencing-based methods to quantify the abundance of specific transcript regions could complement validation schemes based on synthetic data and quantitative PCR in future or ongoing assessments of RNA-seq analysis methods.


Asunto(s)
Perfilación de la Expresión Génica/métodos , Isoformas de ARN/análisis , Análisis de Secuencia de ARN/métodos , Programas Informáticos , Animales , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos , Células Jurkat , Ratones , Células 3T3 NIH
12.
Curr Protoc Mol Biol ; 110: 4.23.1-4.23.17, 2015 Apr 01.
Artículo en Inglés | MEDLINE | ID: mdl-25827089

RESUMEN

Most eukaryotic genes are transcribed into mRNAs with alternative poly(A) sites. Emerging evidence suggests that mRNA isoforms with alternative poly(A) sites can perform critical regulatory functions in numerous biological processes. In recent years, a number of strategies utilizing high-throughput sequencing technologies have been developed to aid in the identification of genome-wide poly(A) sites. This unit describes a modified protocol for a recently published 3'READS (3' region extraction and deep sequencing) method that accurately identifies genome-wide poly(A) sites and that can be used to quantify the relative abundance of the resulting 3' mRNA isoforms. This approach minimizes nonspecific sequence reads due to internal priming and typically yields a high percentage of sequence reads that are ideally suited for accurate poly(A) identification.


Asunto(s)
Regiones no Traducidas 3' , Biología Molecular/métodos , Poli A/genética , Isoformas de ARN/análisis , Isoformas de ARN/genética , Secuenciación de Nucleótidos de Alto Rendimiento , Isoformas de ARN/aislamiento & purificación
13.
BMC Genomics ; 15: 862, 2014 Oct 06.
Artículo en Inglés | MEDLINE | ID: mdl-25283306

RESUMEN

BACKGROUND: Most genes in mammals generate several transcript isoforms that differ in stability and translational efficiency through alternative splicing. Such alternative splicing can be tissue- and developmental stage-specific, and such specificity is sometimes associated with disease. Thus, detecting differential isoform usage for a gene between tissues or cell lines/types (differences in the fraction of total expression of a gene represented by the expression of each of its isoforms) is potentially important for cell and developmental biology. RESULTS: We present a new method IUTA that is designed to test each gene in the genome for differential isoform usage between two groups of samples. IUTA also estimates isoform usage for each gene in each sample as well as averaged across samples within each group. IUTA is the first method to formulate the testing problem as testing for equal means of two probability distributions under the Aitchison geometry, which is widely recognized as the most appropriate geometry for compositional data (vectors that contain the relative amount of each component comprising the whole). Evaluation using simulated data showed that IUTA was able to provide test results for many more genes than was Cuffdiff2 (version 2.2.0, released in Mar. 2014), and IUTA performed better than Cuffdiff2 for the limited number of genes that Cuffdiff2 did analyze. When applied to actual mouse RNA-Seq datasets from six tissues, IUTA identified 2,073 significant genes with clear patterns of differential isoform usage between a pair of tissues. IUTA is implemented as an R package and is available at http://www.niehs.nih.gov/research/resources/software/biostatistics/iuta/index.cfm. CONCLUSIONS: Both simulation and real-data results suggest that IUTA accurately detects differential isoform usage. We believe that our analysis of RNA-seq data from six mouse tissues represents the first comprehensive characterization of isoform usage in these tissues. IUTA will be a valuable resource for those who study the roles of alternative transcripts in cell development and disease.


Asunto(s)
Biología Computacional/métodos , Isoformas de ARN/análisis , Programas Informáticos , Algoritmos , Animales , Secuencia de Bases , Simulación por Computador , Bases de Datos de Ácidos Nucleicos , Ratones
14.
Hum Mutat ; 35(10): 1249-59, 2014 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-25066652

RESUMEN

Assessment of the functional consequences of variants near splice sites is a major challenge in the diagnostic laboratory. To address this issue, we created expression minigenes (EMGs) to determine the RNA and protein products generated by splice site variants (n = 10) implicated in cystic fibrosis (CF). Experimental results were compared with the splicing predictions of eight in silico tools. EMGs containing the full-length Cystic Fibrosis Transmembrane Conductance Regulator (CFTR) coding sequence and flanking intron sequences generated wild-type transcript and fully processed protein in Human Embryonic Kidney (HEK293) and CF bronchial epithelial (CFBE41o-) cells. Quantification of variant induced aberrant mRNA isoforms was concordant using fragment analysis and pyrosequencing. The splicing patterns of c.1585-1G>A and c.2657+5G>A were comparable to those reported in primary cells from individuals bearing these variants. Bioinformatics predictions were consistent with experimental results for 9/10 variants (MES), 8/10 variants (NNSplice), and 7/10 variants (SSAT and Sroogle). Programs that estimate the consequences of mis-splicing predicted 11/16 (HSF and ASSEDA) and 10/16 (Fsplice and SplicePort) experimentally observed mRNA isoforms. EMGs provide a robust experimental approach for clinical interpretation of splice site variants and refinement of in silico tools.


Asunto(s)
Simulación por Computador , Técnicas Genéticas , Isoformas de ARN/genética , Empalme del ARN , Línea Celular , Fibrosis Quística/genética , Fibrosis Quística/metabolismo , Regulador de Conductancia de Transmembrana de Fibrosis Quística/genética , Humanos , Mutación , Isoformas de ARN/análisis , Sitios de Empalme de ARN/genética
15.
BMC Bioinformatics ; 15: 135, 2014 May 09.
Artículo en Inglés | MEDLINE | ID: mdl-24885830

RESUMEN

BACKGROUND: The main goal of the whole transcriptome analysis is to correctly identify all expressed transcripts within a specific cell/tissue--at a particular stage and condition--to determine their structures and to measure their abundances. RNA-seq data promise to allow identification and quantification of transcriptome at unprecedented level of resolution, accuracy and low cost. Several computational methods have been proposed to achieve such purposes. However, it is still not clear which promises are already met and which challenges are still open and require further methodological developments. RESULTS: We carried out a simulation study to assess the performance of 5 widely used tools, such as: CEM, Cufflinks, iReckon, RSEM, and SLIDE. All of them have been used with default parameters. In particular, we considered the effect of the following three different scenarios: the availability of complete annotation, incomplete annotation, and no annotation at all. Moreover, comparisons were carried out using the methods in three different modes of action. In the first mode, the methods were forced to only deal with those isoforms that are present in the annotation; in the second mode, they were allowed to detect novel isoforms using the annotation as guide; in the third mode, they were operating in fully data driven way (although with the support of the alignment on the reference genome). In the latter modality, precision and recall are quite poor. On the contrary, results are better with the support of the annotation, even though it is not complete. Finally, abundance estimation error often shows a very skewed distribution. The performance strongly depends on the true real abundance of the isoforms. Lowly (and sometimes also moderately) expressed isoforms are poorly detected and estimated. In particular, lowly expressed isoforms are identified mainly if they are provided in the original annotation as potential isoforms. CONCLUSIONS: Both detection and quantification of all isoforms from RNA-seq data are still hard problems and they are affected by many factors. Overall, the performance significantly changes since it depends on the modes of action and on the type of available annotation. Results obtained using complete or partial annotation are able to detect most of the expressed isoforms, even though the number of false positives is often high. Fully data driven approaches require more attention, at least for complex eucaryotic genomes. Improvements are desirable especially for isoform quantification and for isoform detection with low abundance.


Asunto(s)
Isoformas de ARN/análisis , Programas Informáticos , Algoritmos , Perfilación de la Expresión Génica , Humanos , Isoformas de ARN/química , Isoformas de ARN/metabolismo , Análisis de Secuencia de ARN/métodos
16.
Nat Biotechnol ; 32(5): 462-4, 2014 May.
Artículo en Inglés | MEDLINE | ID: mdl-24752080

RESUMEN

We introduce Sailfish, a computational method for quantifying the abundance of previously annotated RNA isoforms from RNA-seq data. Because Sailfish entirely avoids mapping reads, a time-consuming step in all current methods, it provides quantification estimates much faster than do existing approaches (typically 20 times faster) without loss of accuracy. By facilitating frequent reanalysis of data and reducing the need to optimize parameters, Sailfish exemplifies the potential of lightweight algorithms for efficiently processing sequencing reads.


Asunto(s)
Algoritmos , Biología Computacional/métodos , Isoformas de ARN , Análisis de Secuencia de ARN/métodos , Programas Informáticos , Química Encefálica , Humanos , Modelos Biológicos , Isoformas de ARN/análisis , Isoformas de ARN/química , Isoformas de ARN/genética
17.
BMC Cancer ; 14: 195, 2014 Mar 15.
Artículo en Inglés | MEDLINE | ID: mdl-24628780

RESUMEN

BACKGROUND: The serine-threonine kinase AKT1 plays essential roles during normal mammary gland development as well as the initiation and progression of breast cancer. AKT1 is generally considered a ubiquitously expressed gene, and its persistent activation is transcriptionally controlled by regulatory elements characteristic of housekeeping gene promoters. We recently identified a novel Akt1 transcript in mice (Akt1m), which is induced by growth factors and their signal transducers of transcription from a previously unknown promoter. The purpose of this study was to examine whether normal and neoplastic human breast epithelial cells express an orthologous AKT1m transcript and whether its expression is deregulated in cancer cells. METHODS: Initial sequence analyses were performed using the UCSC Genome Browser and GenBank to assess the potential occurrence of an AKT1m transcript variant in human cells and to identify conserved promoter sequences that are orthologous to the murine Akt1m. Quantitative RT-PCR was used to determine the transcriptional activation of AKT1m in mouse mammary tumors as well as 41 normal and neoplastic human breast epithelial cell lines and selected primary breast cancers. RESULTS: We identified four new AKT1 transcript variants in human breast cancer cells that are orthologous to the murine Akt1m and that encode the full-length kinase. These transcripts originate from an alternative promoter that is conserved between humans and mice. Akt1m is upregulated in the majority of luminal-type and basal-type mammary cancers in four different genetically engineered mouse models. Similarly, a subset of human breast cancer cell lines and primary breast cancers exhibited a higher expression of orthologous AKT1m transcripts. CONCLUSIONS: The existence of an alternative promoter that drives the expression of the unique AKT1m transcript may provide a mechanism by which the levels of AKT1 can be temporally and spatially regulated at particular physiological states, such as cancer, where a heightened activity of this kinase is required.


Asunto(s)
Proteína BRCA1/genética , Neoplasias de la Mama/genética , Regiones Promotoras Genéticas , Proteínas Proto-Oncogénicas c-akt/genética , Isoformas de ARN/análisis , Empalme Alternativo , Animales , Secuencia de Bases , Neoplasias de la Mama/patología , Línea Celular Tumoral , Secuencia Conservada , Femenino , Regulación Neoplásica de la Expresión Génica , Humanos , Neoplasias Mamarias Experimentales , Ratones , Datos de Secuencia Molecular , Proteínas Proto-Oncogénicas c-akt/metabolismo , Sitios de Empalme de ARN
18.
Drug Metab Pharmacokinet ; 29(4): 325-32, 2014.
Artículo en Inglés | MEDLINE | ID: mdl-24522200

RESUMEN

Human equilibrative nucleoside transporter 1 (hENT1) transports various nucleoside analogues into cells. Although the single hENT1 promoter region (P1) and the mRNA isoform (a1) have been characterized previously, we have recently identified additional promoter regions P2 and P3 (which primarily generate c1/2/3 mRNAs and d1/2/3/4 mRNAs, respectively) in the human liver. Therefore, this study aimed at identifying the primary hENT1 mRNA isoforms expressed in human hepatocytes, while simultaneously obtaining functional evidence of alternative hENT1 promoter usage. Our results showed that the expressions of hENT1c1, d3, and (to a lesser extent) c2 mRNAs were strikingly predominant over the other mRNA isoforms in human hepatocytes, that the abundant expression of these mRNAs was consistent with the high levels of P2 and P3 promoter activity, and that these promoters were significantly marked by transcriptionally active histone modification in hepatic cells. To summarize, our results demonstrate that, resulting from the manipulated alternative promoter usage, hENT1c1 and d3 (and c2) mRNAs are primarily expressed in human hepatocytes, which suggests that they may play important roles in controlling hENT1 expression levels in those cells. Our findings are expected to provide significant insights into the molecular machinery of hENT1 expression control.


Asunto(s)
Tranportador Equilibrativo 1 de Nucleósido/genética , Regulación de la Expresión Génica/genética , Hepatocitos/metabolismo , Regiones Promotoras Genéticas/genética , Isoformas de ARN/análisis , Línea Celular , Histonas/metabolismo , Humanos , Isoformas de ARN/biosíntesis , Isoformas de ARN/genética
19.
Nucleic Acids Res ; 42(7): 4640-51, 2014 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-24464992

RESUMEN

MicroRNAs (miRNAs) are central regulators of gene expression, and a large fraction of them are encoded in introns of RNA polymerase II transcripts. Thus, the biogenesis of intronic miRNAs by the microprocessor and the splicing of their host introns by the spliceosome require coordination between these processing events. This cross-talk is addressed here. We show that key microprocessor proteins Drosha and DGCR8 as well as pre-miRNAs cosediment with supraspliceosomes, where nuclear posttranscriptional processing is executed. We further show that inhibition of splicing increases miRNAs expression, whereas knock-down of Drosha increases splicing. We identified a novel splicing event in intron 13 of MCM7, where the miR-106b-25 cluster is located. The unique splice isoform includes a hosted pre-miRNA in the extended exon and excludes its processing. This indicates a possible mechanism of altering the levels of different miRNAs originating from the same transcript. Altogether, our study indicates interplay between the splicing and microprocessor machineries within a supraspliceosome context.


Asunto(s)
MicroARNs/metabolismo , Precursores del ARN/metabolismo , Empalme del ARN , ARN Mensajero/metabolismo , Empalmosomas/metabolismo , Empalme Alternativo , Núcleo Celular/química , Citoplasma/química , Células HeLa , Humanos , Intrones , Componente 7 del Complejo de Mantenimiento de Minicromosoma/metabolismo , Piranos/farmacología , Isoformas de ARN/análisis , Ribonucleasa III/antagonistas & inhibidores , Compuestos de Espiro/farmacología
20.
Nucleic Acids Res ; 42(3): e20, 2014 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-24362841

RESUMEN

Correctly estimating isoform-specific gene expression is important for understanding complicated biological mechanisms and for mapping disease susceptibility genes. However, estimating isoform-specific gene expression is challenging because various biases present in RNA-Seq (RNA sequencing) data complicate the analysis, and if not appropriately corrected, can affect isoform expression estimation and downstream analysis. In this article, we present PennSeq, a statistical method that allows each isoform to have its own non-uniform read distribution. Instead of making parametric assumptions, we give adequate weight to the underlying data by the use of a non-parametric approach. Our rationale is that regardless what factors lead to non-uniformity, whether it is due to hexamer priming bias, local sequence bias, positional bias, RNA degradation, mapping bias or other unknown reasons, the probability that a fragment is sampled from a particular region will be reflected in the aligned data. This empirical approach thus maximally reflects the true underlying non-uniform read distribution. We evaluate the performance of PennSeq using both simulated data with known ground truth, and using two real Illumina RNA-Seq data sets including one with quantitative real time polymerase chain reaction measurements. Our results indicate superior performance of PennSeq over existing methods, particularly for isoforms demonstrating severe non-uniformity. PennSeq is freely available for download at http://sourceforge.net/projects/pennseq.


Asunto(s)
Perfilación de la Expresión Génica/métodos , Isoformas de ARN/metabolismo , Análisis de Secuencia de ARN/métodos , Tejido Adiposo/metabolismo , Femenino , Humanos , Modelos Estadísticos , Análisis de Secuencia por Matrices de Oligonucleótidos , Isoformas de ARN/análisis , Estadísticas no Paramétricas
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...