Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 47
Filtrar
Más filtros

Banco de datos
Tipo del documento
Intervalo de año de publicación
1.
Brief Bioinform ; 25(4)2024 May 23.
Artículo en Inglés | MEDLINE | ID: mdl-38960406

RESUMEN

Spatial transcriptomics data play a crucial role in cancer research, providing a nuanced understanding of the spatial organization of gene expression within tumor tissues. Unraveling the spatial dynamics of gene expression can unveil key insights into tumor heterogeneity and aid in identifying potential therapeutic targets. However, in many large-scale cancer studies, spatial transcriptomics data are limited, with bulk RNA-seq and corresponding Whole Slide Image (WSI) data being more common (e.g. TCGA project). To address this gap, there is a critical need to develop methodologies that can estimate gene expression at near-cell (spot) level resolution from existing WSI and bulk RNA-seq data. This approach is essential for reanalyzing expansive cohort studies and uncovering novel biomarkers that have been overlooked in the initial assessments. In this study, we present STGAT (Spatial Transcriptomics Graph Attention Network), a novel approach leveraging Graph Attention Networks (GAT) to discern spatial dependencies among spots. Trained on spatial transcriptomics data, STGAT is designed to estimate gene expression profiles at spot-level resolution and predict whether each spot represents tumor or non-tumor tissue, especially in patient samples where only WSI and bulk RNA-seq data are available. Comprehensive tests on two breast cancer spatial transcriptomics datasets demonstrated that STGAT outperformed existing methods in accurately predicting gene expression. Further analyses using the TCGA breast cancer dataset revealed that gene expression estimated from tumor-only spots (predicted by STGAT) provides more accurate molecular signatures for breast cancer sub-type and tumor stage prediction, and also leading to improved patient survival and disease-free analysis. Availability: Code is available at https://github.com/compbiolabucf/STGAT.


Asunto(s)
Perfilación de la Expresión Génica , RNA-Seq , Transcriptoma , Humanos , RNA-Seq/métodos , Perfilación de la Expresión Génica/métodos , Neoplasias de la Mama/genética , Neoplasias de la Mama/metabolismo , Regulación Neoplásica de la Expresión Génica , Biología Computacional/métodos , Femenino , Biomarcadores de Tumor/genética , Biomarcadores de Tumor/metabolismo
2.
Cell ; 146(3): 384-95, 2011 Aug 05.
Artículo en Inglés | MEDLINE | ID: mdl-21816274

RESUMEN

The SMN complex mediates the assembly of heptameric Sm protein rings on small nuclear RNAs (snRNAs), which are essential for snRNP function. Specific Sm core assembly depends on Sm proteins and snRNA recognition by SMN/Gemin2- and Gemin5-containing subunits, respectively. The mechanism by which the Sm proteins are gathered while preventing illicit Sm assembly on non-snRNAs is unknown. Here, we describe the 2.5 Å crystal structure of Gemin2 bound to SmD1/D2/F/E/G pentamer and SMN's Gemin2-binding domain, a key assembly intermediate. Remarkably, through its extended conformation, Gemin2 wraps around the crescent-shaped pentamer, interacting with all five Sm proteins, and gripping its bottom and top sides and outer perimeter. Gemin2 reaches into the RNA-binding pocket, preventing RNA binding. Interestingly, SMN-Gemin2 interaction is abrogated by a spinal muscular atrophy (SMA)-causing mutation in an SMN helix that mediates Gemin2 binding. These findings provide insight into SMN complex assembly and specificity, linking snRNP biogenesis and SMA pathogenesis.


Asunto(s)
Proteínas del Tejido Nervioso/metabolismo , Proteínas de Unión al ARN/metabolismo , Ribonucleoproteínas Nucleares Pequeñas/metabolismo , Proteínas del Complejo SMN/metabolismo , Secuencia de Aminoácidos , Animales , Cristalografía por Rayos X , Humanos , Modelos Moleculares , Datos de Secuencia Molecular , Atrofia Muscular Espinal/genética , Atrofia Muscular Espinal/metabolismo , Mutación , Proteínas del Tejido Nervioso/genética , Proteínas de Unión al ARN/genética , Alineación de Secuencia
3.
Brief Bioinform ; 24(1)2023 01 19.
Artículo en Inglés | MEDLINE | ID: mdl-36513375

RESUMEN

Type 1 diabetes (T1D) outcome prediction plays a vital role in identifying novel risk factors, ensuring early patient care and designing cohort studies. TEDDY is a longitudinal cohort study that collects a vast amount of multi-omics and clinical data from its participants to explore the progression and markers of T1D. However, missing data in the omics profiles make the outcome prediction a difficult task. TEDDY collected time series gene expression for less than 6% of enrolled participants. Additionally, for the participants whose gene expressions are collected, 79% time steps are missing. This study introduces an advanced bioinformatics framework for gene expression imputation and islet autoimmunity (IA) prediction. The imputation model generates synthetic data for participants with partially or entirely missing gene expression. The prediction model integrates the synthetic gene expression with other risk factors to achieve better predictive performance. Comprehensive experiments on TEDDY datasets show that: (1) Our pipeline can effectively integrate synthetic gene expression with family history, HLA genotype and SNPs to better predict IA status at 2 years (sensitivity 0.622, AUC 0.715) compared with the individual datasets and state-of-the-art results in the literature (AUC 0.682). (2) The synthetic gene expression contains predictive signals as strong as the true gene expression, reducing reliance on expensive and long-term longitudinal data collection. (3) Time series gene expression is crucial to the proposed improvement and shows significantly better predictive ability than cross-sectional gene expression. (4) Our pipeline is robust to limited data availability. Availability: Code is available at https://github.com/compbiolabucf/TEDDY.


Asunto(s)
Diabetes Mellitus Tipo 1 , Islotes Pancreáticos , Humanos , Diabetes Mellitus Tipo 1/genética , Autoinmunidad/genética , Estudios Longitudinales , Factores de Tiempo , Estudios Transversales , Predisposición Genética a la Enfermedad , Expresión Génica
4.
Cell ; 138(2): 328-39, 2009 Jul 23.
Artículo en Inglés | MEDLINE | ID: mdl-19632182

RESUMEN

Here we identify a component of the nuclear RNA cap-binding complex (CBC), Ars2, that is important for miRNA biogenesis and critical for cell proliferation. Unlike other components of the CBC, Ars2 expression is linked to the proliferative state of the cell. Deletion of Ars2 is developmentally lethal, and deletion in adult mice led to bone marrow failure whereas parenchymal organs composed of nonproliferating cells were unaffected. Depletion of Ars2 or CBP80 from proliferating cells impaired miRNA-mediated repression and led to alterations in primary miRNA processing in the nucleus. Ars2 depletion also reduced the levels of several miRNAs, including miR-21, let-7, and miR-155, that are implicated in cellular transformation. These findings provide evidence for a role for Ars2 in RNA interference regulation during cell proliferation.


Asunto(s)
Proliferación Celular , Complejo Proteico Nuclear de Unión a la Caperuza/metabolismo , Proteínas Nucleares/metabolismo , Interferencia de ARN , Animales , Arsénico/toxicidad , Línea Celular , Guanosina/análogos & derivados , Guanosina/metabolismo , Humanos , Ratones , MicroARNs
5.
Nucleic Acids Res ; 50(3): 1465-1483, 2022 02 22.
Artículo en Inglés | MEDLINE | ID: mdl-35037047

RESUMEN

p53-binding protein 1 (53BP1) regulates the DNA double-strand break (DSB) repair pathway and maintains genomic integrity. Here we found that 53BP1 functions as a molecular scaffold for the nucleoside diphosphate kinase-mediated phosphorylation of ATP-citrate lyase (ACLY) which enhances the ACLY activity. This functional association is critical for promoting global histone acetylation and subsequent transcriptome-wide alterations in gene expression. Specifically, expression of a replication-dependent histone biogenesis factor, stem-loop binding protein (SLBP), is dependent upon 53BP1-ACLY-controlled acetylation at the SLBP promoter. This chain of regulation events carried out by 53BP1, ACLY, and SLBP is crucial for both quantitative and qualitative histone biogenesis as well as for the preservation of genomic integrity. Collectively, our findings reveal a previously unknown role for 53BP1 in coordinating replication-dependent histone biogenesis and highlight a DNA repair-independent function in the maintenance of genomic stability through a regulatory network that includes ACLY and SLBP.


Asunto(s)
ATP Citrato (pro-S)-Liasa , Histonas , ATP Citrato (pro-S)-Liasa/genética , ATP Citrato (pro-S)-Liasa/metabolismo , Acetilación , Roturas del ADN de Doble Cadena , Reparación del ADN , Histonas/genética , Histonas/metabolismo , Proteína 1 de Unión al Supresor Tumoral P53/metabolismo
6.
Brief Bioinform ; 22(6)2021 11 05.
Artículo en Inglés | MEDLINE | ID: mdl-34279571

RESUMEN

Deregulation of gene expression is associated with the pathogenesis of numerous human diseases including cancer. Current data analyses on gene expression are mostly focused on differential gene/transcript expression in big data-driven studies. However, a poor connection to the proteome changes is a widespread problem in current data analyses. This is partly due to the complexity of gene regulatory pathways at the post-transcriptional level. In this study, we overcome these limitations and introduce a graph-based learning model, PTNet, which simulates the microRNAs (miRNAs) that regulate gene expression post-transcriptionally in silico. Our model does not require large-scale proteomics studies to measure the protein expression and can successfully predict the protein levels by considering the miRNA-mRNA interaction network, the mRNA expression, and the miRNA expression. Large-scale experiments on simulations and real cancer high-throughput datasets using PTNet validated that (i) the miRNA-mediated interaction network affects the abundance of corresponding proteins and (ii) the predicted protein expression has a higher correlation with the proteomics data (ground-truth) than the mRNA expression data. The classification performance also shows that the predicted protein expression has an improved prediction power on cancer outcomes compared to the prediction done by the mRNA expression data only or considering both mRNA and miRNA. Availability: PTNet toolbox is available at http://github.com/CompbioLabUCF/PTNet.


Asunto(s)
Redes Reguladoras de Genes , MicroARNs/genética , Neoplasias/genética , Algoritmos , Simulación por Computador , Conjuntos de Datos como Asunto , Humanos , Proteómica
7.
BMC Bioinformatics ; 23(Suppl 3): 396, 2022 Sep 28.
Artículo en Inglés | MEDLINE | ID: mdl-36171568

RESUMEN

BACKGROUND: The eukaryotic genome is capable of producing multiple isoforms from a gene by alternative polyadenylation (APA) during pre-mRNA processing. APA in the 3'-untranslated region (3'-UTR) of mRNA produces transcripts with shorter or longer 3'-UTR. Often, 3'-UTR serves as a binding platform for microRNAs and RNA-binding proteins, which affect the fate of the mRNA transcript. Thus, 3'-UTR APA is known to modulate translation and provides a mean to regulate gene expression at the post-transcriptional level. Current bioinformatics pipelines have limited capability in profiling 3'-UTR APA events due to incomplete annotations and a low-resolution analyzing power: widely available bioinformatics pipelines do not reference actionable polyadenylation (cleavage) sites but simulate 3'-UTR APA only using RNA-seq read coverage, causing false positive identifications. To overcome these limitations, we developed APA-Scan, a robust program that identifies 3'-UTR APA events and visualizes the RNA-seq short-read coverage with gene annotations. METHODS: APA-Scan utilizes either predicted or experimentally validated actionable polyadenylation signals as a reference for polyadenylation sites and calculates the quantity of long and short 3'-UTR transcripts in the RNA-seq data. APA-Scan works in three major steps: (i) calculate the read coverage of the 3'-UTR regions of genes; (ii) identify the potential APA sites and evaluate the significance of the events among two biological conditions; (iii) graphical representation of user specific event with 3'-UTR annotation and read coverage on the 3'-UTR regions. APA-Scan is implemented in Python3. Source code and a comprehensive user's manual are freely available at https://github.com/compbiolabucf/APA-Scan . RESULT: APA-Scan was applied to both simulated and real RNA-seq datasets and compared with two widely used baselines DaPars and APAtrap. In simulation APA-Scan significantly improved the accuracy of 3'-UTR APA identification compared to the other baselines. The performance of APA-Scan was also validated by 3'-end-seq data and qPCR on mouse embryonic fibroblast cells. The experiments confirm that APA-Scan can detect unannotated 3'-UTR APA events and improve genome annotation. CONCLUSION: APA-Scan is a comprehensive computational pipeline to detect transcriptome-wide 3'-UTR APA events. The pipeline integrates both RNA-seq and 3'-end-seq data information and can efficiently identify the significant events with a high-resolution short reads coverage plots.


Asunto(s)
MicroARNs , Poliadenilación , Regiones no Traducidas 3'/genética , Animales , Fibroblastos/metabolismo , Ratones , MicroARNs/metabolismo , Isoformas de Proteínas/genética , Precursores del ARN/metabolismo , ARN Mensajero/genética , ARN Mensajero/metabolismo , RNA-Seq
8.
Bioinformatics ; 38(1): 179-186, 2021 12 22.
Artículo en Inglés | MEDLINE | ID: mdl-34415323

RESUMEN

MOTIVATION: Accurate disease phenotype prediction plays an important role in the treatment of heterogeneous diseases like cancer in the era of precision medicine. With the advent of high throughput technologies, more comprehensive multi-omics data is now available that can effectively link the genotype to phenotype. However, the interactive relation of multi-omics datasets makes it particularly challenging to incorporate different biological layers to discover the coherent biological signatures and predict phenotypic outcomes. In this study, we introduce omicsGAN, a generative adversarial network model to integrate two omics data and their interaction network. The model captures information from the interaction network as well as the two omics datasets and fuse them to generate synthetic data with better predictive signals. RESULTS: Large-scale experiments on The Cancer Genome Atlas breast cancer, lung cancer and ovarian cancer datasets validate that (i) the model can effectively integrate two omics data (e.g. mRNA and microRNA expression data) and their interaction network (e.g. microRNA-mRNA interaction network). The synthetic omics data generated by the proposed model has a better performance on cancer outcome classification and patients survival prediction compared to original omics datasets. (ii) The integrity of the interaction network plays a vital role in the generation of synthetic data with higher predictive quality. Using a random interaction network does not allow the framework to learn meaningful information from the omics datasets; therefore, results in synthetic data with weaker predictive signals. AVAILABILITY AND IMPLEMENTATION: Source code is available at: https://github.com/CompbioLabUCF/omicsGAN. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Neoplasias Pulmonares , MicroARNs , Humanos , Multiómica , Programas Informáticos , Genoma , MicroARNs/genética
9.
PLoS Comput Biol ; 17(4): e1008218, 2021 04.
Artículo en Inglés | MEDLINE | ID: mdl-33826608

RESUMEN

High-throughput spatial-transcriptomics RNA sequencing (sptRNA-seq) based on in-situ capturing technologies has recently been developed to spatially resolve transcriptome-wide mRNA expressions mapped to the captured locations in a tissue sample. Due to the low RNA capture efficiency by in-situ capturing and the complication of tissue section preparation, sptRNA-seq data often only provides an incomplete profiling of the gene expressions over the spatial regions of the tissue. In this paper, we introduce a graph-regularized tensor completion model for imputing the missing mRNA expressions in sptRNA-seq data, namely FIST, Fast Imputation of Spatially-resolved transcriptomes by graph-regularized Tensor completion. We first model sptRNA-seq data as a 3-way sparse tensor in genes (p-mode) and the (x, y) spatial coordinates (x-mode and y-mode) of the observed gene expressions, and then consider the imputation of the unobserved entries or fibers as a tensor completion problem in Canonical Polyadic Decomposition (CPD) form. To improve the imputation of highly sparse sptRNA-seq data, we also introduce a protein-protein interaction network to add prior knowledge of gene functions, and a spatial graph to capture the the spatial relations among the capture spots. The tensor completion model is then regularized by a Cartesian product graph of protein-protein interaction network and the spatial graph to capture the high-order relations in the tensor. In the experiments, FIST was tested on ten 10x Genomics Visium spatial transcriptomic datasets of different tissue sections with cross-validation among the known entries in the imputation. FIST significantly outperformed the state-of-the-art methods for single-cell RNAseq data imputation. We also demonstrate that both the spatial graph and PPI network play an important role in improving the imputation. In a case study, we further analyzed the gene clusters obtained from the imputed gene expressions to show that the imputations by FIST indeed capture the spatial characteristics in the gene expressions and reveal functions that are highly relevant to three different kinds of tissues in mouse kidney.


Asunto(s)
Transcriptoma , Algoritmos , Animales , Conjuntos de Datos como Asunto , Perfilación de la Expresión Génica , Riñón/metabolismo , Ratones , Análisis de Secuencia de ARN/métodos
10.
Int J Mol Sci ; 23(20)2022 Oct 17.
Artículo en Inglés | MEDLINE | ID: mdl-36293270

RESUMEN

The mammalian target of rapamycin (mTOR) pathway is crucial in energy metabolism and cell proliferation. Previously, we reported transcriptome-wide 3'-untranslated region (UTR) shortening by alternative polyadenylation upon mTOR activation and its impact on the proteome. Here, we further interrogated the mTOR-activated transcriptome and found that hyperactivation of mTOR promotes transcriptome-wide exon skipping/exclusion, producing short isoform transcripts from genes. This widespread exon skipping confers multifarious regulations in the mTOR-controlled functional proteomics: AS in coding regions widely affects the protein length and functional domains. They also alter the half-life of proteins and affect the regulatory post-translational modifications. Among the RNA processing factors differentially regulated by mTOR signaling, we found that SRSF3 mechanistically facilitates exon skipping in the mTOR-activated transcriptome. This study reveals a role of mTOR in AS regulation and demonstrates that widespread AS is a multifaceted modulator of the mTOR-regulated functional proteome.


Asunto(s)
Empalme Alternativo , Transcriptoma , Proteoma/genética , Serina-Treonina Quinasas TOR/genética , Isoformas de Proteínas/genética , Regiones no Traducidas
11.
Bioinformatics ; 36(8): 2466-2473, 2020 04 15.
Artículo en Inglés | MEDLINE | ID: mdl-31834359

RESUMEN

MOTIVATION: Accurate estimation of transcript isoform abundance is critical for downstream transcriptome analyses and can lead to precise molecular mechanisms for understanding complex human diseases, like cancer. Simplex mRNA Sequencing (RNA-Seq) based isoform quantification approaches are facing the challenges of inherent sampling bias and unidentifiable read origins. A large-scale experiment shows that the consistency between RNA-Seq and other mRNA quantification platforms is relatively low at the isoform level compared to the gene level. In this project, we developed a platform-integrated model for transcript quantification (IntMTQ) to improve the performance of RNA-Seq on isoform expression estimation. IntMTQ, which benefits from the mRNA expressions reported by the other platforms, provides more precise RNA-Seq-based isoform quantification and leads to more accurate molecular signatures for disease phenotype prediction. RESULTS: In the experiments to assess the quality of isoform expression estimated by IntMTQ, we designed three tasks for clustering and classification of 46 cancer cell lines with four different mRNA quantification platforms, including newly developed NanoString's nCounter technology. The results demonstrate that the isoform expressions learned by IntMTQ consistently provide more and better molecular features for downstream analyses compared with five baseline algorithms which consider RNA-Seq data only. An independent RT-qPCR experiment on seven genes in twelve cancer cell lines showed that the IntMTQ improved overall transcript quantification. The platform-integrated algorithms could be applied to large-scale cancer studies, such as The Cancer Genome Atlas (TCGA), with both RNA-Seq and array-based platforms available. AVAILABILITY AND IMPLEMENTATION: Source code is available at: https://github.com/CompbioLabUcf/IntMTQ. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Isoformas de ARN , Programas Informáticos , Algoritmos , Perfilación de la Expresión Génica , Humanos , Isoformas de Proteínas/genética , Isoformas de ARN/genética , ARN Mensajero/genética , Análisis de Secuencia de ARN
12.
Nucleic Acids Res ; 47(19): 10373-10387, 2019 11 04.
Artículo en Inglés | MEDLINE | ID: mdl-31504847

RESUMEN

U2 auxiliary factor 1 (U2AF1) functions in 3'-splice site selection during pre-mRNA processing. Alternative usage of duplicated tandem exons in U2AF1 produces two isoforms, U2AF1a and U2AF1b, but their functional differences are unappreciated due to their homology. Through integrative approaches of genome editing, customized-transcriptome profiling and crosslinking-mediated interactome analyses, we discovered that the expression of U2AF1 isoforms is controlled by mTOR and they exhibit a distinctive molecular profile for the splice site and protein interactomes. Mechanistic dissection of mutually exclusive alternative splicing events revealed that U2AF1 isoforms' inherent differential preferences of nucleotide sequences and their stoichiometry determine the 3'-splice site. Importantly, U2AF1a-driven transcriptomes feature alternative splicing events in the 5'-untranslated region (5'-UTR) that are favorable for translation. These findings unveil distinct roles of duplicated tandem exon-derived U2AF1 isoforms in the regulation of the transcriptome and suggest U2AF1a-driven 5'-UTR alternative splicing as a molecular mechanism of mTOR-regulated translational control.


Asunto(s)
Empalme Alternativo/genética , Biosíntesis de Proteínas , Factor de Empalme U2AF/genética , Serina-Treonina Quinasas TOR/genética , Animales , Secuencia de Bases/genética , Exones/genética , Células HeLa , Humanos , Ratones , Sitios de Empalme de ARN/genética , Empalme del ARN/genética , Transcriptoma/genética
13.
Int J Mol Sci ; 22(18)2021 Sep 07.
Artículo en Inglés | MEDLINE | ID: mdl-34575842

RESUMEN

Microbes and viruses are known to alter host transcriptomes by means of infection. In light of recent challenges posed by the COVID-19 pandemic, a deeper understanding of the disease at the transcriptome level is needed. However, research about transcriptome reprogramming by post-transcriptional regulation is very limited. In this study, computational methods developed by our lab were applied to RNA-seq data to detect transcript variants (i.e., alternative splicing (AS) and alternative polyadenylation (APA) events). The RNA-seq data were obtained from a publicly available source, and they consist of mock-treated and SARS-CoV-2 infected (COVID-19) lung alveolar (A549) cells. Data analysis results show that more AS events are found in SARS-CoV-2 infected cells than in mock-treated cells, whereas fewer APA events are detected in SARS-CoV-2 infected cells. A combination of conventional differential gene expression analysis and transcript variants analysis revealed that most of the genes with transcript variants are not differentially expressed. This indicates that no strong correlation exists between differential gene expression and the AS/APA events in the mock-treated or SARS-CoV-2 infected samples. These genes with transcript variants can be applied as another layer of molecular signatures for COVID-19 studies. In addition, the transcript variants are enriched in important biological pathways that were not detected in the studies that only focused on differential gene expression analysis. Therefore, the pathways may lead to new molecular mechanisms of SARS-CoV-2 pathogenesis.


Asunto(s)
COVID-19/virología , Regulación Viral de la Expresión Génica , Genes Virales , SARS-CoV-2/genética , Transcriptoma/genética , Células A549 , Humanos
14.
Int J Mol Sci ; 22(9)2021 Apr 25.
Artículo en Inglés | MEDLINE | ID: mdl-33922891

RESUMEN

(1) Background: A simplistic understanding of the central dogma falls short in correlating the number of genes in the genome to the number of proteins in the proteome. Post-transcriptional alternative splicing contributes to the complexity of the proteome and is critical in understanding gene expression. mRNA-sequencing (RNA-seq) has been widely used to study the transcriptome and provides opportunity to detect alternative splicing events among different biological conditions. Despite the popularity of studying transcriptome variants with RNA-seq, few efficient and user-friendly bioinformatics tools have been developed for the genome-wide detection and visualization of alternative splicing events. (2) Results: We propose AS-Quant, (Alternative Splicing Quantitation), a robust program to identify alternative splicing events from RNA-seq data. We then extended AS-Quant to visualize the splicing events with short-read coverage plots along with complete gene annotation. The tool works in three major steps: (i) calculate the read coverage of the potential spliced exons and the corresponding gene; (ii) categorize the events into five different categories according to the annotation, and assess the significance of the events between two biological conditions; (iii) generate the short reads coverage plot for user specified splicing events. Our extensive experiments on simulated and real datasets demonstrate that AS-Quant outperforms the other three widely used baselines, SUPPA2, rMATS, and diffSplice for detecting alternative splicing events. Moreover, the significant alternative splicing events identified by AS-Quant between two biological contexts were validated by RT-PCR experiment. (3) Availability: AS-Quant is implemented in Python 3.0. Source code and a comprehensive user's manual are freely available online.


Asunto(s)
Empalme Alternativo , Análisis de Secuencia de ARN/métodos , Programas Informáticos , Animales , Biología Computacional/métodos , Visualización de Datos , Exones , Fibroblastos/citología , Fibroblastos/fisiología , Ratones , Anotación de Secuencia Molecular
15.
BMC Genomics ; 21(1): 272, 2020 Mar 30.
Artículo en Inglés | MEDLINE | ID: mdl-32228441

RESUMEN

BACKGROUND: Most eukaryotic genes produce different transcripts of multiple isoforms by inclusion or exclusion of particular exons. The isoforms of a gene often play diverse functional roles, and thus it is necessary to accurately measure isoform expressions as well as gene expressions. While previous studies have demonstrated the strong agreement between mRNA sequencing (RNA-seq) and array-based gene and/or isoform quantification platforms (Microarray gene expression and Exon-array), the more recently developed NanoString platform has not been systematically evaluated and compared, especially in large-scale studies across different cancer domains. RESULTS: In this paper, we present a large-scale comparative study among RNA-seq, NanoString, array-based, and RT-qPCR platforms using 46 cancer cell lines across different cancer types. The goal is to understand and evaluate the calibers of the platforms for measuring gene and isoform expressions in cancer studies. We first performed NanoString experiments on 59 cancer cell lines with 404 custom-designed probes for measuring the expressions of 478 isoforms in 155 genes, and additional RT-qPCR experiments for a subset of the measured isoforms in 13 cell lines. We then combined the data with the matched RNA-seq, Exon-array, and Microarray data of 46 of the 59 cell lines for the comparative analysis. CONCLUSION: In the comparisons of the platforms for measuring the expressions at both isoform and gene levels, we found that (1) the agreement on isoform expressions is lower than the agreement on gene expressions across the four platforms; (2) NanoString and Exon-array are not consistent on isoform quantification even though both techniques are based on hybridization reactions; (3) RT-qPCR experiments are more consistent with RNA-seq and Exon-array than NanoString in isoform quantification; (4) different RNA-seq isoform quantification methods show varying estimation results, and among the methods, Net-RSTQ and eXpress are more consistent across the platforms; and (5) RNA-seq has the best overall consistency with the other platforms on gene expression quantification.


Asunto(s)
Perfilación de la Expresión Génica/métodos , Algoritmos , Exones/genética , Exones/fisiología , Humanos , Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , Isoformas de Proteínas/genética , Isoformas de Proteínas/metabolismo , Análisis de Secuencia de ARN/métodos , Programas Informáticos
16.
Mol Cell ; 45(1): 87-98, 2012 Jan 13.
Artículo en Inglés | MEDLINE | ID: mdl-22244333

RESUMEN

Ars2 is a component of the nuclear cap-binding complex that contributes to microRNA biogenesis and is required for cellular proliferation. Here, we expand on the repertoire of Ars2-dependent microRNAs and determine that Ars2 regulates a number of mRNAs, the largest defined subset of which code for histones. Histone mRNAs are unique among mammalian mRNAs because they are not normally polyadenylated but, rather, are cleaved following a 3' stem loop. A significant reduction in correctly processed histone mRNAs was observed following Ars2 depletion, concurrent with an increase in polyadenylated histone transcripts. Furthermore, Ars2 physically associated with histone mRNAs and the noncoding RNA 7SK. Knockdown of 7SK led to an enhanced ratio of cleaved to polyadenylated histone transcripts, an effect dependent on Ars2. Together, the data demonstrate that Ars2 contributes to histone mRNA 3' end formation and expression and these functional properties of Ars2 are negatively regulated by interaction with 7SK RNA.


Asunto(s)
Histonas/genética , Proteínas Nucleares/fisiología , Procesamiento de Término de ARN 3' , ARN Mensajero/metabolismo , Células HeLa , Humanos , Metiltransferasas/antagonistas & inhibidores , Metiltransferasas/fisiología , MicroARNs/metabolismo , Proteínas Nucleares/antagonistas & inhibidores , Proteínas Nucleares/genética , Interferencia de ARN , ARN Interferente Pequeño
17.
Nucleic Acids Res ; 46(12): 5996-6008, 2018 07 06.
Artículo en Inglés | MEDLINE | ID: mdl-29733382

RESUMEN

3'-untranslated regions (UTRs) can vary through the use of alternative polyadenylation sites during pre-mRNA processing. Multiple publically available pipelines combining high profiling technologies and bioinformatics tools have been developed to catalog changes in 3'-UTR lengths. In our recent RNA-seq experiments using cells with hyper-activated mammalian target of rapamycin (mTOR), we found that cellular mTOR activation leads to transcriptome-wide alternative polyadenylation (APA), resulting in the activation of multiple cellular pathways. Here, we developed a novel bioinformatics algorithm, IntMAP, which integrates RNA-Seq and PolyA Site (PAS)-Seq data for a comprehensive characterization of APA events. By applying IntMAP to the datasets from cells with hyper-activated mTOR, we identified novel APA events that could otherwise not be identified by either profiling method alone. Several transcription factors including Cebpg (CCAAT/enhancer binding protein gamma) were among the newly discovered APA transcripts, indicating that diverse transcriptional networks may be regulated by mTOR-coordinated APA. The prevention of APA in Cebpg using the CRISPR/cas9-mediated genome editing tool showed that mTOR-driven 3'-UTR shortening in Cebpg is critical in protecting cells from endoplasmic reticulum (ER) stress. Taken together, we present IntMAP as a new bioinformatics algorithm for APA analysis by which we expand our understanding of the physiological role of mTOR-coordinated APA events to ER stress response. IntMAP toolbox is available at http://compbio.cs.umn.edu/IntMAP/.


Asunto(s)
Algoritmos , Estrés del Retículo Endoplásmico/genética , Poliadenilación , Serina-Treonina Quinasas TOR/metabolismo , Regiones no Traducidas 3' , Animales , Proteínas Potenciadoras de Unión a CCAAT/biosíntesis , Proteínas Potenciadoras de Unión a CCAAT/genética , Células Cultivadas , Ratones
18.
Mol Cell ; 38(4): 551-62, 2010 May 28.
Artículo en Inglés | MEDLINE | ID: mdl-20513430

RESUMEN

The SMN complex assembles Sm cores on snRNAs, a key step in the biogenesis of snRNPs, the spliceosome's major components. Here, using SMN complex inhibitors identified by high-throughput screening and a ribo-proteomic strategy on formaldehyde crosslinked RNPs, we dissected this pathway in cells. We show that protein synthesis inhibition impairs the SMN complex, revealing discrete SMN and Gemin subunits and accumulating an snRNA precursor (pre-snRNA)-Gemin5 intermediate. By high-throughput sequencing of this transient intermediate's RNAs, we discovered the previously undetectable precursors of all the snRNAs and identified their Gemin5-binding sites. We demonstrate that pre-snRNA 3' sequences function to enhance snRNP biogenesis. The SMN complex is also inhibited by oxidation, and we show that it stalls an inventory-complete SMN complex containing pre-snRNAs. We propose a stepwise pathway of SMN complex formation and snRNP biogenesis, highlighting Gemin5's function in delivering pre-snRNAs as substrates for Sm core assembly and processing.


Asunto(s)
Precursores de Ácido Nucleico/metabolismo , ARN Nuclear Pequeño/metabolismo , Ribonucleoproteínas Nucleares Pequeñas/genética , Proteínas del Complejo SMN/metabolismo , Sitios de Unión , Células Cultivadas , Células HeLa , Humanos , Reacción en Cadena de la Polimerasa de Transcriptasa Inversa , Ribonucleoproteínas Nucleares Pequeñas/metabolismo
19.
Mol Cell ; 37(5): 668-78, 2010 Mar 12.
Artículo en Inglés | MEDLINE | ID: mdl-20227371

RESUMEN

The specific molecular events that characterize the intrinsic apoptosis pathway have been the subject of intense research due to the pathway's fundamental role in development, homeostasis, and cancer. This pathway is defined by the release of cytochrome c from mitochondria into the cytosol and subsequent binding of cytochrome c to the caspase activator Apaf-1. Here, we report that both mitochondrial and cytosolic transfer RNA (tRNA) bind to cytochrome c. This binding prevents cytochrome c interaction with Apaf-1, blocking Apaf-1 oligomerization and caspase activation. tRNA hydrolysis in living cells and cell lysates enhances apoptosis and caspase activation, whereas microinjection of tRNA into living cells blocks apoptosis. These findings suggest that tRNA, in addition to its well-established role in gene expression, may determine cellular responsiveness to apoptotic stimuli.


Asunto(s)
Apoptosis , Inhibidores de Caspasas , Citocromos c/metabolismo , ARN de Hongos/metabolismo , ARN de Transferencia/metabolismo , Animales , Apoptosis/efectos de los fármacos , Factor Apoptótico 1 Activador de Proteasas/metabolismo , Caspasa 3/metabolismo , Caspasa 9/metabolismo , Bovinos , Supervivencia Celular , Citosol/enzimología , Doxorrubicina/farmacología , Activación Enzimática , Células HeLa , Humanos , Hidrólisis , Células Jurkat , Microinyecciones , Mitocondrias/enzimología , Unión Proteica , Proteínas Recombinantes/metabolismo , Ribonucleasa Pancreática/metabolismo , Ribonucleasas/metabolismo , Transfección
20.
J Biol Chem ; 291(19): 10426-36, 2016 May 06.
Artículo en Inglés | MEDLINE | ID: mdl-26961879

RESUMEN

The intrinsic apoptosis pathway occurs through the release of mitochondrial cytochrome c to the cytosol, where it promotes activation of the caspase family of proteases. The observation that tRNA binds to cytochrome c revealed a previously unexpected mode of apoptotic regulation. However, the molecular characteristics of this interaction, and its impact on each interaction partner, are not well understood. Using a novel fluorescence assay, we show here that cytochrome c binds to tRNA with an affinity comparable with other tRNA-protein binding interactions and with a molecular ratio of ∼3:1. Cytochrome c recognizes the tertiary structural features of tRNA, particularly in the core region. This binding is independent of the charging state of tRNA but is regulated by the redox state of cytochrome c. Compared with reduced cytochrome c, oxidized cytochrome c binds to tRNA with a weaker affinity, which correlates with its stronger pro-apoptotic activity. tRNA binding both facilitates cytochrome c reduction and inhibits the peroxidase activity of cytochrome c, which is involved in its release from mitochondria. Together, these findings provide new insights into the cytochrome c-tRNA interaction and apoptotic regulation.


Asunto(s)
Citocromos c/química , ARN de Transferencia/química , Proteínas de Saccharomyces cerevisiae/química , Saccharomyces cerevisiae/enzimología , Animales , Apoptosis/fisiología , Bovinos , Citocromos c/genética , Citocromos c/metabolismo , Humanos , Unión Proteica , Estructura Cuaternaria de Proteína , ARN de Transferencia/genética , ARN de Transferencia/metabolismo , Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/metabolismo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA