Your browser doesn't support javascript.
loading
Flexible expressed region analysis for RNA-seq with derfinder.
Collado-Torres, Leonardo; Nellore, Abhinav; Frazee, Alyssa C; Wilks, Christopher; Love, Michael I; Langmead, Ben; Irizarry, Rafael A; Leek, Jeffrey T; Jaffe, Andrew E.
Afiliação
  • Collado-Torres L; Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21205, USA.
  • Nellore A; Center for Computational Biology, Johns Hopkins University, Baltimore, MD 21205, USA.
  • Frazee AC; Lieber Institute for Brain Development, Johns Hopkins Medical Campus, Baltimore, MD 21205, USA.
  • Wilks C; Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21205, USA.
  • Love MI; Center for Computational Biology, Johns Hopkins University, Baltimore, MD 21205, USA.
  • Langmead B; Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218, USA.
  • Irizarry RA; Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21205, USA.
  • Leek JT; Center for Computational Biology, Johns Hopkins University, Baltimore, MD 21205, USA.
  • Jaffe AE; Center for Computational Biology, Johns Hopkins University, Baltimore, MD 21205, USA.
Nucleic Acids Res ; 45(2): e9, 2017 01 25.
Article em En | MEDLINE | ID: mdl-27694310
ABSTRACT
Differential expression analysis of RNA sequencing (RNA-seq) data typically relies on reconstructing transcripts or counting reads that overlap known gene structures. We previously introduced an intermediate statistical approach called differentially expressed region (DER) finder that seeks to identify contiguous regions of the genome showing differential expression signal at single base resolution without relying on existing annotation or potentially inaccurate transcript assembly.We present the derfinder software that improves our annotation-agnostic approach to RNA-seq analysis by (i) implementing a computationally efficient bump-hunting approach to identify DERs that permits genome-scale analyses in a large number of samples, (ii) introducing a flexible statistical modeling framework, including multi-group and time-course analyses and (iii) introducing a new set of data visualizations for expressed region analysis. We apply this approach to public RNA-seq data from the Genotype-Tissue Expression (GTEx) project and BrainSpan project to show that derfinder permits the analysis of hundreds of samples at base resolution in R, identifies expression outside of known gene boundaries and can be used to visualize expressed regions at base-resolution. In simulations, our base resolution approaches enable discovery in the presence of incomplete annotation and is nearly as powerful as feature-level methods when the annotation is complete.derfinder analysis using expressed region-level and single base-level approaches provides a compromise between full transcript reconstruction and feature-level analysis. The package is available from Bioconductor at www.bioconductor.org/packages/derfinder.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Software / Perfilação da Expressão Gênica Idioma: En Revista: Nucleic Acids Res Ano de publicação: 2017 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Software / Perfilação da Expressão Gênica Idioma: En Revista: Nucleic Acids Res Ano de publicação: 2017 Tipo de documento: Article País de afiliação: Estados Unidos