Your browser doesn't support javascript.
loading
Outlier detection for improved differential splicing quantification from RNA-Seq experiments with replicates.
Norton, Scott S; Vaquero-Garcia, Jorge; Lahens, Nicholas F; Grant, Gregory R; Barash, Yoseph.
Afiliação
  • Norton SS; Department of Genetics, Perelman School of Medicine.
  • Vaquero-Garcia J; Department of Genetics, Perelman School of Medicine.
  • Lahens NF; Department of Computer and Information Science, School of Engineering.
  • Grant GR; Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.
  • Barash Y; Department of Genetics, Perelman School of Medicine.
Bioinformatics ; 34(9): 1488-1497, 2018 05 01.
Article em En | MEDLINE | ID: mdl-29236961
ABSTRACT
Motivation A key component in many RNA-Seq-based studies is contrasting multiple replicates from different experimental conditions. In this setup, replicates play a key role as they allow to capture underlying biological variability inherent to the compared conditions, as well as experimental variability. However, what constitutes a 'bad' replicate is not necessarily well defined. Consequently, researchers might discard valuable data or downstream analysis may be hampered by failed experiments.

Results:

Here we develop a probability model to weigh a given RNA-Seq sample as a representative of an experimental condition when performing alternative splicing analysis. We demonstrate that this model detects outlier samples which are consistently and significantly different compared with other samples from the same condition. Moreover, we show that instead of discarding such samples the proposed weighting scheme can be used to downweight samples and specific splicing variations suspected as outliers, gaining statistical power. These weights can then be used for differential splicing (DS) analysis, where the resulting algorithm offers a generalization of the MAJIQ algorithm. Using both synthetic and real-life data, we perform an extensive evaluation of the improved MAJIQ algorithm in different scenarios involving perturbed samples, mislabeled samples, same condition groups, and different levels of coverage, showing it compares favorably to other tools. Overall, this work offers an outlier detection algorithm that can be combined with any splicing pipeline, a generalized and improved version of MAJIQ for DS detection, and evaluation metrics with matching code and data for DS algorithms. Availability and implementation Software and data are accessible via majiq.biociphers.org/norton_et_al_2017/. Contact yosephb@upenn.edu. Supplementary information Supplementary data are available at Bioinformatics online.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Algoritmos / Análise de Sequência de RNA / Processamento Alternativo Tipo de estudo: Diagnostic_studies Idioma: En Ano de publicação: 2018 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Algoritmos / Análise de Sequência de RNA / Processamento Alternativo Tipo de estudo: Diagnostic_studies Idioma: En Ano de publicação: 2018 Tipo de documento: Article