Your browser doesn't support javascript.
loading
Alignment and mapping methodology influence transcript abundance estimation.
Srivastava, Avi; Malik, Laraib; Sarkar, Hirak; Zakeri, Mohsen; Almodaresi, Fatemeh; Soneson, Charlotte; Love, Michael I; Kingsford, Carl; Patro, Rob.
Afiliação
  • Srivastava A; Department of Computer Science, Stony Brook University, Stony Brook, USA.
  • Malik L; Department of Computer Science, Stony Brook University, Stony Brook, USA.
  • Sarkar H; Department of Computer Science, University of Maryland, College Park, USA.
  • Zakeri M; Department of Computer Science, University of Maryland, College Park, USA.
  • Almodaresi F; Department of Computer Science, University of Maryland, College Park, USA.
  • Soneson C; Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland.
  • Love MI; SIB Swiss Institute of Bioinformatics, Basel, Switzerland.
  • Kingsford C; Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, USA.
  • Patro R; Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, USA.
Genome Biol ; 21(1): 239, 2020 09 07.
Article em En | MEDLINE | ID: mdl-32894187
ABSTRACT

BACKGROUND:

The accuracy of transcript quantification using RNA-seq data depends on many factors, such as the choice of alignment or mapping method and the quantification model being adopted. While the choice of quantification model has been shown to be important, considerably less attention has been given to comparing the effect of various read alignment approaches on quantification accuracy.

RESULTS:

We investigate the influence of mapping and alignment on the accuracy of transcript quantification in both simulated and experimental data, as well as the effect on subsequent differential expression analysis. We observe that, even when the quantification model itself is held fixed, the effect of choosing a different alignment methodology, or aligning reads using different parameters, on quantification estimates can sometimes be large and can affect downstream differential expression analyses as well. These effects can go unnoticed when assessment is focused too heavily on simulated data, where the alignment task is often simpler than in experimentally acquired samples. We also introduce a new alignment methodology, called selective alignment, to overcome the shortcomings of lightweight approaches without incurring the computational cost of traditional alignment.

CONCLUSION:

We observe that, on experimental datasets, the performance of lightweight mapping and alignment-based approaches varies significantly, and highlight some of the underlying factors. We show this variation both in terms of quantification and downstream differential expression analysis. In all comparisons, we also show the improved performance of our proposed selective alignment method and suggest best practices for performing RNA-seq quantification.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Alinhamento de Sequência / Mapeamento Cromossômico Tipo de estudo: Evaluation_studies / Guideline / Prognostic_studies Limite: Animals Idioma: En Revista: Genome Biol Assunto da revista: BIOLOGIA MOLECULAR / GENETICA Ano de publicação: 2020 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Alinhamento de Sequência / Mapeamento Cromossômico Tipo de estudo: Evaluation_studies / Guideline / Prognostic_studies Limite: Animals Idioma: En Revista: Genome Biol Assunto da revista: BIOLOGIA MOLECULAR / GENETICA Ano de publicação: 2020 Tipo de documento: Article País de afiliação: Estados Unidos