Your browser doesn't support javascript.
loading
Impact of human gene annotations on RNA-seq differential expression analysis.
Hamaguchi, Yu; Zeng, Chao; Hamada, Michiaki.
Afiliação
  • Hamaguchi Y; Faculty of Science and Engineering, Waseda University, 55N-06-10, 3-4-1 Okubo Shinjuku-ku, Tokyo, 169-8555, Japan. yh549848@aoni.waseda.jp.
  • Zeng C; Faculty of Science and Engineering, Waseda University, 55N-06-10, 3-4-1 Okubo Shinjuku-ku, Tokyo, 169-8555, Japan.
  • Hamada M; AIST-Waseda University Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), 3-4-1, Okubo Shinjuku-ku, Tokyo, 169-8555, Japan.
BMC Genomics ; 22(1): 730, 2021 Oct 08.
Article em En | MEDLINE | ID: mdl-34625021
ABSTRACT

BACKGROUND:

Differential expression (DE) analysis of RNA-seq data typically depends on gene annotations. Different sets of gene annotations are available for the human genome and are continually updated-a process complicated with the development and application of high-throughput sequencing technologies. However, the impact of the complexity of gene annotations on DE analysis remains unclear.

RESULTS:

Using "mappability", a metric of the complexity of gene annotation, we compared three distinct human gene annotations, GENCODE, RefSeq, and NONCODE, and evaluated how mappability affected DE analysis. We found that mappability was significantly different among the human gene annotations. We also found that increasing mappability improved the performance of DE analysis, and the impact of mappability mainly evident in the quantification step and propagated downstream of DE analysis systematically.

CONCLUSIONS:

We assessed how the complexity of gene annotations affects DE analysis using mappability. Our findings indicate that the growth and complexity of gene annotations negatively impact the performance of DE analysis, suggesting that an approach that excludes unnecessary gene models from gene annotations improves the performance of DE analysis.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Genoma Humano / Sequenciamento de Nucleotídeos em Larga Escala Idioma: En Ano de publicação: 2021 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Genoma Humano / Sequenciamento de Nucleotídeos em Larga Escala Idioma: En Ano de publicação: 2021 Tipo de documento: Article