Your browser doesn't support javascript.
loading
Normalization of transposon-mutant library sequencing datasets to improve identification of conditionally essential genes.
DeJesus, Michael A; Ioerger, Thomas R.
Afiliação
  • DeJesus MA; 1 Department of Computer Science, Texas A&M University, College Station, Texas 77843, USA.
  • Ioerger TR; 1 Department of Computer Science, Texas A&M University, College Station, Texas 77843, USA.
J Bioinform Comput Biol ; 14(3): 1642004, 2016 06.
Article em En | MEDLINE | ID: mdl-26932272
ABSTRACT
Sequencing of transposon-mutant libraries using next-generation sequencing (TnSeq) has become a popular method for determining which genes and non-coding regions are essential for growth under various conditions in bacteria. For methods that rely on quantitative comparison of counts of reads at transposon insertion sites, proper normalization of TnSeq datasets is vitally important. Real TnSeq datasets are often noisy and exhibit a significant skew that can be dominated by high counts at a small number of sites (often for non-biological reasons). If two datasets that are not appropriately normalized are compared, it might cause the artifactual appearance of Differentially Essential (DE) genes in a statistical test, constituting type I errors (false positives). In this paper, we propose a novel method for normalization of TnSeq datasets that corrects for the skew of read-count distributions by fitting them to a Beta-Geometric distribution. We show that this read-count correction procedure reduces the number of false positives when comparing replicate datasets grown under the same conditions (for which no genuine differences in essentiality are expected). We compare these results to results obtained with other normalization procedures, and show that it results in greater reduction in the number of false positives. In addition we investigate the effects of normalization on the detection of DE genes.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Elementos de DNA Transponíveis / Biologia Computacional / Genes Essenciais / Bases de Dados de Ácidos Nucleicos / Sequenciamento de Nucleotídeos em Larga Escala Tipo de estudo: Diagnostic_studies Idioma: En Revista: J Bioinform Comput Biol Assunto da revista: BIOLOGIA / INFORMATICA MEDICA Ano de publicação: 2016 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Elementos de DNA Transponíveis / Biologia Computacional / Genes Essenciais / Bases de Dados de Ácidos Nucleicos / Sequenciamento de Nucleotídeos em Larga Escala Tipo de estudo: Diagnostic_studies Idioma: En Revista: J Bioinform Comput Biol Assunto da revista: BIOLOGIA / INFORMATICA MEDICA Ano de publicação: 2016 Tipo de documento: Article País de afiliação: Estados Unidos