Your browser doesn't support javascript.
loading
CNOGpro: detection and quantification of CNVs in prokaryotic whole-genome sequencing data.
Brynildsrud, Ola; Snipen, Lars-Gustav; Bohlin, Jon.
Afiliação
  • Brynildsrud O; Section for Biostatistics and Epidemiology, Norwegian University of Life Sciences (NMBU), Oslo, Department of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences (NMBU), Ås and Norwegian Institute of Public Health, Division of Epidemiology, 0403 Oslo, Norway.
  • Snipen LG; Section for Biostatistics and Epidemiology, Norwegian University of Life Sciences (NMBU), Oslo, Department of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences (NMBU), Ås and Norwegian Institute of Public Health, Division of Epidemiology, 0403 Oslo, Norway.
  • Bohlin J; Section for Biostatistics and Epidemiology, Norwegian University of Life Sciences (NMBU), Oslo, Department of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences (NMBU), Ås and Norwegian Institute of Public Health, Division of Epidemiology, 0403 Oslo, Norway.
Bioinformatics ; 31(11): 1708-15, 2015 Jun 01.
Article em En | MEDLINE | ID: mdl-25644268
ABSTRACT
MOTIVATION The explosion of whole-genome sequencing (WGS) as a tool in the mapping and understanding of genomes has been accompanied by an equally massive report of tools and pipelines for the analysis of DNA copy number variation (CNV). Most currently available tools are designed specifically for human genomes, with comparatively little literature devoted to CNVs in prokaryotic organisms. However, there are several idiosyncrasies in prokaryotic WGS data. This work proposes a step-by-step approach for detection and quantification of copy number variants specifically aimed at prokaryotes.

RESULTS:

After aligning WGS reads to a reference genome, we count the individual reads in a sliding window and normalize these counts for bias introduced by differences in GC content. We then investigate the coverage in two fundamentally different ways (i) Employing a Hidden Markov Model and (ii) by repeated sampling with replacement (bootstrapping) on each individual gene. The latter bypasses the complex problem of breakpoint determination. To demonstrate our method, we apply it to real and simulated WGS data and benchmark it against two popular methods for CNV detection. The proposed methodology will in some cases represent a significant jump in accuracy from other current methods. AVAILABILITY AND IMPLEMENTATION CNOGpro is written entirely in the R programming language and is available from the CRAN repository (http//cran.r-project.org) under the GNU General Public License.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Software / Genoma Bacteriano / Variações do Número de Cópias de DNA Tipo de estudo: Diagnostic_studies Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2015 Tipo de documento: Article País de afiliação: Noruega

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Software / Genoma Bacteriano / Variações do Número de Cópias de DNA Tipo de estudo: Diagnostic_studies Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2015 Tipo de documento: Article País de afiliação: Noruega