Best practices for evaluating single nucleotide variant calling methods for microbial genomics.

Olson, Nathan D; Lund, Steven P; Colman, Rebecca E; Foster, Jeffrey T; Sahl, Jason W; Schupp, James M; Keim, Paul; Morrow, Jayne B; Salit, Marc L; Zook, Justin M

Olson, Nathan D; Lund, Steven P; Colman, Rebecca E; Foster, Jeffrey T; Sahl, Jason W; Schupp, James M; Keim, Paul; Morrow, Jayne B; Salit, Marc L; Zook, Justin M.

Afiliação

Olson ND; Biosystems and Biomaterials Division, Material Measurement Laboratory, National Institute of Standards and Technology , Gaithersburg, MD, USA.
Lund SP; Statistical Engineering Division, Information Technology Laboratory, National Institute of Standards and Technology , Gaithersburg, MD, USA.
Colman RE; Division of Pathogen Genomics, Translational Genomics Research Institute , Flagstaff, AZ, USA.
Foster JT; Center for Microbial Genetics and Genomics, Northern Arizona University , Flagstaff, AZ, USA.
Sahl JW; Division of Pathogen Genomics, Translational Genomics Research Institute , Flagstaff, AZ, USA ; Center for Microbial Genetics and Genomics, Northern Arizona University , Flagstaff, AZ, USA.
Schupp JM; Division of Pathogen Genomics, Translational Genomics Research Institute , Flagstaff, AZ, USA.
Keim P; Division of Pathogen Genomics, Translational Genomics Research Institute , Flagstaff, AZ, USA ; Center for Microbial Genetics and Genomics, Northern Arizona University , Flagstaff, AZ, USA.
Morrow JB; Biosystems and Biomaterials Division, Material Measurement Laboratory, National Institute of Standards and Technology , Gaithersburg, MD, USA.
Salit ML; Biosystems and Biomaterials Division, Material Measurement Laboratory, National Institute of Standards and Technology , Gaithersburg, MD, USA ; Department of Bioengineering, Stanford University , Stanford, CA, USA.
Zook JM; Biosystems and Biomaterials Division, Material Measurement Laboratory, National Institute of Standards and Technology , Gaithersburg, MD, USA.

Front Genet ; 6: 235, 2015.

Article em En | MEDLINE | ID: mdl-26217378

RESUMO

Innovations in sequencing technologies have allowed biologists to make incredible advances in understanding biological systems. As experience grows, researchers increasingly recognize that analyzing the wealth of data provided by these new sequencing platforms requires careful attention to detail for robust results. Thus far, much of the scientific Communit's focus for use in bacterial genomics has been on evaluating genome assembly algorithms and rigorously validating assembly program performance. Missing, however, is a focus on critical evaluation of variant callers for these genomes. Variant calling is essential for comparative genomics as it yields insights into nucleotide-level organismal differences. Variant calling is a multistep process with a host of potential error sources that may lead to incorrect variant calls. Identifying and resolving these incorrect calls is critical for bacterial genomics to advance. The goal of this review is to provide guidance on validating algorithms and pipelines used in variant calling for bacterial genomics. First, we will provide an overview of the variant calling procedures and the potential sources of error associated with the methods. We will then identify appropriate datasets for use in evaluating algorithms and describe statistical methods for evaluating algorithm performance. As variant calling moves from basic research to the applied setting, standardized methods for performance evaluation and reporting are required; it is our hope that this review provides the groundwork for the development of these standards.

Palavras-chave

indel; next-generation sequencing; performance metrics; single nucleotide variants; variant calling

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Tipo de estudo: Guideline Idioma: En Ano de publicação: 2015 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Tipo de estudo: Guideline Idioma: En Ano de publicação: 2015 Tipo de documento: Article