Your browser doesn't support javascript.
loading
Direct comparison of performance of single nucleotide variant calling in human genome with alignment-based and assembly-based approaches.
Wu, Leihong; Yavas, Gokhan; Hong, Huixiao; Tong, Weida; Xiao, Wenming.
Afiliación
  • Wu L; National Center for Toxicological Research, US Food and Drug Administration, 3900 NCTR RD, Jefferson, AR, 72079, USA.
  • Yavas G; National Center for Toxicological Research, US Food and Drug Administration, 3900 NCTR RD, Jefferson, AR, 72079, USA.
  • Hong H; National Center for Toxicological Research, US Food and Drug Administration, 3900 NCTR RD, Jefferson, AR, 72079, USA.
  • Tong W; National Center for Toxicological Research, US Food and Drug Administration, 3900 NCTR RD, Jefferson, AR, 72079, USA.
  • Xiao W; National Center for Toxicological Research, US Food and Drug Administration, 3900 NCTR RD, Jefferson, AR, 72079, USA. wenming.xiao@fda.hhs.gov.
Sci Rep ; 7(1): 10963, 2017 09 08.
Article en En | MEDLINE | ID: mdl-28887485
Complementary to reference-based variant detection, recent studies revealed that many novel variants could be detected with de novo assembled genomes. To evaluate the effect of reads coverage and the accuracy of assembly-based variant calling, we simulated short reads containing more than 3 million of single nucleotide variants (SNVs) from the whole human genome and compared the efficiency of SNV calling between the assembly-based and alignment-based calling approaches. We assessed the quality of the assembled contig and found that a minimum of 30X coverage of short reads was needed to ensure reliable SNV calling and to generate assembled contigs with a good coverage of genome and genes. In addition, we observed that the assembly-based approach had a much lower recall rate and precision comparing to the alignment-based approach that would recover 99% of imputed SNVs. We observed similar results with experimental reads for NA24385, an individual whose germline variants were well characterized. Although there are additional values for SNVs detection, the assembly-based approach would have great risk of false discovery of novel SNVs. Further improvement of de novo assembly algorithms are needed in order to warrant a good completeness of genome with haplotype resolved and high fidelity of assembled sequences.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Alineación de Secuencia / Análisis de Secuencia de ADN / Mapeo Contig / Polimorfismo de Nucleótido Simple / Estudio de Asociación del Genoma Completo Límite: Humans Idioma: En Revista: Sci Rep Año: 2017 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Alineación de Secuencia / Análisis de Secuencia de ADN / Mapeo Contig / Polimorfismo de Nucleótido Simple / Estudio de Asociación del Genoma Completo Límite: Humans Idioma: En Revista: Sci Rep Año: 2017 Tipo del documento: Article País de afiliación: Estados Unidos
...