Your browser doesn't support javascript.
loading
Haplotype estimation using sequencing reads.
Delaneau, Olivier; Howie, Bryan; Cox, Anthony J; Zagury, Jean-François; Marchini, Jonathan.
  • Delaneau O; Department of Statistics, University of Oxford, Oxford OX1 3TG, UK.
Am J Hum Genet ; 93(4): 687-96, 2013 Oct 03.
Article en En | MEDLINE | ID: mdl-24094745
ABSTRACT
High-throughput sequencing technologies produce short sequence reads that can contain phase information if they span two or more heterozygote genotypes. This information is not routinely used by current methods that infer haplotypes from genotype data. We have extended the SHAPEIT2 method to use phase-informative sequencing reads to improve phasing accuracy. Our model incorporates the read information in a probabilistic model through base quality scores within each read. The method is primarily designed for high-coverage sequence data or data sets that already have genotypes called. One important application is phasing of single samples sequenced at high coverage for use in medical sequencing and studies of rare diseases. Our method can also use existing panels of reference haplotypes. We tested the method by using a mother-father-child trio sequenced at high-coverage by Illumina together with the low-coverage sequence data from the 1000 Genomes Project (1000GP). We found that use of phase-informative reads increases the mean distance between switch errors by 22% from 274.4 kb to 328.6 kb. We also used male chromosome X haplotypes from the 1000GP samples to simulate sequencing reads with varying insert size, read length, and base error rate. When using short 100 bp paired-end reads, we found that using mixtures of insert sizes produced the best results. When using longer reads with high error rates (5-20 kb read with 4%-15% error per base), phasing performance was substantially improved.
Asunto(s)

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Haplotipos / Genoma Humano / Análisis de Secuencia de ADN Tipo de estudio: Prognostic_studies Límite: Child / Female / Humans / Male Idioma: En Año: 2013 Tipo del documento: Article

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Haplotipos / Genoma Humano / Análisis de Secuencia de ADN Tipo de estudio: Prognostic_studies Límite: Child / Female / Humans / Male Idioma: En Año: 2013 Tipo del documento: Article