Your browser doesn't support javascript.
loading
Whole-genome haplotyping using long reads and statistical methods.
Kuleshov, Volodymyr; Xie, Dan; Chen, Rui; Pushkarev, Dmitry; Ma, Zhihai; Blauwkamp, Tim; Kertesz, Michael; Snyder, Michael.
Affiliation
  • Kuleshov V; Department of Computer Science, Stanford University, Stanford, CA 94305, USA.
  • Xie D; Illumina, Inc., 5200 Illumina Way, San Diego, CA 92199, USA.
  • Chen R; Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA.
  • Pushkarev D; Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA.
  • Ma Z; Illumina, Inc., 5200 Illumina Way, San Diego, CA 92199, USA.
  • Blauwkamp T; Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA.
  • Kertesz M; Illumina, Inc., 5200 Illumina Way, San Diego, CA 92199, USA.
  • Snyder M; Illumina, Inc., 5200 Illumina Way, San Diego, CA 92199, USA.
Nat Biotechnol ; 32(3): 261-266, 2014 Mar.
Article in En | MEDLINE | ID: mdl-24561555
ABSTRACT
The rapid growth of sequencing technologies has greatly contributed to our understanding of human genetics. Yet, despite this growth, mainstream technologies have not been fully able to resolve the diploid nature of the human genome. Here we describe statistically aided, long-read haplotyping (SLRH), a rapid, accurate method that uses a statistical algorithm to take advantage of the partially phased information contained in long genomic fragments analyzed by short-read sequencing. For a human sample, as little as 30 Gbp of additional sequencing data are needed to phase genotypes identified by 50× coverage whole-genome sequencing. Using SLRH, we phase 99% of single-nucleotide variants in three human genomes into long haplotype blocks 0.2-1 Mbp in length. We apply our method to determine allele-specific methylation patterns in a human genome and identify hundreds of differentially methylated regions that were previously unknown. SLRH should facilitate population-scale haplotyping of human genomes.
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Haplotypes / Sequence Analysis, DNA / Genomics Limits: Humans Language: En Journal: Nat Biotechnol Journal subject: BIOTECNOLOGIA Year: 2014 Type: Article Affiliation country: United States

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Haplotypes / Sequence Analysis, DNA / Genomics Limits: Humans Language: En Journal: Nat Biotechnol Journal subject: BIOTECNOLOGIA Year: 2014 Type: Article Affiliation country: United States