Your browser doesn't support javascript.
loading
De novo assembly and phasing of a Korean human genome.
Seo, Jeong-Sun; Rhie, Arang; Kim, Junsoo; Lee, Sangjin; Sohn, Min-Hwan; Kim, Chang-Uk; Hastie, Alex; Cao, Han; Yun, Ji-Young; Kim, Jihye; Kuk, Junho; Park, Gun Hwa; Kim, Juhyeok; Ryu, Hanna; Kim, Jongbum; Roh, Mira; Baek, Jeonghun; Hunkapiller, Michael W; Korlach, Jonas; Shin, Jong-Yeon; Kim, Changhoon.
Afiliação
  • Seo JS; Genomic Medicine Institute (GMI), Medical Research Center, Seoul National University, Seoul 110-799, South Korea.
  • Rhie A; Department of Biochemistry and Molecular Biology, Seoul National University College of Medicine, Seoul 110-799, South Korea.
  • Kim J; Department of Biomedical Sciences, Seoul National University Graduate School, Seoul 110-799, South Korea.
  • Lee S; Bioinformatics Institute, Macrogen Inc., Seoul 153-023, South Korea.
  • Sohn MH; Genome Institute, Macrogen Inc., Seoul 153-023, South Korea.
  • Kim CU; Genomic Medicine Institute (GMI), Medical Research Center, Seoul National University, Seoul 110-799, South Korea.
  • Hastie A; Department of Biochemistry and Molecular Biology, Seoul National University College of Medicine, Seoul 110-799, South Korea.
  • Cao H; Department of Biomedical Sciences, Seoul National University Graduate School, Seoul 110-799, South Korea.
  • Yun JY; Genomic Medicine Institute (GMI), Medical Research Center, Seoul National University, Seoul 110-799, South Korea.
  • Kim J; Bioinformatics Institute, Macrogen Inc., Seoul 153-023, South Korea.
  • Kuk J; Genomic Medicine Institute (GMI), Medical Research Center, Seoul National University, Seoul 110-799, South Korea.
  • Park GH; Genome Institute, Macrogen Inc., Seoul 153-023, South Korea.
  • Kim J; Genomic Medicine Institute (GMI), Medical Research Center, Seoul National University, Seoul 110-799, South Korea.
  • Ryu H; Department of Biochemistry and Molecular Biology, Seoul National University College of Medicine, Seoul 110-799, South Korea.
  • Kim J; Department of Biomedical Sciences, Seoul National University Graduate School, Seoul 110-799, South Korea.
  • Roh M; Genomic Medicine Institute (GMI), Medical Research Center, Seoul National University, Seoul 110-799, South Korea.
  • Baek J; Department of Biochemistry and Molecular Biology, Seoul National University College of Medicine, Seoul 110-799, South Korea.
  • Hunkapiller MW; Department of Biomedical Sciences, Seoul National University Graduate School, Seoul 110-799, South Korea.
  • Korlach J; BioNano Genomics, San Diego, California 92121, USA.
  • Shin JY; BioNano Genomics, San Diego, California 92121, USA.
  • Kim C; Genomic Medicine Institute (GMI), Medical Research Center, Seoul National University, Seoul 110-799, South Korea.
Nature ; 538(7624): 243-247, 2016 Oct 13.
Article em En | MEDLINE | ID: mdl-27706134
ABSTRACT
Advances in genome assembly and phasing provide an opportunity to investigate the diploid architecture of the human genome and reveal the full range of structural variation across population groups. Here we report the de novo assembly and haplotype phasing of the Korean individual AK1 (ref. 1) using single-molecule real-time sequencing, next-generation mapping, microfluidics-based linked reads, and bacterial artificial chromosome (BAC) sequencing approaches. Single-molecule sequencing coupled with next-generation mapping generated a highly contiguous assembly, with a contig N50 size of 17.9 Mb and a scaffold N50 size of 44.8 Mb, resolving 8 chromosomal arms into single scaffolds. The de novo assembly, along with local assemblies and spanning long reads, closes 105 and extends into 72 out of 190 euchromatic gaps in the reference genome, adding 1.03 Mb of previously intractable sequence. High concordance between the assembly and paired-end sequences from 62,758 BAC clones provides strong support for the robustness of the assembly. We identify 18,210 structural variants by direct comparison of the assembly with the human reference, identifying thousands of breakpoints that, to our knowledge, have not been reported before. Many of the insertions are reflected in the transcriptome and are shared across the Asian population. We performed haplotype phasing of the assembly with short reads, long reads and linked reads from whole-genome sequencing and with short reads from 31,719 BAC clones, thereby achieving phased blocks with an N50 size of 11.6 Mb. Haplotigs assembled from single-molecule real-time reads assigned to haplotypes on phased blocks covered 89% of genes. The haplotigs accurately characterized the hypervariable major histocompatability complex region as well as demonstrating allele configuration in clinically relevant genes such as CYP2D6. This work presents the most contiguous diploid human genome assembly so far, with extensive investigation of unreported and Asian-specific structural variants, and high-quality haplotyping of clinically relevant alleles for precision medicine.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Haplótipos / Genoma Humano / Análise de Sequência de DNA / Mapeamento de Sequências Contíguas / Genômica / Povo Asiático Limite: Humans País/Região como assunto: Asia Idioma: En Ano de publicação: 2016 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Haplótipos / Genoma Humano / Análise de Sequência de DNA / Mapeamento de Sequências Contíguas / Genômica / Povo Asiático Limite: Humans País/Região como assunto: Asia Idioma: En Ano de publicação: 2016 Tipo de documento: Article