Your browser doesn't support javascript.
loading
Discovery and genotyping of structural variation from long-read haploid genome sequence data.
Huddleston, John; Chaisson, Mark J P; Steinberg, Karyn Meltz; Warren, Wes; Hoekzema, Kendra; Gordon, David; Graves-Lindsay, Tina A; Munson, Katherine M; Kronenberg, Zev N; Vives, Laura; Peluso, Paul; Boitano, Matthew; Chin, Chen-Shin; Korlach, Jonas; Wilson, Richard K; Eichler, Evan E.
Afiliação
  • Huddleston J; Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA.
  • Chaisson MJP; Howard Hughes Medical Institute, University of Washington, Seattle, Washington 98195, USA.
  • Steinberg KM; Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA.
  • Warren W; McDonnell Genome Institute, Department of Medicine, Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63108, USA.
  • Hoekzema K; McDonnell Genome Institute, Department of Medicine, Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63108, USA.
  • Gordon D; Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA.
  • Graves-Lindsay TA; Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA.
  • Munson KM; Howard Hughes Medical Institute, University of Washington, Seattle, Washington 98195, USA.
  • Kronenberg ZN; McDonnell Genome Institute, Department of Medicine, Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63108, USA.
  • Vives L; Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA.
  • Peluso P; Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA.
  • Boitano M; Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA.
  • Chin CS; Pacific Biosciences of California, Incorporated, Menlo Park, California 94025, USA.
  • Korlach J; Pacific Biosciences of California, Incorporated, Menlo Park, California 94025, USA.
  • Wilson RK; Pacific Biosciences of California, Incorporated, Menlo Park, California 94025, USA.
  • Eichler EE; Pacific Biosciences of California, Incorporated, Menlo Park, California 94025, USA.
Genome Res ; 27(5): 677-685, 2017 05.
Article em En | MEDLINE | ID: mdl-27895111
ABSTRACT
In an effort to more fully understand the full spectrum of human genetic variation, we generated deep single-molecule, real-time (SMRT) sequencing data from two haploid human genomes. By using an assembly-based approach (SMRT-SV), we systematically assessed each genome independently for structural variants (SVs) and indels resolving the sequence structure of 461,553 genetic variants from 2 bp to 28 kbp in length. We find that >89% of these variants have been missed as part of analysis of the 1000 Genomes Project even after adjusting for more common variants (MAF > 1%). We estimate that this theoretical human diploid differs by as much as ∼16 Mbp with respect to the human reference, with long-read sequencing data providing a fivefold increase in sensitivity for genetic variants ranging in size from 7 bp to 1 kbp compared with short-read sequence data. Although a large fraction of genetic variants were not detected by short-read approaches, once the alternate allele is sequence-resolved, we show that 61% of SVs can be genotyped in short-read sequence data sets with high accuracy. Uncoupling discovery from genotyping thus allows for the majority of this missed common variation to be genotyped in the human population. Interestingly, when we repeat SV detection on a pseudodiploid genome constructed in silico by merging the two haploids, we find that ∼59% of the heterozygous SVs are no longer detected by SMRT-SV. These results indicate that haploid resolution of long-read sequencing data will significantly increase sensitivity of SV detection.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Genoma Humano / Análise de Sequência de DNA / Mapeamento de Sequências Contíguas / Variação Estrutural do Genoma / Haploidia Limite: Humans Idioma: En Revista: Genome Res Assunto da revista: BIOLOGIA MOLECULAR / GENETICA Ano de publicação: 2017 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Genoma Humano / Análise de Sequência de DNA / Mapeamento de Sequências Contíguas / Variação Estrutural do Genoma / Haploidia Limite: Humans Idioma: En Revista: Genome Res Assunto da revista: BIOLOGIA MOLECULAR / GENETICA Ano de publicação: 2017 Tipo de documento: Article País de afiliação: Estados Unidos