Your browser doesn't support javascript.
loading
Chromosome assembly of large and complex genomes using multiple references.
Kolmogorov, Mikhail; Armstrong, Joel; Raney, Brian J; Streeter, Ian; Dunn, Matthew; Yang, Fengtang; Odom, Duncan; Flicek, Paul; Keane, Thomas M; Thybert, David; Paten, Benedict; Pham, Son.
Affiliation
  • Kolmogorov M; Department of Computer Science and Engineering, University of California, San Diego, California 92093, USA.
  • Armstrong J; Center for Biomolecular Science and Engineering, University of California, Santa Cruz, California 95064, USA.
  • Raney BJ; Center for Biomolecular Science and Engineering, University of California, Santa Cruz, California 95064, USA.
  • Streeter I; European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, United Kingdom.
  • Dunn M; Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, United Kingdom.
  • Yang F; Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, United Kingdom.
  • Odom D; Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, United Kingdom.
  • Flicek P; Cancer Research UK Cambridge Institute, University of Cambridge, CB2 0RE Cambridge, United Kingdom.
  • Keane TM; European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, United Kingdom.
  • Thybert D; Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, United Kingdom.
  • Paten B; European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, United Kingdom.
  • Pham S; Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, United Kingdom.
Genome Res ; 28(11): 1720-1732, 2018 11.
Article in En | MEDLINE | ID: mdl-30341161
Despite the rapid development of sequencing technologies, the assembly of mammalian-scale genomes into complete chromosomes remains one of the most challenging problems in bioinformatics. To help address this difficulty, we developed Ragout 2, a reference-assisted assembly tool that works for large and complex genomes. By taking one or more target assemblies (generated from an NGS assembler) and one or multiple related reference genomes, Ragout 2 infers the evolutionary relationships between the genomes and builds the final assemblies using a genome rearrangement approach. By using Ragout 2, we transformed NGS assemblies of 16 laboratory mouse strains into sets of complete chromosomes, leaving <5% of sequence unlocalized per set. Various benchmarks, including PCR testing and realigning of long Pacific Biosciences (PacBio) reads, suggest only a small number of structural errors in the final assemblies, comparable with direct assembly approaches. We applied Ragout 2 to the Mus caroli and Mus pahari genomes, which exhibit karyotype-scale variations compared with other genomes from the Muridae family. Chromosome painting maps confirmed most large-scale rearrangements that Ragout 2 detected. We applied Ragout 2 to improve draft sequences of three ape genomes that have recently been published. Ragout 2 transformed three sets of contigs (generated using PacBio reads only) into chromosome-scale assemblies with accuracy comparable to chromosome assemblies generated in the original study using BioNano maps, Hi-C, BAC clones, and FISH.
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Contig Mapping / Whole Genome Sequencing Limits: Animals Language: En Journal: Genome Res Journal subject: BIOLOGIA MOLECULAR / GENETICA Year: 2018 Document type: Article Affiliation country: United States Country of publication: United States

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Contig Mapping / Whole Genome Sequencing Limits: Animals Language: En Journal: Genome Res Journal subject: BIOLOGIA MOLECULAR / GENETICA Year: 2018 Document type: Article Affiliation country: United States Country of publication: United States