Your browser doesn't support javascript.
loading
Progressive Cactus is a multiple-genome aligner for the thousand-genome era.
Armstrong, Joel; Hickey, Glenn; Diekhans, Mark; Fiddes, Ian T; Novak, Adam M; Deran, Alden; Fang, Qi; Xie, Duo; Feng, Shaohong; Stiller, Josefin; Genereux, Diane; Johnson, Jeremy; Marinescu, Voichita Dana; Alföldi, Jessica; Harris, Robert S; Lindblad-Toh, Kerstin; Haussler, David; Karlsson, Elinor; Jarvis, Erich D; Zhang, Guojie; Paten, Benedict.
Afiliación
  • Armstrong J; UC Santa Cruz Genomics Institute, UC Santa Cruz, Santa Cruz, CA, USA.
  • Hickey G; UC Santa Cruz Genomics Institute, UC Santa Cruz, Santa Cruz, CA, USA.
  • Diekhans M; UC Santa Cruz Genomics Institute, UC Santa Cruz, Santa Cruz, CA, USA.
  • Fiddes IT; UC Santa Cruz Genomics Institute, UC Santa Cruz, Santa Cruz, CA, USA.
  • Novak AM; UC Santa Cruz Genomics Institute, UC Santa Cruz, Santa Cruz, CA, USA.
  • Deran A; UC Santa Cruz Genomics Institute, UC Santa Cruz, Santa Cruz, CA, USA.
  • Fang Q; BGI-Shenzhen, Beishan Industrial Zone, Shenzhen, China.
  • Xie D; Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
  • Feng S; BGI-Shenzhen, Beishan Industrial Zone, Shenzhen, China.
  • Stiller J; BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, China.
  • Genereux D; BGI-Shenzhen, Beishan Industrial Zone, Shenzhen, China.
  • Johnson J; State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China.
  • Marinescu VD; Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
  • Alföldi J; Broad Institute of Harvard and Massachusetts Institute of Technology (MIT), Cambridge, MA, USA.
  • Harris RS; Broad Institute of Harvard and Massachusetts Institute of Technology (MIT), Cambridge, MA, USA.
  • Lindblad-Toh K; Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden.
  • Haussler D; Broad Institute of Harvard and Massachusetts Institute of Technology (MIT), Cambridge, MA, USA.
  • Karlsson E; Department of Biology, The Pennsylvania State University, University Park, PA, USA.
  • Jarvis ED; Broad Institute of Harvard and Massachusetts Institute of Technology (MIT), Cambridge, MA, USA.
  • Zhang G; Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden.
  • Paten B; UC Santa Cruz Genomics Institute, UC Santa Cruz, Santa Cruz, CA, USA.
Nature ; 587(7833): 246-251, 2020 11.
Article en En | MEDLINE | ID: mdl-33177663
ABSTRACT
New genome assemblies have been arriving at a rapidly increasing pace, thanks to decreases in sequencing costs and improvements in third-generation sequencing technologies1-3. For example, the number of vertebrate genome assemblies currently in the NCBI (National Center for Biotechnology Information) database4 increased by more than 50% to 1,485 assemblies in the year from July 2018 to July 2019. In addition to this influx of assemblies from different species, new human de novo assemblies5 are being produced, which enable the analysis of not only small polymorphisms, but also complex, large-scale structural differences between human individuals and haplotypes. This coming era and its unprecedented amount of data offer the opportunity to uncover many insights into genome evolution but also present challenges in how to adapt current analysis methods to meet the increased scale. Cactus6, a reference-free multiple genome alignment program, has been shown to be highly accurate, but the existing implementation scales poorly with increasing numbers of genomes, and struggles in regions of highly duplicated sequences. Here we describe progressive extensions to Cactus to create Progressive Cactus, which enables the reference-free alignment of tens to thousands of large vertebrate genomes while maintaining high alignment quality. We describe results from an alignment of more than 600 amniote genomes, which is to our knowledge the largest multiple vertebrate genome alignment created so far.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Vertebrados / Programas Informáticos / Alineación de Secuencia / Genoma / Genómica Tipo de estudio: Prognostic_studies Límite: Animals / Humans Idioma: En Revista: Nature Año: 2020 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Vertebrados / Programas Informáticos / Alineación de Secuencia / Genoma / Genómica Tipo de estudio: Prognostic_studies Límite: Animals / Humans Idioma: En Revista: Nature Año: 2020 Tipo del documento: Article País de afiliación: Estados Unidos