Your browser doesn't support javascript.
loading
Complete vertebrate mitogenomes reveal widespread repeats and gene duplications.
Formenti, Giulio; Rhie, Arang; Balacco, Jennifer; Haase, Bettina; Mountcastle, Jacquelyn; Fedrigo, Olivier; Brown, Samara; Capodiferro, Marco Rosario; Al-Ajli, Farooq O; Ambrosini, Roberto; Houde, Peter; Koren, Sergey; Oliver, Karen; Smith, Michelle; Skelton, Jason; Betteridge, Emma; Dolucan, Jale; Corton, Craig; Bista, Iliana; Torrance, James; Tracey, Alan; Wood, Jonathan; Uliano-Silva, Marcela; Howe, Kerstin; McCarthy, Shane; Winkler, Sylke; Kwak, Woori; Korlach, Jonas; Fungtammasan, Arkarachai; Fordham, Daniel; Costa, Vania; Mayes, Simon; Chiara, Matteo; Horner, David S; Myers, Eugene; Durbin, Richard; Achilli, Alessandro; Braun, Edward L; Phillippy, Adam M; Jarvis, Erich D.
Affiliation
  • Formenti G; The Vertebrate Genome Lab, Rockefeller University, New York, NY, USA. gformenti@rockefeller.edu.
  • Rhie A; Laboratory of Neurogenetics of Language, Rockefeller University, New York, NY, USA. gformenti@rockefeller.edu.
  • Balacco J; The Howards Hughes Medical Institute, Chevy Chase, MD, USA. gformenti@rockefeller.edu.
  • Haase B; Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA.
  • Mountcastle J; The Vertebrate Genome Lab, Rockefeller University, New York, NY, USA.
  • Fedrigo O; The Vertebrate Genome Lab, Rockefeller University, New York, NY, USA.
  • Brown S; The Vertebrate Genome Lab, Rockefeller University, New York, NY, USA.
  • Capodiferro MR; The Vertebrate Genome Lab, Rockefeller University, New York, NY, USA.
  • Al-Ajli FO; Laboratory of Neurogenetics of Language, Rockefeller University, New York, NY, USA.
  • Ambrosini R; The Howards Hughes Medical Institute, Chevy Chase, MD, USA.
  • Houde P; Department of Biology and Biotechnology "L. Spallanzani", University of Pavia, Pavia, Italy.
  • Koren S; Monash University Malaysia Genomics Facility, School of Science, Bandar Sunway, Selangor Darul Ehsan, Malaysia.
  • Oliver K; Tropical Medicine and Biology Multidisciplinary Platform, Monash University Malaysia, Bandar Sunway, Selangor Darul Ehsan, Malaysia.
  • Smith M; Qatar Falcon Genome Project, Doha, State of Qatar.
  • Skelton J; Department of Environmental Science and Policy, University of Milan, Milan, Italy.
  • Betteridge E; Department of Biology, New Mexico State University, Las Cruces, NM, USA.
  • Dolucan J; Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA.
  • Corton C; Wellcome Sanger Institute, Cambridge, UK.
  • Bista I; Wellcome Sanger Institute, Cambridge, UK.
  • Torrance J; Wellcome Sanger Institute, Cambridge, UK.
  • Tracey A; Wellcome Sanger Institute, Cambridge, UK.
  • Wood J; Wellcome Sanger Institute, Cambridge, UK.
  • Uliano-Silva M; Wellcome Sanger Institute, Cambridge, UK.
  • Howe K; Wellcome Sanger Institute, Cambridge, UK.
  • McCarthy S; Department of Genetics, University of Cambridge, Cambridge, UK.
  • Winkler S; Wellcome Sanger Institute, Cambridge, UK.
  • Kwak W; Wellcome Sanger Institute, Cambridge, UK.
  • Korlach J; Wellcome Sanger Institute, Cambridge, UK.
  • Fungtammasan A; Wellcome Sanger Institute, Cambridge, UK.
  • Fordham D; Wellcome Sanger Institute, Cambridge, UK.
  • Costa V; Wellcome Sanger Institute, Cambridge, UK.
  • Mayes S; Department of Genetics, University of Cambridge, Cambridge, UK.
  • Chiara M; Max Planck Institute of Molecular Cell Biology & Genetics, Dresden, Germany.
  • Horner DS; Hoonygen, Seoul, Korea.
  • Myers E; Pacific Biosciences, Menlo Park, CA, USA.
  • Durbin R; DNAnexus Inc., Mountain View, CA, USA.
  • Achilli A; Oxford Nanopore Technologies Ltd, Oxford Science Park, Oxford, UK.
  • Braun EL; Oxford Nanopore Technologies Ltd, Oxford Science Park, Oxford, UK.
  • Phillippy AM; Oxford Nanopore Technologies Ltd, Oxford Science Park, Oxford, UK.
  • Jarvis ED; Department of Biosciences, University of Milan, Milan, Italy.
Genome Biol ; 22(1): 120, 2021 04 29.
Article in En | MEDLINE | ID: mdl-33910595
ABSTRACT

BACKGROUND:

Modern sequencing technologies should make the assembly of the relatively small mitochondrial genomes an easy undertaking. However, few tools exist that address mitochondrial assembly directly.

RESULTS:

As part of the Vertebrate Genomes Project (VGP) we develop mitoVGP, a fully automated pipeline for similarity-based identification of mitochondrial reads and de novo assembly of mitochondrial genomes that incorporates both long (> 10 kbp, PacBio or Nanopore) and short (100-300 bp, Illumina) reads. Our pipeline leads to successful complete mitogenome assemblies of 100 vertebrate species of the VGP. We observe that tissue type and library size selection have considerable impact on mitogenome sequencing and assembly. Comparing our assemblies to purportedly complete reference mitogenomes based on short-read sequencing, we identify errors, missing sequences, and incomplete genes in those references, particularly in repetitive regions. Our assemblies also identify novel gene region duplications. The presence of repeats and duplications in over half of the species herein assembled indicates that their occurrence is a principle of mitochondrial structure rather than an exception, shedding new light on mitochondrial genome evolution and organization.

CONCLUSIONS:

Our results indicate that even in the "simple" case of vertebrate mitogenomes the completeness of many currently available reference sequences can be further improved, and caution should be exercised before claiming the complete assembly of a mitogenome, particularly from short reads alone.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Vertebrates / Repetitive Sequences, Nucleic Acid / Gene Duplication / Genomics / Genome, Mitochondrial Type of study: Prognostic_studies Limits: Animals Language: En Journal: Genome Biol Journal subject: BIOLOGIA MOLECULAR / GENETICA Year: 2021 Document type: Article Affiliation country:

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Vertebrates / Repetitive Sequences, Nucleic Acid / Gene Duplication / Genomics / Genome, Mitochondrial Type of study: Prognostic_studies Limits: Animals Language: En Journal: Genome Biol Journal subject: BIOLOGIA MOLECULAR / GENETICA Year: 2021 Document type: Article Affiliation country:
...