Your browser doesn't support javascript.
loading
TypeTE: a tool to genotype mobile element insertions from whole genome resequencing data.
Goubert, Clément; Thomas, Jainy; Payer, Lindsay M; Kidd, Jeffrey M; Feusier, Julie; Watkins, W Scott; Burns, Kathleen H; Jorde, Lynn B; Feschotte, Cédric.
Afiliação
  • Goubert C; Department of Molecular Biology and Genetics, 215 Tower Rd, Cornell University, Ithaca, NY 14853, USA.
  • Thomas J; Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT 84112, USA.
  • Payer LM; Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA.
  • Kidd JM; Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI 48109, USA.
  • Feusier J; Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT 84112, USA.
  • Watkins WS; Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT 84112, USA.
  • Burns KH; Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA.
  • Jorde LB; Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT 84112, USA.
  • Feschotte C; Department of Molecular Biology and Genetics, 215 Tower Rd, Cornell University, Ithaca, NY 14853, USA.
Nucleic Acids Res ; 48(6): e36, 2020 04 06.
Article em En | MEDLINE | ID: mdl-32067044
ABSTRACT
Alu retrotransposons account for more than 10% of the human genome, and insertions of these elements create structural variants segregating in human populations. Such polymorphic Alus are powerful markers to understand population structure, and they represent variants that can greatly impact genome function, including gene expression. Accurate genotyping of Alus and other mobile elements has been challenging. Indeed, we found that Alu genotypes previously called for the 1000 Genomes Project are sometimes erroneous, which poses significant problems for phasing these insertions with other variants that comprise the haplotype. To ameliorate this issue, we introduce a new pipeline - TypeTE - which genotypes Alu insertions from whole-genome sequencing data. Starting from a list of polymorphic Alus, TypeTE identifies the hallmarks (poly-A tail and target site duplication) and orientation of Alu insertions using local re-assembly to reconstruct presence and absence alleles. Genotype likelihoods are then computed after re-mapping sequencing reads to the reconstructed alleles. Using a high-quality set of PCR-based genotyping of >200 loci, we show that TypeTE improves genotype accuracy from 83% to 92% in the 1000 Genomes dataset. TypeTE can be readily adapted to other retrotransposon families and brings a valuable toolbox addition for population genomics.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software / Mutagênese Insercional / Sequências Repetitivas Dispersas / Sequenciamento Completo do Genoma Idioma: En Ano de publicação: 2020 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software / Mutagênese Insercional / Sequências Repetitivas Dispersas / Sequenciamento Completo do Genoma Idioma: En Ano de publicação: 2020 Tipo de documento: Article País de afiliação: Estados Unidos