Your browser doesn't support javascript.
loading
Reconstruction of full-length LINE-1 progenitors from ancestral genomes.
Campitelli, Laura F; Yellan, Isaac; Albu, Mihai; Barazandeh, Marjan; Patel, Zain M; Blanchette, Mathieu; Hughes, Timothy R.
Afiliación
  • Campitelli LF; Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A1, Canada.
  • Yellan I; Donnelly Centre, University of Toronto, Toronto, ON M5S 1A1, Canada.
  • Albu M; Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A1, Canada.
  • Barazandeh M; Donnelly Centre, University of Toronto, Toronto, ON M5S 1A1, Canada.
  • Patel ZM; Donnelly Centre, University of Toronto, Toronto, ON M5S 1A1, Canada.
  • Blanchette M; Donnelly Centre, University of Toronto, Toronto, ON M5S 1A1, Canada.
  • Hughes TR; Faculty of Pharmaceutical Sciences, University of British Columbia, Vancouver, BC V6T 1Z4, Canada.
Genetics ; 221(3)2022 07 04.
Article en En | MEDLINE | ID: mdl-35552404
ABSTRACT
Sequences derived from the Long INterspersed Element-1 (L1) family of retrotransposons occupy at least 17% of the human genome, with 67 distinct subfamilies representing successive waves of expansion and extinction in mammalian lineages. L1s contribute extensively to gene regulation, but their molecular history is difficult to trace, because most are present only as truncated and highly mutated fossils. Consequently, L1 entries in current databases of repeat sequences are composed mainly of short diagnostic subsequences, rather than full functional progenitor sequences for each subfamily. Here, we have coupled 2 levels of sequence reconstruction (at the level of whole genomes and L1 subfamilies) to reconstruct progenitor sequences for all human L1 subfamilies that are more functionally and phylogenetically plausible than existing models. Most of the reconstructed sequences are at or near the canonical length of L1s and encode uninterrupted ORFs with expected protein domains. We also show that the presence or absence of binding sites for KRAB-C2H2 Zinc Finger Proteins, even in ancient-reconstructed progenitor L1s, mirrors binding observed in human ChIP-exo experiments, thus extending the arms race and domestication model. RepeatMasker searches of the modern human genome suggest that the new models may be able to assign subfamily resolution identities to previously ambiguous L1 instances. The reconstructed L1 sequences will be useful for genome annotation and functional study of both L1 evolution and L1 contributions to host regulatory networks.
Asunto(s)
Palabras clave

Texto completo: 1 Bases de datos: MEDLINE Asunto principal: Retroelementos / Elementos de Nucleótido Esparcido Largo Tipo de estudio: Prognostic_studies Límite: Animals / Humans Idioma: En Revista: Genetics Año: 2022 Tipo del documento: Article País de afiliación: Canadá

Texto completo: 1 Bases de datos: MEDLINE Asunto principal: Retroelementos / Elementos de Nucleótido Esparcido Largo Tipo de estudio: Prognostic_studies Límite: Animals / Humans Idioma: En Revista: Genetics Año: 2022 Tipo del documento: Article País de afiliación: Canadá