Your browser doesn't support javascript.
loading
Discovery of Novel Sequences in 1,000 Swedish Genomes.
Eisfeldt, Jesper; Mårtensson, Gustaf; Ameur, Adam; Nilsson, Daniel; Lindstrand, Anna.
Afiliação
  • Eisfeldt J; Department of Molecular Medicine and Surgery, Center for Molecular Medicine, Karolinska Institute, Stockholm, Sweden.
  • Mårtensson G; Science for Life Laboratory, Karolinska Institutet Science Park, Solna, Sweden.
  • Ameur A; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden.
  • Nilsson D; Division of Nanobiotechnology, Department of Protein Science, Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH Royal Institute of Technology, Stockholm, Sweden.
  • Lindstrand A; Science for Life Laboratory, Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden.
Mol Biol Evol ; 37(1): 18-30, 2020 Jan 01.
Article em En | MEDLINE | ID: mdl-31560401
ABSTRACT
Novel sequences (NSs), not present in the human reference genome, are abundant and remain largely unexplored. Here, we utilize de novo assembly to study NS in 1,000 Swedish individuals first sequenced as part of the SweGen project revealing a total of 46 Mb in 61,044 distinct contigs of sequences not present in GRCh38. The contigs were aligned to recently published catalogs of Icelandic and Pan-African NSs, as well as the chimpanzee genome, revealing a great diversity of shared sequences. Analyzing the positioning of NS across the chimpanzee genome, we find that 2,807 NS align confidently within 143 chimpanzee orthologs of human genes. Aligning the whole genome sequencing data to the chimpanzee genome, we discover ancestral NS common throughout the Swedish population. The NSs were searched for repeats and repeat elements revealing a majority of repetitive sequence (56%), and enrichment of simple repeats (28%) and satellites (15%). Lastly, we align the unmappable reads of a subset of the thousand genomes data to our collection of NS, as well as the previously published Pan-African NS revealing that both the Swedish and Pan-African NS are widespread, and that the Swedish NSs are largely a subset of the Pan-African NS. Overall, these results highlight the importance of creating a more diverse reference genome and illustrate that significant amounts of the NS may be of ancestral origin.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Variação Genética / Genoma Humano Limite: Animals / Humans País/Região como assunto: Europa Idioma: En Revista: Mol Biol Evol Assunto da revista: BIOLOGIA MOLECULAR Ano de publicação: 2020 Tipo de documento: Article País de afiliação: Suécia

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Variação Genética / Genoma Humano Limite: Animals / Humans País/Região como assunto: Europa Idioma: En Revista: Mol Biol Evol Assunto da revista: BIOLOGIA MOLECULAR Ano de publicação: 2020 Tipo de documento: Article País de afiliação: Suécia