Your browser doesn't support javascript.
loading
The genome-wide landscape of C:G > T:A polymorphism at the CpG contexts in the human population.
Youk, Jeonghwan; An, Yohan; Park, Seongyeol; Lee, June-Koo; Ju, Young Seok.
Afiliación
  • Youk J; Graduate School of Medical Science and Engineering, Korea Advanced Institute of Science and Technology (KAIST), 291 Daehak-ro, Yuseong-gu, Daejeon, 34141, Republic of Korea.
  • An Y; Biomedical Science and Engineering Interdisciplinary Program, Korea Advanced Institute of Science and Technology (KAIST), 291 Daehak-ro, Yuseong-gu, Daejeon, 34141, Republic of Korea.
  • Park S; Graduate School of Medical Science and Engineering, Korea Advanced Institute of Science and Technology (KAIST), 291 Daehak-ro, Yuseong-gu, Daejeon, 34141, Republic of Korea.
  • Lee JK; Graduate School of Medical Science and Engineering, Korea Advanced Institute of Science and Technology (KAIST), 291 Daehak-ro, Yuseong-gu, Daejeon, 34141, Republic of Korea.
  • Ju YS; Graduate School of Medical Science and Engineering, Korea Advanced Institute of Science and Technology (KAIST), 291 Daehak-ro, Yuseong-gu, Daejeon, 34141, Republic of Korea. ysju@kaist.ac.kr.
BMC Genomics ; 21(1): 270, 2020 Mar 30.
Article en En | MEDLINE | ID: mdl-32228436
ABSTRACT

BACKGROUND:

The CG > TA substitution at the CpG dinucleotide contexts is the most frequent substitution type in genome evolution. The mutational process is obviously ongoing in the human germline; however, its impact on common and rare genomic polymorphisms has not been comprehensively investigated yet. Here we observed the landscape and dynamics of CG > TA substitutions from population-scale human genome sequencing datasets including ~ 4300 whole-genomes from the 1000 Genomes and the pan-cancer analysis of whole genomes (PCAWG) Project and ~ 60,000 whole-exomes from the Exome Aggregation Consortium (ExAC) database.

RESULTS:

Of the 28,084,558 CpG sites in the human reference genome, 26.0% show CG > TA substitution in the dataset. Remarkably, CpGs in CpG islands (CGIs) have a much lower frequency of such mutations (5.6%). Interestingly, the mutation frequency of CGIs is not uniform with a significantly higher CG > TA substitution rate for intragenic CGIs compared to other types. For non-CGI CpGs, the mutation rate was positively correlated with the distance from the nearest CGI up to 2 kb. Finally, we found the impact of negative selection for coding CpG mutations resulting in amino acid change.

CONCLUSIONS:

This study provides the first unbiased rate of CG > TA substitution at the CpG dinucleotide contexts, using population-scale human genome sequencing data. Our findings provide insights into the dynamics of the mutation acquisition in the human genome.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Islas de CpG Límite: Humans Idioma: En Revista: BMC Genomics Asunto de la revista: GENETICA Año: 2020 Tipo del documento: Article

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Islas de CpG Límite: Humans Idioma: En Revista: BMC Genomics Asunto de la revista: GENETICA Año: 2020 Tipo del documento: Article