RESUMEN
Extrachromosomal circular DNA (eccDNA) originates from linear chromosomal DNA in various human tissues under physiological and disease conditions. The genomic origins of eccDNA have largely been investigated using in vitro-amplified DNA. However, in vitro amplification obscures quantitative information by skewing the total population stoichiometry. In addition, the analyses have focused on eccDNA stemming from single-copy genomic regions, leaving eccDNA from multicopy regions unexamined. To address these issues, we isolated eccDNA without in vitro amplification (naïve small circular DNA, nscDNA) and assessed the populations quantitatively by integrated genomic, molecular, and cytogenetic approaches. nscDNA of up to tens of kilobases were successfully enriched by our approach and were predominantly derived from multicopy genomic regions including segmental duplications (SDs). SDs, which account for 5% of the human genome and are hotspots for copy number variations, were significantly overrepresented in sperm nscDNA, with three times more sequencing reads derived from SDs than from the entire single-copy regions. SDs were also overrepresented in mouse sperm nscDNA, which we estimated to comprise 0.2% of nuclear DNA. Considering that eccDNA can be integrated into chromosomes, germline-derived nscDNA may be a mediator of genome diversity.
Asunto(s)
ADN Circular , Células Germinativas , Animales , Cromosomas , ADN , Variaciones en el Número de Copia de ADN , Genoma Humano , Células HeLa , Humanos , Masculino , Ratones , Ratones Endogámicos C57BL , Duplicaciones Segmentarias en el Genoma , EspermatozoidesRESUMEN
The human genome contains hundreds of large, structurally diverse blocks that are insufficiently represented in the reference genome and are thus not amenable to genomic analyses. Structural diversity in the human population suggests that these blocks are unstable in the germline; however, whether or not these blocks are also unstable in the cancer genome remains elusive. Here we report that the 500 kb block called KRTAP_region_1 (KRTAP-1) on 17q12-21 recurrently demarcates the amplicon of the ERBB2 (HER2) oncogene in breast tumors. KRTAP-1 carries numerous tandemly-duplicated segments that exhibit diversity within the human population. We evaluated the fragility of the block by cytogenetically measuring the distances between the flanking regions and found that spontaneous distance outliers (i.e DNA breaks) appear more frequently at KRTAP-1 than at the representative common fragile site (CFS) FRA16D. Unlike CFSs, KRTAP-1 is not sensitive to aphidicolin. The exonuclease activity of DNA repair protein Mre11 protects KRTAP-1 from breaks, whereas CtIP does not. Breaks at KRTAP-1 lead to the palindromic duplication of the ERBB2 locus and trigger Breakage-Fusion-Bridge cycles. Our results indicate that an insufficiently investigated area of the human genome is fragile and could play a crucial role in cancer genome evolution.