Your browser doesn't support javascript.
loading
: 20 | 50 | 100
1 - 20 de 951
1.
BMC Bioinformatics ; 25(1): 177, 2024 May 04.
Article En | MEDLINE | ID: mdl-38704528

BACKGROUND: Hepatitis B virus (HBV) integrates into human chromosomes and can lead to genomic instability and hepatocarcinogenesis. Current tools for HBV integration site detection lack accuracy and stability. RESULTS: This study proposes a deep learning-based method, named ViroISDC, for detecting integration sites. ViroISDC generates corresponding grammar rules and encodes the characteristics of the language data to predict integration sites accurately. Compared with Lumpy, Pindel, Seeksv, and SurVirus, ViroISDC exhibits better overall performance and is less sensitive to sequencing depth and integration sequence length, displaying good reliability, stability, and generality. Further downstream analysis of integrated sites detected by ViroISDC reveals the integration patterns and features of HBV. It is observed that HBV integration exhibits specific chromosomal preferences and tends to integrate into cancerous tissue. Moreover, HBV integration frequency was higher in males than females, and high-frequency integration sites were more likely to be present on hepatocarcinogenesis- and anti-cancer-related genes, validating the reliability of the ViroISDC. CONCLUSIONS: ViroISDC pipeline exhibits superior precision, stability, and reliability across various datasets when compared to similar software. It is invaluable in exploring HBV infection in the human body, holding significant implications for the diagnosis, treatment, and prognosis assessment of HCC.


Hepatitis B virus , Virus Integration , Hepatitis B virus/genetics , Humans , Virus Integration/genetics , Software , Deep Learning , Male , Female , Hepatitis B/genetics , Hepatitis B/virology , Liver Neoplasms/genetics , Liver Neoplasms/virology , Computational Biology/methods
2.
mBio ; 15(5): e0072924, 2024 May 08.
Article En | MEDLINE | ID: mdl-38624210

The integration of HPV DNA into human chromosomes plays a pivotal role in the onset of papillomavirus-related cancers. HPV DNA integration often occurs by linearizing the viral DNA in the E1/E2 region, resulting in the loss of a critical viral early polyadenylation signal (PAS), which is essential for the polyadenylation of the E6E7 bicistronic transcripts and for the expression of the viral E6 and E7 oncogenes. Here, we provide compelling evidence that, despite the presence of numerous integrated viral DNA copies, virus-host fusion transcripts originate from only a single integrated HPV DNA in HPV16 and HPV18 cervical cancers and cervical cancer-derived cell lines. The host genomic elements neighboring the integrated HPV DNA are critical for the efficient expression of the viral oncogenes that leads to clonal cell expansion. The fusion RNAs that are produced use a host RNA polyadenylation signal downstream of the integration site, and almost all involve splicing to host sequences. In cell culture, siRNAs specifically targeting the host portion of the virus-host fusion transcripts effectively silenced viral E6 and E7 expression. This, in turn, inhibited cell growth and promoted cell senescence in HPV16+ CaSki and HPV18+ HeLa cells. Showing that HPV E6 and E7 expression from a single integration site is instrumental in clonal cell expansion sheds new light on the mechanisms of HPV-induced carcinogenesis and could be used for the development of precision medicine tailored to combat HPV-related malignancies. IMPORTANCE: Persistent oncogenic HPV infections lead to viral DNA integration into the human genome and the development of cervical, anogenital, and oropharyngeal cancers. The expression of the viral E6 and E7 oncogenes plays a key role in cell transformation and tumorigenesis. However, how E6 and E7 could be expressed from the integrated viral DNA which often lacks a viral polyadenylation signal in the cancer cells remains unknown. By analyzing the integrated HPV DNA sites and expressed HPV RNAs in cervical cancer tissues and cell lines, we show that HPV oncogenes are expressed from only one of multiple chromosomal HPV DNA integrated copies. A host polyadenylation signal downstream of the integrated viral DNA is used for polyadenylation and stabilization of the virus-host chimeric RNAs, making the oncogenic transcripts targetable by siRNAs. This observation provides further understanding of the tumorigenic mechanism of HPV integration and suggests possible therapeutic strategies for the development of precision medicine for HPV cancers.


DNA, Viral , Oncogene Proteins, Viral , Papillomavirus Infections , Uterine Cervical Neoplasms , Virus Integration , Humans , Female , Uterine Cervical Neoplasms/virology , Uterine Cervical Neoplasms/genetics , Virus Integration/genetics , Oncogene Proteins, Viral/genetics , Oncogene Proteins, Viral/metabolism , Papillomavirus Infections/virology , Papillomavirus Infections/genetics , DNA, Viral/genetics , Human papillomavirus 16/genetics , Human papillomavirus 18/genetics , Cell Line, Tumor , Oncogenes/genetics , Polyadenylation
3.
J Med Virol ; 96(4): e29614, 2024 Apr.
Article En | MEDLINE | ID: mdl-38647071

The clearance or transcriptional silencing of integrated HBV DNA is crucial for achieving a functional cure in patients with chronic hepatitis B and reducing the risk of hepatocellular carcinoma development. The PLC/PRF/5 cell line is commonly used as an in vitro model for studying HBV integration. In this study, we employed a range of multi-omics techniques to gain a panoramic understanding of the characteristics of HBV integration in PLC/PRF/5 cells and to reveal the transcriptional regulatory mechanisms of integrated HBV DNA. Transcriptome long-read sequencing (ONT) was conducted to analyze and characterize the transcriptional activity of different HBV DNA integration sites in PLC/PRF/5 cells. Additionally, we collected data related to epigenetic regulation, including whole-genome bisulfite sequencing (WGBS), histone chromatin immunoprecipitation sequencing (ChIP-seq), and assays for transposase-accessible chromatin using sequencing (ATAC-seq), to explore the potential mechanisms involved in the transcriptional regulation of integrated HBV DNA. Long-read RNA sequencing analysis revealed significant transcriptional differences at various integration sites in the PLC/PRF/5 cell line, with higher HBV DNA transcription levels at integration sites on chr11, chr13, and the chr13/chr5 fusion chromosome t (13:5). Combining long-read DNA and RNA sequencing results, we found that transcription of integrated HBV DNA generally starts downstream of the SP1, SP2, or XP promoters. ATAC-seq data confirmed that chromatin accessibility has limited influence on the transcription of integrated HBV DNA in the PLC/PRF/5 cell line. Analysis of WGBS data showed that the methylation intensity of integrated HBV DNA was highly negatively correlated with its transcription level (r = -0.8929, p = 0.0123). After AzaD treatment, the transcription level of integrated HBV DNA significantly increased, especially for the integration chr17, which had the highest level of methylation. Through ChIP-seq data, we observed the association between histone modification of H3K4me3 and H3K9me3 with the transcription of integrated HBV DNA. Our findings suggest that the SP1, SP2 and XP in integrated HBV DNA, methylation level of surrounding host chromosome, and histone modifications affect the transcription of integrated HBV DNA in PLC/PRF/5 cells. This provides important clues for future studies on the expression and regulatory mechanisms of integrated HBV.


Epigenesis, Genetic , Hepatitis B virus , Virus Integration , Humans , Hepatitis B virus/genetics , Hepatitis B virus/physiology , Virus Integration/genetics , DNA, Viral/genetics , Transcription, Genetic , Cell Line , DNA Methylation , Cell Line, Tumor , Histones/genetics , Histones/metabolism , Multiomics
4.
BMC Genomics ; 25(1): 198, 2024 Feb 20.
Article En | MEDLINE | ID: mdl-38378450

BACKGROUND: Cervical cancer (CC) causes more than 311,000 deaths annually worldwide. The integration of human papillomavirus (HPV) is a crucial genetic event that contributes to cervical carcinogenesis. Despite HPV DNA integration is known to disrupt the genomic architecture of both the host and viral genomes in CC, the complexity of this process remains largely unexplored. RESULTS: In this study, we conducted whole-genome sequencing (WGS) at 55-65X coverage utilizing the PacBio long-read sequencing platform in SiHa and HeLa cells, followed by comprehensive analyses of the sequence data to elucidate the complexity of HPV integration. Firstly, our results demonstrated that PacBio long-read sequencing effectively identifies HPV integration breakpoints with comparable accuracy to targeted-capture Next-generation sequencing (NGS) methods. Secondly, we constructed detailed models of complex integrated genome structures that included both the HPV genome and nearby regions of the human genome by utilizing PacBio long-read WGS. Thirdly, our sequencing results revealed the occurrence of a wide variety of genome-wide structural variations (SVs) in SiHa and HeLa cells. Additionally, our analysis further revealed a potential correlation between changes in gene expression levels and SVs on chromosome 13 in the genome of SiHa cells. CONCLUSIONS: Using PacBio long-read sequencing, we have successfully constructed complex models illustrating HPV integrated genome structures in SiHa and HeLa cells. This accomplishment serves as a compelling demonstration of the valuable capabilities of long-read sequencing in detecting and characterizing HPV genomic integration structures within human cells. Furthermore, these findings offer critical insights into the complex process of HPV16 and HPV18 integration and their potential contribution to the development of cervical cancer.


Papillomavirus Infections , Uterine Cervical Neoplasms , Female , Humans , Uterine Cervical Neoplasms/genetics , HeLa Cells , Papillomavirus Infections/genetics , DNA , Genomics , Virus Integration/genetics
5.
Gene ; 896: 148060, 2024 Feb 20.
Article En | MEDLINE | ID: mdl-38048968

Lentivirus containing simian virus 40 large T antigen (SV40T) is routinely used to induce cell immortalization. However, the roles of viral integration itself in this progress is still controversial. Here, we transformed primary mouse embryonic fibroblasts (MEFs) with SV40T lentivirus and studied the roles of viral integration in the immortalization using RNA sequencing (RNA-seq) and whole genome sequencing (WGS). During the immortalization, differentially expressed genes (DGEs) are enriched in viral infection and several diverse activities. However, DEGs between immortalized and aging cells are significantly enriched in DNA/chromosome- and extracellular matrix (ECM)-associated activities. Gene regulatory network (GRN) analysis shows that although p53 is a key regulatory factor, many other transcription factors also play critical roles in the process, like STAT1. Of these DEGs, 32 genes have viral integration in their coding and/or regulatory regions. Our findings suggest that viral integration may promote SV40T-mediated immortalization by disturbing the expression of DNA/chromosome- and ECM-associated genes.


DNA , Fibroblasts , Animals , Mice , Extracellular Matrix/genetics , Chromosomes , Virus Integration/genetics
6.
BMC Cancer ; 23(1): 1052, 2023 Nov 02.
Article En | MEDLINE | ID: mdl-37914994

OBJECTIVE: To detect the HPV genotype and integration sites in patients with high-risk HPV infection at different stages of photodynamic therapy using nanopore technology and to evaluate the treatment effect. METHODS: Four patients with HPV infection were selected and subjected to photodynamic therapy, and cervical exfoliated cell was sampled at before treatment, after three courses of treatment and six courses of treatment, their viral abundance and insertion sites were analyzed by nanopore technology, and pathological examinations were performed before and after treatment. In this study, we developed a novel assay that combined viral sequence enrichment and Nanopore sequencing for identification of HPV genotype and integration sites at once. The assay has obvious advantages over qPCR or NGS-based methods, as it has better sensitivity after viral sequences enrichment and can generate long-reads (kb to Mb) for better detection rate of structure variations, moreover, fast turn-around time for real-time viral sequencing and analysis. RESULTS: The pathological grade was reduced in all four patients after photodynamic therapy. Virus has been cleared in two cases after treatment, the virus amount reduced after treatment but not completely cleared in one case, and two type viruses were cleared and one type virus persisted after treatment in the last patient with multiple infection. Viral abundance and the number of integration sites were positively correlated. Gene enrichment analysis showed complete viral clearance in 1 patient and 3 patients required follow-up. CONCLUSION: Nanopore sequencing can effectively monitor the abundance of HPV viruses and integration sites to show the presence status of viruses, and combined with the results of gene enrichment analysis, the treatment effect can be dynamically assessed.


Nanopore Sequencing , Papillomavirus Infections , Photochemotherapy , Uterine Cervical Neoplasms , Female , Humans , Uterine Cervical Neoplasms/pathology , DNA, Viral/genetics , DNA, Viral/analysis , Virus Integration/genetics
7.
Hum Gene Ther ; 34(21-22): 1081-1094, 2023 Nov.
Article En | MEDLINE | ID: mdl-37930949

Integration of naturally occurring adeno-associated viruses (AAV; wild-type AAV [wtAAV]) and those used in gene therapy (recombinant AAV [rAAV]) into host genomic DNA has been documented for over two decades. Results from mouse and dog studies have raised concerns of insertional mutagenesis and clonal expansion following AAV exposure, particularly in the context of gene therapy. This study aimed to characterize the genomic location, abundance, and expansion of wtAAV and rAAV integrations in macaque and human genomes. Using an unbiased, next-generation sequencing-based approach, we identified the genome-wide integration loci in tissue samples (primarily liver) in 168 nonhuman primates (NHPs) and 85 humans naïve to rAAV exposure and 86 NHPs treated with rAAV in preclinical studies. Our results suggest that rAAV and wtAAV integrations exhibit similar, broad distribution patterns across species, with a higher frequency in genomic regions highly vulnerable to DNA damage or close to highly transcribed genes. rAAV exhibited a higher abundance of unique integration loci, whereas wtAAV integration loci were associated with greater clonal expansion. This expansive and detailed characterization of AAV integration in NHPs and humans provides key translational insights, with important implications for the safety of rAAV as a gene therapy vector.


Dependovirus , Macaca , Animals , Humans , Dependovirus/genetics , Genetic Therapy , Genetic Vectors/genetics , Liver , Macaca/genetics , Virus Integration/genetics
8.
J Virol ; 97(10): e0071623, 2023 10 31.
Article En | MEDLINE | ID: mdl-37737586

IMPORTANCE: Marek's disease virus (MDV) is a ubiquitous chicken pathogen that inflicts a large economic burden on the poultry industry, despite worldwide vaccination programs. MDV is only partially controlled by available vaccines, and the virus retains the ability to replicate and spread between vaccinated birds. Following an initial infection, MDV enters a latent state and integrates into host telomeres and this may be a prerequisite for malignant transformation, which is usually fatal. To understand the mechanism that underlies the dynamic relationship between integrated-latent and reactivated MDV, we have characterized integrated MDV (iMDV) genomes and their associated telomeres. This revealed a single orientation among iMDV genomes and the loss of some terminal sequences that is consistent with integration by homology-directed recombination and excision via a telomere-loop-mediated process.


Chickens , Genome, Viral , Herpesvirus 2, Gallid , Homologous Recombination , Marek Disease , Telomere , Virus Integration , Animals , Chickens/virology , Genome, Viral/genetics , Herpesvirus 2, Gallid/genetics , Marek Disease/genetics , Marek Disease/virology , Poultry Diseases/genetics , Poultry Diseases/virology , Telomere/genetics , Viral Vaccines/immunology , Virus Activation , Virus Latency , Virus Integration/genetics
9.
Commun Biol ; 6(1): 684, 2023 07 03.
Article En | MEDLINE | ID: mdl-37400627

Hepatitis B virus (HBV) may integrate into the genome of infected cells and contribute to hepatocarcinogenesis. However, the role of HBV integration in hepatocellular carcinoma (HCC) development remains unclear. In this study, we apply a high-throughput HBV integration sequencing approach that allows sensitive identification of HBV integration sites and enumeration of integration clones. We identify 3339 HBV integration sites in paired tumour and non-tumour tissue samples from 7 patients with HCC. We detect 2107 clonally expanded integrations (1817 in tumour and 290 in non-tumour tissues), and a significant enrichment of clonal HBV integrations in mitochondrial DNA (mtDNA) preferentially occurring in the oxidative phosphorylation genes (OXPHOS) and D-loop region. We also find that HBV RNA sequences are imported into the mitochondria of hepatoma cells with the involvement of polynucleotide phosphorylase (PNPASE), and that HBV RNA might have a role in the process of HBV integration into mtDNA. Our results suggest a potential mechanism by which HBV integration may contribute to HCC development.


Carcinoma, Hepatocellular , Liver Neoplasms , Humans , Hepatitis B virus/genetics , Carcinoma, Hepatocellular/genetics , Carcinoma, Hepatocellular/pathology , Liver Neoplasms/genetics , Liver Neoplasms/pathology , DNA, Mitochondrial/genetics , Virus Integration/genetics , Mitochondria/genetics
10.
Genome Res ; 33(8): 1395-1408, 2023 08.
Article En | MEDLINE | ID: mdl-37463751

A weak palindromic nucleotide motif is the hallmark of retroviral integration site alignments. Given that the majority of target sequences are not palindromic, the current model explains the symmetry by an overlap of the nonpalindromic motif present on one of the half-sites of the sequences. Here, we show that the implementation of multicomponent mixture models allows for different interpretations consistent with the existence of both palindromic and nonpalindromic submotifs in the sets of integration site sequences. We further show that the weak palindromic motifs result from freely combined site-specific submotifs restricted to only a few positions proximal to the site of integration. The submotifs are formed by either palindrome-forming nucleotide preference or nucleotide exclusion. Using the mixture models, we also identify HIV-1-favored palindromic sequences in Alu repeats serving as local hotspots for integration. The application of the novel statistical approach provides deeper insight into the selection of retroviral integration sites and may prove to be a valuable tool in the analysis of any type of DNA motifs.


Nucleotides , Virus Integration , Virus Integration/genetics , Nucleotide Motifs
11.
Med ; 4(6): 347-352, 2023 Jun 09.
Article En | MEDLINE | ID: mdl-37301195

The majority of oncogenic viruses are capable of integrating into the host genome, posing significant challenges to clinical control. Recent conceptual and technological advances, however, offer promising clinical applications. Here, we summarize the advances in our understanding of oncogenic viral integration, their clinical relevance, and the future perspectives.


Genome , Oncogenic Viruses , Oncogenic Viruses/genetics , Virus Integration/genetics
12.
Nucleic Acids Res ; 51(9): 4237-4251, 2023 05 22.
Article En | MEDLINE | ID: mdl-36864748

Human papillomavirus (HPV) integration is a critical step in cervical cancer development; however, the oncogenic mechanism at the genome-wide transcriptional level is still poorly understood. In this study, we employed integrative analysis on multi-omics data of six HPV-positive and three HPV-negative cell lines. Through HPV integration detection, super-enhancer (SE) identification, SE-associated gene expression and extrachromosomal DNA (ecDNA) investigation, we aimed to explore the genome-wide transcriptional influence of HPV integration. We identified seven high-ranking cellular SEs generated by HPV integration in total (the HPV breakpoint-induced cellular SEs, BP-cSEs), leading to intra-chromosomal and inter-chromosomal regulation of chromosomal genes. The pathway analysis revealed that the dysregulated chromosomal genes were correlated to cancer-related pathways. Importantly, we demonstrated that BP-cSEs existed in the HPV-human hybrid ecDNAs, explaining the above transcriptional alterations. Our results suggest that HPV integration generates cellular SEs that function as ecDNA to regulate unconstrained transcription, expanding the tumorigenic mechanism of HPV integration and providing insights for developing new diagnostic and therapeutic strategies.


DNA , Enhancer Elements, Genetic , Genome, Human , Human Papillomavirus Viruses , Papillomavirus Infections , Transcription, Genetic , Uterine Cervical Neoplasms , Virus Integration , Female , Humans , Human Papillomavirus Viruses/genetics , Papillomavirus Infections/genetics , Papillomavirus Infections/virology , Uterine Cervical Neoplasms/genetics , Uterine Cervical Neoplasms/pathology , Uterine Cervical Neoplasms/virology , Virus Integration/genetics , Enhancer Elements, Genetic/genetics , DNA/genetics , DNA/metabolism , Genome, Human/genetics , Carcinogenesis , Chromosome Breakpoints , Chromosomes, Human/genetics
13.
Cell Rep ; 42(2): 112110, 2023 02 28.
Article En | MEDLINE | ID: mdl-36790927

HIV-1 encounters the hierarchically organized host chromatin to stably integrate and persist in anatomically distinct latent reservoirs. The contribution of genome organization in HIV-1 infection has been largely understudied across different HIV-1 targets. Here, we determine HIV-1 integration sites (ISs), associate them with chromatin and expression signatures at different genomic scales in a microglia cell model, and profile them together with the primary T cell reservoir. HIV-1 insertions into introns of actively transcribed genes with IS hotspots in genic and super-enhancers, characteristic of blood cells, are maintained in the microglia cell model. Genome organization analysis reveals dynamic CCCTC-binding factor (CTCF) clusters in cells with active and repressed HIV-1 transcription, whereas CTCF removal impairs viral integration. We identify CTCF-enriched topologically associated domain (TAD) boundaries with signatures of transcriptionally active chromatin as HIV-1 integration determinants in microglia and CD4+ T cells, highlighting the importance of host genome organization in HIV-1 infection.


HIV-1 , HIV-1/genetics , HIV-1/metabolism , Microglia/metabolism , CCCTC-Binding Factor/metabolism , Chromatin , Genomics , Virus Integration/genetics
14.
Genomics Proteomics Bioinformatics ; 21(2): 300-310, 2023 04.
Article En | MEDLINE | ID: mdl-36804047

Integration of oncogenic DNA viruses into the human genome is a key step in most virus-induced carcinogenesis. Here, we constructed a virus integration site (VIS) Atlas database, an extensive collection of integration breakpoints for three most prevalent oncoviruses, human papillomavirus, hepatitis B virus, and Epstein-Barr virus based on the next-generation sequencing (NGS) data, literature, and experimental data. There are 63,179 breakpoints and 47,411 junctional sequences with full annotations deposited in the VIS Atlas database, comprising 47 virus genotypes and 17 disease types. The VIS Atlas database provides (1) a genome browser for NGS breakpoint quality check, visualization of VISs, and the local genomic context; (2) a novel platform to discover integration patterns; and (3) a statistics interface for a comprehensive investigation of genotype-specific integration features. Data collected in the VIS Atlas aid to provide insights into virus pathogenic mechanisms and the development of novel antitumor drugs. The VIS Atlas database is available at https://www.vis-atlas.tech/.


Epstein-Barr Virus Infections , Humans , Epstein-Barr Virus Infections/genetics , Genome, Human , Herpesvirus 4, Human/genetics , Carcinogenesis/genetics , High-Throughput Nucleotide Sequencing , Virus Integration/genetics
15.
Nat Commun ; 14(1): 16, 2023 01 10.
Article En | MEDLINE | ID: mdl-36627271

APOBEC3 (A3) proteins are host-encoded deoxycytidine deaminases that provide an innate immune barrier to retroviral infection, notably against HIV-1. Low levels of deamination are believed to contribute to the genetic evolution of HIV-1, while intense catalytic activity of these proteins can induce catastrophic hypermutation in proviral DNA leading to near-total HIV-1 restriction. So far, little is known about how A3 cytosine deaminases might impact HIV-1 proviral DNA integration sites in human chromosomal DNA. Using a deep sequencing approach, we analyze the influence of catalytic active and inactive APOBEC3F and APOBEC3G on HIV-1 integration site selections. Here we show that DNA editing is detected at the extremities of the long terminal repeat regions of the virus. Both catalytic active and non-catalytic A3 mutants decrease insertions into gene coding sequences and increase integration sites into SINE elements, oncogenes and transcription-silencing non-B DNA features. Our data implicates A3 as a host factor influencing HIV-1 integration site selection and also promotes what appears to be a more latent expression profile.


HIV Infections , HIV-1 , Humans , Cytidine Deaminase/genetics , Cytidine Deaminase/metabolism , HIV-1/genetics , HIV-1/metabolism , APOBEC-3G Deaminase/metabolism , Cytosine Deaminase/genetics , Cytosine Deaminase/metabolism , Proteins/metabolism , Anti-Retroviral Agents , Virus Integration/genetics , Cytidine/metabolism , APOBEC Deaminases/genetics , APOBEC Deaminases/metabolism
16.
Cancer Discov ; 13(4): 910-927, 2023 04 03.
Article En | MEDLINE | ID: mdl-36715691

The human papillomavirus (HPV) genome is integrated into host DNA in most HPV-positive cancers, but the consequences for chromosomal integrity are unknown. Continuous long-read sequencing of oropharyngeal cancers and cancer cell lines identified a previously undescribed form of structural variation, "heterocateny," characterized by diverse, interrelated, and repetitive patterns of concatemerized virus and host DNA segments within a cancer. Unique breakpoints shared across structural variants facilitated stepwise reconstruction of their evolution from a common molecular ancestor. This analysis revealed that virus and virus-host concatemers are unstable and, upon insertion into and excision from chromosomes, facilitate capture, amplification, and recombination of host DNA and chromosomal rearrangements. Evidence of heterocateny was detected in extrachromosomal and intrachromosomal DNA. These findings indicate that heterocateny is driven by the dynamic, aberrant replication and recombination of an oncogenic DNA virus, thereby extending known consequences of HPV integration to include promotion of intratumoral heterogeneity and clonal evolution. SIGNIFICANCE: Long-read sequencing of HPV-positive cancers revealed "heterocateny," a previously unreported form of genomic structural variation characterized by heterogeneous, interrelated, and repetitive genomic rearrangements within a tumor. Heterocateny is driven by unstable concatemerized HPV genomes, which facilitate capture, rearrangement, and amplification of host DNA, and promotes intratumoral heterogeneity and clonal evolution. See related commentary by McBride and White, p. 814. This article is highlighted in the In This Issue feature, p. 799.


Oropharyngeal Neoplasms , Papillomavirus Infections , Humans , Human Papillomavirus Viruses , Gene Rearrangement , Clonal Evolution/genetics , Virus Integration/genetics , Papillomaviridae/genetics
17.
Nat Commun ; 13(1): 5968, 2022 10 10.
Article En | MEDLINE | ID: mdl-36216793

Small cell cervical carcinoma (SCCC) is a rare but aggressive malignancy. Here, we report human papillomavirus features and genomic landscape in SCCC via high-throughput HPV captured sequencing, whole-genome sequencing, whole-transcriptome sequencing, and OncoScan microarrays. HPV18 infections and integrations are commonly detected. Besides MYC family genes (37.9%), we identify SOX (8.4%), NR4A (6.3%), ANKRD (7.4%), and CEA (3.2%) family genes as HPV-integrated hotspots. We construct the genomic local haplotype around HPV-integrated sites, and find tandem duplications and amplified HPV long control regions (LCR). We propose three prominent HPV integration patterns: duplicating oncogenes (MYCN, MYC, and NR4A2), forming fusions (FGFR3-TACC3 and ANKRD12-NDUFV2), and activating genes (MYC) via the cis-regulations of viral LCRs. Moreover, focal CNA amplification peaks harbor canonical cancer genes including the HPV-integrated hotspots within MYC family, SOX2, and others. Our findings may provide potential molecular criteria for the accurate diagnosis and efficacious therapies for this lethal disease.


Alphapapillomavirus , Carcinoma, Small Cell , Papillomavirus Infections , Uterine Cervical Neoplasms , Female , Humans , Microtubule-Associated Proteins/genetics , N-Myc Proto-Oncogene Protein/genetics , Nuclear Proteins/genetics , Papillomaviridae/genetics , Uterine Cervical Neoplasms/pathology , Virus Integration/genetics
18.
J Virol ; 96(18): e0101122, 2022 09 28.
Article En | MEDLINE | ID: mdl-36094316

HIV-1 DNA is preferentially integrated into chromosomal hot spots by the preintegration complex (PIC). To understand the mechanism, we measured the DNA integration activity of PICs-extracted from infected cells-and intasomes, biochemically assembled PIC substructures using a number of relevant target substrates. We observed that PIC-mediated integration into human chromatin is preferred compared to genomic DNA. Surprisingly, nucleosomes lacking histone modifications were not preferred integration compared to the analogous naked DNA. Nucleosomes containing the trimethylated histone 3 lysine 36 (H3K36me3), an epigenetic mark linked to active transcription, significantly stimulated integration, but the levels remained lower than the naked DNA. Notably, H3K36me3-modified nucleosomes with linker DNA optimally supported integration mediated by the PIC but not by the intasome. Interestingly, optimal intasome-mediated integration required the cellular cofactor LEDGF. Unexpectedly, LEDGF minimally affected PIC-mediated integration into naked DNA but blocked integration into nucleosomes. The block for the PIC-mediated integration was significantly relieved by H3K36me3 modification. Mapping the integration sites in the preferred substrates revealed that specific features of the nucleosome-bound DNA are preferred for integration, whereas integration into naked DNA was random. Finally, biochemical and genetic studies demonstrate that DNA condensation by the H1 protein dramatically reduces integration, providing further evidence that features inherent to the open chromatin are preferred for HIV-1 integration. Collectively, these results identify the optimal target substrate for HIV-1 integration, report a mechanistic link between H3K36me3 and integration preference, and importantly, reveal distinct mechanisms utilized by the PIC for integration compared to the intasomes. IMPORTANCE HIV-1 infection is dependent on integration of the viral DNA into the host chromosomes. The preintegration complex (PIC) containing the viral DNA, the virally encoded integrase (IN) enzyme, and other viral/host factors carries out HIV-1 integration. HIV-1 integration is not dependent on the target DNA sequence, and yet the viral DNA is selectively inserted into specific "hot spots" of human chromosomes. A growing body of literature indicates that structural features of the human chromatin are important for integration targeting. However, the mechanisms that guide the PIC and enable insertion of the PIC-associated viral DNA into specific hot spots of the human chromosomes are not fully understood. In this study, we describe a biochemical mechanism for the preference of the HIV-1 DNA integration into open chromatin. Furthermore, our study defines a direct role for the histone epigenetic mark H3K36me3 in HIV-1 integration preference and identify an optimal substrate for HIV-1 PIC-mediated viral DNA integration.


Chromosomes, Human , HIV-1 , Histone Code , Histones , Nucleosomes , Virus Integration , Chromatin/metabolism , Chromosomes, Human/virology , DNA, Viral/genetics , DNA, Viral/metabolism , HIV Infections/virology , HIV Integrase/genetics , HIV Integrase/metabolism , HIV-1/genetics , Histones/chemistry , Histones/metabolism , Humans , Lysine/genetics , Methylation , Nucleosomes/genetics , Nucleosomes/metabolism , Nucleosomes/virology , Virus Integration/genetics
19.
Hepatol Int ; 16(6): 1339-1352, 2022 Dec.
Article En | MEDLINE | ID: mdl-36123506

BACKGROUND: Integration of HBV DNA into the human genome could progressively contribute to hepatocarcinogenesis. Both intrahepatic cholangiocarcinoma (ICC) and combined hepatocellular-cholangiocarcinoma (CHC) are known to be associated with HBV infection. However, the integration of HBV and mechanism of HBV-induced carcinogenesis in ICC and CHC remains unclear. METHODS: 41 patients with ICC and 20 patients with CHC were recruited in the study. We conducted HIVID analysis on these 61 samples to identify HBV integration sites in both the tumor tissues and adjacent non-tumor liver tissues. To further explore the effect of HBV integration on gene alteration, we selected paired tumors and adjacent non-tumor liver tissues from 3 ICC and 4 CHC patients for RNA-seq and WGS. RESULTS: We detected 493 HBV integration sites in ICC patients, of which 417 were from tumor samples and 76 were from non-tumor samples. And 246 HBV integration sites were detected in CHC patients, of which 156 were located in the genome of tumor samples and 90 were in non-tumor samples. Recurrent HBV integration events were detected in ICC including TERT, ZMAT4, MET, ANKFN1, PLXNB2, and in CHC like TERT, ALKBH5. Together with our established data of HBV-infected hepatocellular carcinoma, we found that HBV preferentially integrates into the specific regions which may affect the gene expression and regulation in cells and involved in carcinogenesis. We further performed genomic and transcriptomic sequencing of three ICC and four CHC patients, and found that HBV fragments could integrate near some important oncogene like TERT, causing large-scale genome variations on nearby genomic sequences, and at the same time changing the expression level of the oncogenes. CONCLUSION: Comparative analysis demonstrates numerous newly discovered mutational events in ICC and CHC resulting from HBV insertions in the host genome. Our study provides an in-depth biological and clinical insights into HBV-induced ICC and CHC.


Bile Duct Neoplasms , Carcinoma, Hepatocellular , Cholangiocarcinoma , Liver Neoplasms , Humans , Carcinoma, Hepatocellular/genetics , Carcinoma, Hepatocellular/pathology , Hepatitis B virus/genetics , Liver Neoplasms/genetics , Liver Neoplasms/pathology , Cholangiocarcinoma/genetics , Virus Integration/genetics , Oncogenes , Carcinogenesis/genetics , Bile Duct Neoplasms/genetics , Bile Duct Neoplasms/pathology , Bile Ducts, Intrahepatic/metabolism , Bile Ducts, Intrahepatic/pathology
...