ABSTRACT
Human papillomavirus (HPV) causes 5% of all cancers and frequently integrates into host chromosomes. The HPV oncoproteins E6 and E7 are necessary but insufficient for cancer formation, indicating that additional secondary genetic events are required. Here, we investigate potential oncogenic impacts of virus integration. Analysis of 105 HPV-positive oropharyngeal cancers by whole-genome sequencing detects virus integration in 77%, revealing five statistically significant sites of recurrent integration near genes that regulate epithelial stem cell maintenance (i.e., SOX2, TP63, FGFR, MYC) and immune evasion (i.e., CD274). Genomic copy number hyperamplification is enriched 16-fold near HPV integrants, and the extent of focal host genomic instability increases with their local density. The frequency of genes expressed at extreme outlier levels is increased 86-fold within ±150 kb of integrants. Across 95% of tumors with integration, host gene transcription is disrupted via intragenic integrants, chimeric transcription, outlier expression, gene breaking, and/or de novo expression of noncoding or imprinted genes. We conclude that virus integration can contribute to carcinogenesis in a large majority of HPV-positive oropharyngeal cancers by inducing extensive disruption of host genome structure and gene expression.
Subject(s)
Alphapapillomavirus , Oncogene Proteins, Viral , Oropharyngeal Neoplasms , Alphapapillomavirus/metabolism , Carcinogenesis , Humans , Oncogene Proteins, Viral/genetics , Oropharyngeal Neoplasms/genetics , Papillomaviridae/genetics , Papillomaviridae/metabolism , Papillomavirus E7 Proteins/genetics , Papillomavirus E7 Proteins/metabolism , Virus Integration/geneticsABSTRACT
Human papillomavirus (HPV) is a necessary but insufficient cause of a subset of oral squamous cell carcinomas (OSCCs) that is increasing markedly in frequency. To identify contributory, secondary genetic alterations in these cancers, we used comprehensive genomics methods to compare 149 HPV-positive and 335 HPV-negative OSCC tumor/normal pairs. Different behavioral risk factors underlying the two OSCC types were reflected in distinctive genomic mutational signatures. In HPV-positive OSCCs, the signatures of APOBEC cytosine deaminase editing, associated with anti-viral immunity, were strongly linked to overall mutational burden. In contrast, in HPV-negative OSCCs, T>C substitutions in the sequence context 5'-ATN-3' correlated with tobacco exposure. Universal expression of HPV E6*1 and E7 oncogenes was a sine qua non of HPV-positive OSCCs. Significant enrichment of somatic mutations was confirmed or newly identified in PIK3CA, KMT2D, FGFR3, FBXW7, DDX3X, PTEN, TRAF3, RB1, CYLD, RIPK4, ZNF750, EP300, CASZ1, TAF5, RBL1, IFNGR1, and NFKBIA Of these, many affect host pathways already targeted by HPV oncoproteins, including the p53 and pRB pathways, or disrupt host defenses against viral infections, including interferon (IFN) and nuclear factor kappa B signaling. Frequent copy number changes were associated with concordant changes in gene expression. Chr 11q (including CCND1) and 14q (including DICER1 and AKT1) were recurrently lost in HPV-positive OSCCs, in contrast to their gains in HPV-negative OSCCs. High-ranking variant allele fractions implicated ZNF750, PIK3CA, and EP300 mutations as candidate driver events in HPV-positive cancers. We conclude that virus-host interactions cooperatively shape the unique genetic features of these cancers, distinguishing them from their HPV-negative counterparts.
Subject(s)
Carcinoma, Squamous Cell , Mouth Neoplasms , Neoplasm Proteins , Oncogene Proteins, Viral , Papillomavirus Infections , Carcinoma, Squamous Cell/genetics , Carcinoma, Squamous Cell/metabolism , Carcinoma, Squamous Cell/pathology , Carcinoma, Squamous Cell/virology , Female , Humans , Male , Mouth Neoplasms/genetics , Mouth Neoplasms/metabolism , Mouth Neoplasms/pathology , Mouth Neoplasms/virology , Mutation , Neoplasm Proteins/biosynthesis , Neoplasm Proteins/genetics , Oncogene Proteins, Viral/biosynthesis , Oncogene Proteins, Viral/genetics , Papillomaviridae/genetics , Papillomaviridae/metabolismABSTRACT
Copy number variation (CNV) is a common chromosomal alteration that can occur during in vitro cultivation of human cells and can be accompanied by the accumulation of mutations in coding region sequences. We describe here a systematic application of current molecular technologies to provide a detailed understanding of genomic and sequence profiles of human embryonic stem cell (hESC) lines that were derived under GMP-compliant conditions. We first examined the overall chromosomal integrity using cytogenetic techniques to determine chromosome count, and to detect the presence of cytogenetically aberrant cells in the culture (mosaicism). Assays of copy number variation, using both microarray and sequence-based analyses, provide a detailed view genomic variation in these lines and shows that in early passage cultures of these lines, the size range and distribution of CNVs are entirely consistent with those seen in the genomes of normal individuals. Similarly, genome sequencing shows variation within these lines that is completely within the range seen in normal genomes. Important gene classes, such as tumor suppressors and genetic disease genes, do not display overtly disruptive mutations that could affect the overall safety of cell-based therapeutics. Complete sequence also allows the analysis of important transplantation antigens, such as ABO and HLA types. The combined application of cytogenetic and molecular technologies provides a detailed understanding of genomic and sequence profiles of GMP produced ES lines for potential use as therapeutic agents.