RESUMO
Somatic mutations in cancer genomes are caused by multiple mutational processes, each of which generates a characteristic mutational signature1. Here, as part of the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium2 of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA), we characterized mutational signatures using 84,729,690 somatic mutations from 4,645 whole-genome and 19,184 exome sequences that encompass most types of cancer. We identified 49 single-base-substitution, 11 doublet-base-substitution, 4 clustered-base-substitution and 17 small insertion-and-deletion signatures. The substantial size of our dataset, compared with previous analyses3-15, enabled the discovery of new signatures, the separation of overlapping signatures and the decomposition of signatures into components that may represent associated-but distinct-DNA damage, repair and/or replication mechanisms. By estimating the contribution of each signature to the mutational catalogues of individual cancer genomes, we revealed associations of signatures to exogenous or endogenous exposures, as well as to defective DNA-maintenance processes. However, many signatures are of unknown cause. This analysis provides a systematic perspective on the repertoire of mutational processes that contribute to the development of human cancer.
Assuntos
Mutação/genética , Neoplasias/genética , Fatores Etários , Sequência de Bases , Exoma/genética , Genoma Humano/genética , Humanos , Análise de Sequência de DNARESUMO
Cisplatin reacts with DNA and thereby likely generates a characteristic pattern of somatic mutations, called a mutational signature. Despite widespread use of cisplatin in cancer treatment and its role in contributing to secondary malignancies, its mutational signature has not been delineated. We hypothesize that cisplatin's mutational signature can serve as a biomarker to identify cisplatin mutagenesis in suspected secondary malignancies. Knowledge of which tissues are at risk of developing cisplatin-induced secondary malignancies could lead to guidelines for noninvasive monitoring for secondary malignancies after cisplatin chemotherapy. We performed whole genome sequencing of 10 independent clones of cisplatin-exposed MCF-10A and HepG2 cells and delineated the patterns of single and dinucleotide mutations in terms of flanking sequence, transcription strand bias, and other characteristics. We used the mSigAct signature presence test and nonnegative matrix factorization to search for cisplatin mutagenesis in hepatocellular carcinomas and esophageal adenocarcinomas. All clones showed highly consistent patterns of single and dinucleotide substitutions. The proportion of dinucleotide substitutions was high: 8.1% of single nucleotide substitutions were part of dinucleotide substitutions, presumably due to cisplatin's propensity to form intra- and interstrand crosslinks between purine bases in DNA. We identified likely cisplatin exposure in nine hepatocellular carcinomas and three esophageal adenocarcinomas. All hepatocellular carcinomas for which clinical data were available and all esophageal cancers indeed had histories of cisplatin treatment. We experimentally delineated the single and dinucleotide mutational signature of cisplatin. This signature enabled us to detect previous cisplatin exposure in human hepatocellular carcinomas and esophageal adenocarcinomas with high confidence.
Assuntos
Cisplatino/intoxicação , Análise Mutacional de DNA/métodos , Sequenciamento do Exoma/métodos , Mutação/efeitos dos fármacos , Adenocarcinoma/genética , Antineoplásicos/intoxicação , Carcinoma Hepatocelular/genética , Linhagem Celular , Neoplasias Esofágicas/genética , Genoma Humano/genética , Células Hep G2 , Humanos , Neoplasias Hepáticas/genética , Mutagênese/efeitos dos fármacosRESUMO
Aflatoxin B1 (AFB1) is a mutagen and IARC (International Agency for Research on Cancer) Group 1 carcinogen that causes hepatocellular carcinoma (HCC). Here, we present the first whole-genome data on the mutational signatures of AFB1 exposure from a total of >40,000 mutations in four experimental systems: two different human cell lines, in liver tumors in wild-type mice, and in mice that carried a hepatitis B surface antigen transgene-this to model the multiplicative effects of aflatoxin exposure and hepatitis B in causing HCC. AFB1 mutational signatures from all four experimental systems were remarkably similar. We integrated the experimental mutational signatures with data from newly sequenced HCCs from Qidong County, China, a region of well-studied aflatoxin exposure. This indicated that COSMIC mutational signature 24, previously hypothesized to stem from aflatoxin exposure, indeed likely represents AFB1 exposure, possibly combined with other exposures. Among published somatic mutation data, we found evidence of AFB1 exposure in 0.7% of HCCs treated in North America, 1% of HCCs from Japan, but 16% of HCCs from Hong Kong. Thus, aflatoxin exposure apparently remains a substantial public health issue in some areas. This aspect of our study exemplifies the promise of future widespread resequencing of tumor genomes in providing new insights into the contribution of mutagenic exposures to cancer incidence.
Assuntos
Aflatoxina B1/toxicidade , Carcinógenos/toxicidade , Análise Mutacional de DNA , Mutação/efeitos dos fármacos , Animais , Carcinoma Hepatocelular/induzido quimicamente , Carcinoma Hepatocelular/genética , Carcinoma Hepatocelular/patologia , Linhagem Celular Tumoral , China , Antígenos de Superfície da Hepatite B/genética , Humanos , Neoplasias Hepáticas/induzido quimicamente , Neoplasias Hepáticas/genética , Neoplasias Hepáticas/patologia , Camundongos , Mutação/genéticaRESUMO
BACKGROUND: Cancer genomes are peppered with somatic mutations imprinted by different mutational processes. The mutational pattern of a cancer genome can be used to identify and understand the etiology of the underlying mutational processes. A plethora of prior research has focused on examining mutational signatures and mutational patterns from single base substitutions and their immediate sequencing context. We recently demonstrated that further classification of small mutational events (including substitutions, insertions, deletions, and doublet substitutions) can be used to provide a deeper understanding of the mutational processes that have molded a cancer genome. However, there has been no standard tool that allows fast, accurate, and comprehensive classification for all types of small mutational events. RESULTS: Here, we present SigProfilerMatrixGenerator, a computational tool designed for optimized exploration and visualization of mutational patterns for all types of small mutational events. SigProfilerMatrixGenerator is written in Python with an R wrapper package provided for users that prefer working in an R environment. SigProfilerMatrixGenerator produces fourteen distinct matrices by considering transcriptional strand bias of individual events and by incorporating distinct classifications for single base substitutions, doublet base substitutions, and small insertions and deletions. While the tool provides a comprehensive classification of mutations, SigProfilerMatrixGenerator is also faster and more memory efficient than existing tools that generate only a single matrix. CONCLUSIONS: SigProfilerMatrixGenerator provides a standardized method for classifying small mutational events that is both efficient and scalable to large datasets. In addition to extending the classification of single base substitutions, the tool is the first to provide support for classifying doublet base substitutions and small insertions and deletions. SigProfilerMatrixGenerator is freely available at https://github.com/AlexandrovLab/SigProfilerMatrixGenerator with an extensive documentation at https://osf.io/s93d5/wiki/home/ .
Assuntos
Mutação , Neoplasias/genética , Software , Genômica/métodos , Humanos , Mutação INDELRESUMO
Introduction: The comorbidity of optic neuritis with multiple sclerosis has been well recognized. However, the causal association between multiple sclerosis and optic neuritis, as well as other eye disorders, remains incompletely understood. To address these gaps, we investigated the genetically relationship between multiple sclerosis and eye disorders, and explored potential drugs. Methods: In order to elucidate the genetic susceptibility and causal links between multiple sclerosis and eye disorders, we performed two-sample Mendelian randomization analyses to examine the causality between multiple sclerosis and eye disorders. Additionally, causal single-nucleotide polymorphisms were annotated and searched for expression quantitative trait loci data. Pathway enrichment analysis was performed to identify the possible mechanisms responsible for the eye disorders coexisting with multiple sclerosis. Potential therapeutic chemicals were also explored using the Cytoscape. Results: Mendelian randomization analysis revealed that multiple sclerosis increased the incidence of optic neuritis while reducing the likelihood of concurrent of cataract and macular degeneration. Gene Ontology enrichment analysis implicated that lymphocyte proliferation, activation and antigen processing as potential contributors to the pathogenesis of eye disorders coexisting with multiple sclerosis. Furthermore, pharmaceutical agents traditionally employed for allograft rejection exhibited promising therapeutic potential for the eye disorders coexisting with multiple sclerosis. Discussion: Multiple sclerosis genetically contributes to the development of optic neuritis while mitigating the concurrent occurrence of cataract and macular degeneration. Further research is needed to validate these findings and explore additional mechanisms underlying the comorbidity of multiple sclerosis and eye disorders.
Assuntos
Catarata , Degeneração Macular , Esclerose Múltipla , Neurite Óptica , Humanos , Predisposição Genética para Doença , Esclerose Múltipla/epidemiologia , Esclerose Múltipla/genética , Esclerose Múltipla/complicações , Neurite Óptica/epidemiologia , Neurite Óptica/genética , Análise da Randomização MendelianaRESUMO
Many traditional pharmacopeias include Aristolochia and related plants, which contain nephrotoxins and mutagens in the form of aristolochic acids and similar compounds (collectively, AA). AA is implicated in multiple cancer types, sometimes with very high mutational burdens, especially in upper tract urothelial cancers (UTUCs). AA-associated kidney failure and UTUCs are prevalent in Taiwan, but AA's role in hepatocellular carcinomas (HCCs) there remains unexplored. Therefore, we sequenced the whole exomes of 98 HCCs from two hospitals in Taiwan and found that 78% showed the distinctive mutational signature of AA exposure, accounting for most of the nonsilent mutations in known cancer driver genes. We then searched for the AA signature in 1400 HCCs from diverse geographic regions. Consistent with exposure through known herbal medicines, 47% of Chinese HCCs showed the signature, albeit with lower mutation loads than in Taiwan. In addition, 29% of HCCs from Southeast Asia showed the signature. The AA signature was also detected in 13 and 2.7% of HCCs from Korea and Japan as well as in 4.8 and 1.7% of HCCs from North America and Europe, respectively, excluding one U.S. hospital where 22% of 87 "Asian" HCCs had the signature. Thus, AA exposure is geographically widespread. Asia, especially Taiwan, appears to be much more extensively affected, which is consistent with other evidence of patterns of AA exposure. We propose that additional measures aimed at primary prevention through avoidance of AA exposure and investigation of possible approaches to secondary prevention are warranted.
Assuntos
Ácidos Aristolóquicos/uso terapêutico , Neoplasias Hepáticas/tratamento farmacológico , Ásia , Geografia , Humanos , Imunoterapia , Neoplasias Hepáticas/genética , Mutagênese/genética , Mutação/genética , TaiwanRESUMO
Cholangiocarcinoma (CCA) is a hepatobiliary malignancy exhibiting high incidence in countries with endemic liver-fluke infection. We analyzed 489 CCAs from 10 countries, combining whole-genome (71 cases), targeted/exome, copy-number, gene expression, and DNA methylation information. Integrative clustering defined 4 CCA clusters-fluke-positive CCAs (clusters 1/2) are enriched in ERBB2 amplifications and TP53 mutations; conversely, fluke-negative CCAs (clusters 3/4) exhibit high copy-number alterations and PD-1/PD-L2 expression, or epigenetic mutations (IDH1/2, BAP1) and FGFR/PRKA-related gene rearrangements. Whole-genome analysis highlighted FGFR2 3' untranslated region deletion as a mechanism of FGFR2 upregulation. Integration of noncoding promoter mutations with protein-DNA binding profiles demonstrates pervasive modulation of H3K27me3-associated sites in CCA. Clusters 1 and 4 exhibit distinct DNA hypermethylation patterns targeting either CpG islands or shores-mutation signature and subclonality analysis suggests that these reflect different mutational pathways. Our results exemplify how genetics, epigenetics, and environmental carcinogens can interplay across different geographies to generate distinct molecular subtypes of cancer.Significance: Integrated whole-genome and epigenomic analysis of CCA on an international scale identifies new CCA driver genes, noncoding promoter mutations, and structural variants. CCA molecular landscapes differ radically by etiology, underscoring how distinct cancer subtypes in the same organ may arise through different extrinsic and intrinsic carcinogenic processes. Cancer Discov; 7(10); 1116-35. ©2017 AACR.This article is highlighted in the In This Issue feature, p. 1047.
Assuntos
Neoplasias dos Ductos Biliares/genética , Colangiocarcinoma/genética , Epigenômica/métodos , Estudo de Associação Genômica Ampla/métodos , Ilhas de CpG , Metilação de DNA , Regulação Neoplásica da Expressão Gênica , Redes Reguladoras de Genes , Humanos , Receptor ErbB-2/genética , Receptor Tipo 2 de Fator de Crescimento de Fibroblastos/genética , Proteína Supressora de Tumor p53/genéticaRESUMO
Microsatellite instability (MSI) is a form of hypermutation that occurs in some tumors due to defects in cellular DNA mismatch repair. MSI is characterized by frequent somatic mutations (i.e., cancer-specific mutations) that change the length of simple repeats (e.g., AAAAA ., GATAGATAGATA...). Clinical MSI tests evaluate the lengths of a handful of simple repeat sites, while next-generation sequencing can assay many more sites and offers a much more complete view of their somatic mutation frequencies. Using somatic mutation data from the exomes of a 361-tumor training set, we developed classifiers to determine MSI status based on four machine-learning frameworks. All frameworks had high accuracy, and after choosing one we determined that it had >98% concordance with clinical tests in a separate 163-tumor test set. Furthermore, this classifier retained high concordance even when classifying tumors based on subsets of whole-exome data. We have released a CRAN R package, MSIseq, based on this classifier. MSIseq is faster and simpler to use than software that requires large files of aligned sequenced reads. MSIseq will be useful for genomic studies in which clinical MSI test results are unavailable and for detecting possible misclassifications by clinical tests.
Assuntos
Algoritmos , Análise Mutacional de DNA/métodos , Bases de Dados Genéticas , Instabilidade de Microssatélites , Análise de Sequência de DNA/métodos , Software , Sequência de Bases , Mineração de Dados/métodos , Sistemas de Gerenciamento de Base de Dados , Humanos , Aprendizado de Máquina , Dados de Sequência Molecular , Reconhecimento Automatizado de Padrão/métodos , Reprodutibilidade dos Testes , Sensibilidade e EspecificidadeRESUMO
BACKGROUND: Aristolochic acid (AA) is a natural compound found in many plants of the Aristolochia genus, and these plants are widely used in traditional medicines for numerous conditions and for weight loss. Previous work has connected AA-mutagenesis to upper-tract urothelial cell carcinomas and hepatocellular carcinomas. We hypothesize that AA may also contribute to bladder cancer. METHODS: Here, we investigated the involvement of AA-mutagenesis in bladder cancer by sequencing bladder tumor genomes from two patients with known exposure to AA. After detecting strong mutational signatures of AA exposure in these tumors, we exome-sequenced and analyzed an additional 11 bladder tumors and analyzed publicly available somatic mutation data from a further 336 bladder tumors. RESULTS: The somatic mutations in the bladder tumors from the two patients with known AA exposure showed overwhelming AA signatures. We also detected evidence of AA exposure in 1 out of 11 bladder tumors from Singapore and in 3 out of 99 bladder tumors from China. In addition, 1 out of 194 bladder tumors from North America showed a pattern of mutations that might have resulted from exposure to an unknown mutagen with a heretofore undescribed pattern of A > T mutations. Besides the signature of AA exposure, the bladder tumors also showed the CpG > TpG and activated-APOBEC signatures, which have been previously reported in bladder cancer. CONCLUSIONS: This study demonstrates the utility of inferring mutagenic exposures from somatic mutation spectra. Moreover, AA exposure in bladder cancer appears to be more pervasive in the East, where traditional herbal medicine is more widely used. More broadly, our results suggest that AA exposure is more extensive than previously thought both in terms of populations at risk and in terms of types of cancers involved. This appears to be an important public health issue that should be addressed by further investigation and by primary prevention through regulation and education. In addition to opportunities for primary prevention, knowledge of AA exposure would provide opportunities for secondary prevention in the form of intensified screening of patients with known or suspected AA exposure.