Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 18 de 18
Filtrar
1.
Nature ; 629(8013): 910-918, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-38693263

RESUMO

International differences in the incidence of many cancer types indicate the existence of carcinogen exposures that have not yet been identified by conventional epidemiology make a substantial contribution to cancer burden1. In clear cell renal cell carcinoma, obesity, hypertension and tobacco smoking are risk factors, but they do not explain the geographical variation in its incidence2. Underlying causes can be inferred by sequencing the genomes of cancers from populations with different incidence rates and detecting differences in patterns of somatic mutations. Here we sequenced 962 clear cell renal cell carcinomas from 11 countries with varying incidence. The somatic mutation profiles differed between countries. In Romania, Serbia and Thailand, mutational signatures characteristic of aristolochic acid compounds were present in most cases, but these were rare elsewhere. In Japan, a mutational signature of unknown cause was found in more than 70% of cases but in less than 2% elsewhere. A further mutational signature of unknown cause was ubiquitous but exhibited higher mutation loads in countries with higher incidence rates of kidney cancer. Known signatures of tobacco smoking correlated with tobacco consumption, but no signature was associated with obesity or hypertension, suggesting that non-mutagenic mechanisms of action underlie these risk factors. The results of this study indicate the existence of multiple, geographically variable, mutagenic exposures that potentially affect tens of millions of people and illustrate the opportunities for new insights into cancer causation through large-scale global cancer genomics.


Assuntos
Carcinoma de Células Renais , Genoma Humano , Neoplasias Renais , Mutação , Humanos , Neoplasias Renais/genética , Neoplasias Renais/epidemiologia , Neoplasias Renais/induzido quimicamente , Carcinoma de Células Renais/genética , Carcinoma de Células Renais/epidemiologia , Carcinoma de Células Renais/induzido quimicamente , Genoma Humano/genética , Ácidos Aristolóquicos/efeitos adversos , Ácidos Aristolóquicos/toxicidade , Incidência , Tailândia/epidemiologia , Japão/epidemiologia , Mutagênicos/efeitos adversos , Geografia , Fatores de Risco , Romênia/epidemiologia , Obesidade/genética , Obesidade/epidemiologia , Masculino , Hipertensão/genética , Hipertensão/epidemiologia , Fumar Tabaco/efeitos adversos , Fumar Tabaco/genética , Feminino
2.
medRxiv ; 2024 Apr 17.
Artigo em Inglês | MEDLINE | ID: mdl-38699364

RESUMO

Tobacco smoke, alone or combined with alcohol, is the predominant cause of head and neck cancer (HNC). Here, we further explore how tobacco exposure contributes to cancer development by mutational signature analysis of 265 whole-genome sequenced HNC from eight countries. Six tobacco-associated mutational signatures were detected, including some not previously reported. Differences in HNC incidence between countries corresponded with differences in mutation burdens of tobacco-associated signatures, consistent with the dominant role of tobacco in HNC causation. Differences were found in the burden of tobacco-associated signatures between anatomical subsites, suggesting that tissue-specific factors modulate mutagenesis. We identified an association between tobacco smoking and three additional alcohol-related signatures indicating synergism between the two exposures. Tobacco smoking was associated with differences in the mutational spectra and repertoire of driver mutations in cancer genes, and in patterns of copy number change. Together, the results demonstrate the multiple pathways by which tobacco smoke can influence the evolution of cancer cell clones.

3.
bioRxiv ; 2024 Apr 03.
Artigo em Inglês | MEDLINE | ID: mdl-38617360

RESUMO

APOBEC enzymes are part of the innate immunity and are responsible for restricting viruses and retroelements by deaminating cytosine residues1,2. Most solid tumors harbor different levels of somatic mutations attributed to the off-target activities of APOBEC3A (A3A) and/or APOBEC3B (A3B)3-6. However, how APOBEC3A/B enzymes shape the tumor evolution in the presence of exogenous mutagenic processes is largely unknown. Here, by combining deep whole-genome sequencing with multi-omics profiling of 309 lung cancers from smokers with detailed tobacco smoking information, we identify two subtypes defined by low (LAS) and high (HAS) APOBEC mutagenesis. LAS are enriched for A3B-like mutagenesis and KRAS mutations, whereas HAS for A3A-like mutagenesis and TP53 mutations. Unlike APOBEC3A, APOBEC3B expression is strongly associated with an upregulation of the base excision repair pathway. Hypermutation by unrepaired A3A and tobacco smoking mutagenesis combined with TP53-induced genomic instability can trigger senescence7, apoptosis8, and cell regeneration9, as indicated by high expression of pulmonary healing signaling pathway, stemness markers and distal cell-of-origin in HAS. The expected association of tobacco smoking variables (e.g., time to first cigarette) with genomic/epigenomic changes are not observed in HAS, a plausible consequence of frequent cell senescence or apoptosis. HAS have more neoantigens, slower clonal expansion, and older age at onset compared to LAS, particularly in heavy smokers, consistent with high proportions of newly generated, unmutated cells and frequent immuno-editing. These findings show how heterogeneity in mutational burden across co-occurring mutational processes and cell types contributes to tumor development, with important clinical implications.

4.
Nat Genet ; 55(10): 1721-1734, 2023 10.
Artigo em Inglês | MEDLINE | ID: mdl-37735199

RESUMO

The single-stranded DNA cytosine-to-uracil deaminase APOBEC3B is an antiviral protein implicated in cancer. However, its substrates in cells are not fully delineated. Here APOBEC3B proteomics reveal interactions with a surprising number of R-loop factors. Biochemical experiments show APOBEC3B binding to R-loops in cells and in vitro. Genetic experiments demonstrate R-loop increases in cells lacking APOBEC3B and decreases in cells overexpressing APOBEC3B. Genome-wide analyses show major changes in the overall landscape of physiological and stimulus-induced R-loops with thousands of differentially altered regions, as well as binding of APOBEC3B to many of these sites. APOBEC3 mutagenesis impacts genes overexpressed in tumors and splice factor mutant tumors preferentially, and APOBEC3-attributed kataegis are enriched in RTCW motifs consistent with APOBEC3B deamination. Taken together with the fact that APOBEC3B binds single-stranded DNA and RNA and preferentially deaminates DNA, these results support a mechanism in which APOBEC3B regulates R-loops and contributes to R-loop mutagenesis in cancer.


Assuntos
Neoplasias , Estruturas R-Loop , Humanos , DNA de Cadeia Simples/genética , Estudo de Associação Genômica Ampla , Mutagênese , Neoplasias/genética , Neoplasias/patologia , Citidina Desaminase/genética , Antígenos de Histocompatibilidade Menor/genética , Antígenos de Histocompatibilidade Menor/metabolismo
5.
BMC Genomics ; 24(1): 469, 2023 Aug 21.
Artigo em Inglês | MEDLINE | ID: mdl-37605126

RESUMO

BACKGROUND: All cancers harbor somatic mutations in their genomes. In principle, mutations affecting between one and fifty base pairs are generally classified as small mutational events. Conversely, large mutational events affect more than fifty base pairs, and, in most cases, they encompass copy-number and structural variants affecting many thousands of base pairs. Prior studies have demonstrated that examining patterns of somatic mutations can be leveraged to provide both biological and clinical insights, thus, resulting in an extensive repertoire of tools for evaluating small mutational events. Recently, classification schemas for examining large-scale mutational events have emerged and shown their utility across the spectrum of human cancers. However, there has been no computationally efficient bioinformatics tool that allows visualizing and exploring these large-scale mutational events. RESULTS: Here, we present a new version of SigProfilerMatrixGenerator that now delivers integrated capabilities for examining large mutational events. The tool provides support for examining copy-number variants and structural variants under two previously developed classification schemas and it supports data from numerous algorithms and data modalities. SigProfilerMatrixGenerator is written in Python with an R wrapper package provided for users that prefer working in an R environment. CONCLUSIONS: The new version of SigProfilerMatrixGenerator provides the first standardized bioinformatics tool for optimized exploration and visualization of two previously developed classification schemas for copy number and structural variants. The tool is freely available at https://github.com/AlexandrovLab/SigProfilerMatrixGenerator with an extensive documentation at https://osf.io/s93d5/wiki/home/ .


Assuntos
Algoritmos , Biologia Computacional , Humanos , Mutação
6.
Cell Rep ; 42(8): 112930, 2023 08 29.
Artigo em Inglês | MEDLINE | ID: mdl-37540596

RESUMO

The somatic mutations found in a cancer genome are imprinted by different mutational processes. Each process exhibits a characteristic mutational signature, which can be affected by the genome architecture. However, the interplay between mutational signatures and topographical genomic features has not been extensively explored. Here, we integrate mutations from 5,120 whole-genome-sequenced tumors from 40 cancer types with 516 topographical features from ENCODE to evaluate the effect of nucleosome occupancy, histone modifications, CTCF binding, replication timing, and transcription/replication strand asymmetries on the cancer-specific accumulation of mutations from distinct mutagenic processes. Most mutational signatures are affected by topographical features, with signatures of related etiologies being similarly affected. Certain signatures exhibit periodic behaviors or cancer-type-specific enrichments/depletions near topographical features, revealing further information about the processes that imprinted them. Our findings, disseminated via the COSMIC (Catalog of Somatic Mutations in Cancer) signatures database, provide a comprehensive online resource for exploring the interactions between mutational signatures and topographical features across human cancer.


Assuntos
Neoplasias , Humanos , Mutação/genética , Neoplasias/genética , Genômica , Sequência de Bases , Genoma Humano
7.
Nat Genet ; 55(4): 607-618, 2023 04.
Artigo em Inglês | MEDLINE | ID: mdl-36928603

RESUMO

Malignant pleural mesothelioma (MPM) is an aggressive cancer with rising incidence and challenging clinical management. Through a large series of whole-genome sequencing data, integrated with transcriptomic and epigenomic data using multiomics factor analysis, we demonstrate that the current World Health Organization classification only accounts for up to 10% of interpatient molecular differences. Instead, the MESOMICS project paves the way for a morphomolecular classification of MPM based on four dimensions: ploidy, tumor cell morphology, adaptive immune response and CpG island methylator profile. We show that these four dimensions are complementary, capture major interpatient molecular differences and are delimited by extreme phenotypes that-in the case of the interdependent tumor cell morphology and adapted immune response-reflect tumor specialization. These findings unearth the interplay between MPM functional biology and its genomic history, and provide insights into the variations observed in the clinical behavior of patients with MPM.


Assuntos
Neoplasias Pulmonares , Mesotelioma Maligno , Mesotelioma , Neoplasias Pleurais , Humanos , Mesotelioma Maligno/genética , Mesotelioma Maligno/complicações , Mesotelioma/genética , Mesotelioma/patologia , Multiômica , Neoplasias Pleurais/genética , Neoplasias Pleurais/patologia , Neoplasias Pulmonares/patologia , Biomarcadores Tumorais/genética
8.
bioRxiv ; 2023 Feb 04.
Artigo em Inglês | MEDLINE | ID: mdl-36778452

RESUMO

Background: All cancers harbor somatic mutations in their genomes. In principle, mutations affecting between one and fifty base pairs are generally classified as small mutational events. Conversely, large mutational events affect more than fifty base pairs, and, in most cases, they encompass copy-number and structural variants affecting many thousands of base pairs. Prior studies have demonstrated that examining patterns of somatic mutations can be leveraged to provide both biological and clinical insights, thus, resulting in an extensive repertoire of tools for evaluating small mutational events. Recently, classification schemas for examining large-scale mutational events have emerged and shown their utility across the spectrum of human cancers. However, there has been no standard bioinformatics tool that allows visualizing and exploring these large-scale mutational events. Results: Here, we present a new version of SigProfilerMatrixGenerator that now delivers integrated capabilities for examining large mutational events. The tool provides support for examining copy-number variants and structural variants under two previously developed classification schemas and it supports data from numerous algorithms and data modalities. SigProfilerMatrixGenerator is written in Python with an R wrapper package provided for users that prefer working in an R environment. Conclusions: The new version of SigProfilerMatrixGenerator provides the first standardized bioinformatics tool for optimized exploration and visualization of two previously developed classification schemas for copy number and structural variants. The tool is freely available at https://github.com/AlexandrovLab/SigProfilerMatrixGenerator with an extensive documentation at https://osf.io/s93d5/wiki/home/ .

10.
Cell Genom ; 2(11): None, 2022 Nov 09.
Artigo em Inglês | MEDLINE | ID: mdl-36388765

RESUMO

Mutational signature analysis is commonly performed in cancer genomic studies. Here, we present SigProfilerExtractor, an automated tool for de novo extraction of mutational signatures, and benchmark it against another 13 bioinformatics tools by using 34 scenarios encompassing 2,500 simulated signatures found in 60,000 synthetic genomes and 20,000 synthetic exomes. For simulations with 5% noise, reflecting high-quality datasets, SigProfilerExtractor outperforms other approaches by elucidating between 20% and 50% more true-positive signatures while yielding 5-fold less false-positive signatures. Applying SigProfilerExtractor to 4,643 whole-genome- and 19,184 whole-exome-sequenced cancers reveals four novel signatures. Two of the signatures are confirmed in independent cohorts, and one of these signatures is associated with tobacco smoking. In summary, this report provides a reference tool for analysis of mutational signatures, a comprehensive benchmarking of bioinformatics tools for extracting signatures, and several novel mutational signatures, including one putatively attributed to direct tobacco smoking mutagenesis in bladder tissues.

11.
Nature ; 607(7920): 799-807, 2022 07.
Artigo em Inglês | MEDLINE | ID: mdl-35859169

RESUMO

The APOBEC3 family of cytosine deaminases has been implicated in some of the most prevalent mutational signatures in cancer1-3. However, a causal link between endogenous APOBEC3 enzymes and mutational signatures in human cancer genomes has not been established, leaving the mechanisms of APOBEC3 mutagenesis poorly understood. Here, to investigate the mechanisms of APOBEC3 mutagenesis, we deleted implicated genes from human cancer cell lines that naturally generate APOBEC3-associated mutational signatures over time4. Analysis of non-clustered and clustered signatures across whole-genome sequences from 251 breast, bladder and lymphoma cancer cell line clones revealed that APOBEC3A deletion diminished APOBEC3-associated mutational signatures. Deletion of both APOBEC3A and APOBEC3B further decreased APOBEC3 mutation burdens, without eliminating them. Deletion of APOBEC3B increased APOBEC3A protein levels, activity and APOBEC3A-mediated mutagenesis in some cell lines. The uracil glycosylase UNG was required for APOBEC3-mediated transversions, whereas the loss of the translesion polymerase REV1 decreased overall mutation burdens. Together, these data represent direct evidence that endogenous APOBEC3 deaminases generate prevalent mutational signatures in human cancer cells. Our results identify APOBEC3A as the main driver of these mutations, indicate that APOBEC3B can restrain APOBEC3A-dependent mutagenesis while contributing its own smaller mutation burdens and dissect mechanisms that translate APOBEC3 activities into distinct mutational signatures.


Assuntos
Desaminases APOBEC , Mutagênese , Neoplasias , Desaminases APOBEC/deficiência , Desaminases APOBEC/genética , Desaminases APOBEC/metabolismo , Linhagem Celular Tumoral , DNA Polimerase Dirigida por DNA/metabolismo , Deleção de Genes , Genoma Humano , Humanos , Mutagênese/genética , Neoplasias/enzimologia , Neoplasias/genética , Neoplasias/patologia , Uracila-DNA Glicosidase/metabolismo
12.
Bioinformatics ; 38(13): 3470-3473, 2022 06 27.
Artigo em Inglês | MEDLINE | ID: mdl-35595234

RESUMO

MOTIVATION: Clustered mutations are found in the human germline as well as in the genomes of cancer and normal somatic cells. Clustered events can be imprinted by a multitude of mutational processes, and they have been implicated in both cancer evolution and development disorders. Existing tools for identifying clustered mutations have been optimized for a particular subtype of clustered event and, in most cases, relied on a predefined inter-mutational distance (IMD) cutoff combined with a piecewise linear regression analysis. RESULTS: Here, we present SigProfilerClusters, an automated tool for detecting all types of clustered mutations by calculating a sample-dependent IMD threshold using a simulated background model that takes into account extended sequence context, transcriptional strand asymmetries and regional mutation densities. SigProfilerClusters disentangles all types of clustered events from non-clustered mutations and annotates each clustered event into an established subclass, including the widely used classes of doublet-base substitutions, multi-base substitutions, omikli and kataegis. SigProfilerClusters outputs non-clustered mutations and clustered events using standard data formats as well as provides multiple visualizations for exploring the distributions and patterns of clustered mutations across the genome. AVAILABILITY AND IMPLEMENTATION: SigProfilerClusters is supported across most operating systems and made freely available at https://github.com/AlexandrovLab/SigProfilerClusters with an extensive documentation located at https://osf.io/qpmzw/wiki/home/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Neoplasias , Software , Humanos , Mutação , Neoplasias/genética , Genoma
13.
Nature ; 602(7897): 510-517, 2022 02.
Artigo em Inglês | MEDLINE | ID: mdl-35140399

RESUMO

Clustered somatic mutations are common in cancer genomes and previous analyses reveal several types of clustered single-base substitutions, which include doublet- and multi-base substitutions1-5, diffuse hypermutation termed omikli6, and longer strand-coordinated events termed kataegis3,7-9. Here we provide a comprehensive characterization of clustered substitutions and clustered small insertions and deletions (indels) across 2,583 whole-genome-sequenced cancers from 30 types of cancer10. Clustered mutations were highly enriched in driver genes and associated with differential gene expression and changes in overall survival. Several distinct mutational processes gave rise to clustered indels, including signatures that were enriched in tobacco smokers and homologous-recombination-deficient cancers. Doublet-base substitutions were caused by at least 12 mutational processes, whereas most multi-base substitutions were generated by either tobacco smoking or exposure to ultraviolet light. Omikli events, which have previously been attributed to APOBEC3 activity6, accounted for a large proportion of clustered substitutions; however, only 16.2% of omikli matched APOBEC3 patterns. Kataegis was generated by multiple mutational processes, and 76.1% of all kataegic events exhibited mutational patterns that are associated with the activation-induced deaminase (AID) and APOBEC3 family of deaminases. Co-occurrence of APOBEC3 kataegis and extrachromosomal DNA (ecDNA), termed kyklonas (Greek for cyclone), was found in 31% of samples with ecDNA. Multiple distinct kyklonic events were observed on most mutated ecDNA. ecDNA containing known cancer genes exhibited both positive selection and kyklonic hypermutation. Our results reveal the diversity of clustered mutational processes in human cancer and the role of APOBEC3 in recurrently mutating and fuelling the evolution of ecDNA.


Assuntos
Neoplasias , Desaminases APOBEC/genética , Genoma , Humanos , Mutação INDEL , Mutagênese/genética , Mutação , Neoplasias/genética
14.
Nat Genet ; 53(11): 1553-1563, 2021 11.
Artigo em Inglês | MEDLINE | ID: mdl-34663923

RESUMO

Esophageal squamous cell carcinoma (ESCC) shows remarkable variation in incidence that is not fully explained by known lifestyle and environmental risk factors. It has been speculated that an unknown exogenous exposure(s) could be responsible. Here we combine the fields of mutational signature analysis with cancer epidemiology to study 552 ESCC genomes from eight countries with varying incidence rates. Mutational profiles were similar across all countries studied. Associations between specific mutational signatures and ESCC risk factors were identified for tobacco, alcohol, opium and germline variants, with modest impacts on mutation burden. We find no evidence of a mutational signature indicative of an exogenous exposure capable of explaining differences in ESCC incidence. Apolipoprotein B mRNA-editing enzyme, catalytic polypeptide-like (APOBEC)-associated mutational signatures single-base substitution (SBS)2 and SBS13 were present in 88% and 91% of cases, respectively, and accounted for 25% of the mutation burden on average, indicating that APOBEC activation is a crucial step in ESCC tumor development.


Assuntos
Neoplasias Esofágicas/epidemiologia , Neoplasias Esofágicas/genética , Carcinoma de Células Escamosas do Esôfago/epidemiologia , Carcinoma de Células Escamosas do Esôfago/genética , Mutação , Desaminases APOBEC/genética , Adulto , Idoso , Idoso de 80 Anos ou mais , Aldeído-Desidrogenase Mitocondrial/genética , Brasil/epidemiologia , China/epidemiologia , Feminino , Humanos , Incidência , Irã (Geográfico)/epidemiologia , Masculino , Pessoa de Meia-Idade , Proteína Supressora de Tumor p53/genética , Reino Unido/epidemiologia , Sequenciamento Completo do Genoma
16.
BMC Bioinformatics ; 21(1): 438, 2020 Oct 07.
Artigo em Inglês | MEDLINE | ID: mdl-33028213

RESUMO

BACKGROUND: Performing a statistical test requires a null hypothesis. In cancer genomics, a key challenge is the fast generation of accurate somatic mutational landscapes that can be used as a realistic null hypothesis for making biological discoveries. RESULTS: Here we present SigProfilerSimulator, a powerful tool that is capable of simulating the mutational landscapes of thousands of cancer genomes at different resolutions within seconds. Applying SigProfilerSimulator to 2144 whole-genome sequenced cancers reveals: (i) that most doublet base substitutions are not due to two adjacent single base substitutions but likely occur as single genomic events; (ii) that an extended sequencing context of ± 2 bp is required to more completely capture the patterns of substitution mutational signatures in human cancer; (iii) information on false-positive discovery rate of commonly used bioinformatics tools for detecting driver genes. CONCLUSIONS: SigProfilerSimulator's breadth of features allows one to construct a tailored null hypothesis and use it for evaluating the accuracy of other bioinformatics tools or for downstream statistical analysis for biological discoveries. SigProfilerSimulator is freely available at https://github.com/AlexandrovLab/SigProfilerSimulator with an extensive documentation at https://osf.io/usxjz/wiki/home/ .


Assuntos
Neoplasias/patologia , Interface Usuário-Computador , Mapeamento Cromossômico , Genômica/métodos , Humanos , Melanoma/genética , Melanoma/patologia , Mutação , Neoplasias/genética , Sequenciamento Completo do Genoma
17.
Nature ; 578(7793): 94-101, 2020 02.
Artigo em Inglês | MEDLINE | ID: mdl-32025018

RESUMO

Somatic mutations in cancer genomes are caused by multiple mutational processes, each of which generates a characteristic mutational signature1. Here, as part of the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium2 of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA), we characterized mutational signatures using 84,729,690 somatic mutations from 4,645 whole-genome and 19,184 exome sequences that encompass most types of cancer. We identified 49 single-base-substitution, 11 doublet-base-substitution, 4 clustered-base-substitution and 17 small insertion-and-deletion signatures. The substantial size of our dataset, compared with previous analyses3-15, enabled the discovery of new signatures, the separation of overlapping signatures and the decomposition of signatures into components that may represent associated-but distinct-DNA damage, repair and/or replication mechanisms. By estimating the contribution of each signature to the mutational catalogues of individual cancer genomes, we revealed associations of signatures to exogenous or endogenous exposures, as well as to defective DNA-maintenance processes. However, many signatures are of unknown cause. This analysis provides a systematic perspective on the repertoire of mutational processes that contribute to the development of human cancer.


Assuntos
Mutação/genética , Neoplasias/genética , Fatores Etários , Sequência de Bases , Exoma/genética , Genoma Humano/genética , Humanos , Análise de Sequência de DNA
18.
BMC Genomics ; 20(1): 685, 2019 Aug 30.
Artigo em Inglês | MEDLINE | ID: mdl-31470794

RESUMO

BACKGROUND: Cancer genomes are peppered with somatic mutations imprinted by different mutational processes. The mutational pattern of a cancer genome can be used to identify and understand the etiology of the underlying mutational processes. A plethora of prior research has focused on examining mutational signatures and mutational patterns from single base substitutions and their immediate sequencing context. We recently demonstrated that further classification of small mutational events (including substitutions, insertions, deletions, and doublet substitutions) can be used to provide a deeper understanding of the mutational processes that have molded a cancer genome. However, there has been no standard tool that allows fast, accurate, and comprehensive classification for all types of small mutational events. RESULTS: Here, we present SigProfilerMatrixGenerator, a computational tool designed for optimized exploration and visualization of mutational patterns for all types of small mutational events. SigProfilerMatrixGenerator is written in Python with an R wrapper package provided for users that prefer working in an R environment. SigProfilerMatrixGenerator produces fourteen distinct matrices by considering transcriptional strand bias of individual events and by incorporating distinct classifications for single base substitutions, doublet base substitutions, and small insertions and deletions. While the tool provides a comprehensive classification of mutations, SigProfilerMatrixGenerator is also faster and more memory efficient than existing tools that generate only a single matrix. CONCLUSIONS: SigProfilerMatrixGenerator provides a standardized method for classifying small mutational events that is both efficient and scalable to large datasets. In addition to extending the classification of single base substitutions, the tool is the first to provide support for classifying doublet base substitutions and small insertions and deletions. SigProfilerMatrixGenerator is freely available at https://github.com/AlexandrovLab/SigProfilerMatrixGenerator with an extensive documentation at https://osf.io/s93d5/wiki/home/ .


Assuntos
Mutação , Neoplasias/genética , Software , Genômica/métodos , Humanos , Mutação INDEL
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA