Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 29
Filtrar
1.
Cell ; 165(2): 357-71, 2016 Apr 07.
Artigo em Inglês | MEDLINE | ID: mdl-27058666

RESUMO

We report a mechanism through which the transcription machinery directly controls topoisomerase 1 (TOP1) activity to adjust DNA topology throughout the transcription cycle. By comparing TOP1 occupancy using chromatin immunoprecipitation sequencing (ChIP-seq) versus TOP1 activity using topoisomerase 1 sequencing (TOP1-seq), a method reported here to map catalytically engaged TOP1, TOP1 bound at promoters was discovered to become fully active only after pause-release. This transition coupled the phosphorylation of the carboxyl-terminal-domain (CTD) of RNA polymerase II (RNAPII) with stimulation of TOP1 above its basal rate, enhancing its processivity. TOP1 stimulation is strongly dependent on the kinase activity of BRD4, a protein that phosphorylates Ser2-CTD and regulates RNAPII pause-release. Thus the coordinated action of BRD4 and TOP1 overcame the torsional stress opposing transcription as RNAPII commenced elongation but preserved negative supercoiling that assists promoter melting at start sites. This nexus between transcription and DNA topology promises to elicit new strategies to intercept pathological gene expression.


Assuntos
DNA Topoisomerases Tipo I/metabolismo , DNA/metabolismo , RNA Polimerase II/metabolismo , Transcrição Gênica , DNA/química , DNA Topoisomerases Tipo I/genética , Técnicas de Silenciamento de Genes , Humanos , Regiões Promotoras Genéticas , RNA Polimerase II/química , RNA Polimerase II/isolamento & purificação , Elongação da Transcrição Genética , Fatores de Transcrição/isolamento & purificação , Sítio de Iniciação de Transcrição
2.
Cell ; 153(5): 988-99, 2013 May 23.
Artigo em Inglês | MEDLINE | ID: mdl-23706737

RESUMO

Lymphocyte activation is initiated by a global increase in messenger RNA synthesis. However, the mechanisms driving transcriptome amplification during the immune response are unknown. By monitoring single-stranded DNA genome wide, we show that the genome of naive cells is poised for rapid activation. In G0, ∼90% of promoters from genes to be expressed in cycling lymphocytes are polymerase loaded but unmelted and support only basal transcription. Furthermore, the transition from abortive to productive elongation is kinetically limiting, causing polymerases to accumulate nearer to transcription start sites. Resting lymphocytes also limit the expression of the transcription factor IIH complex, including XPB and XPD helicases involved in promoter melting and open complex extension. To date, two rate-limiting steps have been shown to control global gene expression in eukaryotes: preinitiation complex assembly and polymerase pausing. Our studies identify promoter melting as a third key regulatory step and propose that this mechanism ensures a prompt lymphocyte response to invading pathogens.


Assuntos
Linfócitos B/metabolismo , Regulação da Expressão Gênica , Ativação Linfocitária , Linfócitos/metabolismo , Regiões Promotoras Genéticas , Animais , Linfócitos B/imunologia , Linhagem Celular Tumoral , DNA de Cadeia Simples/metabolismo , Elementos Facilitadores Genéticos , Estudo de Associação Genômica Ampla , Humanos , Linfócitos/citologia , Linfócitos/imunologia , Camundongos , Fator de Transcrição TFIIH/metabolismo , Transcrição Gênica
3.
Bioinformatics ; 34(2): 330-337, 2018 Jan 15.
Artigo em Inglês | MEDLINE | ID: mdl-29028923

RESUMO

MOTIVATION: Cancers arise as the result of somatically acquired changes in the DNA of cancer cells. However, in addition to the mutations that confer a growth advantage, cancer genomes accumulate a large number of somatic mutations resulting from normal DNA damage and repair processes as well as carcinogenic exposures or cancer related aberrations of DNA maintenance machinery. These mutagenic processes often produce characteristic mutational patterns called mutational signatures. The decomposition of a cancer genome's mutation catalog into mutations consistent with such signatures can provide valuable information about cancer etiology. However, the results from different decomposition methods are not always consistent. Hence, one needs to be able to not only decompose a patient's mutational profile into signatures but also establish the accuracy of such decomposition. RESULTS: We proposed two complementary ways of measuring confidence and stability of decomposition results and applied them to analyze mutational signatures in breast cancer genomes. We identified both very stable and highly unstable signatures, as well as signatures that previously have not been associated with breast cancer. We also provided additional support for the novel signatures. Our results emphasize the importance of assessing the confidence and stability of inferred signature contributions. AVAILABILITY AND IMPLEMENTATION: All tools developed in this paper have been implemented in an R package, called SignatureEstimation, which is available from https://www.ncbi.nlm.nih.gov/CBBresearch/Przytycka/index.cgi\#signatureestimation. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

4.
PLoS Comput Biol ; 13(10): e1005695, 2017 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-29023534

RESUMO

The analysis of the mutational landscape of cancer, including mutual exclusivity and co-occurrence of mutations, has been instrumental in studying the disease. We hypothesized that exploring the interplay between co-occurrence, mutual exclusivity, and functional interactions between genes will further improve our understanding of the disease and help to uncover new relations between cancer driving genes and pathways. To this end, we designed a general framework, BeWith, for identifying modules with different combinations of mutation and interaction patterns. We focused on three different settings of the BeWith schema: (i) BeME-WithFun, in which the relations between modules are enriched with mutual exclusivity, while genes within each module are functionally related; (ii) BeME-WithCo, which combines mutual exclusivity between modules with co-occurrence within modules; and (iii) BeCo-WithMEFun, which ensures co-occurrence between modules, while the within module relations combine mutual exclusivity and functional interactions. We formulated the BeWith framework using Integer Linear Programming (ILP), enabling us to find optimally scoring sets of modules. Our results demonstrate the utility of BeWith in providing novel information about mutational patterns, driver genes, and pathways. In particular, BeME-WithFun helped identify functionally coherent modules that might be relevant for cancer progression. In addition to finding previously well-known drivers, the identified modules pointed to other novel findings such as the interaction between NCOR2 and NCOA3 in breast cancer. Additionally, an application of the BeME-WithCo setting revealed that gene groups differ with respect to their vulnerability to different mutagenic processes, and helped us to uncover pairs of genes with potentially synergistic effects, including a potential synergy between mutations in TP53 and the metastasis related DCC gene. Overall, BeWith not only helped us uncover relations between potential driver genes and pathways, but also provided additional insights on patterns of the mutational landscape, going beyond cancer driving mutations. Implementation is available at https://www.ncbi.nlm.nih.gov/CBBresearch/Przytycka/software/bewith.html.


Assuntos
Biologia Computacional/métodos , Redes Reguladoras de Genes/genética , Neoplasias/genética , Neoplasias/metabolismo , Algoritmos , Humanos , Peptídeos e Proteínas de Sinalização Intracelular/análise , Peptídeos e Proteínas de Sinalização Intracelular/genética , Peptídeos e Proteínas de Sinalização Intracelular/metabolismo , Mutação/genética
5.
PLoS Comput Biol ; 12(4): e1004821, 2016 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-27078128

RESUMO

Recent genome-wide analyses have uncovered a high accumulation of RNA polymerase II (Pol II) at the 5' end of genes. This elevated Pol II presence at promoters, referred to here as Poll II poising, is mainly (but not exclusively) attributed to temporal pausing of transcription during early elongation which, in turn, has been proposed to be a regulatory step for processes that need to be activated "on demand". Yet, the full genome-wide regulatory role of Pol II poising is yet to be delineated. To elucidate the role of Pol II poising in B cell activation, we compared Pol II profiles in resting and activated B cells. We found that while Pol II poised genes generally overlap functionally among different B cell states and correspond to the functional groups previously identified for other cell types, non-poised genes are B cell state specific. Focusing on the changes in transcription activity upon B cell activation, we found that the majority of such changes were from poised to non-poised state. The genes showing this type of transition were functionally enriched in translation, RNA processing and mRNA metabolic process. Interestingly, we also observed a transition from non-poised to poised state. Within this set of genes we identified several Immediate Early Genes (IEG), which were highly expressed in resting B cell and shifted from non-poised to poised state after B cell activation. Thus Pol II poising does not only mark genes for rapid expression in the future, but it is also associated with genes that are silenced after a burst of their expression. Finally, we performed comparative analysis of the presence of G4 motifs in the context of poised versus non-poised but active genes. Interestingly we observed a differential enrichment of these motifs upstream versus downstream of TSS depending on poising status. The enrichment of G4 sequence motifs upstream of TSS of non-poised active genes suggests a potential role of quadruplexes in expression regulation.


Assuntos
Linfócitos B/enzimologia , RNA Polimerase II/metabolismo , Animais , Linfócitos B/citologia , Linfócitos B/imunologia , Biologia Computacional , Quadruplex G , Regulação da Expressão Gênica , Genes Precoces , Ativação Linfocitária , Camundongos , Camundongos Endogâmicos C57BL , Regiões Promotoras Genéticas , Elongação da Transcrição Genética , Sítio de Iniciação de Transcrição
6.
Nucleic Acids Res ; 42(20): 12367-79, 2014 Nov 10.
Artigo em Inglês | MEDLINE | ID: mdl-25336616

RESUMO

While individual non-B DNA structures have been shown to impact gene expression, their broad regulatory role remains elusive. We utilized genomic variants and expression quantitative trait loci (eQTL) data to analyze genome-wide variation propensities of potential non-B DNA regions and their relation to gene expression. Independent of genomic location, these regions were enriched in nucleotide variants. Our results are consistent with previously observed mutagenic properties of these regions and counter a previous study concluding that G-quadruplex regions have a reduced frequency of variants. While such mutagenicity might undermine functionality of these elements, we identified in potential non-B DNA regions a signature of negative selection. Yet, we found a depletion of eQTL-associated variants in potential non-B DNA regions, opposite to what might be expected from their proposed regulatory role. However, we also observed that genes downstream of potential non-B DNA regions showed higher expression variation between individuals. This coupling between mutagenicity and tolerance for expression variability of downstream genes may be a result of evolutionary adaptation, which allows reconciling mutagenicity of non-B DNA structures with their location in functionally important regions and their potential regulatory role.


Assuntos
DNA/química , Expressão Gênica , Variação Genética , Genoma Humano , Taxa de Mutação , Humanos , Mutagênese , Conformação de Ácido Nucleico , Nucleotídeos/análise , Locos de Características Quantitativas
7.
Nucleic Acids Res ; 41(12): 5965-77, 2013 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-23620297

RESUMO

Although the right-handed double helical B-form DNA is most common under physiological conditions, DNA is dynamic and can adopt a number of alternative structures, such as the four-stranded G-quadruplex, left-handed Z-DNA, cruciform and others. Active transcription necessitates strand separation and can induce such non-canonical forms at susceptible genomic sequences. Therefore, it has been speculated that these non-B DNA motifs can play regulatory roles in gene transcription. Such conjecture has been supported in higher eukaryotes by direct studies of several individual genes, as well as a number of large-scale analyses. However, the role of non-B DNA structures in many lower organisms, in particular proteobacteria, remains poorly understood and incompletely documented. In this study, we performed the first comprehensive study of the occurrence of B DNA-non-B DNA transition-susceptible sites (non-B DNA motifs) within the context of the operon structure of the Escherichia coli genome. We compared the distributions of non-B DNA motifs in the regulatory regions of operons with those from internal regions. We found an enrichment of some non-B DNA motifs in regulatory regions, and we show that this enrichment cannot be simply explained by base composition bias in these regions. We also showed that the distribution of several non-B DNA motifs within intergenic regions separating divergently oriented operons differs from the distribution found between convergent ones. In particular, we found a strong enrichment of cruciforms in the termination region of operons; this enrichment was observed for operons with Rho-dependent, as well as Rho-independent terminators. Finally, a preference for some non-B DNA motifs was observed near transcription factor-binding sites. Overall, the conspicuous enrichment of transition-susceptible sites in these specific regulatory regions suggests that non-B DNA structures may have roles in the transcriptional regulation of specific operons within the E. coli genome.


Assuntos
DNA Bacteriano/química , Escherichia coli/genética , Óperon , Elementos Reguladores de Transcrição , Transcrição Gênica , Sítios de Ligação , DNA Bacteriano/metabolismo , DNA Cruciforme/análise , DNA Intergênico/química , Genoma Bacteriano , Motivos de Nucleotídeos , Regiões Promotoras Genéticas , Fatores de Transcrição/metabolismo , Terminação da Transcrição Genética
8.
Int J Mol Sci ; 15(7): 13111-22, 2014 Jul 23.
Artigo em Inglês | MEDLINE | ID: mdl-25056547

RESUMO

Genomic DNA is under constant assault by endogenous and exogenous DNA damaging agents. DNA breakage can represent a major threat to genome integrity but can also be necessary for genome function. Here we present approaches to map DNA double-strand breaks (DSBs) and single-strand breaks (SSBs) at the genome-wide scale by two methods called DSB- and SSB-Seq, respectively. We tested these methods in human colon cancer cells and validated the results using the Topoisomerase II (Top2)-poisoning agent etoposide (ETO). Our results show that the combination of ETO treatment with break-mapping techniques is a powerful method to elaborate the pattern of Top2 enzymatic activity across the genome.


Assuntos
Quebras de DNA de Cadeia Dupla , Quebras de DNA de Cadeia Simples , DNA Topoisomerases Tipo II/metabolismo , DNA/metabolismo , Mapeamento Cromossômico , Células HCT116 , Humanos , Análise de Sequência de DNA
9.
PLoS Comput Biol ; 8(8): e1002644, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22956896

RESUMO

The intrinsic stochasticity of gene expression leads to cell-to-cell variations, noise, in protein abundance. Several processes, including transcription, translation, and degradation of mRNA and proteins, can contribute to these variations. Recent single cell analyses of gene expression in yeast have uncovered a general trend where expression noise scales with protein abundance. This trend is consistent with a stochastic model of gene expression where mRNA copy number follows the random birth and death process. However, some deviations from this basic trend have also been observed, prompting questions about the contribution of gene-specific features to such deviations. For example, recent studies have pointed to the TATA box as a sequence feature that can influence expression noise by facilitating expression bursts. Transcription-originated noise can be potentially further amplified in translation. Therefore, we asked the question of to what extent sequence features known or postulated to accompany translation efficiency can also be associated with increase in noise strength and, on average, how such increase compares to the amplification associated with the TATA box. Untangling different components of expression noise is highly nontrivial, as they may be gene or gene-module specific. In particular, focusing on codon usage as one of the sequence features associated with efficient translation, we found that ribosomal genes display a different relationship between expression noise and codon usage as compared to other genes. Within nonribosomal genes we found that sequence high codon usage is correlated with increased noise relative to the average noise of proteins with the same abundance. Interestingly, by projecting the data on a theoretical model of gene expression, we found that the amplification of noise strength associated with codon usage is comparable to that of the TATA box, suggesting that the effect of translation on noise in eukaryotic gene expression might be more prominent than previously appreciated.


Assuntos
Expressão Gênica , Biossíntese de Proteínas , Processos Estocásticos , Transcrição Gênica , Células Eucarióticas/metabolismo , Conformação de Ácido Nucleico , RNA Mensageiro/química , RNA Mensageiro/genética , TATA Box
10.
Methods Mol Biol ; 2651: 179-193, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36892768

RESUMO

Z-DNAs are nucleic acid secondary structures that form within a special pattern of nucleotides and are promoted by DNA supercoiling. Through Z-DNA formation, DNA encodes information by dynamic changes in its secondary structure. A growing body of evidence indicates that Z-DNA formation can play a role in gene regulation; it can affect chromatin architecture and demonstrates its association with genomic instability, genetic diseases, and genome evolution. Many functional roles of Z-DNA are yet to be discovered highlighting the need for techniques to detect genome-wide folding of DNA into this structure. Here, we describe an approach to convert linear genome into supercoiled genome sponsoring Z-DNA formation. Applying permanganate-based methodology and high-throughput sequencing to supercoiled genome allows genome-wide detection of single-stranded DNA. Single-stranded DNA is characteristic of the junctions between the classical B-form of DNA and Z-DNA. Consequently, analysis of single-stranded DNA map provides snapshots of the Z-DNA conformation over the whole genome.


Assuntos
DNA Forma Z , DNA de Cadeia Simples , DNA/genética , DNA/química , Conformação de Ácido Nucleico , Cromatina , DNA Super-Helicoidal/genética
11.
Genome Med ; 15(1): 15, 2023 03 06.
Artigo em Inglês | MEDLINE | ID: mdl-36879282

RESUMO

BACKGROUND: There has been a growing appreciation recently that mutagenic processes can be studied through the lenses of mutational signatures, which represent characteristic mutation patterns attributed to individual mutagens. However, the causal links between mutagens and observed mutation patterns as well as other types of interactions between mutagenic processes and molecular pathways are not fully understood, limiting the utility of mutational signatures. METHODS: To gain insights into these relationships, we developed a network-based method, named GENESIGNET that constructs an influence network among genes and mutational signatures. The approach leverages sparse partial correlation among other statistical techniques to uncover dominant influence relations between the activities of network nodes. RESULTS: Applying GENESIGNET to cancer data sets, we uncovered important relations between mutational signatures and several cellular processes that can shed light on cancer-related processes. Our results are consistent with previous findings, such as the impact of homologous recombination deficiency on clustered APOBEC mutations in breast cancer. The network identified by GENESIGNET also suggest an interaction between APOBEC hypermutation and activation of regulatory T Cells (Tregs), as well as a relation between APOBEC mutations and changes in DNA conformation. GENESIGNET also exposed a possible link between the SBS8 signature of unknown etiology and the Nucleotide Excision Repair (NER) pathway. CONCLUSIONS: GENESIGNET provides a new and powerful method to reveal the relation between mutational signatures and gene expression. The GENESIGNET method was implemented in python, and installable package, source codes and the data sets used for and generated during this study are available at the Github site https://github.com/ncbi/GeneSigNet.


Assuntos
Fenômenos Biológicos , Neoplasias da Mama , Humanos , Feminino , Mutação , Mutagênicos , Neoplasias da Mama/genética , Núcleo Celular
12.
Biomolecules ; 12(10)2022 09 27.
Artigo em Inglês | MEDLINE | ID: mdl-36291592

RESUMO

Smoking is a widely recognized risk factor in the emergence of cancers and other lung diseases. Studies of non-cancer lung diseases typically investigate the role that smoking has in chronic changes in lungs that might predispose patients to the diseases, whereas most cancer studies focus on the mutagenic properties of smoking. Large-scale cancer analysis efforts have collected expression data from both tumor and control lung tissues, and studies have used control samples to estimate the impact of smoking on gene expression. However, such analyses may be confounded by tumor-related micro-environments as well as patient-specific exposure to smoking. Thus, in this paper, we explore the utilization of mutational signatures to study environment-induced changes of gene expression in control lung tissues from lung adenocarcinoma samples. We show that a joint computational analysis of mutational signatures derived from sequenced tumor samples, and the gene expression obtained from control samples, can shed light on the combined impact that smoking and tumor-related micro-environments have on gene expression and cell-type composition in non-neoplastic (control) lung tissue. The results obtained through such analysis are both supported by experimental studies, including studies utilizing single-cell technology, and also suggest additional novel insights. We argue that the study provides a proof of principle of the utility of mutational signatures to be used as sensors of environmental exposures not only in the context of the mutational landscape of cancer, but also as a reference for changes in non-cancer lung tissues. It also provides an example of how a database collected with the purpose of understanding cancer can provide valuable information for studies not directly related to the disease.


Assuntos
Pneumopatias , Neoplasias Pulmonares , Neoplasias , Humanos , Mutação , Neoplasias/genética , Fumar/efeitos adversos , Fumar/genética , Pulmão , Neoplasias Pulmonares/induzido quimicamente , Neoplasias Pulmonares/genética , Microambiente Tumoral/genética
13.
Cell Syst ; 12(10): 994-1003.e4, 2021 10 20.
Artigo em Inglês | MEDLINE | ID: mdl-34375586

RESUMO

Cancer genomes accumulate a large number of somatic mutations resulting from a combination of stochastic errors in DNA processing, cancer-related aberrations of the DNA repair machinery, or carcinogenic exposures; each mutagenic process leaves a characteristic mutational signature. A key challenge is understanding the interactions between signatures, particularly as DNA repair deficiencies often modify the effects of other mutagens. Here, we introduce RepairSig, a computational method that explicitly models additive primary mutagenic processes; non-additive secondary processes, which interact with the primary processes; and a mutation opportunity, that is, the distribution of sites across the genome that are vulnerable to damage or preferentially repaired. We demonstrate that RepairSig accurately recapitulates experimentally identified signatures, identifies autonomous signatures of deficient DNA repair processes, and explains mismatch repair deficiency in breast cancer by de novo inference of both primary and secondary signatures from patient data. RepairSig is freely available for download at https://github.com/ncbi/RepairSig.


Assuntos
Neoplasias da Mama , Dano ao DNA , Neoplasias da Mama/genética , DNA , Dano ao DNA/genética , Reparo do DNA/genética , Feminino , Humanos , Mutação/genética
14.
Annu Rev Biomed Data Sci ; 4: 189-206, 2021 07 20.
Artigo em Inglês | MEDLINE | ID: mdl-34465178

RESUMO

Mutations are the driving force of evolution, yet they underlie many diseases, in particular, cancer. They are thought to arise from a combination of stochastic errors in DNA processing, naturally occurring DNA damage (e.g., the spontaneous deamination of methylated CpG sites), replication errors, and dysregulation of DNA repair mechanisms. High-throughput sequencing has made it possible to generate large datasets to study mutational processes in health and disease. Since the emergence of the first mutational process studies in 2012, this field is gaining increasing attention and has already accumulated a host of computational approaches and biomedical applications.


Assuntos
Neoplasias , Dano ao DNA , Reparo do DNA/genética , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Mutação , Neoplasias/genética
15.
Pac Symp Biocomput ; 25: 262-273, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-31797602

RESUMO

Cancer genomes accumulate a large number of somatic mutations resulting from imperfection of DNA processing during normal cell cycle as well as from carcinogenic exposures or cancer related aberrations of DNA maintenance machinery. These processes often lead to distinctive patterns of mutations, called mutational signatures. Several computational methods have been developed to uncover such signatures from catalogs of somatic mutations. However, cancer mutational signatures are the end-effect of several interplaying factors including carcinogenic exposures and potential deficiencies of the DNA repair mechanism. To fully understand the nature of each signature, it is important to disambiguate the atomic components that contribute to the final signature. Here, we introduce a new descriptor of mutational signatures, DNA Repair FootPrint (RePrint), and show that it can capture common properties of deficiencies in repair mechanisms contributing to diverse signatures. We validate the method with published mutational signatures from cell lines targeted with CRISPR-Cas9-based knockouts of DNA repair genes.


Assuntos
Biologia Computacional , Neoplasias , DNA , Reparo do DNA/genética , Humanos , Mutação , Neoplasias/genética
16.
iScience ; 23(10): 101619, 2020 Oct 23.
Artigo em Inglês | MEDLINE | ID: mdl-33089107

RESUMO

Phenotypic heterogeneity in cancer is often caused by different patterns of genetic alterations. Understanding such phenotype-genotype relationships is fundamental for the advance of personalized medicine. We develop a computational method, named NETPHIX (NETwork-to-PHenotype association with eXclusivity) to identify subnetworks of genes whose genetic alterations are associated with drug response or other continuous cancer phenotypes. Leveraging interaction information among genes and properties of cancer mutations such as mutual exclusivity, we formulate the problem as an integer linear program and solve it optimally to obtain a subnetwork of associated genes. Applied to a large-scale drug screening dataset, NETPHIX uncovered gene modules significantly associated with drug responses. Utilizing interaction information, NETPHIX modules are functionally coherent and can thus provide important insights into drug action. In addition, we show that modules identified by NETPHIX together with their association patterns can be leveraged to suggest drug combinations.

17.
iScience ; 23(3): 100900, 2020 Mar 27.
Artigo em Inglês | MEDLINE | ID: mdl-32088392

RESUMO

The characterization of mutational processes in terms of their signatures of activity relies mostly on the assumption that mutations in a given cancer genome are independent of one another. Recently, it was discovered that certain segments of mutations, termed processive groups, occur on the same DNA strand and are generated by a single process or signature. Here we provide a first probabilistic model of mutational signatures that accounts for their observed stickiness and strand coordination. The model conditions on the observed strand for each mutation and allows the same signature to generate a run of mutations. It can both use known signatures or learn new ones. We show that this model provides a more accurate description of the properties of mutagenic processes than independent-mutation achieving substantially higher likelihood on held-out data. We apply this model to characterize the processivity of mutagenic processes across multiple types of cancer.

18.
Genome Med ; 12(1): 52, 2020 05 29.
Artigo em Inglês | MEDLINE | ID: mdl-32471470

RESUMO

BACKGROUND: Studies of cancer mutations have typically focused on identifying cancer driving mutations that confer growth advantage to cancer cells. However, cancer genomes accumulate a large number of passenger somatic mutations resulting from various endogenous and exogenous causes, including normal DNA damage and repair processes or cancer-related aberrations of DNA maintenance machinery as well as mutations triggered by carcinogenic exposures. Different mutagenic processes often produce characteristic mutational patterns called mutational signatures. Identifying mutagenic processes underlying mutational signatures shaping a cancer genome is an important step towards understanding tumorigenesis. METHODS: To investigate the genetic aberrations associated with mutational signatures, we took a network-based approach considering mutational signatures as cancer phenotypes. Specifically, our analysis aims to answer the following two complementary questions: (i) what are functional pathways whose gene expression activities correlate with the strengths of mutational signatures, and (ii) are there pathways whose genetic alterations might have led to specific mutational signatures? To identify mutated pathways, we adopted a recently developed optimization method based on integer linear programming. RESULTS: Analyzing a breast cancer dataset, we identified pathways associated with mutational signatures on both expression and mutation levels. Our analysis captured important differences in the etiology of the APOBEC-related signatures and the two clock-like signatures. In particular, it revealed that clustered and dispersed APOBEC mutations may be caused by different mutagenic processes. In addition, our analysis elucidated differences between two age-related signatures-one of the signatures is correlated with the expression of cell cycle genes while the other has no such correlation but shows patterns consistent with the exposure to environmental/external processes. CONCLUSIONS: This work investigated, for the first time, a network-level association of mutational signatures and dysregulated pathways. The identified pathways and subnetworks provide novel insights into mutagenic processes that the cancer genomes might have undergone and important clues for developing personalized drug therapies.


Assuntos
Neoplasias da Mama/genética , Desaminases APOBEC/genética , Feminino , Humanos , Mutação , Fenótipo
19.
Methods Mol Biol ; 2035: 369-382, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31444763

RESUMO

While DNA inside the cells is predominantly canonical right-handed double helix, guanine-rich DNAs have potential to fold into four-stranded structures that contain stacks of G-quartets (G4 DNA quadruplex). Genome sequencing has revealed G4 sequences tend to localize at the gene control regions, especially in the promoters of oncogenes. A growing body of evidence indicates that G4 DNA quadruplexes might have important regulatory roles in genome function, highlighting the need for techniques to detect genome-wide folding of DNA into this structure. Potassium permanganate in vivo treatment of cells results in oxidizing of nucleotides in single-stranded DNA regions that accompany G4 DNA quadruplexes formation, providing an excellent probe for the conformational state of DNA inside the living cells. Here, we describe a permanganate-based methodology to detect G4 DNA quadruplex, genome-wide. This methodology combined with high-throughput sequencing provides a snapshot of the DNA conformation over the whole genome in vivo.


Assuntos
Quadruplex G , Cromatina/química , Genômica , Compostos de Manganês/química , Óxidos/química
20.
Genome Med ; 11(1): 49, 2019 07 26.
Artigo em Inglês | MEDLINE | ID: mdl-31349863

RESUMO

Knowing the activity of the mutational processes shaping a cancer genome may provide insight into tumorigenesis and personalized therapy. It is thus important to characterize the signatures of active mutational processes in patients from their patterns of single base substitutions. However, mutational processes do not act uniformly on the genome, leading to statistical dependencies among neighboring mutations. To account for such dependencies, we develop the first sequence-dependent model, SigMa, for mutation signatures. We apply SigMa to characterize genomic and other factors that influence the activity of mutation signatures in breast cancer. We show that SigMa outperforms previous approaches, revealing novel insights on signature etiology. The source code for SigMa is publicly available at https://github.com/lrgr/sigma.


Assuntos
Biomarcadores Tumorais , Biologia Computacional/métodos , Análise Mutacional de DNA/métodos , Cadeias de Markov , Mutação , Neoplasias/genética , Algoritmos , Neoplasias da Mama/genética , Feminino , Genoma Humano , Genômica/métodos , Humanos , Software
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA