Búsqueda | Portal de Búsqueda de la BVS Enfermería

1.

Clustered Mutation Signatures Reveal that Error-Prone DNA Repair Targets Mutations to Active Genes.

Supek, Fran; Lehner, Ben.

Cell ; 170(3): 534-547.e23, 2017 Jul 27.

Artículo en Inglés | MEDLINE | ID: mdl-28753428

RESUMEN

Many processes can cause the same nucleotide change in a genome, making the identification of the mechanisms causing mutations a difficult challenge. Here, we show that clustered mutations provide a more precise fingerprint of mutagenic processes. Of nine clustered mutation signatures identified from >1,000 tumor genomes, three relate to variable APOBEC activity and three are associated with tobacco smoking. An additional signature matches the spectrum of translesion DNA polymerase eta (POLH). In lymphoid cells, these mutations target promoters, consistent with AID-initiated somatic hypermutation. In solid tumors, however, they are associated with UV exposure and alcohol consumption and target the H3K36me3 chromatin of active genes in a mismatch repair (MMR)-dependent manner. These regions normally have a low mutation rate because error-free MMR also targets H3K36me3 chromatin. Carcinogens and error-prone repair therefore redistribute mutations to the more important regions of the genome, contributing a substantial mutation load in many tumors, including driver mutations.

Asunto(s)

Reparación de la Incompatibilidad de ADN , Mutación , Neoplasias/genética , Desaminasas APOBEC , Citidina Desaminasa , Citosina Desaminasa/genética , ADN Polimerasa Dirigida por ADN/genética , Humanos , Neoplasias Hepáticas/inducido químicamente , Neoplasias Hepáticas/genética , Melanoma/genética , Mutagénesis , Fumar/efectos adversos , Rayos Ultravioleta/efectos adversos

2.

Synonymous mutations frequently act as driver mutations in human cancers.

Supek, Fran; Miñana, Belén; Valcárcel, Juan; Gabaldón, Toni; Lehner, Ben.

Cell ; 156(6): 1324-1335, 2014 Mar 13.

Artículo en Inglés | MEDLINE | ID: mdl-24630730

RESUMEN

Synonymous mutations change the sequence of a gene without directly altering the sequence of the encoded protein. Here, we present evidence that these "silent" mutations frequently contribute to human cancer. Selection on synonymous mutations in oncogenes is cancer-type specific, and although the functional consequences of cancer-associated synonymous mutations may be diverse, they recurrently alter exonic motifs that regulate splicing and are associated with changes in oncogene splicing in tumors. The p53 tumor suppressor (TP53) also has recurrent synonymous mutations, but, in contrast to those in oncogenes, these are adjacent to splice sites and inactivate them. We estimate that between one in two and one in five silent mutations in oncogenes have been selected, equating to ~6%- 8% of all selected single-nucleotide changes in these genes. In addition, our analyses suggest that dosage-sensitive oncogenes have selected mutations in their 3' UTRs.

Asunto(s)

Mutación , Neoplasias/genética , Oncogenes , Sitios de Empalme de ARN , Regiones no Traducidas 3' , Secuencia de Bases , Genes p53 , Humanos , Empalme del ARN

3.

Mutation rate heterogeneity at the sub-gene scale due to local DNA hypomethylation.

Mas-Ponte, David; Supek, Fran.

Nucleic Acids Res ; 52(8): 4393-4408, 2024 May 08.

Artículo en Inglés | MEDLINE | ID: mdl-38587182

RESUMEN

Local mutation rates in human are highly heterogeneous, with known variability at the scale of megabase-sized chromosomal domains, and, on the other extreme, at the scale of oligonucleotides. The intermediate, kilobase-scale heterogeneity in mutation risk is less well characterized. Here, by analyzing thousands of somatic genomes, we studied mutation risk gradients along gene bodies, representing a genomic scale spanning roughly 1-10 kb, hypothesizing that different mutational mechanisms are differently distributed across gene segments. The main heterogeneity concerns several kilobases at the transcription start site and further downstream into 5' ends of gene bodies; these are commonly hypomutated with several mutational signatures, most prominently the ubiquitous C > T changes at CpG dinucleotides. The width and shape of this mutational coldspot at 5' gene ends is variable across genes, and corresponds to variable interval of lowered DNA methylation depending on gene activity level and regulation. Such hypomutated loci, at 5' gene ends or elsewhere, correspond to DNA hypomethylation that can associate with various landmarks, including intragenic enhancers, Polycomb-marked regions, or chromatin loop anchor points. Tissue-specific DNA hypomethylation begets tissue-specific local hypomutation. Of note, direction of mutation risk is inverted for AID/APOBEC3 cytosine deaminase activity, whose signatures are enriched in hypomethylated regions.

Asunto(s)

Metilación de ADN , Tasa de Mutación , Humanos , Islas de CpG , Heterogeneidad Genética , Genoma Humano , Mutación , Sitio de Iniciación de la Transcripción

4.

Genomics reveal unknown mutation-promoting agents at global sites.

Franco, Irene; Supek, Fran.

Nature ; 629(8013): 767-768, 2024 May.

Artículo en Inglés | MEDLINE | ID: mdl-38773293

Asunto(s)

Genómica , Mutación , Humanos , Animales

5.

To NMD or Not To NMD: Nonsense-Mediated mRNA Decay in Cancer and Other Genetic Diseases.

Supek, Fran; Lehner, Ben; Lindeboom, Rik G H.

Trends Genet ; 37(7): 657-668, 2021 07.

Artículo en Inglés | MEDLINE | ID: mdl-33277042

RESUMEN

The nonsense-mediated mRNA decay (NMD) pathway degrades some but not all mRNAs bearing premature termination codons (PTCs). Decades of work have elucidated the molecular mechanisms of NMD. More recently, statistical analyses of large genomic datasets have allowed the importance of known and novel 'rules of NMD' to be tested and combined into methods that accurately predict whether PTC-containing mRNAs are degraded or not. We discuss these genomic approaches and how they can be applied to identify diseases and individuals that may benefit from inhibition or activation of NMD. We also discuss the importance of NMD for gene editing and tumor evolution, and how inhibiting NMD may be an effective strategy to increase the efficacy of cancer immunotherapy.

Asunto(s)

Empalme Alternativo/genética , Enfermedades Genéticas Congénitas/genética , Neoplasias/genética , Degradación de ARNm Mediada por Codón sin Sentido/genética , Codón sin Sentido/genética , Humanos , ARN Mensajero/genética

6.

Loss of the abasic site sensor HMCES is synthetic lethal with the activity of the APOBEC3A cytosine deaminase in cancer cells.

Biayna, Josep; Garcia-Cao, Isabel; Álvarez, Miguel M; Salvadores, Marina; Espinosa-Carrasco, Jose; McCullough, Marcel; Supek, Fran; Stracker, Travis H.

PLoS Biol ; 19(3): e3001176, 2021 03.

Artículo en Inglés | MEDLINE | ID: mdl-33788831

RESUMEN

Analysis of cancer mutagenic signatures provides information about the origin of mutations and can inform the use of clinical therapies, including immunotherapy. In particular, APOBEC3A (A3A) has emerged as a major driver of mutagenesis in cancer cells, and its expression results in DNA damage and susceptibility to treatment with inhibitors of the ATR and CHK1 checkpoint kinases. Here, we report the implementation of CRISPR/Cas-9 genetic screening to identify susceptibilities of multiple A3A-expressing lung adenocarcinoma (LUAD) cell lines. We identify HMCES, a protein recently linked to the protection of abasic sites, as a central protein for the tolerance of A3A expression. HMCES depletion results in synthetic lethality with A3A expression preferentially in a TP53-mutant background. Analysis of previous screening data reveals a strong association between A3A mutational signatures and sensitivity to HMCES loss and indicates that HMCES is specialized in protecting against a narrow spectrum of DNA damaging agents in addition to A3A. We experimentally show that both HMCES disruption and A3A expression increase susceptibility of cancer cells to ionizing radiation (IR), oxidative stress, and ATR inhibition, strategies that are often applied in tumor therapies. Overall, our results suggest that HMCES is an attractive target for selective treatment of A3A-expressing tumors.

Asunto(s)

Adenocarcinoma del Pulmón/genética , Citidina Desaminasa/genética , Proteínas de Unión al ADN/genética , Proteínas/genética , Adenocarcinoma del Pulmón/metabolismo , Proteínas de la Ataxia Telangiectasia Mutada/metabolismo , Carcinoma de Pulmón de Células no Pequeñas/genética , Carcinoma de Pulmón de Células no Pequeñas/metabolismo , Línea Celular Tumoral , Quinasa 1 Reguladora del Ciclo Celular (Checkpoint 1)/metabolismo , Citidina Desaminasa/metabolismo , Citosina Desaminasa/genética , Citosina Desaminasa/metabolismo , ADN/genética , ADN/metabolismo , Daño del ADN/genética , Daño del ADN/fisiología , Replicación del ADN/genética , Replicación del ADN/fisiología , Proteínas de Unión al ADN/metabolismo , Humanos , Proteínas/metabolismo

7.

Prevalence, causes and impact of TP53-loss phenocopying events in human tumors.

Fito-Lopez, Bruno; Salvadores, Marina; Alvarez, Miguel-Martin; Supek, Fran.

BMC Biol ; 21(1): 92, 2023 04 24.

Artículo en Inglés | MEDLINE | ID: mdl-37095494

RESUMEN

BACKGROUND: TP53 is a master tumor suppressor gene, mutated in approximately half of all human cancers. Given the many regulatory roles of the corresponding p53 protein, it is possible to infer loss of p53 activity - which may occur due to alterations in trans - from gene expression patterns. Several such alterations that phenocopy p53 loss are known, however additional ones may exist, but their identity and prevalence among human tumors are not well characterized. RESULTS: We perform a large-scale statistical analysis on transcriptomes of ~ 7,000 tumors and ~ 1,000 cell lines, estimating that 12% and 8% of tumors and cancer cell lines, respectively, phenocopy TP53 loss: they are likely deficient in the activity of the p53 pathway, while not bearing obvious TP53 inactivating mutations. While some of these cases are explained by amplifications in the known phenocopying genes MDM2, MDM4 and PPM1D, many are not. An association analysis of cancer genomic scores jointly with CRISPR/RNAi genetic screening data identified an additional common TP53-loss phenocopying gene, USP28. Deletions in USP28 are associated with a TP53 functional impairment in 2.9-7.6% of breast, bladder, lung, liver and stomach tumors, and have comparable effect size to MDM4 amplifications. Additionally, in the known copy number alteration (CNA) segment harboring MDM2, we identify an additional co-amplified gene (CNOT2) that may cooperatively boost the TP53 functional inactivation effect of MDM2. An analysis of cancer cell line drug screens using phenocopy scores suggests that TP53 (in)activity commonly modulates associations between anticancer drug effects and various genetic markers, such as PIK3CA and PTEN mutations, and should thus be considered as a drug activity modifying factor in precision medicine. As a resource, we provide the drug-genetic marker associations that differ depending on TP53 functional status. CONCLUSIONS: Human tumors that do not bear obvious TP53 genetic alterations but that phenocopy p53 activity loss are common, and the USP28 gene deletions are one likely cause.

Asunto(s)

Neoplasias , Proteína p53 Supresora de Tumor , Humanos , Proteína p53 Supresora de Tumor/metabolismo , Proteínas Proto-Oncogénicas c-mdm2/metabolismo , Prevalencia , Neoplasias/genética , Genes p53 , Mutación , Ubiquitina Tiolesterasa/genética , Ubiquitina Tiolesterasa/metabolismo , Proteínas Proto-Oncogénicas/genética , Proteínas de Ciclo Celular/metabolismo , Proteínas Represoras/genética

8.

Spectrum of DNA mismatch repair failures viewed through the lens of cancer genomics and implications for therapy.

Mas-Ponte, David; McCullough, Marcel; Supek, Fran.

Clin Sci (Lond) ; 136(5): 383-404, 2022 03 18.

Artículo en Inglés | MEDLINE | ID: mdl-35274136

RESUMEN

Genome sequencing can be used to detect DNA repair failures in tumors and learn about underlying mechanisms. Here, we synthesize findings from genomic studies that examined deficiencies of the DNA mismatch repair (MMR) pathway. The impairment of MMR results in genome-wide hypermutation and in the 'microsatellite instability' (MSI) phenotype-occurrence of indel mutations at short tandem repeat (microsatellite) loci. The MSI status of tumors was traditionally assessed by molecular testing of a selected set of MS loci or by measuring MMR protein expression levels. Today, genomic data can provide a more complete picture of the consequences on genomic instability. Multiple computational studies examined somatic mutation distributions that result from failed DNA repair pathways in tumors. These include analyzing the commonly studied trinucleotide mutational spectra of single-nucleotide variants (SNVs), as well as of other features such as indels, structural variants, mutation clusters and regional mutation rate redistribution. The identified mutation patterns can be used to rigorously measure prevalence of MMR failures across cancer types, and potentially to subcategorize the MMR deficiencies. Diverse data sources, genomic and pre-genomic, from human and from experimental models, suggest there are different ways in which MMR can fail, and/or that the cell-type or genetic background may result in different types of MMR mutational patterns. The spectrum of MMR failures may direct cancer evolution, generating particular sets of driver mutations. Moreover, MMR affects outcomes of therapy by DNA damaging drugs, antimetabolites, nonsense-mediated mRNA decay (NMD) inhibitors, and immunotherapy by promoting either resistance or sensitivity, depending on the type of therapy.

Asunto(s)

Neoplasias Colorrectales , Reparación de la Incompatibilidad de ADN , Neoplasias Colorrectales/genética , Reparación de la Incompatibilidad de ADN/genética , Reparación del ADN/genética , Genómica , Humanos , Inestabilidad de Microsatélites , Mutación/genética

9.

Differential DNA mismatch repair underlies mutation rate variation across the human genome.

Supek, Fran; Lehner, Ben.

Nature ; 521(7550): 81-4, 2015 May 07.

Artículo en Inglés | MEDLINE | ID: mdl-25707793

RESUMEN

Cancer genome sequencing has revealed considerable variation in somatic mutation rates across the human genome, with mutation rates elevated in heterochromatic late replicating regions and reduced in early replicating euchromatin. Multiple mechanisms have been suggested to underlie this, but the actual cause is unknown. Here we identify variable DNA mismatch repair (MMR) as the basis of this variation. Analysing â¼17 million single-nucleotide variants from the genomes of 652 tumours, we show that regional autosomal mutation rates at megabase resolution are largely stable across cancer types, with differences related to changes in replication timing and gene expression. However, mutations arising after the inactivation of MMR are no longer enriched in late replicating heterochromatin relative to early replicating euchromatin. Thus, differential DNA repair and not differential mutation supply is the primary cause of the large-scale regional mutation rate variation across the human genome.

Asunto(s)

Reparación de la Incompatibilidad de ADN/genética , Genoma Humano/genética , Tasa de Mutación , Línea Celular , Momento de Replicación del ADN/genética , Regulación Neoplásica de la Expresión Génica/genética , Humanos , Mutagénesis/genética , Neoplasias/genética , Neoplasias/patología

10.

Passenger mutations accurately classify human tumors.

Salvadores, Marina; Mas-Ponte, David; Supek, Fran.

PLoS Comput Biol ; 15(4): e1006953, 2019 04.

Artículo en Inglés | MEDLINE | ID: mdl-30986244

RESUMEN

Determining the cancer type and molecular subtype has important clinical implications. The primary site is however unknown for some malignancies discovered in the metastatic stage. Moreover liquid biopsies may be used to screen for tumoral DNA, which upon detection needs to be assigned to a site-of-origin. Classifiers based on genomic features are a promising approach to prioritize the tumor anatomical site, type and subtype. We examined the predictive ability of causal (driver) somatic mutations in this task, comparing it against global patterns of non-selected (passenger) mutations, including features based on regional mutation density (RMD). In the task of distinguishing 18 cancer types, the driver mutations-mutated oncogenes or tumor suppressors, pathways and hotspots-classified 36% of the patients to the correct cancer type. In contrast, the features based on passenger mutations did so at 92% accuracy, with similar contribution from the RMD and the trinucleotide mutation spectra. The RMD and the spectra covered distinct sets of patients with predictions. In particular, introducing the RMD features into a combined classification model increased the fraction of diagnosed patients by 50 percentage points (at 20% FDR). Furthermore, RMD was able to discriminate molecular subtypes and/or anatomical site of six major cancers. The advantage of passenger mutations was upheld under high rates of false negative mutation calls and with exome sequencing, even though overall accuracy decreased. We suggest whole genome sequencing is valuable for classifying tumors because it captures global patterns emanating from mutational processes, which are informative of the underlying tumor biology.

Asunto(s)

Biología Computacional/métodos , Neoplasias/clasificación , Neoplasias/genética , Algoritmos , ADN de Neoplasias/clasificación , ADN de Neoplasias/genética , Exoma/genética , Genómica , Humanos , Aprendizaje Automático , Mutación/genética , Programas Informáticos , Secuenciación del Exoma/métodos , Secuenciación Completa del Genoma/métodos

11.

The landscape of microbial phenotypic traits and associated genes.

Brbic, Maria; Piskorec, Matija; Vidulin, Vedrana; Krisko, Anita; Smuc, Tomislav; Supek, Fran.

Nucleic Acids Res ; 44(21): 10074-10090, 2016 12 01.

Artículo en Inglés | MEDLINE | ID: mdl-27915291

RESUMEN

Bacteria and Archaea display a variety of phenotypic traits and can adapt to diverse ecological niches. However, systematic annotation of prokaryotic phenotypes is lacking. We have therefore developed ProTraits, a resource containing â¼545 000 novel phenotype inferences, spanning 424 traits assigned to 3046 bacterial and archaeal species. These annotations were assigned by a computational pipeline that associates microbes with phenotypes by text-mining the scientific literature and the broader World Wide Web, while also being able to define novel concepts from unstructured text. Moreover, the ProTraits pipeline assigns phenotypes by drawing extensively on comparative genomics, capturing patterns in gene repertoires, codon usage biases, proteome composition and co-occurrence in metagenomes. Notably, we find that gene synteny is highly predictive of many phenotypes, and highlight examples of gene neighborhoods associated with spore-forming ability. A global analysis of trait interrelatedness outlined clusters in the microbial phenotype network, suggesting common genetic underpinnings. Our extended set of phenotype annotations allows detection of 57 088 high confidence gene-trait links, which recover many known associations involving sporulation, flagella, catalase activity, aerobicity, photosynthesis and other traits. Over 99% of the commonly occurring gene families are involved in genetic interactions conditional on at least one phenotype, suggesting that epistasis has a major role in shaping microbial gene content.

Asunto(s)

Archaea/genética , Bacterias/genética , Bases de Datos Genéticas , Fenotipo , Codón , Biología Computacional/métodos , Minería de Datos , Genes Arqueales , Genes Bacterianos , Genoma Arqueal , Genoma Bacteriano , Metagenoma , Anotación de Secuencia Molecular , Herencia Multifactorial , Reproducibilidad de los Resultados

12.

Extensive complementarity between gene function prediction methods.

Vidulin, Vedrana; Smuc, Tomislav; Supek, Fran.

Bioinformatics ; 32(23): 3645-3653, 2016 12 01.

Artículo en Inglés | MEDLINE | ID: mdl-27522084

RESUMEN

MOTIVATION: The number of sequenced genomes rises steadily but we still lack the knowledge about the biological roles of many genes. Automated function prediction (AFP) is thus a necessity. We hypothesized that AFP approaches that draw on distinct genome features may be useful for predicting different types of gene functions, motivating a systematic analysis of the benefits gained by obtaining and integrating such predictions. RESULTS: Our pipeline amalgamates 5 133 543 genes from 2071 genomes in a single massive analysis that evaluates five established genomic AFP methodologies. While 1227 Gene Ontology (GO) terms yielded reliable predictions, the majority of these functions were accessible to only one or two of the methods. Moreover, different methods tend to assign a GO term to non-overlapping sets of genes. Thus, inferences made by diverse genomic AFP methods display a striking complementary, both gene-wise and function-wise. Because of this, a viable integration strategy is to rely on a single most-confident prediction per gene/function, rather than enforcing agreement across multiple AFP methods. Using an information-theoretic approach, we estimate that current databases contain 29.2 bits/gene of known Escherichia coli gene functions. This can be increased by up to 5.5 bits/gene using individual AFP methods or by 11 additional bits/gene upon integration, thereby providing a highly-ranking predictor on the Critical Assessment of Function Annotation 2 community benchmark. Availability of more sequenced genomes boosts the predictive accuracy of AFP approaches and also the benefit from integrating them. AVAILABILITY AND IMPLEMENTATION: The individual and integrated GO predictions for the complete set of genes are available from http://gorbi.irb.hr/ CONTACT: fran.supek@irb.hrSupplementary information: Supplementary materials are available at Bioinformatics online.

Asunto(s)

Biología Computacional/métodos , Ontología de Genes , Genómica/métodos , Algoritmos , Genoma , Aprendizaje Automático , Modelos Teóricos

13.

Hydroxymethylated cytosines are associated with elevated C to G transversion rates.

Supek, Fran; Lehner, Ben; Hajkova, Petra; Warnecke, Tobias.

PLoS Genet ; 10(9): e1004585, 2014 Sep.

Artículo en Inglés | MEDLINE | ID: mdl-25211471

RESUMEN

It has long been known that methylated cytosines deaminate at higher rates than unmodified cytosines and constitute mutational hotspots in mammalian genomes. The repertoire of naturally occurring cytosine modifications, however, extends beyond 5-methylcytosine to include its oxidation derivatives, notably 5-hydroxymethylcytosine. The effects of these modifications on sequence evolution are unknown. Here, we combine base-resolution maps of methyl- and hydroxymethylcytosine in human and mouse with population genomic, divergence and somatic mutation data to show that hydroxymethylated and methylated cytosines show distinct patterns of variation and evolution. Surprisingly, hydroxymethylated sites are consistently associated with elevated C to G transversion rates at the level of segregating polymorphisms, fixed substitutions, and somatic mutations in tumors. Controlling for multiple potential confounders, we find derived C to G SNPs to be 1.43-fold (1.22-fold) more common at hydroxymethylated sites compared to methylated sites in human (mouse). Increased C to G rates are evident across diverse functional and sequence contexts and, in cancer genomes, correlate with the expression of Tet enzymes and specific components of the mismatch repair pathway (MSH2, MSH6, and MBD4). Based on these and other observations we suggest that hydroxymethylation is associated with a distinct mutational burden and that the mismatch repair pathway is implicated in causing elevated transversion rates at hydroxymethylated cytosines.

Asunto(s)

5-Metilcitosina/metabolismo , Islas de CpG/genética , Citosina/análogos & derivados , Citosina/metabolismo , Animales , Metilación de ADN/genética , Humanos , Ratones , Mutación/genética

14.

The Code of Silence: Widespread Associations Between Synonymous Codon Biases and Gene Function.

Supek, Fran.

J Mol Evol ; 82(1): 65-73, 2016 Jan.

Artículo en Inglés | MEDLINE | ID: mdl-26538122

RESUMEN

Some mutations in gene coding regions exchange one synonymous codon for another, and thus do not alter the amino acid sequence of the encoded protein. Even though they are often called 'silent,' these mutations may exhibit a plethora of effects on the living cell. Therefore, they are often selected during evolution, causing synonymous codon usage biases in genomes. Comparative analyses of bacterial, archaeal, fungal, and human cancer genomes have found many links between a gene's biological role and the accrual of synonymous mutations during evolution. In particular, highly expressed genes in certain functional categories are enriched with optimal codons, which are decoded by the abundant tRNAs, thus enhancing the speed and accuracy of the translating ribosome. The set of genes exhibiting codon adaptation differs between genomes, and these differences show robust associations to organismal phenotypes. In addition to selection for translation efficiency, other distinct codon bias patterns have been found in: amino acid starvation genes, cyclically expressed genes, tissue-specific genes in animals and plants, oxidative stress response genes, cellular differentiation genes, and oncogenes. In addition, genomes of organisms harboring tRNA modifications exhibit particular codon preferences. The evolutionary trace of codon bias patterns across orthologous genes may be examined to learn about a gene's relevance to various phenotypes, or, more generally, its function in the cell.

Asunto(s)

Codón , Evolución Molecular , Sistemas de Lectura Abierta , Mutación Silenciosa , Archaea , Bacterias , Secuencia de Consenso , Hongos , Humanos , Neoplasias/genética

15.

Phyletic profiling with cliques of orthologs is enhanced by signatures of paralogy relationships.

Skunca, Nives; Bosnjak, Matko; Krisko, Anita; Panov, Pance; Dzeroski, Saso; Smuc, Tomislav; Supek, Fran.

PLoS Comput Biol ; 9(1): e1002852, 2013.

Artículo en Inglés | MEDLINE | ID: mdl-23308060

RESUMEN

New microbial genomes are sequenced at a high pace, allowing insight into the genetics of not only cultured microbes, but a wide range of metagenomic collections such as the human microbiome. To understand the deluge of genomic data we face, computational approaches for gene functional annotation are invaluable. We introduce a novel model for computational annotation that refines two established concepts: annotation based on homology and annotation based on phyletic profiling. The phyletic profiling-based model that includes both inferred orthologs and paralogs-homologs separated by a speciation and a duplication event, respectively-provides more annotations at the same average Precision than the model that includes only inferred orthologs. For experimental validation, we selected 38 poorly annotated Escherichia coli genes for which the model assigned one of three GO terms with high confidence: involvement in DNA repair, protein translation, or cell wall synthesis. Results of antibiotic stress survival assays on E. coli knockout mutants showed high agreement with our model's estimates of accuracy: out of 38 predictions obtained at the reported Precision of 60%, we confirmed 25 predictions, indicating that our confidence estimates can be used to make informed decisions on experimental validation. Our work will contribute to making experimental validation of computational predictions more approachable, both in cost and time. Our predictions for 998 prokaryotic genomes include ~400000 specific annotations with the estimated Precision of 90%, ~19000 of which are highly specific-e.g. "penicillin binding," "tRNA aminoacylation for protein translation," or "pathogenesis"-and are freely available at http://gorbi.irb.hr/.

Asunto(s)

Perfilación de la Expresión Génica , Filogenia , Escherichia coli/genética , Genes Bacterianos , Modelos Teóricos

16.

Cell cycle gene alterations associate with a redistribution of mutation risk across chromosomal domains in human cancers.

Salvadores, Marina; Supek, Fran.

Nat Cancer ; 5(2): 330-346, 2024 Feb.

Artículo en Inglés | MEDLINE | ID: mdl-38200245

RESUMEN

Mutations in human cells exhibit increased burden in heterochromatic, late DNA replication time (RT) chromosomal domains, with variation in mutation rates between tissues mirroring variation in heterochromatin and RT. We observed that regional mutation risk further varies between individual tumors in a manner independent of cell type, identifying three signatures of domain-scale mutagenesis in >4,000 tumor genomes. The major signature reflects remodeling of heterochromatin and of the RT program domains seen across tumors, tissues and cultured cells, and is robustly linked with higher expression of cell proliferation genes. Regional mutagenesis is associated with loss of activity of the tumor-suppressor genes RB1 and TP53, consistent with their roles in cell cycle control, with distinct mutational patterns generated by the two genes. Loss of regional heterogeneity in mutagenesis is associated with deficiencies in various DNA repair pathways. These mutation risk redistribution processes modify the mutation supply towards important genes, diverting the course of somatic evolution.

Asunto(s)

Genes cdc , Neoplasias , Humanos , Heterocromatina , Mutación/genética , Neoplasias/genética , Mutagénesis/genética

17.

Copy number losses of oncogenes and gains of tumor suppressor genes generate common driver mutations.

Besedina, Elizaveta; Supek, Fran.

Nat Commun ; 15(1): 6139, 2024 Jul 20.

Artículo en Inglés | MEDLINE | ID: mdl-39033140

RESUMEN

Cancer driver genes can undergo positive selection for various types of genetic alterations, including gain-of-function or loss-of-function mutations and copy number alterations (CNA). We investigated the landscape of different types of alterations affecting driver genes in 17,644 cancer exomes and genomes. We find that oncogenes may simultaneously exhibit signatures of positive selection and also negative selection in different gene segments, suggesting a method to identify additional tumor types where an oncogene is a driver or a vulnerability. Next, we characterize the landscape of CNA-dependent selection effects, revealing a general trend of increased positive selection on oncogene mutations not only upon CNA gains but also upon CNA deletions. Similarly, we observe a positive interaction between mutations and CNA gains in tumor suppressor genes. Thus, two-hit events involving point mutations and CNA are universally observed regardless of the type of CNA and may signal new therapeutic opportunities. An analysis with focus on the somatic CNA two-hit events can help identify additional driver genes relevant to a tumor type. By a global inference of point mutation and CNA selection signatures and interactions thereof across genes and tissues, we identify 9 evolutionary archetypes of driver genes, representing different mechanisms of (in)activation by genetic alterations.

Asunto(s)

Variaciones en el Número de Copia de ADN , Genes Supresores de Tumor , Neoplasias , Oncogenes , Humanos , Oncogenes/genética , Variaciones en el Número de Copia de ADN/genética , Neoplasias/genética , Mutación , Mutación Puntual , Exoma/genética , Genoma Humano

18.

Genome-scale quantification and prediction of pathogenic stop codon readthrough by small molecules.

Toledano, Ignasi; Supek, Fran; Lehner, Ben.

Nat Genet ; 56(9): 1914-1924, 2024 Sep.

Artículo en Inglés | MEDLINE | ID: mdl-39174735

RESUMEN

Premature termination codons (PTCs) cause ~10-20% of inherited diseases and are a major mechanism of tumor suppressor gene inactivation in cancer. A general strategy to alleviate the effects of PTCs would be to promote translational readthrough. Nonsense suppression by small molecules has proven effective in diverse disease models, but translation into the clinic is hampered by ineffective readthrough of many PTCs. Here we directly tackle the challenge of defining drug efficacy by quantifying the readthrough of ~5,800 human pathogenic stop codons by eight drugs. We find that different drugs promote the readthrough of complementary subsets of PTCs defined by local sequence context. This allows us to build interpretable models that accurately predict drug-induced readthrough genome-wide, and we validate these models by quantifying endogenous stop codon readthrough. Accurate readthrough quantification and prediction will empower clinical trial design and the development of personalized nonsense suppression therapies.

Asunto(s)

Codón sin Sentido , Codón de Terminación , Humanos , Genoma Humano , Biosíntesis de Proteínas/efectos de los fármacos , Bibliotecas de Moléculas Pequeñas/farmacología

19.

Nucleoid-associated proteins affect mutation dynamics in E. coli in a growth phase-specific manner.

Warnecke, Tobias; Supek, Fran; Lehner, Ben.

PLoS Comput Biol ; 8(12): e1002846, 2012.

Artículo en Inglés | MEDLINE | ID: mdl-23284284

RESUMEN

The binding of proteins can shield DNA from mutagenic processes but also interfere with efficient repair. How the presence of DNA-binding proteins shapes intra-genomic differences in mutability and, ultimately, sequence variation in natural populations, however, remains poorly understood. In this study, we examine sequence evolution in Escherichia coli in relation to the binding of four abundant nucleoid-associated proteins: Fis, H-NS, IhfA, and IhfB. We find that, for a subset of mutations, protein occupancy is associated with both increased and decreased mutability in the underlying sequence depending on when the protein is bound during the bacterial growth cycle. On average, protein-bound DNA exhibits reduced mutability compared to protein-free DNA. However, this net protective effect is weak and can be abolished or even reversed during stages of colony growth where binding coincides - and hence likely interferes with - DNA repair activity. We suggest that the four nucleoid-associated proteins analyzed here have played a minor but significant role in patterning extant sequence variation in E. coli.

Asunto(s)

Proteínas de Escherichia coli/fisiología , Escherichia coli/genética , Mutación , Reparación del ADN , Escherichia coli/crecimiento & desarrollo , Proteínas de Escherichia coli/metabolismo , Evolución Molecular , Filogenia , Unión Proteica

20.

Translational selection is ubiquitous in prokaryotes.

Supek, Fran; Skunca, Nives; Repar, Jelena; Vlahovicek, Kristian; Smuc, Tomislav.

PLoS Genet ; 6(6): e1001004, 2010 Jun 24.

Artículo en Inglés | MEDLINE | ID: mdl-20585573

RESUMEN

Codon usage bias in prokaryotic genomes is largely a consequence of background substitution patterns in DNA, but highly expressed genes may show a preference towards codons that enable more efficient and/or accurate translation. We introduce a novel approach based on supervised machine learning that detects effects of translational selection on genes, while controlling for local variation in nucleotide substitution patterns represented as sequence composition of intergenic DNA. A cornerstone of our method is a Random Forest classifier that outperformed previous distance measure-based approaches, such as the codon adaptation index, in the task of discerning the (highly expressed) ribosomal protein genes by their codon frequencies. Unlike previous reports, we show evidence that translational selection in prokaryotes is practically universal: in 460 of 461 examined microbial genomes, we find that a subset of genes shows a higher codon usage similarity to the ribosomal proteins than would be expected from the local sequence composition. These genes constitute a substantial part of the genome--between 5% and 33%, depending on genome size--while also exhibiting higher experimentally measured mRNA abundances and tending toward codons that match tRNA anticodons by canonical base pairing. Certain gene functional categories are generally enriched with, or depleted of codon-optimized genes, the trends of enrichment/depletion being conserved between Archaea and Bacteria. Prominent exceptions from these trends might indicate genes with alternative physiological roles; we speculate on specific examples related to detoxication of oxygen radicals and ammonia and to possible misannotations of asparaginyl-tRNA synthetases. Since the presence of codon optimizations on genes is a valid proxy for expression levels in fully sequenced genomes, we provide an example of an "adaptome" by highlighting gene functions with expression levels elevated specifically in thermophilic Bacteria and Archaea.

Asunto(s)

Células Procariotas/metabolismo , Biosíntesis de Proteínas , Archaea/genética , Bacterias/genética , Codón , Regulación de la Expresión Génica , Genoma , Modelos Genéticos , Proteínas Ribosómicas/genética

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA