Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
1.
Nucleic Acids Res ; 52(8): 4393-4408, 2024 May 08.
Artigo em Inglês | MEDLINE | ID: mdl-38587182

RESUMO

Local mutation rates in human are highly heterogeneous, with known variability at the scale of megabase-sized chromosomal domains, and, on the other extreme, at the scale of oligonucleotides. The intermediate, kilobase-scale heterogeneity in mutation risk is less well characterized. Here, by analyzing thousands of somatic genomes, we studied mutation risk gradients along gene bodies, representing a genomic scale spanning roughly 1-10 kb, hypothesizing that different mutational mechanisms are differently distributed across gene segments. The main heterogeneity concerns several kilobases at the transcription start site and further downstream into 5' ends of gene bodies; these are commonly hypomutated with several mutational signatures, most prominently the ubiquitous C > T changes at CpG dinucleotides. The width and shape of this mutational coldspot at 5' gene ends is variable across genes, and corresponds to variable interval of lowered DNA methylation depending on gene activity level and regulation. Such hypomutated loci, at 5' gene ends or elsewhere, correspond to DNA hypomethylation that can associate with various landmarks, including intragenic enhancers, Polycomb-marked regions, or chromatin loop anchor points. Tissue-specific DNA hypomethylation begets tissue-specific local hypomutation. Of note, direction of mutation risk is inverted for AID/APOBEC3 cytosine deaminase activity, whose signatures are enriched in hypomethylated regions.


Assuntos
Metilação de DNA , Taxa de Mutação , Humanos , Ilhas de CpG , Heterogeneidade Genética , Genoma Humano , Mutação , Sítio de Iniciação de Transcrição
2.
Clin Sci (Lond) ; 136(5): 383-404, 2022 03 18.
Artigo em Inglês | MEDLINE | ID: mdl-35274136

RESUMO

Genome sequencing can be used to detect DNA repair failures in tumors and learn about underlying mechanisms. Here, we synthesize findings from genomic studies that examined deficiencies of the DNA mismatch repair (MMR) pathway. The impairment of MMR results in genome-wide hypermutation and in the 'microsatellite instability' (MSI) phenotype-occurrence of indel mutations at short tandem repeat (microsatellite) loci. The MSI status of tumors was traditionally assessed by molecular testing of a selected set of MS loci or by measuring MMR protein expression levels. Today, genomic data can provide a more complete picture of the consequences on genomic instability. Multiple computational studies examined somatic mutation distributions that result from failed DNA repair pathways in tumors. These include analyzing the commonly studied trinucleotide mutational spectra of single-nucleotide variants (SNVs), as well as of other features such as indels, structural variants, mutation clusters and regional mutation rate redistribution. The identified mutation patterns can be used to rigorously measure prevalence of MMR failures across cancer types, and potentially to subcategorize the MMR deficiencies. Diverse data sources, genomic and pre-genomic, from human and from experimental models, suggest there are different ways in which MMR can fail, and/or that the cell-type or genetic background may result in different types of MMR mutational patterns. The spectrum of MMR failures may direct cancer evolution, generating particular sets of driver mutations. Moreover, MMR affects outcomes of therapy by DNA damaging drugs, antimetabolites, nonsense-mediated mRNA decay (NMD) inhibitors, and immunotherapy by promoting either resistance or sensitivity, depending on the type of therapy.


Assuntos
Neoplasias Colorretais , Reparo de Erro de Pareamento de DNA , Neoplasias Colorretais/genética , Reparo de Erro de Pareamento de DNA/genética , Reparo do DNA/genética , Genômica , Humanos , Instabilidade de Microssatélites , Mutação/genética
3.
PLoS Comput Biol ; 15(4): e1006953, 2019 04.
Artigo em Inglês | MEDLINE | ID: mdl-30986244

RESUMO

Determining the cancer type and molecular subtype has important clinical implications. The primary site is however unknown for some malignancies discovered in the metastatic stage. Moreover liquid biopsies may be used to screen for tumoral DNA, which upon detection needs to be assigned to a site-of-origin. Classifiers based on genomic features are a promising approach to prioritize the tumor anatomical site, type and subtype. We examined the predictive ability of causal (driver) somatic mutations in this task, comparing it against global patterns of non-selected (passenger) mutations, including features based on regional mutation density (RMD). In the task of distinguishing 18 cancer types, the driver mutations-mutated oncogenes or tumor suppressors, pathways and hotspots-classified 36% of the patients to the correct cancer type. In contrast, the features based on passenger mutations did so at 92% accuracy, with similar contribution from the RMD and the trinucleotide mutation spectra. The RMD and the spectra covered distinct sets of patients with predictions. In particular, introducing the RMD features into a combined classification model increased the fraction of diagnosed patients by 50 percentage points (at 20% FDR). Furthermore, RMD was able to discriminate molecular subtypes and/or anatomical site of six major cancers. The advantage of passenger mutations was upheld under high rates of false negative mutation calls and with exome sequencing, even though overall accuracy decreased. We suggest whole genome sequencing is valuable for classifying tumors because it captures global patterns emanating from mutational processes, which are informative of the underlying tumor biology.


Assuntos
Biologia Computacional/métodos , Neoplasias/classificação , Neoplasias/genética , Algoritmos , DNA de Neoplasias/classificação , DNA de Neoplasias/genética , Exoma/genética , Genômica , Humanos , Aprendizado de Máquina , Mutação/genética , Software , Sequenciamento do Exoma/métodos , Sequenciamento Completo do Genoma/métodos
4.
RNA ; 23(7): 1080-1087, 2017 07.
Artigo em Inglês | MEDLINE | ID: mdl-28386015

RESUMO

The subcellular localization of long noncoding RNAs (lncRNAs) holds valuable clues to their molecular function. However, measuring localization of newly discovered lncRNAs involves time-consuming and costly experimental methods. We have created "lncATLAS," a comprehensive resource of lncRNA localization in human cells based on RNA-sequencing data sets. Altogether, 6768 GENCODE-annotated lncRNAs are represented across various compartments of 15 cell lines. We introduce relative concentration index (RCI) as a useful measure of localization derived from ensemble RNA-seq measurements. LncATLAS is accessible through an intuitive and informative webserver, from which lncRNAs of interest are accessed using identifiers or names. Localization is presented across cell types and organelles, and may be compared to the distribution of all other genes. Publication-quality figures and raw data tables are automatically generated with each query, and the entire data set is also available to download. LncATLAS makes lncRNA subcellular localization data available to the widest possible number of researchers. It is available at lncatlas.crg.eu.


Assuntos
Núcleo Celular/genética , Citoplasma/genética , RNA Longo não Codificante/metabolismo , Biologia Computacional/métodos , Bases de Dados Genéticas , Humanos , Anotação de Sequência Molecular , Análise de Sequência de RNA , Software
5.
medRxiv ; 2023 Dec 18.
Artigo em Inglês | MEDLINE | ID: mdl-38196581

RESUMO

Variation in DNA repair genes can increase cancer risk by elevating the rate of oncogenic mutation. Defects in one such gene, MUTYH, are known to elevate the incidence of colorectal cancer in a recessive Mendelian manner, and some evidence has also linked MUTYH to elevated incidence of other cancers as well as elevated mutation rates in normal somatic and germline cells. Here, we use whole genome sequencing to measure germline de novo mutation rates in a large extended family affected by pathogenic MUTYH variation and a history of colorectal cancer. Although this family's genotype, p.Y179C/V234M (c.536A>G/700G>A on transcript NM_001128425), contains a variant with conflicting functional interpretations, we use an in vitro cell line assay to determine that it partially attenuates MUTYH's function. In the children of mothers affected by the Y179C/V234M genotype, we identify an elevation of the C>A mutation rate that is weaker than mutator effects previously reported to be caused by other pathogenic MUTYH genotypes, suggesting that mutation rates in normal tissues may be useful for classifying cancer-associated variation along a continuum of severity. Surprisingly, we detect no significant elevation of the C>A mutation rate in children born to a father with the same biallelic MUTYH genotype, despite calculating that we should have adequate power to detect such a mutator effect. This suggests that the oxidative stress repaired by MUTYH may contribute more to female reproductive aging than male reproductive aging in the general population.

6.
Nat Genet ; 52(9): 958-968, 2020 09.
Artigo em Inglês | MEDLINE | ID: mdl-32747826

RESUMO

Certain mutagens, including the APOBEC3 (A3) cytosine deaminase enzymes, can create multiple genetic changes in a single event. Activity of A3s results in striking 'mutation showers' occurring near DNA breakpoints; however, less is known about the mechanisms underlying the majority of A3 mutations. We classified the diverse patterns of clustered mutagenesis in tumor genomes, which identified a new A3 pattern: nonrecurrent, diffuse hypermutation (omikli). This mechanism occurs independently of the known focal hypermutation (kataegis), and is associated with activity of the DNA mismatch-repair pathway, which can provide the single-stranded DNA substrate needed by A3, and contributes to a substantial proportion of A3 mutations genome wide. Because mismatch repair is directed towards early-replicating, gene-rich chromosomal domains, A3 mutagenesis has a high propensity to generate impactful mutations, which exceeds that of other common carcinogens such as tobacco smoke and ultraviolet exposure. Cells direct their DNA repair capacity towards more important genomic regions; thus, carcinogens that subvert DNA repair can be remarkably potent.


Assuntos
Citidina Desaminase/genética , Reparo de Erro de Pareamento de DNA/genética , Mutação/genética , Neoplasias/genética , Desaminases APOBEC , Citosina Desaminase/genética , DNA de Cadeia Simples/genética , Genoma/genética , Humanos , Mutagênese/genética
7.
Commun Biol ; 3(1): 56, 2020 02 05.
Artigo em Inglês | MEDLINE | ID: mdl-32024996

RESUMO

Long non-coding RNAs (lncRNAs) are a growing focus of cancer genomics studies, creating the need for a resource of lncRNAs with validated cancer roles. Furthermore, it remains debated whether mutated lncRNAs can drive tumorigenesis, and whether such functions could be conserved during evolution. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, we introduce the Cancer LncRNA Census (CLC), a compilation of 122 GENCODE lncRNAs with causal roles in cancer phenotypes. In contrast to existing databases, CLC requires strong functional or genetic evidence. CLC genes are enriched amongst driver genes predicted from somatic mutations, and display characteristic genomic features. Strikingly, CLC genes are enriched for driver mutations from unbiased, genome-wide transposon-mutagenesis screens in mice. We identified 10 tumour-causing mutations in orthologues of 8 lncRNAs, including LINC-PINT and NEAT1, but not MALAT1. Thus CLC represents a dataset of high-confidence cancer lncRNAs. Mutagenesis maps are a novel means for identifying deeply-conserved roles of lncRNAs in tumorigenesis.


Assuntos
Transformação Celular Neoplásica/genética , Suscetibilidade a Doenças , Neoplasias/genética , RNA Longo não Codificante , Animais , Biomarcadores Tumorais , Sistemas CRISPR-Cas , Bases de Dados Genéticas , Evolução Molecular , Genoma Humano , Genômica/métodos , Humanos , Polimorfismo de Nucleotídeo Único
8.
Genome Biol ; 20(1): 285, 2019 12 18.
Artigo em Inglês | MEDLINE | ID: mdl-31849330

RESUMO

BACKGROUND: The lifelong accumulation of somatic mutations underlies age-related phenotypes and cancer. Mutagenic forces are thought to shape the genome of aging cells in a tissue-specific way. Whole genome analyses of somatic mutation patterns, based on both types and genomic distribution of variants, can shed light on specific processes active in different human tissues and their effect on the transition to cancer. RESULTS: To analyze somatic mutation patterns, we compile a comprehensive genetic atlas of somatic mutations in healthy human cells. High-confidence variants are obtained from newly generated and publicly available whole genome DNA sequencing data from single non-cancer cells, clonally expanded in vitro. To enable a well-controlled comparison of different cell types, we obtain single genome data (92% mean coverage) from multi-organ biopsies from the same donors. These data show multiple cell types that are protected from mutagens and display a stereotyped mutation profile, despite their origin from different tissues. Conversely, the same tissue harbors cells with distinct mutation profiles associated to different differentiation states. Analyses of mutation rate in the coding and non-coding portions of the genome identify a cell type bearing a unique mutation pattern characterized by mutation enrichment in active chromatin, regulatory, and transcribed regions. CONCLUSIONS: Our analysis of normal cells from healthy donors identifies a somatic mutation landscape that enhances the risk of tumor transformation in a specific cell population from the kidney proximal tubule. This unique pattern is characterized by high rate of mutation accumulation during adult life and specific targeting of expressed genes and regulatory regions.


Assuntos
Mutagênese , Neoplasias/etiologia , Sequenciamento Completo do Genoma , Idoso , Feminino , Humanos
9.
Eur J Hum Genet ; 26(12): 1859-1866, 2018 12.
Artigo em Inglês | MEDLINE | ID: mdl-30089826

RESUMO

Basques show specific cultural, demographic, and genetic characteristics that have placed them as an isolated and unique population within Europe, such as their non-Indo-European language, Euskara. They have historically lived along the Western Pyrenees, between Spain and France, in one of the most important European glacial refugia during the Last Glacial Maximum. The most striking genetic characteristic is their highest frequency of the RhD blood group negative allele, a variant related to the hemolytic disease of the newborn. Both demographic and adaptive processes have been suggested as possible causes of the high frequency of RhD negative in Basques, but neither hypothesis has been clearly demonstrated. While previous studies on the Rh system in Basques have been mostly focused on serological and genotyping diversity, in this work we analyze genotyping and next generation sequencing data in order to provide a general framework of the genetic scenario of the system in Basques. In particular, we genotyped the most relevant variants of the system (D/d, E/e, and C/c), and sequenced three ~6 kb flanking regions surrounding the Rh genes in Basques and also in other populations for comparison. Our results are in agreement with previous studies, with Basques presenting the highest frequency of the RHD deletion (47.2%). Haplotype analyses of D/d, E/e, and C/c variants confirmed an association between the RhC allele, previously suggested to be under positive selection, and the RhD positive variant in non-sub-Saharan populations, including Basques. We also found extreme differentiation for the C/c variant when comparing sub-Saharan to non-sub-Saharan populations.


Assuntos
Polimorfismo Genético , Sistema do Grupo Sanguíneo Rh-Hr/genética , Feminino , Frequência do Gene , Haplótipos , Humanos , Masculino , Espanha
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA