Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 20
Filter
Add more filters











Publication year range
2.
J Mol Biol ; 434(11): 167578, 2022 06 15.
Article in English | MEDLINE | ID: mdl-35398145

ABSTRACT

Expression of mRNA is often regulated by the binding of a small RNA (miRNA, snoRNA, siRNA). While the pairing contribution to the net free energy is well parameterized and can be computed in O(N) time, the cost of removing pre-existing mRNA secondary structure has not received sufficient attention. Conventional methods for computing the unfolding free energy of a target mRNA are costly, scaling like the cube of the number of target bases O(N3). Here we introduce a model to describe the unfolding costs of the binding site, which features surprisingly big differences in the free energy parameters for the four bases. The model is implemented in our O(N) algorithm, BindOligoNet. Donor splice site prediction is more accurate when using our calculation of spliceosomal U1-snRNA to mRNA net binding free energy. Our base-dependent free energies also correlate with efficient ribosome docking near the start codon.


Subject(s)
Peptide Chain Initiation, Translational , RNA Splicing , RNA, Messenger , Algorithms , Binding Sites , Nucleic Acid Conformation , Nucleotides , RNA, Messenger/biosynthesis , RNA, Messenger/chemistry , RNA, Small Nuclear/chemistry , Spliceosomes/chemistry , Thermodynamics
3.
Nature ; 578(7793): 102-111, 2020 02.
Article in English | MEDLINE | ID: mdl-32025015

ABSTRACT

The discovery of drivers of cancer has traditionally focused on protein-coding genes1-4. Here we present analyses of driver point mutations and structural variants in non-coding regions across 2,658 genomes from the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium5 of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA). For point mutations, we developed a statistically rigorous strategy for combining significance levels from multiple methods of driver discovery that overcomes the limitations of individual methods. For structural variants, we present two methods of driver discovery, and identify regions that are significantly affected by recurrent breakpoints and recurrent somatic juxtapositions. Our analyses confirm previously reported drivers6,7, raise doubts about others and identify novel candidates, including point mutations in the 5' region of TP53, in the 3' untranslated regions of NFKBIZ and TOB1, focal deletions in BRD4 and rearrangements in the loci of AKR1C genes. We show that although point mutations and structural variants that drive cancer are less frequent in non-coding genes and regulatory sequences than in protein-coding genes, additional examples of these drivers will be found as more cancer genomes become available.


Subject(s)
Genome, Human/genetics , Mutation/genetics , Neoplasms/genetics , DNA Breaks , Databases, Genetic , Gene Expression Regulation, Neoplastic , Genome-Wide Association Study , Humans , INDEL Mutation
4.
Cancer Cell ; 36(3): 288-301.e14, 2019 09 16.
Article in English | MEDLINE | ID: mdl-31526759

ABSTRACT

Current statistical models for assessing hotspot significance do not properly account for variation in site-specific mutability, thereby yielding many false-positives. We thus (i) detail a Log-normal-Poisson (LNP) background model that accounts for this variability in a manner consistent with models of mutagenesis; (ii) use it to show that passenger hotspots arise from all common mutational processes; and (iii) apply it to a ∼10,000-patient cohort to nominate driver hotspots with far fewer false-positives compared with conventional methods. Overall, we show that many cancer hotspot mutations recurring at the same genomic site across multiple tumors are actually passenger events, recurring at inherently mutable genomic sites under no positive selection.


Subject(s)
Carcinogenesis/genetics , Genomics/methods , Models, Genetic , Mutagenesis , Neoplasms/genetics , DNA Mutational Analysis , Datasets as Topic , Genes, Tumor Suppressor , Humans , Poisson Distribution , ROC Curve , Selection, Genetic , Exome Sequencing
5.
Science ; 364(6444)2019 Jun 07.
Article in English | MEDLINE | ID: mdl-31171663

ABSTRACT

How somatic mutations accumulate in normal cells is poorly understood. A comprehensive analysis of RNA sequencing data from ~6700 samples across 29 normal tissues revealed multiple somatic variants, demonstrating that macroscopic clones can be found in many normal tissues. We found that sun-exposed skin, esophagus, and lung have a higher mutation burden than other tested tissues, which suggests that environmental factors can promote somatic mosaicism. Mutation burden was associated with both age and tissue-specific cell proliferation rate, highlighting that mutations accumulate over both time and number of cell divisions. Finally, normal tissues were found to harbor mutations in known cancer genes and hotspots. This study provides a broad view of macroscopic clonal expansion in human tissues, thus serving as a foundation for associating clonal expansion with environmental factors, aging, and risk of disease.


Subject(s)
DNA Mutational Analysis/methods , Neoplasms/genetics , Sequence Analysis, RNA/methods , Clone Cells , Female , Humans , Male , Organ Specificity/genetics
6.
Nature ; 569(7757): 503-508, 2019 05.
Article in English | MEDLINE | ID: mdl-31068700

ABSTRACT

Large panels of comprehensively characterized human cancer models, including the Cancer Cell Line Encyclopedia (CCLE), have provided a rigorous framework with which to study genetic variants, candidate targets, and small-molecule and biological therapeutics and to identify new marker-driven cancer dependencies. To improve our understanding of the molecular features that contribute to cancer phenotypes, including drug responses, here we have expanded the characterizations of cancer cell lines to include genetic, RNA splicing, DNA methylation, histone H3 modification, microRNA expression and reverse-phase protein array data for 1,072 cell lines from individuals of various lineages and ethnicities. Integration of these data with functional characterizations such as drug-sensitivity, short hairpin RNA knockdown and CRISPR-Cas9 knockout data reveals potential targets for cancer drugs and associated biomarkers. Together, this dataset and an accompanying public data portal provide a resource for the acceleration of cancer research using model cancer cell lines.


Subject(s)
Cell Line, Tumor , Neoplasms/genetics , Neoplasms/pathology , Antineoplastic Agents/pharmacology , Biomarkers, Tumor , DNA Methylation , Drug Resistance, Neoplasm , Ethnicity/genetics , Gene Editing , Histones/metabolism , Humans , MicroRNAs/genetics , Molecular Targeted Therapy , Neoplasms/metabolism , Protein Array Analysis , RNA Splicing
8.
Cancer Cell ; 34(2): 242-255.e5, 2018 08 13.
Article in English | MEDLINE | ID: mdl-30107175

ABSTRACT

Hürthle cell carcinoma of the thyroid (HCC) is a form of thyroid cancer recalcitrant to radioiodine therapy that exhibits an accumulation of mitochondria. We performed whole-exome sequencing on a cohort of primary, recurrent, and metastatic tumors, and identified recurrent mutations in DAXX, TP53, NRAS, NF1, CDKN1A, ARHGAP35, and the TERT promoter. Parallel analysis of mtDNA revealed recurrent homoplasmic mutations in subunits of complex I of the electron transport chain. Analysis of DNA copy-number alterations uncovered widespread loss of chromosomes culminating in near-haploid chromosomal content in a large fraction of HCC, which was maintained during metastatic spread. This work uncovers a distinct molecular origin of HCC compared with other thyroid malignancies.


Subject(s)
Chromosome Aberrations , DNA, Mitochondrial/genetics , Mutation , Thyroid Neoplasms/genetics , DNA Copy Number Variations , Haploidy , Humans , Neoplasm Metastasis , Telomerase/genetics , Thyroid Neoplasms/pathology , Exome Sequencing
10.
Nat Med ; 24(8): 1292, 2018 Aug.
Article in English | MEDLINE | ID: mdl-29955181

ABSTRACT

In the version of this article originally published, some text above the "Tri-nucleotide sequence motifs" label in Fig. 2a appeared incorrectly. The text was garbled and should have appeared as nucleotide codes.Additionally, the labels on the bars in Fig. 2c were not italicized in the original publication. These are gene symbols, and they should have been italicized.The colored labels above the graphs in Fig. 4b were also erroneously not italicized. These labels represent gene names and loci, and they should have been italicized.

11.
Nat Med ; 24(5): 679-690, 2018 05.
Article in English | MEDLINE | ID: mdl-29713087

ABSTRACT

Diffuse large B cell lymphoma (DLBCL), the most common lymphoid malignancy in adults, is a clinically and genetically heterogeneous disease that is further classified into transcriptionally defined activated B cell (ABC) and germinal center B cell (GCB) subtypes. We carried out a comprehensive genetic analysis of 304 primary DLBCLs and identified low-frequency alterations, captured recurrent mutations, somatic copy number alterations, and structural variants, and defined coordinate signatures in patients with available outcome data. We integrated these genetic drivers using consensus clustering and identified five robust DLBCL subsets, including a previously unrecognized group of low-risk ABC-DLBCLs of extrafollicular/marginal zone origin; two distinct subsets of GCB-DLBCLs with different outcomes and targetable alterations; and an ABC/GCB-independent group with biallelic inactivation of TP53, CDKN2A loss, and associated genomic instability. The genetic features of the newly characterized subsets, their mutational signatures, and the temporal ordering of identified alterations provide new insights into DLBCL pathogenesis. The coordinate genetic signatures also predict outcome independent of the clinical International Prognostic Index and suggest new combination treatment strategies. More broadly, our results provide a roadmap for an actionable DLBCL classification.


Subject(s)
Lymphoma, Large B-Cell, Diffuse/genetics , Lymphoma, Large B-Cell, Diffuse/pathology , DNA Copy Number Variations/genetics , Gene Rearrangement/genetics , Genes, Neoplasm , Genetic Heterogeneity , Humans , Mutation/genetics , Mutation Rate , Treatment Outcome
12.
Cell ; 173(2): 371-385.e18, 2018 04 05.
Article in English | MEDLINE | ID: mdl-29625053

ABSTRACT

Identifying molecular cancer drivers is critical for precision oncology. Multiple advanced algorithms to identify drivers now exist, but systematic attempts to combine and optimize them on large datasets are few. We report a PanCancer and PanSoftware analysis spanning 9,423 tumor exomes (comprising all 33 of The Cancer Genome Atlas projects) and using 26 computational tools to catalog driver genes and mutations. We identify 299 driver genes with implications regarding their anatomical sites and cancer/cell types. Sequence- and structure-based analyses identified >3,400 putative missense driver mutations supported by multiple lines of evidence. Experimental validation confirmed 60%-85% of predicted mutations as likely drivers. We found that >300 MSI tumors are associated with high PD-1/PD-L1, and 57% of tumors analyzed harbor putative clinically actionable events. Our study represents the most comprehensive discovery of cancer genes and mutations to date and will serve as a blueprint for future biological and clinical endeavors.


Subject(s)
Neoplasms/pathology , Algorithms , B7-H1 Antigen/genetics , Computational Biology , Databases, Genetic , Entropy , Humans , Microsatellite Instability , Mutation , Neoplasms/genetics , Neoplasms/immunology , Principal Component Analysis , Programmed Cell Death 1 Receptor/genetics
14.
Nat Biotechnol ; 35(10): 951-959, 2017 Oct.
Article in English | MEDLINE | ID: mdl-28892075

ABSTRACT

Microsatellites (MSs) are tracts of variable-length repeats of short DNA motifs that exhibit high rates of mutation in the form of insertions or deletions (indels) of the repeated motif. Despite their prevalence, the contribution of somatic MS indels to cancer has been largely unexplored, owing to difficulties in detecting them in short-read sequencing data. Here we present two tools: MSMuTect, for accurate detection of somatic MS indels, and MSMutSig, for identification of genes containing MS indels at a higher frequency than expected by chance. Applying MSMuTect to whole-exome data from 6,747 human tumors representing 20 tumor types, we identified >1,000 previously undescribed MS indels in cancer genes. Additionally, we demonstrate that the number and pattern of MS indels can accurately distinguish microsatellite-stable tumors from tumors with microsatellite instability, thus potentially improving classification of clinically relevant subgroups. Finally, we identified seven MS indel driver hotspots: four in known cancer genes (ACVR2A, RNF43, JAK1, and MSH3) and three in genes not previously implicated as cancer drivers (ESRP1, PRDM2, and DOCK3).


Subject(s)
INDEL Mutation/genetics , Microsatellite Repeats/genetics , Neoplasms/genetics , Exome/genetics , Genes, Neoplasm , High-Throughput Nucleotide Sequencing , Humans , Microsatellite Instability , Mutation/genetics , RNA, Messenger/genetics , RNA, Messenger/metabolism , RNA-Binding Proteins/genetics , RNA-Binding Proteins/metabolism
15.
Cancer Cell ; 32(2): 204-220.e15, 2017 08 14.
Article in English | MEDLINE | ID: mdl-28810145

ABSTRACT

Comprehensive multiplatform analysis of 80 uveal melanomas (UM) identifies four molecularly distinct, clinically relevant subtypes: two associated with poor-prognosis monosomy 3 (M3) and two with better-prognosis disomy 3 (D3). We show that BAP1 loss follows M3 occurrence and correlates with a global DNA methylation state that is distinct from D3-UM. Poor-prognosis M3-UM divide into subsets with divergent genomic aberrations, transcriptional features, and clinical outcomes. We report change-of-function SRSF2 mutations. Within D3-UM, EIF1AX- and SRSF2/SF3B1-mutant tumors have distinct somatic copy number alterations and DNA methylation profiles, providing insight into the biology of these low- versus intermediate-risk clinical mutation subtypes.


Subject(s)
Biomarkers, Tumor/genetics , DNA Methylation , Gene Expression Regulation, Neoplastic , Melanoma/genetics , Mutation , Uveal Neoplasms/genetics , DNA Copy Number Variations , Eukaryotic Initiation Factor-1/genetics , Humans , Melanoma/classification , Monosomy , Phosphoproteins/genetics , Prognosis , RNA Splicing Factors/genetics , Serine-Arginine Splicing Factors/genetics , Tumor Suppressor Proteins/genetics , Ubiquitin Thiolesterase/genetics , Uveal Neoplasms/classification
17.
Cell Rep ; 18(11): 2780-2794, 2017 03 14.
Article in English | MEDLINE | ID: mdl-28297679

ABSTRACT

Cholangiocarcinoma (CCA) is an aggressive malignancy of the bile ducts, with poor prognosis and limited treatment options. Here, we describe the integrated analysis of somatic mutations, RNA expression, copy number, and DNA methylation by The Cancer Genome Atlas of a set of predominantly intrahepatic CCA cases and propose a molecular classification scheme. We identified an IDH mutant-enriched subtype with distinct molecular features including low expression of chromatin modifiers, elevated expression of mitochondrial genes, and increased mitochondrial DNA copy number. Leveraging the multi-platform data, we observed that ARID1A exhibited DNA hypermethylation and decreased expression in the IDH mutant subtype. More broadly, we found that IDH mutations are associated with an expanded histological spectrum of liver tumors with molecular features that stratify with CCA. Our studies reveal insights into the molecular pathogenesis and heterogeneity of cholangiocarcinoma and provide classification information of potential therapeutic significance.


Subject(s)
Bile Duct Neoplasms/genetics , Cholangiocarcinoma/genetics , Genomics/methods , Isocitrate Dehydrogenase/genetics , Mutation/genetics , Adult , Aged , Aged, 80 and over , Bile Duct Neoplasms/enzymology , Cholangiocarcinoma/enzymology , Chromatin/metabolism , DNA Methylation/genetics , DNA-Binding Proteins , Female , Gene Expression Regulation, Neoplastic , Humans , Liver/pathology , Liver Neoplasms/genetics , Liver Neoplasms/pathology , Male , Middle Aged , Mitochondria/metabolism , Nuclear Proteins/genetics , Pancreatic Neoplasms/genetics , Pancreatic Neoplasms/pathology , Promoter Regions, Genetic/genetics , RNA, Long Noncoding/genetics , RNA, Long Noncoding/metabolism , RNA, Messenger/genetics , RNA, Messenger/metabolism , Transcription Factors/genetics
18.
Nat Genet ; 49(1): 10-16, 2017 01.
Article in English | MEDLINE | ID: mdl-27869828

ABSTRACT

There is a striking and unexplained male predominance across many cancer types. A subset of X-chromosome genes can escape X-inactivation, which would protect females from complete functional loss by a single mutation. To identify putative 'escape from X-inactivation tumor-suppressor' (EXITS) genes, we examined somatic alterations from >4,100 cancers across 21 tumor types for sex bias. Six of 783 non-pseudoautosomal region (PAR) X-chromosome genes (ATRX, CNKSR2, DDX3X, KDM5C, KDM6A, and MAGEC3) harbored loss-of-function mutations more frequently in males (based on a false discovery rate < 0.1), in comparison to zero of 18,055 autosomal and PAR genes (Fisher's exact P < 0.0001). Male-biased mutations in genes that escape X-inactivation were observed in combined analysis across many cancers and in several individual tumor types, suggesting a generalized phenomenon. We conclude that biallelic expression of EXITS genes in females explains a portion of the reduced cancer incidence in females as compared to males across a variety of tumor types.


Subject(s)
Chromosomes, Human, X/genetics , Genes, Tumor Suppressor , Genes, X-Linked/genetics , Mutation/genetics , Neoplasms/genetics , Sexism/statistics & numerical data , X Chromosome Inactivation/genetics , Female , Humans , Male
19.
Cell ; 164(3): 538-49, 2016 Jan 28.
Article in English | MEDLINE | ID: mdl-26806129

ABSTRACT

Mutational processes constantly shape the somatic genome, leading to immunity, aging, cancer, and other diseases. When cancer is the outcome, we are afforded a glimpse into these processes by the clonal expansion of the malignant cell. Here, we characterize a less explored layer of the mutational landscape of cancer: mutational asymmetries between the two DNA strands. Analyzing whole-genome sequences of 590 tumors from 14 different cancer types, we reveal widespread asymmetries across mutagenic processes, with transcriptional ("T-class") asymmetry dominating UV-, smoking-, and liver-cancer-associated mutations and replicative ("R-class") asymmetry dominating POLE-, APOBEC-, and MSI-associated mutations. We report a striking phenomenon of transcription-coupled damage (TCD) on the non-transcribed DNA strand and provide evidence that APOBEC mutagenesis occurs on the lagging-strand template during DNA replication. As more genomes are sequenced, studying and classifying their asymmetries will illuminate the underlying biological mechanisms of DNA damage and repair.


Subject(s)
DNA Damage , DNA Mutational Analysis , DNA Repair , Neoplasms/genetics , DNA Replication , Genome, Human , Genome-Wide Association Study , Humans , Mutation , Neoplasms/pathology , Transcription, Genetic
20.
Nature ; 526(7574): 525-30, 2015 Oct 22.
Article in English | MEDLINE | ID: mdl-26466571

ABSTRACT

Which genetic alterations drive tumorigenesis and how they evolve over the course of disease and therapy are central questions in cancer biology. Here we identify 44 recurrently mutated genes and 11 recurrent somatic copy number variations through whole-exome sequencing of 538 chronic lymphocytic leukaemia (CLL) and matched germline DNA samples, 278 of which were collected in a prospective clinical trial. These include previously unrecognized putative cancer drivers (RPS15, IKZF3), and collectively identify RNA processing and export, MYC activity, and MAPK signalling as central pathways involved in CLL. Clonality analysis of this large data set further enabled reconstruction of temporal relationships between driver events. Direct comparison between matched pre-treatment and relapse samples from 59 patients demonstrated highly frequent clonal evolution. Thus, large sequencing data sets of clinically informative samples enable the discovery of novel genes associated with cancer, the network of relationships between the driver events, and their impact on disease relapse and clinical outcome.


Subject(s)
Disease Progression , Evolution, Molecular , Leukemia, Lymphocytic, Chronic, B-Cell/genetics , Mutation/genetics , Neoplasm Recurrence, Local/genetics , Cell Transformation, Neoplastic/genetics , Clone Cells/metabolism , Clone Cells/pathology , DNA Copy Number Variations/genetics , Exome/genetics , Genes, myc/genetics , Humans , Ikaros Transcription Factor/genetics , Leukemia, Lymphocytic, Chronic, B-Cell/diagnosis , Leukemia, Lymphocytic, Chronic, B-Cell/pathology , Leukemia, Lymphocytic, Chronic, B-Cell/therapy , MAP Kinase Signaling System/genetics , Prognosis , RNA Processing, Post-Transcriptional/genetics , RNA Transport/genetics , Ribosomal Proteins/genetics , Treatment Outcome
SELECTION OF CITATIONS
SEARCH DETAIL