ABSTRACT
PTEN dysfunction plays a crucial role in the pathogenesis of hereditary and sporadic cancers. Here, we show that PTEN homodimerizes and, in this active conformation, exerts lipid phosphatase activity on PtdIns(3,4,5)P3. We demonstrate that catalytically inactive cancer-associated PTEN mutants heterodimerize with wild-type PTEN and constrain its phosphatase activity in a dominant-negative manner. To study the consequences of homo- and heterodimerization of wild-type and mutant PTEN in vivo, we generated Pten knockin mice harboring two cancer-associated PTEN mutations (PtenC124S and PtenG129E). Heterozygous Pten(C124S/+) and Pten(G129E/+) cells and tissues exhibit increased sensitivity to PI3-K/Akt activation compared to wild-type and Pten(+/-) counterparts, whereas this difference is no longer apparent between Pten(C124S/-) and Pten(-/-) cells. Notably, Pten KI mice are more tumor prone and display features reminiscent of complete Pten loss. Our findings reveal that PTEN loss and PTEN mutations are not synonymous and define a working model for the function and regulation of PTEN.
Subject(s)
PTEN Phosphohydrolase/genetics , PTEN Phosphohydrolase/metabolism , Signal Transduction , Animals , Embryo, Mammalian/cytology , Female , Humans , Loss of Heterozygosity , Male , Mice , Mutation , Protein Multimerization , Proto-Oncogene Proteins c-akt/metabolismABSTRACT
Analysis of de novo mutations (DNMs) from sequencing data of nuclear families has identified risk genes for many complex diseases, including multiple neurodevelopmental and psychiatric disorders. Most of these efforts have focused on mutations in protein-coding sequences. Evidence from genome-wide association studies (GWASs) strongly suggests that variants important to human diseases often lie in non-coding regions. Extending DNM-based approaches to non-coding sequences is challenging, however, because the functional significance of non-coding mutations is difficult to predict. We propose a statistical framework for analyzing DNMs from whole-genome sequencing (WGS) data. This method, TADA-Annotations (TADA-A), is a major advance of the TADA method we developed earlier for DNM analysis in coding regions. TADA-A is able to incorporate many functional annotations such as conservation and enhancer marks, to learn from data which annotations are informative of pathogenic mutations, and to combine both coding and non-coding mutations at the gene level to detect risk genes. It also supports meta-analysis of multiple DNM studies, while adjusting for study-specific technical effects. We applied TADA-A to WGS data of â¼300 autism-affected family trios across five studies and discovered several autism risk genes. The software is freely available for all research uses.
Subject(s)
Chromosome Mapping , Genetic Predisposition to Disease , Mutation/genetics , Statistics as Topic , Whole Genome Sequencing , Autistic Disorder/genetics , Calibration , Enhancer Elements, Genetic/genetics , Humans , Molecular Sequence Annotation , Mutation Rate , RNA Splicing/genetics , Risk Factors , Exome SequencingABSTRACT
Analysis of 4,405 variants in 89,050 European subjects from 41 case-control studies identified three independent association signals for estrogen-receptor-positive tumors at 11q13. The strongest signal maps to a transcriptional enhancer element in which the G allele of the best candidate causative variant rs554219 increases risk of breast cancer, reduces both binding of ELK4 transcription factor and luciferase activity in reporter assays, and may be associated with low cyclin D1 protein levels in tumors. Another candidate variant, rs78540526, lies in the same enhancer element. Risk association signal 2, rs75915166, creates a GATA3 binding site within a silencer element. Chromatin conformation studies demonstrate that these enhancer and silencer elements interact with each other and with their likely target gene, CCND1.
Subject(s)
Breast Neoplasms/genetics , Chromosomes, Human, Pair 11/genetics , Cyclin D1/genetics , Enhancer Elements, Genetic/genetics , Polymorphism, Single Nucleotide/genetics , Binding Sites , Case-Control Studies , Cell Line, Tumor , Chromatin/chemistry , Chromatin/genetics , Chromatin Immunoprecipitation , Cyclin D1/metabolism , Electrophoretic Mobility Shift Assay , Female , GATA3 Transcription Factor/antagonists & inhibitors , GATA3 Transcription Factor/genetics , GATA3 Transcription Factor/metabolism , Gene Expression Regulation, Neoplastic , Humans , Luciferases/metabolism , Promoter Regions, Genetic/genetics , RNA, Messenger/genetics , RNA, Small Interfering/genetics , Real-Time Polymerase Chain Reaction , Reverse Transcriptase Polymerase Chain Reaction , Silencer Elements, Transcriptional/genetics , ets-Domain Protein Elk-4/antagonists & inhibitors , ets-Domain Protein Elk-4/genetics , ets-Domain Protein Elk-4/metabolismABSTRACT
A major goal in translational cancer research is to identify biological signatures driving cancer progression and metastasis. A common technique applied in genomics research is to cluster patients using gene expression data from a candidate prognostic gene set, and if the resulting clusters show statistically significant outcome stratification, to associate the gene set with prognosis, suggesting its biological and clinical importance. Recent work has questioned the validity of this approach by showing in several breast cancer data sets that "random" gene sets tend to cluster patients into prognostically variable subgroups. This work suggests that new rigorous statistical methods are needed to identify biologically informative prognostic gene sets. To address this problem, we developed Significance Analysis of Prognostic Signatures (SAPS) which integrates standard prognostic tests with a new prognostic significance test based on stratifying patients into prognostic subtypes with random gene sets. SAPS ensures that a significant gene set is not only able to stratify patients into prognostically variable groups, but is also enriched for genes showing strong univariate associations with patient prognosis, and performs significantly better than random gene sets. We use SAPS to perform a large meta-analysis (the largest completed to date) of prognostic pathways in breast and ovarian cancer and their molecular subtypes. Our analyses show that only a small subset of the gene sets found statistically significant using standard measures achieve significance by SAPS. We identify new prognostic signatures in breast and ovarian cancer and their corresponding molecular subtypes, and we show that prognostic signatures in ER negative breast cancer are more similar to prognostic signatures in ovarian cancer than to prognostic signatures in ER positive breast cancer. SAPS is a powerful new method for deriving robust prognostic biological signatures from clinically annotated genomic datasets.
Subject(s)
Breast Neoplasms/pathology , Ovarian Neoplasms/pathology , Disease Progression , Female , Humans , Neoplasm Metastasis , PrognosisABSTRACT
INTRODUCTION: Estrogen receptor (ER) and progesterone receptor (PR) testing are performed in the evaluation of breast cancer. While the clinical utility of ER as a predictive biomarker to identify patients likely to benefit from hormonal therapy is well-established, the added value of PR is less well-defined. The primary goals of our study were to assess the distribution, inter-assay reproducibility, and prognostic significance of breast cancer subtypes defined by patterns of ER and PR expression. METHODS: We integrated gene expression microarray (GEM) and clinico-pathologic data from 20 published studies to determine the frequency (n = 4,111) and inter-assay reproducibility (n = 1,752) of ER/PR subtypes (ER+/PR+, ER+/PR-, ER-/PR-, ER-/PR+). To extend our findings, we utilized a cohort of patients from the Nurses' Health Study (NHS) with ER/PR data recorded in the medical record and assessed on tissue microarrays (n = 2,011). In both datasets, we assessed the association of ER and PR expression with survival. RESULTS: In a genome-wide analysis, progesterone receptor was among the least variable genes in ER- breast cancer. The ER-/PR+ subtype was rare (approximately 1 to 4%) and showed no significant reproducibility (Kappa = 0.02 and 0.06, in the GEM and NHS datasets, respectively). The vast majority of patients classified as ER-/PR+ in the medical record (97% and 94%, in the GEM and NHS datasets) were re-classified by a second method. In the GEM dataset (n = 2,731), progesterone receptor mRNA expression was associated with prognosis in ER+ breast cancer (adjusted P <0.001), but not in ER- breast cancer (adjusted P = 0.21). PR protein expression did not contribute significant prognostic information to multivariate models considering ER and other standard clinico-pathologic features in the GEM or NHS datasets. CONCLUSION: ER-/PR+ breast cancer is not a reproducible subtype. PR expression is not associated with prognosis in ER- breast cancer, and PR does not contribute significant independent prognostic information to multivariate models considering ER and other standard clinico-pathologic factors. Given that PR provides no clinically actionable information in ER+ breast cancer, these findings question the utility of routine PR testing in breast cancer.
Subject(s)
Breast Neoplasms/diagnosis , Breast Neoplasms/genetics , Receptors, Estrogen/genetics , Receptors, Progesterone/genetics , Adult , Breast Neoplasms/mortality , Female , Gene Expression Profiling , Gene Expression Regulation, Neoplastic , Humans , Middle Aged , Prognosis , RNA, Messenger/genetics , Receptors, Estrogen/metabolism , Receptors, Progesterone/metabolism , Reproducibility of ResultsABSTRACT
Mendelian randomization (MR) is a valuable tool for detecting causal effects by using genetic variant associations. Opportunities to apply MR are growing rapidly with the increasing number of genome-wide association studies (GWAS). However, existing MR methods rely on strong assumptions that are often violated, leading to false positives. Correlated horizontal pleiotropy, which arises when variants affect both traits through a heritable shared factor, remains a particularly challenging problem. We propose a new MR method, Causal Analysis Using Summary Effect estimates (CAUSE), that accounts for correlated and uncorrelated horizontal pleiotropic effects. We demonstrate, in simulations, that CAUSE avoids more false positives induced by correlated horizontal pleiotropy than other methods. Applied to traits studied in recent GWAS studies, we find that CAUSE detects causal relationships that have strong literature support and avoids identifying most unlikely relationships. Our results suggest that shared heritable factors are common and may lead to many false positives using alternative methods.
Subject(s)
Genetic Pleiotropy , Mendelian Randomization Analysis/methods , Causality , Computer Simulation , Disease , False Positive Reactions , Genome , Models, Statistical , Risk FactorsABSTRACT
An amendment to this paper has been published and can be accessed via a link at the top of the paper.
ABSTRACT
While a genetic component of preterm birth (PTB) has long been recognized and recently mapped by genome-wide association studies (GWASs), the molecular determinants underlying PTB remain elusive. This stems in part from an incomplete availability of functional genomic annotations in human cell types relevant to pregnancy and PTB. We generated transcriptome (RNA-seq), epigenome (ChIP-seq of H3K27ac, H3K4me1, and H3K4me3 histone modifications), open chromatin (ATAC-seq), and chromatin interaction (promoter capture Hi-C) annotations of cultured primary decidua-derived mesenchymal stromal/stem cells and in vitro differentiated decidual stromal cells and developed a computational framework to integrate these functional annotations with results from a GWAS of gestational duration in 56,384 women. Using these resources, we uncovered additional loci associated with gestational duration and target genes of associated loci. Our strategy illustrates how functional annotations in pregnancy-relevant cell types aid in the experimental follow-up of GWAS for PTB and, likely, other pregnancy-related conditions.
Subject(s)
Premature Birth , Transcriptome , Chromatin/genetics , Chromatin/metabolism , Decidua , Female , Genome-Wide Association Study , Humans , Infant, Newborn , Male , Pregnancy , Premature Birth/genetics , Premature Birth/metabolism , Stromal CellsABSTRACT
Identifying driver genes from somatic mutations is a central problem in cancer biology. Existing methods, however, either lack explicit statistical models, or use models based on simplistic assumptions. Here, we present driverMAPS (Model-based Analysis of Positive Selection), a model-based approach to driver gene identification. This method explicitly models positive selection at the single-base level, as well as highly heterogeneous background mutational processes. In particular, the selection model captures elevated mutation rates in functionally important sites using multiple external annotations, and spatial clustering of mutations. Simulations under realistic evolutionary models demonstrate the increased power of driverMAPS over current approaches. Applying driverMAPS to TCGA data of 20 tumor types, we identified 159 new potential driver genes, including the mRNA methyltransferase METTL3-METTL14. We experimentally validated METTL3 as a tumor suppressor gene in bladder cancer, providing support to the important role mRNA modification plays in tumorigenesis.
Subject(s)
Neoplasms/genetics , Oncogenes , Carcinogenesis , Humans , Methyltransferases/genetics , Models, Genetic , Mutation , Mutation RateABSTRACT
Sampling of formalin-fixed paraffin-embedded (FFPE) tissue blocks is a critical initial step in molecular pathology. Image-guided coring (IGC) is a new method for using digital pathology images to guide tissue block coring for molecular analyses. The goal of our study is to evaluate the use of IGC for both tissue-based and nucleic acid-based projects in molecular pathology. First, we used IGC to construct a tissue microarray (TMA); second, we used IGC for FFPE block sampling followed by RNA extraction; and third, we assessed the correlation between nuclear counts quantitated from the IGC images and RNA yields. We used IGC to construct a TMA containing 198 normal and breast cancer cores. Histopathologic analysis showed high accuracy for obtaining tumor and normal breast tissue. Next, we used IGC to obtain normal and tumor breast samples before RNA extraction. We selected a random subset of tumor and normal samples to perform computational image analysis to quantify nuclear density, and we built regression models to estimate RNA yields from nuclear count, age of the block, and core diameter. Number of nuclei and core diameter were the strongest predictors of RNA yields in both normal and tumor tissue. IGC is an effective method for sampling FFPE tissue blocks for TMA construction and nucleic acid extraction. We identify significant associations between quantitative nuclear counts obtained from IGC images and RNA yields, suggesting that the integration of computational image analysis with IGC may be an effective approach for tumor sampling in large-scale molecular studies.
Subject(s)
Pathology, Molecular , Humans , Paraffin EmbeddingABSTRACT
BACKGROUND: Epithelial-stromal crosstalk plays a critical role in invasive breast cancer pathogenesis; however, little is known on a systems level about how epithelial-stromal interactions evolve during carcinogenesis. RESULTS: We develop a framework for building genome-wide epithelial-stromal co-expression networks composed of pairwise co-expression relationships between mRNA levels of genes expressed in the epithelium and stroma across a population of patients. We apply this method to laser capture micro-dissection expression profiling datasets in the setting of breast carcinogenesis. Our analysis shows that epithelial-stromal co-expression networks undergo extensive rewiring during carcinogenesis, with the emergence of distinct network hubs in normal breast, and estrogen receptor-positive and estrogen receptor-negative invasive breast cancer, and the emergence of distinct patterns of functional network enrichment. In contrast to normal breast, the strongest epithelial-stromal co-expression relationships in invasive breast cancer mostly represent self-loops, in which the same gene is co-expressed in epithelial and stromal regions. We validate this observation using an independent laser capture micro-dissection dataset and confirm that self-loop interactions are significantly increased in cancer by performing computational image analysis of epithelial and stromal protein expression using images from the Human Protein Atlas. CONCLUSIONS: Epithelial-stromal co-expression network analysis represents a new approach for systems-level analyses of spatially localized transcriptomic data. The analysis provides new biological insights into the rewiring of epithelial-stromal co-expression networks and the emergence of epithelial-stromal co-expression self-loops in breast cancer. The approach may facilitate the development of new diagnostics and therapeutics targeting epithelial-stromal interactions in cancer.
Subject(s)
Breast Neoplasms/genetics , Breast/metabolism , Epithelial Cells/metabolism , Gene Expression Regulation, Neoplastic , Gene Regulatory Networks , Breast Neoplasms/metabolism , Female , Gene Expression Profiling , Genomics , Humans , Immunohistochemistry , Receptors, Estrogen , Stromal Cells/metabolism , Tissue Array AnalysisABSTRACT
UNLABELLED: The PI3K-AKT signaling pathway regulates all phenotypes that contribute to progression of human cancers, including breast cancer. AKT mediates signal relay by phosphorylating numerous substrates, which are causally implicated in biologic responses such as cell growth, survival, metabolic reprogramming, migration, and invasion. Here a new AKT substrate is identified, the adherens junction protein Afadin, which is phosphorylated by AKT at Ser1718. Importantly, under conditions of physiologic IGF-1 signaling and oncogenic PI3K and AKT, Afadin is phosphorylated by all AKT isoforms, and this phosphorylation elicits a relocalization of Afadin from adherens junctions to the nucleus. Also, phosphorylation of Afadin increased breast cancer cell migration that was dependent on Ser1718 phosphorylation. Finally, nuclear localization of Afadin was observed in clinical breast cancer specimens, indicating that regulation of Afadin by the PI3K-AKT pathway has pathophysiologic significance. IMPLICATIONS: Phosphorylation of the adhesion protein Afadin by AKT downstream of the PI3K pathway, leads to redistribution of Afadin and controls cancer cell migration.
Subject(s)
Adherens Junctions/metabolism , Breast Neoplasms/metabolism , Breast Neoplasms/pathology , Cell Movement/physiology , Microfilament Proteins/metabolism , Proto-Oncogene Proteins c-akt/metabolism , Adherens Junctions/genetics , Breast Neoplasms/genetics , Cell Culture Techniques , Cell Growth Processes/physiology , Cell Line, Tumor , Female , Humans , Microfilament Proteins/genetics , Phosphorylation , Proto-Oncogene Proteins c-akt/genetics , Signal TransductionABSTRACT
The categorization of intraductal proliferative lesions of the breast based on routine light microscopic examination of histopathologic sections is in many cases challenging, even for experienced pathologists. The development of computational tools to aid pathologists in the characterization of these lesions would have great diagnostic and clinical value. As a first step to address this issue, we evaluated the ability of computational image analysis to accurately classify DCIS and UDH and to stratify nuclear grade within DCIS. Using 116 breast biopsies diagnosed as DCIS or UDH from the Massachusetts General Hospital (MGH), we developed a computational method to extract 392 features corresponding to the mean and standard deviation in nuclear size and shape, intensity, and texture across 8 color channels. We used L1-regularized logistic regression to build classification models to discriminate DCIS from UDH. The top-performing model contained 22 active features and achieved an AUC of 0.95 in cross-validation on the MGH data-set. We applied this model to an external validation set of 51 breast biopsies diagnosed as DCIS or UDH from the Beth Israel Deaconess Medical Center, and the model achieved an AUC of 0.86. The top-performing model contained active features from all color-spaces and from the three classes of features (morphology, intensity, and texture), suggesting the value of each for prediction. We built models to stratify grade within DCIS and obtained strong performance for stratifying low nuclear grade vs. high nuclear grade DCIS (AUC = 0.98 in cross-validation) with only moderate performance for discriminating low nuclear grade vs. intermediate nuclear grade and intermediate nuclear grade vs. high nuclear grade DCIS (AUC = 0.83 and 0.69, respectively). These data show that computational pathology models can robustly discriminate benign from malignant intraductal proliferative lesions of the breast and may aid pathologists in the diagnosis and classification of these lesions.
Subject(s)
Breast Neoplasms/pathology , Breast/pathology , Carcinoma, Ductal, Breast/pathology , Carcinoma, Intraductal, Noninfiltrating/pathology , Computational Biology , Hyperplasia/pathology , Image Processing, Computer-Assisted/methods , Breast Neoplasms/classification , Female , Humans , Neoplasm Grading , Prognosis , ROC CurveABSTRACT
Mammographic density reflects the amount of stromal and epithelial tissues in relation to adipose tissue in the breast and is a strong risk factor for breast cancer. Here we report the results from meta-analysis of genome-wide association studies (GWAS) of three mammographic density phenotypes: dense area, non-dense area and percent density in up to 7,916 women in stage 1 and an additional 10,379 women in stage 2. We identify genome-wide significant (P<5 × 10(-8)) loci for dense area (AREG, ESR1, ZNF365, LSP1/TNNT3, IGF1, TMEM184B and SGSM3/MKL1), non-dense area (8p11.23) and percent density (PRDM6, 8p11.23 and TMEM184B). Four of these regions are known breast cancer susceptibility loci, and four additional regions were found to be associated with breast cancer (P<0.05) in a large meta-analysis. These results provide further evidence of a shared genetic basis between mammographic density and breast cancer and illustrate the power of studying intermediate quantitative phenotypes to identify putative disease-susceptibility loci.
Subject(s)
Breast Neoplasms/diagnostic imaging , Breast Neoplasms/genetics , Genetic Loci , Genetic Predisposition to Disease , Genome-Wide Association Study , Mammary Glands, Human/abnormalities , Breast Density , Case-Control Studies , Female , Humans , Polymorphism, Single Nucleotide/genetics , RadiographyABSTRACT
Stressful life experiences are known to be a precipitating factor for many mental disorders. The social defeat model induces behavioral responses in rodents (e.g. reduced social interaction) that are similar to behavioral patterns associated with mood disorders. The model has contributed to the discovery of novel mechanisms regulating behavioral responses to stress, but its utility has been largely limited to males. This is disadvantageous because most mood disorders have a higher incidence in women versus men. Male and female California mice (Peromyscus californicus) aggressively defend territories, which allowed us to observe the effects of social defeat in both sexes. In two experiments, mice were exposed to three social defeat or control episodes. Mice were then behaviorally phenotyped, and indirect markers of brain activity and corticosterone responses to a novel social stimulus were assessed. Sex differences in behavioral responses to social stress were long lasting (4 wks). Social defeat reduced social interaction responses in females but not males. In females, social defeat induced an increase in the number of phosphorylated CREB positive cells in the nucleus accumbens shell after exposure to a novel social stimulus. This effect of defeat was not observed in males. The effects of defeat in females were limited to social contexts, as there were no differences in exploratory behavior in the open field or light-dark box test. These data suggest that California mice could be a useful model for studying sex differences in behavioral responses to stress, particularly in neurobiological mechanisms that are involved with the regulation of social behavior.