Your browser doesn't support javascript.
loading
Modeling clinical and molecular covariates of mutational process activity in cancer.
Robinson, Welles; Sharan, Roded; Leiserson, Mark D M.
Afiliação
  • Robinson W; Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD, USA.
  • Sharan R; Cancer Data Science Laboratory, National Cancer Institute, NIH, Bethesda, MD, USA.
  • Leiserson MDM; Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv, Israel.
Bioinformatics ; 35(14): i492-i500, 2019 07 15.
Article em En | MEDLINE | ID: mdl-31510643
ABSTRACT
MOTIVATION Somatic mutations result from processes related to DNA replication or environmental/lifestyle exposures. Knowing the activity of mutational processes in a tumor can inform personalized therapies, early detection, and understanding of tumorigenesis. Computational methods have revealed 30 validated signatures of mutational processes active in human cancers, where each signature is a pattern of single base substitutions. However, half of these signatures have no known etiology, and some similar signatures have distinct etiologies, making patterns of mutation signature activity hard to interpret. Existing mutation signature detection methods do not consider tumor-level clinical/demographic (e.g. smoking history) or molecular features (e.g. inactivations to DNA damage repair genes).

RESULTS:

To begin to address these challenges, we present the Tumor Covariate Signature Model (TCSM), the first method to directly model the effect of observed tumor-level covariates on mutation signatures. To this end, our model uses methods from Bayesian topic modeling to change the prior distribution on signature exposure conditioned on a tumor's observed covariates. We also introduce methods for imputing covariates in held-out data and for evaluating the statistical significance of signature-covariate associations. On simulated and real data, we find that TCSM outperforms both non-negative matrix factorization and topic modeling-based approaches, particularly in recovering the ground truth exposure to similar signatures. We then use TCSM to discover five mutation signatures in breast cancer and predict homologous recombination repair deficiency in held-out tumors. We also discover four signatures in a combined melanoma and lung cancer cohort-using cancer type as a covariate-and provide statistical evidence to support earlier claims that three lung cancers from The Cancer Genome Atlas are misdiagnosed metastatic melanomas. AVAILABILITY AND IMPLEMENTATION TCSM is implemented in Python 3 and available at https//github.com/lrgr/tcsm, along with a data workflow for reproducing the experiments in the paper. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Neoplasias da Mama / Mutação / Neoplasias Tipo de estudo: Prognostic_studies / Screening_studies Limite: Humans Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2019 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Neoplasias da Mama / Mutação / Neoplasias Tipo de estudo: Prognostic_studies / Screening_studies Limite: Humans Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2019 Tipo de documento: Article País de afiliação: Estados Unidos