RESUMO
Pancreatic ductal adenocarcinoma (PDAC) is a highly aggressive cancer with poor patient survival. Toward understanding the underlying molecular alterations that drive PDAC oncogenesis, we conducted comprehensive proteogenomic analysis of 140 pancreatic cancers, 67 normal adjacent tissues, and 9 normal pancreatic ductal tissues. Proteomic, phosphoproteomic, and glycoproteomic analyses were used to characterize proteins and their modifications. In addition, whole-genome sequencing, whole-exome sequencing, methylation, RNA sequencing (RNA-seq), and microRNA sequencing (miRNA-seq) were performed on the same tissues to facilitate an integrated proteogenomic analysis and determine the impact of genomic alterations on protein expression, signaling pathways, and post-translational modifications. To ensure robust downstream analyses, tumor neoplastic cellularity was assessed via multiple orthogonal strategies using molecular features and verified via pathological estimation of tumor cellularity based on histological review. This integrated proteogenomic characterization of PDAC will serve as a valuable resource for the community, paving the way for early detection and identification of novel therapeutic targets.
Assuntos
Adenocarcinoma/genética , Carcinoma Ductal Pancreático/genética , Neoplasias Pancreáticas/genética , Proteogenômica , Adenocarcinoma/diagnóstico , Adulto , Idoso , Idoso de 80 Anos ou mais , Algoritmos , Carcinoma Ductal Pancreático/diagnóstico , Estudos de Coortes , Células Endoteliais/metabolismo , Epigênese Genética , Feminino , Dosagem de Genes , Genoma Humano , Glicólise , Glicoproteínas/biossíntese , Humanos , Masculino , Pessoa de Meia-Idade , Terapia de Alvo Molecular , Neoplasias Pancreáticas/diagnóstico , Fenótipo , Fosfoproteínas/metabolismo , Fosforilação , Prognóstico , Proteínas Quinases/metabolismo , Proteoma/metabolismo , Especificidade por Substrato , Transcriptoma/genéticaRESUMO
Cancer pharmacogenomics studies provide valuable insights into disease progression and associations between genomic features and drug response. PharmacoDB integrates multiple cancer pharmacogenomics datasets profiling approved and investigational drugs across cell lines from diverse tissue types. The web-application enables users to efficiently navigate across datasets, view and compare drug dose-response data for a specific drug-cell line pair. In the new version of PharmacoDB (version 2.0, https://pharmacodb.ca/), we present (i) new datasets such as NCI-60, the Profiling Relative Inhibition Simultaneously in Mixtures (PRISM) dataset, as well as updated data from the Genomics of Drug Sensitivity in Cancer (GDSC) and the Genentech Cell Line Screening Initiative (gCSI); (ii) implementation of FAIR data pipelines using ORCESTRA and PharmacoDI; (iii) enhancements to drug-response analysis such as tissue distribution of dose-response metrics and biomarker analysis; and (iv) improved connectivity to drug and cell line databases in the community. The web interface has been rewritten using a modern technology stack to ensure scalability and standardization to accommodate growing pharmacogenomics datasets. PharmacoDB 2.0 is a valuable tool for mining pharmacogenomics datasets, comparing and assessing drug-response phenotypes of cancer models.
Assuntos
Bases de Dados Genéticas , Farmacogenética/normas , Testes Farmacogenômicos/métodos , Software , Genômica/métodos , HumanosRESUMO
The goal of precision oncology is to tailor treatment for patients individually using the genomic profile of their tumors. Pharmacogenomics datasets such as cancer cell lines are among the most valuable resources for drug sensitivity prediction, a crucial task of precision oncology. Machine learning methods have been employed to predict drug sensitivity based on the multiple omics data available for large panels of cancer cell lines. However, there are no comprehensive guidelines on how to properly train and validate such machine learning models for drug sensitivity prediction. In this paper, we introduce a set of guidelines for different aspects of training gene expression-based predictors using cell line datasets. These guidelines provide extensive analysis of the generalization of drug sensitivity predictors and challenge many current practices in the community including the choice of training dataset and measure of drug sensitivity. The application of these guidelines in future studies will enable the development of more robust preclinical biomarkers.
Assuntos
Resistencia a Medicamentos Antineoplásicos , Aprendizado de Máquina , Farmacogenética , Algoritmos , Linhagem Celular Tumoral , Conjuntos de Dados como Assunto , HumanosRESUMO
Motivation: Predicting anticancer treatment response from baseline genomic data is a critical obstacle in personalized medicine. Machine learning methods are commonly used for predicting drug response from gene expression data. In the process of constructing these machine learning models, one of the most significant challenges is identifying appropriate features among a massive number of genes. Results: In this study, we utilize features (genes) extracted using the text-mining of scientific literatures. Using two independent cancer pharmacogenomic datasets, we demonstrate that text-mining-based features outperform traditional feature selection techniques in machine learning tasks. In addition, our analysis reveals that text-mining feature-based machine learning models trained on in vitro data also perform well when predicting the response of in vivo cancer models. Our results demonstrate that text-mining-based feature selection is an easy to implement approach that is suitable for building machine learning models for anticancer drug response prediction. Availability and implementation: https://github.com/merlab/text_features.
RESUMO
BACKGROUND: Proteins perform their functions in associated cellular locations. Therefore, the study of protein function can be facilitated by predictions of protein location. Protein location can be predicted either from the sequence of a protein alone by identification of targeting peptide sequences and motifs, or by homology to proteins of known location. A third approach, which is complementary, exploits the differences in amino acid composition of proteins associated to different cellular locations, and can be useful if motif and homology information are missing. Here we expand this approach taking into account amino acid composition at different levels of amino acid exposure. RESULTS: Our method has two stages. For stage one, we trained multiple Support Vector Machines (SVMs) to score eukaryotic protein sequences for membership to each of three categories: nuclear, cytoplasmic and extracellular, plus extra category nucleocytoplasmic, accounting for the fact that a large number of proteins shuttles between those two locations. In stage two we use an artificial neural network (ANN) to propose a category from the scores given to the four locations in stage one. The method reaches an accuracy of 68% when using as input 3D-derived values of amino acid exposure. Calibration of the method using predicted values of amino acid exposure allows classifying proteins without 3D-information with an accuracy of 62% and discerning proteins in different locations even if they shared high levels of identity. CONCLUSIONS: In this study we explored the relationship between residue exposure and protein subcellular location. We developed a new algorithm for subcellular location prediction that uses residue exposure signatures. Our algorithm uses a novel approach to address the multiclass classification problem. The algorithm is implemented as web server 'NYCE' and can be accessed at http://cbdm.mdc-berlin.de/~amer/nyce.
Assuntos
Bases de Dados de Proteínas , Proteínas de Transporte Nucleocitoplasmático/química , Proteínas de Transporte Nucleocitoplasmático/metabolismo , Algoritmos , Motivos de Aminoácidos , Sequência de Aminoácidos , Biologia Computacional/métodos , Humanos , Redes Neurais de Computação , Valor Preditivo dos Testes , Análise de Componente Principal , Proteínas/química , Proteínas/metabolismo , Frações Subcelulares/química , Frações Subcelulares/metabolismo , Máquina de Vetores de SuporteRESUMO
In solid tumors, drug concentrations decrease with distance from blood vessels. However, cellular adaptations accompanying the gradated exposure of cancer cells to drugs are largely unknown. Here, we modeled the spatiotemporal changes promoting chemotherapy resistance in breast cancer. Using pairwise cell competition assays at each step during the acquisition of chemoresistance, we reveal an important priming phase that renders cancer cells previously exposed to sublethal drug concentrations refractory to dose escalation. Therapy-resistant cells throughout the concentration gradient display higher expression of the solute carriers SLC38A7 and SLC46A1 and elevated intracellular concentrations of their associated metabolites. Reduced levels of SLC38A7 and SLC46A1 diminish the proliferative potential of cancer cells, and elevated expression of these SLCs in breast tumors from patients correlates with reduced survival. Our work provides mechanistic evidence to support dose-intensive treatment modalities for patients with solid tumors and reveals two members of the SLC family as potential actionable targets.
Assuntos
Neoplasias da Mama , Neoplasias Mamárias Animais , Animais , Humanos , Feminino , Resistencia a Medicamentos Antineoplásicos/genética , Neoplasias da Mama/tratamento farmacológico , Neoplasias da Mama/genética , Neoplasias da Mama/metabolismo , Mama/metabolismo , Transportador de Folato Acoplado a PrótonRESUMO
Identifying biomarkers predictive of cancer cell response to drug treatment constitutes one of the main challenges in precision oncology. Recent large-scale cancer pharmacogenomic studies have opened new avenues of research to develop predictive biomarkers by profiling thousands of human cancer cell lines at the molecular level and screening them with hundreds of approved drugs and experimental chemical compounds. Many studies have leveraged these data to build predictive models of response using various statistical and machine learning methods. However, a common pitfall to these methods is the lack of interpretability as to how they make predictions, hindering the clinical translation of these models. To alleviate this issue, we used the recent logic modeling approach to develop a new machine learning pipeline that explores the space of bimodally expressed genes in multiple large in vitro pharmacogenomic studies and builds multivariate, nonlinear, yet interpretable logic-based models predictive of drug response. The performance of this approach was showcased in a compendium of the three largest in vitro pharmacogenomic datasets to build robust and interpretable models for 101 drugs that span 17 drug classes with high validation rates in independent datasets. These results along with in vivo and clinical validation support a better translation of gene expression biomarkers between model systems using bimodal gene expression. SIGNIFICANCE: A new machine learning pipeline exploits the bimodality of gene expression to provide a reliable set of candidate predictive biomarkers with a high potential for clinical translatability.
Assuntos
Neoplasias , Biomarcadores , Expressão Gênica , Humanos , Neoplasias/tratamento farmacológico , Neoplasias/genética , Farmacogenética , Medicina de PrecisãoRESUMO
Resistance to targeted therapies is an important clinical problem in HER2-positive (HER2+) breast cancer. "Drug-tolerant persisters" (DTP), a subpopulation of cancer cells that survive via reversible, nongenetic mechanisms, are implicated in resistance to tyrosine kinase inhibitors (TKI) in other malignancies, but DTPs following HER2 TKI exposure have not been well characterized. We found that HER2 TKIs evoke DTPs with a luminal-like or a mesenchymal-like transcriptome. Lentiviral barcoding/single-cell RNA sequencing reveals that HER2+ breast cancer cells cycle stochastically through a "pre-DTP" state, characterized by a G0-like expression signature and enriched for diapause and/or senescence genes. Trajectory analysis/cell sorting shows that pre-DTPs preferentially yield DTPs upon HER2 TKI exposure. Cells with similar transcriptomes are present in HER2+ breast tumors and are associated with poor TKI response. Finally, biochemical experiments indicate that luminal-like DTPs survive via estrogen receptor-dependent induction of SGK3, leading to rewiring of the PI3K/AKT/mTORC1 pathway to enable AKT-independent mTORC1 activation. SIGNIFICANCE: DTPs are implicated in resistance to anticancer therapies, but their ontogeny and vulnerabilities remain unclear. We find that HER2 TKI-DTPs emerge from stochastically arising primed cells ("pre-DTPs") that engage either of two distinct transcriptional programs upon TKI exposure. Our results provide new insights into DTP ontogeny and potential therapeutic vulnerabilities. This article is highlighted in the In This Issue feature, p. 873.
Assuntos
Neoplasias da Mama , Neoplasias da Mama/tratamento farmacológico , Neoplasias da Mama/genética , Neoplasias da Mama/patologia , Linhagem Celular Tumoral , Resistencia a Medicamentos Antineoplásicos , Feminino , Humanos , Fosfatidilinositol 3-Quinases/metabolismo , Receptor ErbB-2/genética , Receptor ErbB-2/metabolismo , Transdução de SinaisRESUMO
Acute myeloid leukemia (AML) is heterogeneous with one common subtype recognized by the presence of recurrent mutation of nucleophosmin-1 (NPM1). Emerging evidence indicates that within NPM1 mutated AML there is variation in outcome which challenges how best to characterize and treat the individual patient. Our recent findings show that there are two distinct (primitive and committed) subtypes within NPM1 mutated AML patients. These subtypes exhibit specific molecular characteristics, disease differentiation states, patient survival, and differential drug responses.
RESUMO
In this meeting overview, we summarise the scientific program and organisation of the 16th International Society for Computational Biology Student Council Symposium in 2020 (ISCB SCS2020). This symposium was the first virtual edition in an uninterrupted series of symposia that has been going on for 15 years, aiming to unite computational biology students and early career researchers across the globe.
Assuntos
Biologia Computacional , Estudantes , Humanos , PesquisadoresRESUMO
Quantifying response to drug treatment in mouse models of human cancer is important for treatment development and assignment, yet remains a challenging task. To be able to translate the results of the experiments more readily, a preferred measure to quantify this response should take into account more of the available experimental data, including both tumor size over time and the variation among replicates. We propose a theoretically grounded measure, KuLGaP, to compute the difference between the treatment and control arms. We test and compare KuLGaP to four widely used response measures using 329 patient-derived xenograft (PDX) models. Our results show that KuLGaP is more selective than currently existing measures, reduces the risk of false-positive calls, and improves translation of the laboratory results to clinical practice. We also show that outcomes of human treatment better align with the results of the KuLGaP measure than other response measures. KuLGaP has the potential to become a measure of choice for quantifying drug treatment in mouse models as it can be easily used via the kulgap.ca website.
Assuntos
Xenoenxertos , Animais , Modelos Animais de Doenças , Humanos , Camundongos , Ensaios Antitumorais Modelo de XenoenxertoRESUMO
In acute myeloid leukemia (AML), molecular heterogeneity across patients constitutes a major challenge for prognosis and therapy. AML with NPM1 mutation is a distinct genetic entity in the revised World Health Organization classification. However, differing patterns of co-mutation and response to therapy within this group necessitate further stratification. Here we report two distinct subtypes within NPM1 mutated AML patients, which we label as primitive and committed based on the respective presence or absence of a stem cell signature. Using gene expression (RNA-seq), epigenomic (ATAC-seq) and immunophenotyping (CyToF) analysis, we associate each subtype with specific molecular characteristics, disease differentiation state and patient survival. Using ex vivo drug sensitivity profiling, we show a differential drug response of the subtypes to specific kinase inhibitors, irrespective of the FLT3-ITD status. Differential drug responses of the primitive and committed subtype are validated in an independent AML cohort. Our results highlight heterogeneity among NPM1 mutated AML patient samples based on stemness and suggest that the addition of kinase inhibitors to the treatment of cases with the primitive signature, lacking FLT3-ITD, could have therapeutic benefit.
Assuntos
Leucemia Mieloide Aguda/tratamento farmacológico , Leucemia Mieloide Aguda/genética , Mutação/genética , Proteínas Nucleares/genética , Cromatina/metabolismo , Análise por Conglomerados , Regulação Leucêmica da Expressão Gênica/efeitos dos fármacos , Humanos , Imunofenotipagem , Nucleofosmina , Fenótipo , Inibidores de Proteínas Quinases/farmacologia , Inibidores de Proteínas Quinases/uso terapêutico , Reprodutibilidade dos Testes , Análise de SobrevidaRESUMO
PURPOSE: Non-small cell lung cancer (NSCLC) is the most common cause of cancer-related deaths worldwide. There is an unmet need to develop novel clinically relevant models of NSCLC to accelerate identification of drug targets and our understanding of the disease. EXPERIMENTAL DESIGN: Thirty surgically resected NSCLC primary patient tissue and 35 previously established patient-derived xenograft (PDX) models were processed for organoid culture establishment. Organoids were histologically and molecularly characterized by cytology and histology, exome sequencing, and RNA-sequencing analysis. Tumorigenicity was assessed through subcutaneous injection of organoids in NOD/SCID mice. Organoids were subjected to drug testing using EGFR, FGFR, and MEK-targeted therapies. RESULTS: We have identified cell culture conditions favoring the establishment of short-term and long-term expansion of NSCLC organoids derived from primary lung patient and PDX tumor tissue. The NSCLC organoids recapitulated the histology of the patient and PDX tumor. They also retained tumorigenicity, as evidenced by cytologic features of malignancy, xenograft formation, preservation of mutations, copy number aberrations, and gene expression profiles between the organoid and matched parental tumor tissue by whole-exome and RNA sequencing. NSCLC organoid models also preserved the sensitivity of the matched parental tumor to targeted therapeutics, and could be used to validate or discover biomarker-drug combinations. CONCLUSIONS: Our panel of NSCLC organoids closely recapitulates the genomics and biology of patient tumors, and is a potential platform for drug testing and biomarker validation.
Assuntos
Biomarcadores Tumorais/genética , Carcinoma Pulmonar de Células não Pequenas/patologia , Modelos Animais de Doenças , Neoplasias Pulmonares/patologia , Terapia de Alvo Molecular/métodos , Mutação , Organoides/patologia , Animais , Biomarcadores Tumorais/metabolismo , Carcinoma Pulmonar de Células não Pequenas/tratamento farmacológico , Carcinoma Pulmonar de Células não Pequenas/genética , Humanos , Neoplasias Pulmonares/tratamento farmacológico , Neoplasias Pulmonares/genética , Camundongos , Camundongos Endogâmicos NOD , Camundongos SCID , Técnicas de Cultura de Órgãos/métodos , Organoides/efeitos dos fármacos , Organoides/metabolismo , Ensaios Antitumorais Modelo de XenoenxertoRESUMO
BACKGROUND: Long non-coding RNA (lncRNA) expression has been implicated in a range of molecular mechanisms that are central in cancer. However, lncRNA expression has not yet been comprehensively characterized in acute myeloid leukemia (AML). Here, we assess to what extent lncRNA expression is prognostic of AML patient overall survival (OS) and determine if there are indications of lncRNA-based molecular subtypes of AML. METHODS: We performed RNA sequencing of 274 intensively treated AML patients in a Swedish cohort and quantified lncRNA expression. Univariate and multivariate time-to-event analysis was applied to determine association between individual lncRNAs with OS. Unsupervised statistical learning was applied to ascertain if lncRNA-based molecular subtypes exist and are prognostic. RESULTS: Thirty-three individual lncRNAs were found to be associated with OS (adjusted p value < 0.05). We established four distinct molecular subtypes based on lncRNA expression using a consensus clustering approach. LncRNA-based subtypes were found to stratify patients into groups with prognostic information (p value < 0.05). Subsequently, lncRNA expression-based subtypes were validated in an independent patient cohort (TCGA-AML). LncRNA subtypes could not be directly explained by any of the recurrent cytogenetic or mutational aberrations, although associations with some of the established genetic and clinical factors were found, including mutations in NPM1, TP53, and FLT3. CONCLUSION: LncRNA expression-based four subtypes, discovered in this study, are reproducible and can effectively stratify AML patients. LncRNA expression profiling can provide valuable information for improved risk stratification of AML patients.
Assuntos
Leucemia Mieloide Aguda/genética , RNA Longo não Codificante/metabolismo , Feminino , Humanos , Masculino , Nucleofosmina , Prognóstico , Resultado do TratamentoRESUMO
Sequencing-based molecular characterization of tumors provides information required for individualized cancer treatment. There are well-defined molecular subtypes of breast cancer that provide improved prognostication compared to routine biomarkers. However, molecular subtyping is not yet implemented in routine breast cancer care. Clinical translation is dependent on subtype prediction models providing high sensitivity and specificity. In this study we evaluate sample size and RNA-sequencing read requirements for breast cancer subtyping to facilitate rational design of translational studies. We applied subsampling to ascertain the effect of training sample size and the number of RNA sequencing reads on classification accuracy of molecular subtype and routine biomarker prediction models (unsupervised and supervised). Subtype classification accuracy improved with increasing sample size up to N = 750 (accuracy = 0.93), although with a modest improvement beyond N = 350 (accuracy = 0.92). Prediction of routine biomarkers achieved accuracy of 0.94 (ER) and 0.92 (Her2) at N = 200. Subtype classification improved with RNA-sequencing library size up to 5 million reads. Development of molecular subtyping models for cancer diagnostics requires well-designed studies. Sample size and the number of RNA sequencing reads directly influence accuracy of molecular subtyping. Results in this study provide key information for rational design of translational studies aiming to bring sequencing-based diagnostics to the clinic.
Assuntos
Neoplasias da Mama/diagnóstico , Neoplasias da Mama/genética , Projetos de Pesquisa , Análise de Sequência de RNA/métodos , Neoplasias da Mama/classificação , Bases de Dados Genéticas , Feminino , Humanos , Receptores de Superfície Celular/metabolismo , Tamanho da AmostraRESUMO
BACKGROUND: Over-representation of predicted miRNA targets in sets of genes regulated by a given transcription factor (e.g. as defined by ChIP-sequencing experiments) helps to identify biologically relevant miRNA targets and is useful to get insight into post-transcriptional regulation. FINDINGS: To facilitate the application of this approach we have created the mBISON web-application. mBISON calculates the significance of over-representation of miRNA targets in a given non-ranked gene set. The gene set can be specified either by a list of genes or by one or more ChIP-seq datasets followed by a user-defined peak-gene association procedure. mBISON is based on predictions from TargetScan and uses a randomization step to calculate False-Discovery-Rates for each miRNA, including a correction for gene set specific properties such as 3'UTR length. The tool can be accessed from the following web-resource: http://cbdm.mdc-berlin.de/~mgebhardt/cgi-bin/mbison/home . CONCLUSION: mBISON is a web-application that helps to extract functional information about miRNAs from gene lists, which is in contrast to comparable applications easy to use by everyone and can be applied on ChIP-seq data directly.
Assuntos
Internet , MicroRNAs/genética , Imunoprecipitação da Cromatina , Análise de Sequência de RNARESUMO
BACKGROUND: NF-κB, a major transcription factor involved in mammalian inflammatory signaling, is primarily involved in regulation of response to inflammatory cytokines and pathogens. Its levels are tightly regulated since uncontrolled inflammatory response can cause serious diseases. Mathematical models have been useful in revealing the underlying mechanisms, the dynamics, and other aspects of regulation in NF-κB signaling. The recognition that miRNAs are important regulators of gene expression, and that a number of miRNAs target different components of the NF-κB network, motivate the incorporation of miRNA regulated steps in existing mathematical models to help understand the quantitative aspects of miRNA mediated regulation. METHODOLOGY/PRINCIPAL FINDINGS: In this study, two separate scenarios of miRNA regulation within an existing model are considered. In the first, miRNAs target adaptor proteins involved in the synthesis of IKK that serves as the NF-κB activator. In the second, miRNAs target different isoforms of IκB that act as NF-κB inhibitors. Simulations are carried out under two different conditions: when all three isoforms of IκB are present (wild type), and when only one isoform (IκBα) is present (knockout type). In both scenarios, oscillations in the NF-κB levels are observed and are found to be dependent on the levels of miRNAs. CONCLUSIONS/SIGNIFICANCE: Computational modeling can provide fresh insights into intricate regulatory processes. The introduction of miRNAs affects the dynamics of the NF-κB signaling pathway in a manner that depends on the role of the target. This "fine-tuning" property of miRNAs helps to keep the system in check and prevents it from becoming uncontrolled. The results are consistent with earlier experimental findings.