RESUMO
BACKGROUND: Nasopharyngeal carcinoma (NPC) is a distinct cancer of the head and neck that is highly prevalent in Southeast Asia and North Africa. Though an extensive analysis of environmental and genetic contributors has been performed, very little is known about the proteome of this disease. A proteomic analysis of formalin-fixed paraffin-embedded (FFPE) tissues can provide valuable information on protein expression and molecular patterns for both increasing our understanding of the disease and for biomarker discovery. To date, very few NPC proteomic studies have been performed, and none focused on patients from Morocco and North Africa. METHODS: Label-free Liquid Chromatography-Tandem Mass Spectrometry (LC-MS/MS) was used to perform a proteomic analysis of FFPE tissue samples from a cohort of 41 NPC tumor samples of Morocco and North Africa origins. The LC-MS/MS data from this cohort were analyzed alongside 21 healthy controls using MaxQuant 2.4.2.0. A differential expression analysis was performed using the MSstats package in R. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) functional annotations were carried out using the DAVID bioinformatic tool. RESULTS: 3341 proteins were identified across our NPC cases, revealing three main clusters and five DEPs with prognostic significance. The sex disparity of NPC was investigated from a proteomic perspective in which 59 DEPs were found between males and females, with significantly enriched terms associated with the immune response and gene expression. Furthermore, 26 DEPs were observed between patients with early and advanced stages of NPC with a significant cluster related to the immune response, implicating up-regulated DEPs such as IGHA, IGKC, and VAT1. Across both datasets, 6532 proteins were quantified between NPC patients and healthy controls. Among them, 1507 differentially expressed proteins (DEPs) were observed. GO and KEGG pathway analyses showed enriched terms of DEPs related to increased cellular activity, cell proliferation, and survival. PI3K and MAPK proteins as well as RAC1 BCL2 and PPIA were found to be overexpressed between cancer tissues and healthy controls. EBV infection was also one of the enriched pathways implicating its latent genes like LMP1 and LMP2 that activate several proteins and signaling pathways including NF-Kappa B, MAPK, and JAK-STAT pathways. CONCLUSION: Our findings unveil the proteomic landscape of NPC for the first time in the Moroccan population. These studies additionally may provide a foundation for identifying potential biomarkers. Further research is still needed to help develop tools for the early diagnosis and treatment of NPC in Moroccan and North African populations.
RESUMO
Feature attribution, the ability to localize regions of the input data that are relevant for classification, is an important capability for ML models in scientific and biomedical domains. Current methods for feature attribution, which rely on "explaining" the predictions of end-to-end classifiers, suffer from imprecise feature localization and are inadequate for use with small sample sizes and high-dimensional datasets due to computational challenges. We introduce prospector heads, an efficient and interpretable alternative to explanation-based attribution methods that can be applied to any encoder and any data modality. Prospector heads generalize across modalities through experiments on sequences (text), images (pathology), and graphs (protein structures), outperforming baseline attribution methods by up to 26.3 points in mean localization AUPRC. We also demonstrate how prospector heads enable improved interpretation and discovery of class-specific patterns in input data. Through their high performance, flexibility, and generalizability, prospectors provide a framework for improving trust and transparency for ML models in complex domains.
RESUMO
Though the earliest stages of oncogenesis, post initiation, are not well understood, it is generally appreciated that a successful transition from a collection of dysregulated cells to an aggressive tumour requires complex ecological interactions between cancer cells and their environment. One key component of tumorigenesis is immune evasion. To investigate the interplay amongst the ecological behaviour of mutualism and immune evasion, we used a computational simulation framework. Sensitivity analyses of the growth of a virtual tumour implemented as a 2D-hexagonal lattice model suggests tumour survival depends on the interplay between growth rates, mutualism and immune evasion. In 60% of simulations, cancer clones with low growth rates, but exhibiting mutualism were able to evade the immune system and continue progressing suggesting that tumours with equivalent growth rates and no mutualism are more likely to be eliminated than tumours with mutualism. Tumours with faster growth rates showed a lower dependence upon mutualism for progression. Geostatistical analysis showed decreased spatial heterogeneity over time for polyclonal tumours with a high division rate. Overall, these results suggest that in slow growing tumours, mutualism is critical for early tumorigenesis.
RESUMO
BACKGROUND: Loss of AZGP1 expression is a biomarker associated with progression to castration resistance, development of metastasis, and poor disease-specific survival in prostate cancer. However, high expression of AZGP1 cells in prostate cancer has been reported to increase proliferation and invasion. The exact role of AZGP1 in prostate cancer progression remains elusive. METHOD: AZGP1 knockout and overexpressing prostate cancer cells were generated using a lentiviral system. The effects of AZGP1 under- or over-expression in prostate cancer cells were evaluated by in vitro cell proliferation, migration, and invasion assays. Heterozygous AZGP1± mice were obtained from European Mouse Mutant Archive (EMMA), and prostate tissues from homozygous knockout male mice were collected at 2, 6 and 10 months for histological analysis. In vivo xenografts generated from AZGP1 under- or over-expressing prostate cancer cells were used to determine the role of AZGP1 in prostate cancer tumor growth, and subsequent proteomics analysis was conducted to elucidate the mechanisms of AZGP1 action in prostate cancer progression. AZGP1 expression and microvessel density were measured in human prostate cancer samples on a tissue microarray of 215 independent patient samples. RESULT: Neither the knockout nor overexpression of AZGP1 exhibited significant effects on prostate cancer cell proliferation, clonal growth, migration, or invasion in vitro. The prostates of AZGP1-/- mice initially appeared to have grossly normal morphology; however, we observed fibrosis in the periglandular stroma and higher blood vessel density in the mouse prostate by 6 months. In PC3 and DU145 mouse xenografts, over-expression of AZGP1 did not affect tumor growth. Instead, these tumors displayed decreased microvessel density compared to xenografts derived from PC3 and DU145 control cells, suggesting that AZGP1 functions to inhibit angiogenesis in prostate cancer. Proteomics profiling further indicated that, compared to control xenografts, AZGP1 overexpressing PC3 xenografts are enriched with angiogenesis pathway proteins, including YWHAZ, EPHA2, SERPINE1, and PDCD6, MMP9, GPX1, HSPB1, COL18A1, RNH1, and ANXA1. In vitro functional studies show that AZGP1 inhibits human umbilical vein endothelial cell proliferation, migration, tubular formation and branching. Additionally, tumor microarray analysis shows that AZGP1 expression is negatively correlated with blood vessel density in human prostate cancer tissues. CONCLUSION: AZGP1 is a negative regulator of angiogenesis, such that loss of AZGP1 promotes angiogenesis in prostate cancer. AZGP1 likely exerts heterotypical effects on cells in the tumor microenvironment, such as stromal and endothelial cells. This study sheds light on the anti-angiogenic characteristics of AZGP1 in the prostate and provides a rationale to target AZGP1 to inhibit prostate cancer progression.
Assuntos
Movimento Celular , Proliferação de Células , Neovascularização Patológica , Neoplasias da Próstata , Masculino , Animais , Neoplasias da Próstata/patologia , Neoplasias da Próstata/genética , Neoplasias da Próstata/metabolismo , Humanos , Neovascularização Patológica/genética , Neovascularização Patológica/patologia , Linhagem Celular Tumoral , Camundongos Knockout , Glicoproteínas/metabolismo , Invasividade Neoplásica , Camundongos , Regulação Neoplásica da Expressão Gênica , Angiogênese , Glicoproteína Zn-alfa-2RESUMO
Biomarkers remain the highest value proposition in cancer medicine today-especially protein biomarkers. Despite decades of evolving regulatory frameworks to facilitate the review of emerging technologies, biomarkers have been mostly about promise with very little to show for improvements in human health. Cancer is an emergent property of a complex system, and deconvoluting the integrative and dynamic nature of the overall system through biomarkers is a daunting proposition. The last 2 decades have seen an explosion of multiomics profiling and a range of advanced technologies for precision medicine, including the emergence of liquid biopsy, exciting advances in single-cell analysis, artificial intelligence (machine and deep learning) for data analysis, and many other advanced technologies that promise to transform biomarker discovery. Combining multiple omics modalities to acquire a more comprehensive landscape of the disease state, we are increasingly developing biomarkers to support therapy selection and patient monitoring. Furthering precision medicine, especially in oncology, necessitates moving away from the lens of reductionist thinking toward viewing and understanding that complex diseases are, in fact, complex adaptive systems. As such, we believe it is necessary to redefine biomarkers as representations of biological system states at different hierarchical levels of biological order. This definition could include traditional molecular, histologic, radiographic, or physiological characteristics, as well as emerging classes of digital markers and complex algorithms. To succeed in the future, we must move past purely observational individual studies and instead start building a mechanistic framework to enable integrative analysis of new studies within the context of prior studies. Identifying information in complex systems and applying theoretical constructs, such as information theory, to study cancer as a disease of dysregulated communication could prove to be "game changing" for the clinical outcome of cancer patients.
Assuntos
Biomarcadores Tumorais , Neoplasias , Humanos , Inteligência Artificial , Biomarcadores/análiseRESUMO
Introduction: Prostate cancer is a highly heterogeneous disease, presenting varying levels of aggressiveness and response to treatment. Angiogenesis is one of the hallmarks of cancer, providing oxygen and nutrient supply to tumors. Micro vessel density has previously been correlated with higher Gleason score and poor prognosis. Manual segmentation of blood vessels (BVs) In microscopy images is challenging, time consuming and may be prone to inter-rater variabilities. In this study, an automated pipeline is presented for BV detection and distribution analysis in multiplexed prostate cancer images. Methods: A deep learning model was trained to segment BVs by combining CD31, CD34 and collagen IV images. In addition, the trained model was used to analyze the size and distribution patterns of BVs in relation to disease progression in a cohort of prostate cancer patients (N = 215). Results: The model was capable of accurately detecting and segmenting BVs, as compared to ground truth annotations provided by two reviewers. The precision (P), recall (R) and dice similarity coefficient (DSC) were equal to 0.93 (SD 0.04), 0.97 (SD 0.02) and 0.71 (SD 0.07) with respect to reviewer 1, and 0.95 (SD 0.05), 0.94 (SD 0.07) and 0.70 (SD 0.08) with respect to reviewer 2, respectively. BV count was significantly associated with 5-year recurrence (adjusted p = 0.0042), while both count and area of blood vessel were significantly associated with Gleason grade (adjusted p = 0.032 and 0.003 respectively). Discussion: The proposed methodology is anticipated to streamline and standardize BV analysis, offering additional insights into the biology of prostate cancer, with broad applicability to other cancers.
RESUMO
Assessing the efficacy of cancer therapeutics in mouse models is a critical step in treatment development. However, low-resolution measurement tools and small sample sizes make determining drug efficacy in vivo a difficult and time-intensive task. Here, we present a commercially scalable wearable electronic strain sensor that automates the in vivo testing of cancer therapeutics by continuously monitoring the micrometer-scale progression or regression of subcutaneously implanted tumors at the minute time scale. In two in vivo cancer mouse models, our sensor discerned differences in tumor volume dynamics between drug- and vehicle-treated tumors within 5 hours following therapy initiation. These short-term regression measurements were validated through histology, and caliper and bioluminescence measurements taken over weeklong treatment periods demonstrated the correlation with longer-term treatment response. We anticipate that real-time tumor regression datasets could help expedite and automate the process of screening cancer therapies in vivo.
Assuntos
Cognição , Eletrônica , Animais , Modelos Animais de Doenças , Medições Luminescentes , CamundongosRESUMO
Over the past decade, the development of 'simple' blood tests that enable cancer screening, diagnosis or monitoring and facilitate the design of personalized therapies without the need for invasive tumour biopsy sampling has been a core ambition in cancer research. Data emerging from ongoing biomarker development efforts indicate that multiple markers, used individually or as part of a multimodal panel, are required to enhance the sensitivity and specificity of assays for early stage cancer detection. The discovery of cancer-associated molecular alterations that are reflected in blood at multiple dimensions (genome, epigenome, transcriptome, proteome and metabolome) and integration of the resultant multi-omics data have the potential to uncover novel biomarkers as well as to further elucidate the underlying molecular pathways. Herein, we review key advances in multi-omics liquid biopsy approaches and introduce the 'nano-omics' paradigm: the development and utilization of nanotechnology tools for the enrichment and subsequent omics analysis of the blood-circulating cancerome.
Assuntos
Neoplasias , Proteoma , Biomarcadores/análise , Genoma , Humanos , Metaboloma , Nanotecnologia , Neoplasias/diagnóstico , Neoplasias/genética , Neoplasias/terapia , TranscriptomaRESUMO
Fluorescently labeled antibody and aptamer probes are used in biological studies to characterize binding interactions, measure concentrations of analytes, and sort cells. Fluorescent nanoparticle labels offer an excellent alternative to standard fluorescent labeling strategies due to their enhanced brightness, stability and multivalency; however, challenges in functionalization and characterization have impeded their use. This work introduces a straightforward approach for preparation of fluorescent nanoparticle probes using commercially available reagents and common laboratory equipment. Fluorescent polystyrene nanoparticles, Thermo Fisher Scientific FluoSpheres, were used in these proof-of-principle studies. Particle passivation was achieved by covalent attachment of amine-PEG-azide to carboxylated particles, neutralizing the surface charge from - 43 to - 15 mV. A conjugation-annealing handle and DNA aptamer probe were attached to the azide-PEG nanoparticle surface either through reaction of pre-annealed handle and probe or through a stepwise reaction of the nanoparticles with the handle followed by aptamer annealing. Nanoparticles functionalized with DNA aptamers targeting histidine tags and VEGF protein had high affinity (EC50s ranging from 3 to 12 nM) and specificity, and were more stable than conventional labels. This protocol for preparation of nanoparticle probes relies solely on commercially available reagents and common equipment, breaking down the barriers to use nanoparticles in biological experiments.
Assuntos
Técnicas Biossensoriais , Sondas de DNA/química , Corantes Fluorescentes/química , Nanopartículas/química , Peptídeos/análise , Proteínas/análise , Sequência de Aminoácidos , Aptâmeros de Nucleotídeos/química , Sequência de Bases , Humanos , Nanotecnologia , Polietilenoglicóis , Pontos Quânticos , Coloração e RotulagemRESUMO
BACKGROUND: Nasopharyngeal carcinoma is a multifactorial disease mainly affecting the Asian and North African populations including Morocco. This study aimed to determine the epidemiological profile of nasopharyngeal carcinoma in Northern Morocco as well as its clinicopathological, therapeutic, and prognostic characteristics. METHODS: 129 patients with nasopharyngeal carcinoma followed at the regional center of oncology of Tangier in the period between April 2017 and July 2019, and diagnosed elsewhere from March 2000 to February 2019, were included in this study. Statistical analysis of the data was realized using Statistical Package for the Social Sciences (SPSS) software. RESULTS: Nasopharyngeal carcinoma (NPC) represented 5% of all cases with a median age of 50. The most affected age group was 40-54 years (41.1%). Of all patients, 65.9% were men and 34.1% were women with a sex ratio of 1.93 (Male/Female). Undifferentiated nasopharyngeal carcinomas were the most common histological type affecting 96.12% of patients. At diagnosis, the majority of patients (82.2%) had an advanced stage of NPC (III, VIa, b, c) including 5.4% of metastatic cases (IVc). Most cases (86%) had lymph node involvement with cervical mass being the most common clinical presentation. 81.4% of patients received radiotherapy combined with chemotherapy. Among these patients, 54.3% had concurrent radiochemotherapy preceded by induction chemotherapy. The 5-year overall survival (OS) was 86.8% for all patients. It represented 91.3% for early stages, 87.9% for locally advanced stages, and 57.1% for the metastatic stage significantly. The disease-free survival (DFS) at 5 years was 87.6% knowing that relapse occurred in 16 cases. CONCLUSIONS: Nasopharyngeal carcinoma is a particular disease with a late declaration. It is common in Morocco as is the case in other endemic areas with a high prevalence. Patients' survival is significantly influenced by disease staging.
Assuntos
Carcinoma Nasofaríngeo/epidemiologia , Carcinoma Nasofaríngeo/patologia , Neoplasias Nasofaríngeas/epidemiologia , Neoplasias Nasofaríngeas/patologia , Adolescente , Adulto , Idoso , Idoso de 80 Anos ou mais , Criança , Intervalo Livre de Doença , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Marrocos/epidemiologia , Carcinoma Nasofaríngeo/terapia , Neoplasias Nasofaríngeas/terapia , Recidiva Local de Neoplasia , Estadiamento de Neoplasias , Prognóstico , Análise de Sobrevida , Adulto JovemRESUMO
BACKGROUND: The acquisition of oncogenic drivers is a critical feature of cancer progression. For some carcinomas, it is clear that certain genetic drivers occur early in neoplasia and others late. Why these drivers are selected and how these changes alter the neoplasia's fitness is less understood. METHODS: Here we use spatially oriented genomic approaches to identify transcriptomic and genetic changes at the single-duct level within precursor neoplasia associated with invasive breast cancer. We study HER2 amplification in ductal carcinoma in situ (DCIS) as an event that can be both quantified and spatially located via fluorescence in situ hybridization (FISH) and immunohistochemistry on fixed paraffin-embedded tissue. RESULTS: By combining the HER2-FISH with the laser capture microdissection (LCM) Smart-3SEQ method, we found that HER2 amplification in DCIS alters the transcriptomic profiles and increases diversity of copy number variations (CNVs). Particularly, interferon signaling pathway is activated by HER2 amplification in DCIS, which may provide a prolonged interferon signaling activation in HER2-positive breast cancer. Multiple subclones of HER2-amplified DCIS with distinct CNV profiles are observed, suggesting that multiple events occurred for the acquisition of HER2 amplification. Notably, DCIS acquires key transcriptomic changes and CNV events prior to HER2 amplification, suggesting that pre-amplified DCIS may create a cellular state primed to gain HER2 amplification for growth advantage. CONCLUSION: By using genomic methods that are spatially oriented, this study identifies several features that appear to generate insights into neoplastic progression in precancer lesions at a single-duct level.
Assuntos
Neoplasias da Mama/genética , Carcinoma Intraductal não Infiltrante/genética , Genoma Humano/genética , Receptor ErbB-2/genética , Transcriptoma/genética , Neoplasias da Mama/patologia , Carcinoma Intraductal não Infiltrante/patologia , Variações do Número de Cópias de DNA , Evolução Molecular , Matriz Extracelular/genética , Feminino , Amplificação de Genes , Humanos , Hibridização in Situ Fluorescente , Interferons/metabolismo , Oncogenes/genética , Transdução de Sinais/genéticaRESUMO
Identification of protein biomarkers for cancer diagnosis and prognosis remains a critical unmet clinical need. A major reason is that the dynamic relationship between proliferating and necrotic cell populations during vascularized tumor growth, and the associated extra- and intra-cellular protein outflux from these populations into blood circulation remains poorly understood. Complementary to experimental efforts, mathematical approaches have been employed to effectively simulate the kinetics of detectable surface proteins (e.g., CA-125) shed into the bloodstream. However, existing models can be difficult to tune and may be unable to capture the dynamics of non-extracellular proteins, such as those shed from necrotic and apoptosing cells. The models may also fail to account for intra-tumoral spatial and microenvironmental heterogeneity. We present a new multi-compartment model to simulate heterogeneously vascularized growing tumors and the corresponding protein outflux. Model parameters can be tuned from histology data, including relative vascular volume, mean vessel diameter, and distance from vasculature to necrotic tissue. The model enables evaluating the difference in shedding rates between extra- and non-extracellular proteins from viable and necrosing cells as a function of heterogeneous vascularization. Simulation results indicate that under certain conditions it is possible for non-extracellular proteins to have superior outflux relative to extracellular proteins. This work contributes towards the goal of cancer biomarker identification by enabling simulation of protein shedding kinetics based on tumor tissue-specific characteristics. Ultimately, we anticipate that models like the one introduced herein will enable examining origins and circulating dynamics of candidate biomarkers, thus facilitating marker selection for validation studies.
Assuntos
Modelos Biológicos , Neoplasias/patologia , Neovascularização Patológica/patologia , Humanos , Neoplasias/metabolismo , Neovascularização Patológica/metabolismoRESUMO
BACKGROUND: While technological advances have made it possible to profile the immune system at high resolution, translating high-throughput data into knowledge of immune mechanisms has been challenged by the complexity of the interactions underlying immune processes. Tools to explore the immune network are critical for better understanding the multi-layered processes that underlie immune function and dysfunction, but require a standardized network map of immune interactions. To facilitate this we have developed ImmunoGlobe, a manually curated intercellular immune interaction network extracted from Janeway's Immunobiology textbook. RESULTS: ImmunoGlobe is the first graphical representation of the immune interactome, and is comprised of 253 immune system components and 1112 unique immune interactions with detailed functional and characteristic annotations. Analysis of this network shows that it recapitulates known features of the human immune system and can be used uncover novel multi-step immune pathways, examine species-specific differences in immune processes, and predict the response of immune cells to stimuli. ImmunoGlobe is publicly available through a user-friendly interface at www.immunoglobe.org and can be downloaded as a computable graph and network table. CONCLUSION: While the fields of proteomics and genomics have long benefited from network analysis tools, no such tool yet exists for immunology. ImmunoGlobe provides a ground truth immune interaction network upon which such tools can be built. These tools will allow us to predict the outcome of complex immune interactions, providing mechanistic insight that allows us to precisely modulate immune responses in health and disease.
Assuntos
Comunicação Celular , Curadoria de Dados , Espaço Extracelular/metabolismo , Sistema Imunitário/metabolismo , Mapas de Interação de Proteínas , Software , Biologia de Sistemas , Animais , Humanos , Camundongos , Modelos ImunológicosRESUMO
Integration of transcriptomic and proteomic data should reveal multi-layered regulatory processes governing cancer cell behaviors. Traditional correlation-based analyses have demonstrated limited ability to identify the post-transcriptional regulatory (PTR) processes that drive the non-linear relationship between transcript and protein abundances. In this work, we ideate an integrative approach to explore the variety of post-transcriptional mechanisms that dictate relationships between genes and corresponding proteins. The proposed workflow utilizes the intuitive technique of scatterplot diagnostics or scagnostics, to characterize and examine the diverse scatterplots built from transcript and protein abundances in a proteogenomic experiment. The workflow includes representing gene-protein relationships as scatterplots, clustering on geometric scagnostic features of these scatterplots, and finally identifying and grouping the potential gene-protein relationships according to their disposition to various PTR mechanisms. Our study verifies the efficacy of the implemented approach to excavate possible regulatory mechanisms by utilizing comprehensive tests on a synthetic dataset. We also propose a variety of 2D pattern-specific downstream analyses methodologies such as mixture modeling, and mapping miRNA post-transcriptional effects to explore each mechanism further. This work suggests that the proposed methodology has the potential for discovering and categorizing post-transcriptional regulatory mechanisms, manifesting in proteogenomic trends. These trends subsequently provide evidence for cancer specificity, miRNA targeting, and identification of regulation impacted by biological functionality and different types of degradation. (Supplementary Material - https://github.com/arunima2/PTRE_PSB_2020).
Assuntos
MicroRNAs , Proteogenômica , Biologia Computacional , Regulação da Expressão Gênica , ProteômicaRESUMO
Benchmark challenges, such as the Critical Assessment of Structure Prediction (CASP) and Dialogue for Reverse Engineering Assessments and Methods (DREAM) have been instrumental in driving the development of bioinformatics methods. Typically, challenges are posted, and then competitors perform a prediction based upon blinded test data. Challengers then submit their answers to a central server where they are scored. Recent efforts to automate these challenges have been enabled by systems in which challengers submit Docker containers, a unit of software that packages up code and all of its dependencies, to be run on the cloud. Despite their incredible value for providing an unbiased test-bed for the bioinformatics community, there remain opportunities to further enhance the potential impact of benchmark challenges. Specifically, current approaches only evaluate end-to-end performance; it is nearly impossible to directly compare methodologies or parameters. Furthermore, the scientific community cannot easily reuse challengers' approaches, due to lack of specifics, ambiguity in tools and parameters as well as problems in sharing and maintenance. Lastly, the intuition behind why particular steps are used is not captured, as the proposed workflows are not explicitly defined, making it cumbersome to understand the flow and utilization of data. Here we introduce an approach to overcome these limitations based upon the WINGS semantic workflow system. Specifically, WINGS enables researchers to submit complete semantic workflows as challenge submissions. By submitting entries as workflows, it then becomes possible to compare not just the results and performance of a challenger, but also the methodology employed. This is particularly important when dozens of challenge entries may use nearly identical tools, but with only subtle changes in parameters (and radical differences in results). WINGS uses a component driven workflow design and offers intelligent parameter and data selection by reasoning about data characteristics. This proves to be especially critical in bioinformatics workflows where using default or incorrect parameter values is prone to drastically altering results. Different challenge entries may be readily compared through the use of abstract workflows, which also facilitate reuse. WINGS is housed on a cloud based setup, which stores data, dependencies and workflows for easy sharing and utility. It also has the ability to scale workflow executions using distributed computing through the Pegasus workflow execution system. We demonstrate the application of this architecture to the DREAM proteogenomic challenge.
Assuntos
Benchmarking/métodos , Semântica , Fluxo de Trabalho , Algoritmos , Biologia Computacional/métodos , Perfilação da Expressão Gênica/estatística & dados numéricos , Genômica , Metadados , Proteínas/genética , Proteínas/metabolismo , Reprodutibilidade dos Testes , Análise de Sequência de RNA/estatística & dados numéricosRESUMO
A major goal of proteomics research is the accurate and sensitive identification and quantification of a broad range of proteins within a sample. Data-independent acquisition (DIA) approaches that acquire MS/MS spectra independently of precursor information have been developed to overcome the reproducibility challenges of data-dependent acquisition and the limited breadth of targeted proteomics strategies. Typical DIA implementations use wide MS/MS isolation windows to acquire comprehensive fragment ion data. However, wide isolation windows produce highly chimeric spectra, limiting the achievable sensitivity and accuracy of quantification and identification. Here, we present a DIA strategy in which spectra are collected with overlapping (rather than adjacent or random) windows and then computationally demultiplexed. This approach improves precursor selectivity by nearly a factor of 2, without incurring any loss in mass range, mass resolution, chromatographic resolution, scan speed, or other key acquisition parameters. We demonstrate a 64% improvement in sensitivity and a 17% improvement in peptides detected in a 6-protein bovine mix spiked into a yeast background. To confirm the method's applicability to a realistic biological experiment, we also analyze the regulation of the proteasome in yeast grown in rapamycin and show that DIA experiments with overlapping windows can help elucidate its adaptation toward the degradation of oxidatively damaged proteins. Our integrated computational and experimental DIA strategy is compatible with any DIA-capable instrument. The computational demultiplexing algorithm required to analyze the data has been made available as part of the open-source proteomics software tools Skyline and msconvert (Proteowizard), making it easy to apply as part of standard proteomics workflows. Graphical Abstract.
RESUMO
Recent advances in our understanding of cancer progression have highlighted the roles played by molecular heterogeneity and by the tumor microenvironment in driving drug resistance and metastasis. The coupling of single-cell measurement technologies with algorithms, such as t-sne and SPADE, have enabled deep investigation of tumor heterogeneity. However, such techniques only capture molecular heterogeneity and do not enable the quantification nor visualization of intercellular interactions. They additionally do not allow the visualization of ecological niches that are critical to understanding tumor behavior. Novel computational tools to quantify and visualize spatial patterns in the tumor microenvironment are critically needed. Here, we take a tumor ecology perspective to examine how predation, mutualism, commensalism, and parasitism may impact tumor development and spatial patterning. We additionally quantify local spatial heterogeneity and the emergent global spatial behavior of the models using geostatistics. By visualizing emergent spatial patterns we demonstrate the potential utility of a geostatistical analysis in differentiating amongst cell-cell interactions in the tumor microenvironment. These studies introduce both an ecological framework for characterizing intercellular interactions in cancer and a novel way of quantifying and visualizing spatial patterns in cancer.
RESUMO
Anterior gradient 2 (AGR2) is a member of the protein disulfide isomerase (PDI) family, which plays a role in the regulation of protein homeostasis and the unfolded protein response pathway (UPR). AGR2 has also been characterized as a proto-oncogene and a potential cancer biomarker. Cellular localization of AGR2 is emerging as a key component for understanding the role of AGR2 as a proto-oncogene. Here, we provide evidence that extracellular AGR2 (eAGR2) promotes tumor metastasis in various in vivo models. To further characterize the role of the intracellular-resident versus extracellular protein, we performed a comprehensive protein-protein interaction screen. Based on these results, we identify AGR2 as an interacting partner of the mTORC2 pathway. Importantly, our data indicates that eAGR2 promotes increased phosphorylation of RICTOR (T1135), while intracellular AGR2 (iAGR2) antagonizes its levels and phosphorylation. Localization of AGR2 also has opposing effects on the Hippo pathway, spheroid formation, and response to chemotherapy in vitro. Collectively, our results identify disparate phenotypes predicated on AGR2 localization. Our findings also provide credence for screening of eAGR2 to guide therapeutic decisions.
Assuntos
Retículo Endoplasmático/genética , Alvo Mecanístico do Complexo 2 de Rapamicina/genética , Metástase Neoplásica/genética , Metástase Neoplásica/patologia , Neoplasias/genética , Neoplasias/patologia , Proteínas/genética , Animais , Linhagem Celular , Linhagem Celular Tumoral , Células HEK293 , Humanos , Células MCF-7 , Masculino , Camundongos , Camundongos Nus , Mucoproteínas , Proteínas Oncogênicas , Células PC-3 , Isomerases de Dissulfetos de Proteínas/genética , Proto-Oncogene Mas , Transdução de Sinais/genética , Resposta a Proteínas não Dobradas/genéticaRESUMO
Convolutional neural networks (CNNs) have gained steady popularity as a tool to perform automatic classification of whole slide histology images. While CNNs have proven to be powerful classifiers in this context, they fail to explain this classification, as the network engineered features used for modeling and classification are ONLY interpretable by the CNNs themselves. This work aims at enhancing a traditional neural network model to perform histology image modeling, patient classification, and interpretation of the distinctive features identified by the network within the histology whole slide images (WSIs). We synthesize a workflow which (a) intelligently samples the training data by automatically selecting only image areas that display visible disease-relevant tissue state and (b) isolates regions most pertinent to the trained CNN prediction and translates them to observable and qualitative features such as color, intensity, cell and tissue morphology and texture. We use the Cancer Genome Atlas's Breast Invasive Carcinoma (TCGA-BRCA) histology dataset to build a model predicting patient attributes (disease stage and node status) and the tumor proliferation challenge (TUPAC 2016) breast cancer histology image repository to help identify disease-relevant tissue state (mitotic activity). We find that our enhanced CNN based workflow both increased patient attribute predictive accuracy (~2% increase for disease stage and ~10% increase for node status) and experimentally proved that a data-driven CNN histology model predicting breast invasive carcinoma stages is highly sensitive to features such as color, cell size, and shape, granularity, and uniformity. This work summarizes the need for understanding the widely trusted models built using deep learning and adds a layer of biological context to a technique that functioned as a classification only approach till now.
RESUMO
INTRODUCTION: Despite apparently complete surgical resection, approximately half of resected early-stage lung cancer patients relapse and die of their disease. Adjuvant chemotherapy reduces this risk by only 5% to 8%. Thus, there is a need for better identifying who benefits from adjuvant therapy, the drivers of relapse, and novel targets in this setting. METHODS: RNA sequencing and liquid chromatography/liquid chromatography-mass spectrometry proteomics data were generated from 51 surgically resected non-small cell lung tumors with known recurrence status. RESULTS: We present a rationale and framework for the incorporation of high-content RNA and protein measurements into integrative biomarkers and show the potential of this approach for predicting risk of recurrence in a group of lung adenocarcinomas. In addition, we characterize the relationship between mRNA and protein measurements in lung adenocarcinoma and show that it is outcome specific. CONCLUSIONS: Our results suggest that mRNA and protein data possess independent biological and clinical importance, which can be leveraged to create higher-powered expression biomarkers.