Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 20
Filtrar
1.
BMC Bioinformatics ; 24(1): 17, 2023 Jan 16.
Artigo em Inglês | MEDLINE | ID: mdl-36647008

RESUMO

Colorectal cancer (CRC) is the third most common cancer and the second most deathly worldwide. It is a very heterogeneous disease that can develop via distinct pathways where metastasis is the primary cause of death. Therefore, it is crucial to understand the molecular mechanisms underlying metastasis. RNA-sequencing is an essential tool used for studying the transcriptional landscape. However, the high-dimensionality of gene expression data makes selecting novel metastatic biomarkers problematic. To distinguish early-stage CRC patients at risk of developing metastasis from those that are not, three types of binary classification approaches were used: (1) classification methods (decision trees, linear and radial kernel support vector machines, logistic regression, and random forest) using differentially expressed genes (DEGs) as input features; (2) regularized logistic regression based on the Elastic Net penalty and the proposed iTwiner-a network-based regularizer accounting for gene correlation information; and (3) classification methods based on the genes pre-selected using regularized logistic regression. Classifiers using the DEGs as features showed similar results, with random forest showing the highest accuracy. Using regularized logistic regression on the full dataset yielded no improvement in the methods' accuracy. Further classification using the pre-selected genes found by different penalty factors, instead of the DEGs, significantly improved the accuracy of the binary classifiers. Moreover, the use of network-based correlation information (iTwiner) for gene selection produced the best classification results and the identification of more stable and robust gene sets. Some are known to be tumor suppressor genes (OPCML-IT2), to be related to resistance to cancer therapies (RAC1P3), or to be involved in several cancer processes such as genome stability (XRCC6P2), tumor growth and metastasis (MIR602) and regulation of gene transcription (NME2P2). We show that the classification of CRC patients based on pre-selected features by regularized logistic regression is a valuable alternative to using DEGs, significantly increasing the models' predictive performance. Moreover, the use of correlation-based penalization for biomarker selection stands as a promising strategy for predicting patients' groups based on RNA-seq data.


Assuntos
Neoplasias Colorretais , Humanos , Biomarcadores , Modelos Logísticos , Neoplasias Colorretais/genética , Neoplasias Colorretais/patologia , Biomarcadores Tumorais/genética , Biomarcadores Tumorais/metabolismo , Moléculas de Adesão Celular , Proteínas Ligadas por GPI
2.
Cells ; 11(15)2022 07 27.
Artigo em Inglês | MEDLINE | ID: mdl-35954157

RESUMO

Clear cell renal cell carcinoma (ccRCC) is the most common subtype of RCC showing a significant percentage of mortality. One of the priorities of kidney cancer research is to identify RCC-specific biomarkers for early detection and screening of the disease. With the development of high-throughput technology, it is now possible to measure the expression levels of thousands of genes in parallel and assess the molecular profile of individual tumors. Studying the relationship between gene expression and survival outcome has been widely used to find genes associated with cancer survival, providing new information for clinical decision-making. One of the challenges of using transcriptomics data is their high dimensionality which can lead to instability in the selection of gene signatures. Here we identify potential prognostic biomarkers correlated to the survival outcome of ccRCC patients using two network-based regularizers (EN and TCox) applied to Cox models. Some genes always selected by each method were found (COPS7B, DONSON, GTF2E2, HAUS8, PRH2, and ZNF18) with known roles in cancer formation and progression. Afterward, different lists of genes ranked based on distinct metrics (logFC of DEGs or ß coefficients of regression) were analyzed using GSEA to try to find over- or under-represented mechanisms and pathways. Some ontologies were found in common between the gene sets tested, such as nuclear division, microtubule and tubulin binding, and plasma membrane and chromosome regions. Additionally, genes that were more involved in these ontologies and genes selected by the regularizers were used to create a new gene set where we applied the Cox regression model. With this smaller gene set, we were able to significantly split patients into high/low risk groups showing the importance of studying these genes as potential prognostic factors to help clinicians better identify and monitor patients with ccRCC.


Assuntos
Carcinoma de Células Renais , Neoplasias Renais , Biomarcadores Tumorais/genética , Biomarcadores Tumorais/metabolismo , Carcinoma de Células Renais/metabolismo , Humanos , Rim/patologia , Neoplasias Renais/genética , Neoplasias Renais/patologia , Transcriptoma/genética
3.
Stat Methods Med Res ; 31(5): 947-958, 2022 05.
Artigo em Inglês | MEDLINE | ID: mdl-35072570

RESUMO

The extraction of novel information from omics data is a challenging task, in particular, since the number of features (e.g. genes) often far exceeds the number of samples. In such a setting, conventional parameter estimation leads to ill-posed optimization problems, and regularization may be required. In addition, outliers can largely impact classification accuracy.Here we introduce ROSIE, an ensemble classification approach, which combines three sparse and robust classification methods for outlier detection and feature selection and further performs a bootstrap-based validity check. Outliers of ROSIE are determined by the rank product test using outlier rankings of all three methods, and important features are selected as features commonly selected by all methods.We apply ROSIE to RNA-Seq data from The Cancer Genome Atlas (TCGA) to classify observations into Triple-Negative Breast Cancer (TNBC) and non-TNBC tissue samples. The pre-processed dataset consists of 16,600 genes and more than 1,000 samples. We demonstrate that ROSIE selects important features and outliers in a robust way. Identified outliers are concordant with the distribution of the commonly selected genes by the three methods, and results are in line with other independent studies. Furthermore, we discuss the association of some of the selected genes with the TNBC subtype in other investigations. In summary, ROSIE constitutes a robust and sparse procedure to identify outliers and important genes through binary classification. Our approach is ad hoc applicable to other datasets, fulfilling the overall goal of simultaneously identifying outliers and candidate disease biomarkers to the targeted in therapy research and personalized medicine frameworks.


Assuntos
Neoplasias de Mama Triplo Negativas , Humanos , Neoplasias de Mama Triplo Negativas/genética
4.
J Math Biol ; 83(4): 39, 2021 09 22.
Artigo em Inglês | MEDLINE | ID: mdl-34553267

RESUMO

Bone is constantly being renewed: in the adult skeleton, bone resorption and formation are in a tightly coupled balance, allowing for a constant bone density to be maintained. Yet this micro-environment provides the necessary conditions for the growth and proliferation of tumor cells, and thus bone is a common site for the development of metastases, mainly from primary breast and prostate cancer. Mathematical and computational models with differential equations can replicate this bone remodeling process. These models have been extended to include the effects of disruptive tumor pathologies in the bone dynamics, as metastases contribute to the decoupling between bone resorption and formation and to the self-perpetuating tumor growth cycle. Such models may also contemplate the counteraction effects of currently used therapies, and, in the case of treatments with drugs, their pharmocokinetics and pharmacodynamics. We present a thorough overview of biochemical models for bone remodeling, in the presence of a tumour together with anti-cancer and anti-resorptive therapy, formulated as systems of first-order differential equations, or simplified using variable order derivatives. The latter models, of which some are new to this paper, result in equations with fewer parameters, and allow accounting for anomalous diffusion processes. In this way, more compact and parsimonious models, that promptly highlight tumorous bone interactions, are achieved, providing an effective framework to counteract the loss of bone integrity on the affected areas.


Assuntos
Neoplasias Ósseas , Neoplasias da Próstata , Neoplasias Ósseas/tratamento farmacológico , Remodelação Óssea , Humanos , Masculino , Compostos Radiofarmacêuticos , Microambiente Tumoral
5.
Cancers (Basel) ; 13(5)2021 Mar 02.
Artigo em Inglês | MEDLINE | ID: mdl-33801334

RESUMO

Network science has long been recognized as a well-established discipline across many biological domains. In the particular case of cancer genomics, network discovery is challenged by the multitude of available high-dimensional heterogeneous views of data. Glioblastoma (GBM) is an example of such a complex and heterogeneous disease that can be tackled by network science. Identifying the architecture of molecular GBM networks is essential to understanding the information flow and better informing drug development and pre-clinical studies. Here, we review network-based strategies that have been used in the study of GBM, along with the available software implementations for reproducibility and further testing on newly coming datasets. Promising results have been obtained from both bulk and single-cell GBM data, placing network discovery at the forefront of developing a molecularly-informed-based personalized medicine.

6.
Biomedicines ; 8(11)2020 Nov 10.
Artigo em Inglês | MEDLINE | ID: mdl-33182598

RESUMO

Colorectal cancer (CRC) is one of the leading causes of mortality and morbidity in the world. Being a heterogeneous disease, cancer therapy and prognosis represent a significant challenge to medical care. The molecular information improves the accuracy with which patients are classified and treated since similar pathologies may show different clinical outcomes and other responses to treatment. However, the high dimensionality of gene expression data makes the selection of novel genes a problematic task. We propose TCox, a novel penalization function for Cox models, which promotes the selection of genes that have distinct correlation patterns in normal vs. tumor tissues. We compare TCox to other regularized survival models, Elastic Net, HubCox, and OrphanCox. Gene expression and clinical data of CRC and normal (TCGA) patients are used for model evaluation. Each model is tested 100 times. Within a specific run, eighteen of the features selected by TCox are also selected by the other survival regression models tested, therefore undoubtedly being crucial players in the survival of colorectal cancer patients. Moreover, the TCox model exclusively selects genes able to categorize patients into significant risk groups. Our work demonstrates the ability of the proposed weighted regularizer TCox to disclose novel molecular drivers in CRC survival by accounting for correlation-based network information from both tumor and normal tissue. The results presented support the relevance of network information for biomarker identification in high-dimensional gene expression data and foster new directions for the development of network-based feature selection methods in precision oncology.

7.
BMC Bioinformatics ; 21(1): 59, 2020 Feb 18.
Artigo em Inglês | MEDLINE | ID: mdl-32070274

RESUMO

BACKGROUND: Understanding cellular and molecular heterogeneity in glioblastoma (GBM), the most common and aggressive primary brain malignancy, is a crucial step towards the development of effective therapies. Besides the inter-patient variability, the presence of multiple cell populations within tumors calls for the need to develop modeling strategies able to extract the molecular signatures driving tumor evolution and treatment failure. With the advances in single-cell RNA Sequencing (scRNA-Seq), tumors can now be dissected at the cell level, unveiling information from their life history to their clinical implications. RESULTS: We propose a classification setting based on GBM scRNA-Seq data, through sparse logistic regression, where different cell populations (neoplastic and normal cells) are taken as classes. The goal is to identify gene features discriminating between the classes, but also those shared by different neoplastic clones. The latter will be approached via the network-based twiner regularizer to identify gene signatures shared by neoplastic cells from the tumor core and infiltrating neoplastic cells originated from the tumor periphery, as putative disease biomarkers to target multiple neoplastic clones. Our analysis is supported by the literature through the identification of several known molecular players in GBM. Moreover, the relevance of the selected genes was confirmed by their significance in the survival outcomes in bulk GBM RNA-Seq data, as well as their association with several Gene Ontology (GO) biological process terms. CONCLUSIONS: We presented a methodology intended to identify genes discriminating between GBM clones, but also those playing a similar role in different GBM neoplastic clones (including migrating cells), therefore potential targets for therapy research. Our results contribute to a deeper understanding on the genetic features behind GBM, by disclosing novel therapeutic directions accounting for GBM heterogeneity.


Assuntos
Neoplasias Encefálicas/genética , Glioblastoma/genética , RNA-Seq , Neoplasias Encefálicas/metabolismo , Classificação/métodos , Ontologia Genética , Glioblastoma/metabolismo , Humanos , Análise de Célula Única
8.
BMC Bioinformatics ; 20(1): 356, 2019 Jun 25.
Artigo em Inglês | MEDLINE | ID: mdl-31238876

RESUMO

BACKGROUND: Breast and prostate cancers are typical examples of hormone-dependent cancers, showing remarkable similarities at the hormone-related signaling pathways level, and exhibiting a high tropism to bone. While the identification of genes playing a specific role in each cancer type brings invaluable insights for gene therapy research by targeting disease-specific cell functions not accounted so far, identifying a common gene signature to breast and prostate cancers could unravel new targets to tackle shared hormone-dependent disease features, like bone relapse. This would potentially allow the development of new targeted therapies directed to genes regulating both cancer types, with a consequent positive impact in cancer management and health economics. RESULTS: We address the challenge of extracting gene signatures from transcriptomic data of prostate adenocarcinoma (PRAD) and breast invasive carcinoma (BRCA) samples, particularly estrogen positive (ER+), and androgen positive (AR+) triple-negative breast cancer (TNBC), using sparse logistic regression. The introduction of gene network information based on the distances between BRCA and PRAD correlation matrices is investigated, through the proposed twin networks recovery (twiner) penalty, as a strategy to ensure similarly correlated gene features in two diseases to be less penalized during the feature selection procedure. CONCLUSIONS: Our analysis led to the identification of genes that show a similar correlation pattern in BRCA and PRAD transcriptomic data, and are selected as key players in the classification of breast and prostate samples into ER+ BRCA/AR+ TNBC/PRAD tumor and normal tissues, and also associated with survival time distributions. The results obtained are supported by the literature and are expected to unveil the similarities between the diseases, disclose common disease biomarkers, and help in the definition of new strategies for more effective therapies.


Assuntos
Perfilação da Expressão Gênica/métodos , Neoplasias da Próstata/genética , Transcriptoma , Neoplasias de Mama Triplo Negativas/genética , Estrogênios/metabolismo , Feminino , Redes Reguladoras de Genes , Humanos , Modelos Logísticos , Masculino , Análise de Componente Principal , Neoplasias da Próstata/mortalidade , Neoplasias da Próstata/patologia , Receptores Androgênicos/metabolismo , Análise de Sobrevida , Neoplasias de Mama Triplo Negativas/mortalidade , Neoplasias de Mama Triplo Negativas/patologia
9.
BMC Med Inform Decis Mak ; 19(1): 13, 2019 01 17.
Artigo em Inglês | MEDLINE | ID: mdl-30654776

RESUMO

BACKGROUND: Joint models (JM) have emerged as a promising statistical framework to concurrently analyse survival data and multiple longitudinal responses. This is particularly relevant in clinical studies where the goal is to estimate the association between time-to-event data and the biomarkers evolution. In the context of oncological data, JM can indeed provide interesting prognostic markers for the event under study and thus support clinical decisions and treatment choices. However, several problems arise when dealing with this type of data, such as the high-dimensionality of the covariates space, the lack of knowledge about the function structure of the time series and the presence of missing data, facts that may hamper the accurate estimation of the JM. METHODS: We propose to apply JM for the analysis of bone metastatic patients and infer the association of their survival with several covariates, in particular the N-Telopeptide of Type I Collagen (NTX) dynamics. This biomarker has been identified as a relevant prognostic factor in patients with metastatic cancer, but only using static information in some specific time points. RESULTS: We extended this analysis using the full NTX time series for a larger cohort of patients with bone metastasis, and compared the results obtained by the JM and the extended Cox regression model. Imputation based on fuzzy clustering was used to deal with missing values and several functions for NTX evolution were compared, such as rational, exponential and cubic splines. CONCLUSIONS: The JM obtained confirm the association between NTX values and patients' response, attesting the importance of this time series, and additionally provide a deep understanding of the key survival covariates.


Assuntos
Biomarcadores Tumorais/metabolismo , Neoplasias Ósseas/metabolismo , Colágeno Tipo I/metabolismo , Modelos Teóricos , Peptídeos/metabolismo , Análise de Sobrevida , Neoplasias Ósseas/secundário , Humanos , Estudos Longitudinais
10.
Stat Methods Med Res ; 28(10-11): 3042-3056, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-30146936

RESUMO

Correct classification of breast cancer subtypes is of high importance as it directly affects the therapeutic options. We focus on triple-negative breast cancer which has the worst prognosis among breast cancer types. Using cutting edge methods from the field of robust statistics, we analyze Breast Invasive Carcinoma transcriptomic data publicly available from The Cancer Genome Atlas data portal. Our analysis identifies statistical outliers that may correspond to misdiagnosed patients. Furthermore, it is illustrated that classical statistical methods may fail to identify outliers due to their heavy influence, prompting the need for robust statistics. Using robust sparse logistic regression we obtain 36 relevant genes, of which ca. 60% have been previously reported as biologically relevant to triple-negative breast cancer, reinforcing the validity of the method. The remaining 14 genes identified are new potential biomarkers for triple-negative breast cancer. Out of these, JAM3, SFT2D2, and PAPSS1 were previously associated to breast tumors or other types of cancer. The relevance of these genes is confirmed by the new DetectDeviatingCells outlier detection technique. A comparison of gene networks on the selected genes showed significant differences between triple-negative breast cancer and non-triple-negative breast cancer data. The individual role of FOXA1 in triple-negative breast cancer and non-triple-negative breast cancer, and the strong FOXA1-AGR2 connection in triple-negative breast cancer stand out. The goal of our paper is to contribute to the breast cancer/triple-negative breast cancer understanding and management. At the same time it demonstrates that robust regression and outlier detection constitute key strategies to cope with high-dimensional clinical data such as omics data.


Assuntos
Modelos Genéticos , Neoplasias de Mama Triplo Negativas/genética , Biomarcadores Tumorais/genética , Bases de Dados Genéticas , Feminino , Regulação Neoplásica da Expressão Gênica , Redes Reguladoras de Genes , Humanos , Prognóstico
11.
Orthop J Sports Med ; 6(11): 2325967118808242, 2018 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-30505873

RESUMO

BACKGROUND: Painful dysfunctional shoulders with irreparable rotator cuff tears (IRCTs) in active patients are a challenge. Arthroscopic superior capsular reconstruction (ASCR) is a new treatment option originally described using a fascia lata autograft harvested through an open approach. However, concerns about donor site morbidity have discouraged surgeons from using this type of graft. HYPOTHESIS: ASCR using a minimally invasive harvested fascia lata autograft produces good 6-month and 2-year shoulder outcomes in IRCTs, with low-impact thigh morbidity at 2 years. STUDY DESIGN: Case series; Level of evidence, 4. METHODS: From 2015 to 2016, a total of 22 consecutive patients (mean age, 64.8 ± 8.6 years) with chronic IRCTs (Hamada grade 1-2; Goutallier cumulative grade ≥3; Patte stage 1: 2 patients; Patte stage 2: 6 patients; Patte stage 3: 14 patients) underwent ASCR using a minimally invasive harvested fascia lata autograft. All patients completed preoperative and 6-month evaluations consisting of the Simple Shoulder Test (SST), subjective shoulder value (SSV), Constant score (CS), range of motion (ROM), acromiohumeral interval (AHI), and magnetic resonance imaging. Twenty-one patients completed the 2-year shoulder and donor site morbidity assessments. RESULTS: The mean active ROMs improved significantly (P < .001): elevation, from 74.8° ± 55.5° to 104.5° ± 41.9° (6 months) and 143.8° ± 31.7° (2 years); abduction, from 53.2° ± 43.3° to 86.6° ± 32.9° (6 months) and 120.7° ± 37.7° (2 years); external rotation, from 13.2° ± 18.4° to 27.0° ± 16.1° (6 months) and 35.6° ± 17.3° (2 years); and internal rotation, from 1.2 ± 1.5 points to 2.6 ± 1.5 points (6 months) and 3.8 ± 1.2 points (2 years). The mean functional shoulder scores improved significantly (P < .001): SST, from 2.1 ± 2.9 to 6.8 ± 3.5 (6 months) and 8.6 ± 3.5 (2 years); SSV, from 33.0% ± 17.4% to 55.7% ± 25.6% (6 months) and 70.0% ± 23.0% (2 years); CS, from 17.5 ± 13.4 to 42.5 ± 14.9 (6 months) and 64.9 ± 18.0 (2 years). The mean shoulder abduction strength improved significantly (P < .001) from 0.0 to 1.1 ± 1.4 kg (6 months) and 2.8 ± 2.6 kg (2 years). The mean AHI improved from 6.4 ± 3.3 mm to 8.0 ± 2.5 mm (6 months) and decreased to 7.1 ± 2.5 mm (2 years). This 0.7 ± 1.5-mm overall decrease was statistically significant (P = .042). At 6 months, 20 of 22 patients (90.9%) had no graft tears. At 2 years, 12 of 21 patients (57.1%) were bothered by their harvested thigh, 16 (76.2%) noticed donor site changes, 16 (76.2%) considered that the shoulder surgery's end result compensated for the thigh's changes, and 18 (85.7%) would undergo the same surgery again. CONCLUSION: ASCR using a minimally invasive harvested fascia lata autograft produced good 6-month and 2-year shoulder outcomes in IRCTs, with low-impact thigh morbidity at 2 years.

12.
BMC Bioinformatics ; 19(1): 168, 2018 05 04.
Artigo em Inglês | MEDLINE | ID: mdl-29728051

RESUMO

BACKGROUND: Learning accurate models from 'omics data is bringing many challenges due to their inherent high-dimensionality, e.g. the number of gene expression variables, and comparatively lower sample sizes, which leads to ill-posed inverse problems. Furthermore, the presence of outliers, either experimental errors or interesting abnormal clinical cases, may severely hamper a correct classification of patients and the identification of reliable biomarkers for a particular disease. We propose to address this problem through an ensemble classification setting based on distinct feature selection and modeling strategies, including logistic regression with elastic net regularization, Sparse Partial Least Squares - Discriminant Analysis (SPLS-DA) and Sparse Generalized PLS (SGPLS), coupled with an evaluation of the individuals' outlierness based on the Cook's distance. The consensus is achieved with the Rank Product statistics corrected for multiple testing, which gives a final list of sorted observations by their outlierness level. RESULTS: We applied this strategy for the classification of Triple-Negative Breast Cancer (TNBC) RNA-Seq and clinical data from the Cancer Genome Atlas (TCGA). The detected 24 outliers were identified as putative mislabeled samples, corresponding to individuals with discrepant clinical labels for the HER2 receptor, but also individuals with abnormal expression values of ER, PR and HER2, contradictory with the corresponding clinical labels, which may invalidate the initial TNBC label. Moreover, the model consensus approach leads to the selection of a set of genes that may be linked to the disease. These results are robust to a resampling approach, either by selecting a subset of patients or a subset of genes, with a significant overlap of the outlier patients identified. CONCLUSIONS: The proposed ensemble outlier detection approach constitutes a robust procedure to identify abnormal cases and consensus covariates, which may improve biomarker selection for precision medicine applications. The method can also be easily extended to other regression models and datasets.


Assuntos
Neoplasias de Mama Triplo Negativas/genética , Sequenciamento Completo do Genoma/métodos , Feminino , Humanos , Tamanho da Amostra , Neoplasias de Mama Triplo Negativas/patologia
13.
BioData Min ; 11: 1, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-29456628

RESUMO

BACKGROUND: Survival analysis is a statistical technique widely used in many fields of science, in particular in the medical area, and which studies the time until an event of interest occurs. Outlier detection in this context has gained great importance due to the fact that the identification of long or short-term survivors may lead to the detection of new prognostic factors. However, the results obtained using different outlier detection methods and residuals are seldom the same and are strongly dependent of the specific Cox proportional hazards model selected. In particular, when the inherent data have a high number of covariates, dimensionality reduction becomes a key challenge, usually addressed through regularized optimization, e.g. using Lasso, Ridge or Elastic Net regression. In the case of transcriptomics studies, this is an ubiquitous problem, since each observation has a very high number of associated covariates (genes). RESULTS: In order to solve this issue, we propose to use the Rank Product test, a non-parametric technique, as a method to identify discrepant observations independently of the selection method and deviance considered. An example based on the The Cancer Genome Atlas (TCGA) ovarian cancer dataset is presented, where the covariates are patients' gene expressions. Three sub-models were considered, and, for each one, different outliers were obtained. Additionally, a resampling strategy was conducted to demonstrate the methods' consistency and robustness. The Rank Product worked as a consensus method to identify observations that can be influential under survival models, thus potential outliers in the high-dimensional space. CONCLUSIONS: The proposed technique allows us to combine the different results obtained by each sub-model and find which observations are systematically ranked as putative outliers to be explored further from a clinical point of view.

14.
J Theor Biol ; 391: 1-12, 2016 Feb 21.
Artigo em Inglês | MEDLINE | ID: mdl-26657065

RESUMO

Bone is a common site for the development of metastasis, as its microenvironment provides the necessary conditions for the growth and proliferation of cancer cells. Several mathematical models to describe the bone remodeling process and how osteoclasts and osteoblasts coupled action ensures bone homeostasis have been proposed and further extended to include the effect of cancer cells. The model proposed here includes the influence of the parathyroid hormone (PTH) as capable of triggering and regulating the bone remodeling cycle. It also considers the secretion of PTH-related protein (PTHrP) by cancer cells, which stimulates the production of receptor activator of nuclear factor kappa-B ligand (RANKL) by osteoblasts that activates osteoclasts, increasing bone resorption and the subsequent release of growth factors entrapped in the bone matrix, which induce tumor growth, giving rise to a self-perpetuating cycle known as the vicious cycle of bone metastases. The model additionally describes how the presence of metastases contributes to the decoupling between bone resorption and formation. Moreover, the effects of anti-cancer and anti-resorptive treatments, through chemotherapy and the administration of bisphosphonates or denosumab, are also included, along with their corresponding pharmacokinetics (PK) and pharmacodynamics (PD). The simulated models, available at http://sels.tecnico.ulisboa.pt/software/, are able to describe bone remodeling cycles, the growth of bone metastases and how treatment can effectively reduce tumor burden on bone and prevent loss of bone strength.


Assuntos
Neoplasias Ósseas , Denosumab/uso terapêutico , Difosfonatos/uso terapêutico , Modelos Biológicos , Hormônio Paratireóideo/metabolismo , Microambiente Tumoral , Animais , Neoplasias Ósseas/tratamento farmacológico , Neoplasias Ósseas/metabolismo , Neoplasias Ósseas/patologia , Neoplasias Ósseas/secundário , Humanos , Metástase Neoplásica , Osteoblastos/metabolismo , Osteoblastos/patologia , Osteoclastos/metabolismo , Osteoclastos/patologia
15.
BMC Bioinformatics ; 17(Suppl 16): 449, 2016 Dec 13.
Artigo em Inglês | MEDLINE | ID: mdl-28105908

RESUMO

BACKGROUND: Modeling survival oncological data has become a major challenge as the increase in the amount of molecular information nowadays available means that the number of features greatly exceeds the number of observations. One possible solution to cope with this dimensionality problem is the use of additional constraints in the cost function optimization. LASSO and other sparsity methods have thus already been successfully applied with such idea. Although this leads to more interpretable models, these methods still do not fully profit from the relations between the features, specially when these can be represented through graphs. We propose DEGREECOX, a method that applies network-based regularizers to infer Cox proportional hazard models, when the features are genes and the outcome is patient survival. In particular, we propose to use network centrality measures to constrain the model in terms of significant genes. RESULTS: We applied DEGREECOX to three datasets of ovarian cancer carcinoma and tested several centrality measures such as weighted degree, betweenness and closeness centrality. The a priori network information was retrieved from Gene Co-Expression Networks and Gene Functional Maps. When compared with RIDGE and LASSO, DEGREECOX shows an improvement in the classification of high and low risk patients in a par with NET-COX. The use of network information is especially relevant with datasets that are not easily separated. In terms of RMSE and C-index, DEGREECOX gives results that are similar to those of the best performing methods, in a few cases slightly better. CONCLUSIONS: Network-based regularization seems a promising framework to deal with the dimensionality problem. The centrality metrics proposed can be easily expanded to accommodate other topological properties of different biological networks.


Assuntos
Algoritmos , Regulação Neoplásica da Expressão Gênica , Redes Reguladoras de Genes , Neoplasias Ovarianas/genética , Modelos de Riscos Proporcionais , Feminino , Humanos , Modelos Genéticos
16.
Mol Biosyst ; 10(3): 628-39, 2014 Mar 04.
Artigo em Inglês | MEDLINE | ID: mdl-24413179

RESUMO

Biomedical research and biotechnological production are greatly benefiting from the results provided by the development of dynamic models of microbial metabolism. Although several kinetic models of Lactococcus lactis (a Lactic Acid Bacterium (LAB) commonly used in the dairy industry) have been developed so far, most of them are simplified and focus only on specific metabolic pathways. Therefore, the application of mathematical models in the design of an engineering strategy for the production of industrially important products by L. lactis has been very limited. In this work, we extend the existing kinetic model of L. lactis central metabolism to include industrially relevant production pathways such as mannitol and 2,3-butanediol. In this way, we expect to study the dynamics of metabolite production and make predictive simulations in L. lactis. We used a system of ordinary differential equations (ODEs) with approximate Michaelis-Menten-like kinetics for each reaction, where the parameters were estimated from multivariate time-series metabolite concentrations obtained by our team through in vivo Nuclear Magnetic Resonance (NMR). The results show that the model captures observed transient dynamics when validated under a wide range of experimental conditions. Furthermore, we analyzed the model using global perturbations, which corroborate experimental evidence about metabolic responses upon enzymatic changes. These include that mannitol production is very sensitive to lactate dehydrogenase (LDH) in the wild type (W.T.) strain, and to mannitol phosphoenolpyruvate: a phosphotransferase system (PTS(Mtl)) in a LDH mutant strain. LDH reduction has also a positive control on 2,3-butanediol levels. Furthermore, it was found that overproduction of mannitol-1-phosphate dehydrogenase (MPD) in a LDH/PTS(Mtl) deficient strain can increase the mannitol levels. The results show that this model has prediction capability over new experimental conditions and offers promising possibilities to elucidate the effect of alterations in the main metabolism of L. lactis, with application in strain optimization.


Assuntos
Butileno Glicóis/metabolismo , Lactococcus lactis/metabolismo , Manitol/metabolismo , Modelos Biológicos , Análise por Conglomerados , Cinética , L-Lactato Desidrogenase/genética , L-Lactato Desidrogenase/metabolismo , Lactococcus lactis/genética , Redes e Vias Metabólicas , Mutação , Fatores de Tempo
17.
PLoS One ; 8(10): e76300, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-24194833

RESUMO

It is widely agreed that complex diseases are typically caused by the joint effects of multiple instead of a single genetic variation. These genetic variations may show stronger effects when considered together than when considered individually, a phenomenon known as epistasis or multilocus interaction. In this work, we explore the applicability of information interaction to discover pairwise epistatic effects related with complex diseases. We start by showing that traditional approaches such as classification methods or greedy feature selection methods (such as the Fleuret method) do not perform well on this problem. We then compare our information interaction method with BEAM and SNPHarvester in artificial datasets simulating epistatic interactions and show that our method is more powerful to detect pairwise epistatic interactions than its competitors. We show results of the application of information interaction method to the WTCCC breast cancer dataset. Our results are validated using permutation tests. We were able to find 89 statistically significant pairwise interactions with a p-value lower than 10(-3). Even though many recent algorithms have been designed to find epistasis with low marginals, we observed that all (except one) of the SNPs involved in statistically significant interactions have moderate or high marginals. We also report that the interactions found in this work were not present in gene-gene interaction network STRING.


Assuntos
Neoplasias da Mama/epidemiologia , Causalidade , Suscetibilidade a Doenças/epidemiologia , Epistasia Genética/genética , Modelos Teóricos , Biologia Computacional/métodos , Humanos , Polimorfismo de Nucleotídeo Único/genética
18.
BMC Bioinformatics ; 14: 283, 2013 Sep 25.
Artigo em Inglês | MEDLINE | ID: mdl-24067087

RESUMO

BACKGROUND: Existing tools to model cell growth curves do not offer a flexible integrative approach to manage large datasets and automatically estimate parameters. Due to the increase of experimental time-series from microbiology and oncology, the need for a software that allows researchers to easily organize experimental data and simultaneously extract relevant parameters in an efficient way is crucial. RESULTS: BGFit provides a web-based unified platform, where a rich set of dynamic models can be fitted to experimental time-series data, further allowing to efficiently manage the results in a structured and hierarchical way. The data managing system allows to organize projects, experiments and measurements data and also to define teams with different editing and viewing permission. Several dynamic and algebraic models are already implemented, such as polynomial regression, Gompertz, Baranyi, Logistic and Live Cell Fraction models and the user can add easily new models thus expanding current ones. CONCLUSIONS: BGFit allows users to easily manage their data and models in an integrated way, even if they are not familiar with databases or existing computational tools for parameter estimation. BGFit is designed with a flexible architecture that focus on extensibility and leverages free software with existing tools and methods, allowing to compare and evaluate different data modeling techniques. The application is described in the context of bacterial and tumor cells growth data fitting, but it is also applicable to any type of two-dimensional data, e.g. physical chemistry and macroeconomic time series, being fully scalable to high number of projects, data and model complexity.


Assuntos
Biologia Computacional/métodos , Bases de Dados Factuais , Modelos Biológicos , Software , Algoritmos , Proliferação de Células , Internet , Interface Usuário-Computador
19.
World J Gastroenterol ; 18(11): 1243-8, 2012 Mar 21.
Artigo em Inglês | MEDLINE | ID: mdl-22468088

RESUMO

AIM: To investigate whether, under the influence of polypectomy, the incidence of adenoma decreases with age. METHODS: Consecutive patients with colonic adenomas identified at index colonoscopy were retrospectively selected if they had undergone three or more complete colonoscopies, at least 24 mo apart. Patients who had any first-degree relative with colorectal cancer were excluded. Data regarding number of adenomas at each colonoscopy, their location, size and histological classification were recorded. The monthly incidence density of adenomas after the index examination was estimated for the study population, by using the person-years method. Baseline adenomas were excluded from incidence calculations but their characteristics were correlated with recurrence at follow-up, using the χ(2) test. RESULTS: One hundred and fifty-six patients were included (109 male, mean age at index colonoscopy 56.8 ± 10.3 years), with follow-up that ranged from 48 to 232 mo. No significant correlations were observed between the number, the presence of villous component, or the size of adenomas at index colonoscopy and the presence of adenomas at subsequent colonoscopies (P = 0.49, 0.12 and 0.78, respectively). The incidence of colonic adenomas was observed to decay from 1.4% person-months at the beginning of the study to values close to 0%, at 12 years after index colonoscopy. CONCLUSION: Our results suggest the sporadic formation of adenomas occurs within a discrete period and that, when these adenomas are removed, all neoplasia-prone clones may be extinguished.


Assuntos
Adenoma/epidemiologia , Adenoma/cirurgia , Pólipos do Colo/epidemiologia , Pólipos do Colo/cirurgia , Adenoma/patologia , Adulto , Idoso , Pólipos do Colo/patologia , Colonoscopia/métodos , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Estudos Retrospectivos
20.
J Bioinform Comput Biol ; 9(5): 613-30, 2011 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-21976379

RESUMO

In this study we address the problem of finding a quantitative mathematical model for the genetic network regulating the stress response of the yeast Saccharomyces cerevisiae to the agricultural fungicide mancozeb. An S-system formalism was used to model the interactions of a five-gene network encoding four transcription factors (Yap1, Yrr1, Rpn4 and Pdr3) regulating the transcriptional activation of the FLR1 gene. Parameter estimation was accomplished by decoupling the resulting system of nonlinear ordinary differential equations into a larger nonlinear algebraic system, and using the Levenberg-Marquardt algorithm to fit the models predictions to experimental data. The introduction of constraints in the model, related to the putative topology of the network, was explored. The results show that forcing the network connectivity to adhere to this topology did not lead to better results than the ones obtained using an unrestricted network topology. Overall, the modeling approach obtained partial success when trained on the nonmutant datasets, although further work is required if one wishes to obtain more accurate prediction of the time courses.


Assuntos
Modelos Genéticos , Transportadores de Ânions Orgânicos/genética , Proteínas de Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/genética , Biologia Computacional , Proteínas de Ligação a DNA/genética , Fungicidas Industriais/farmacologia , Redes Reguladoras de Genes , Genes Fúngicos/efeitos dos fármacos , Maneb/farmacologia , Dinâmica não Linear , Saccharomyces cerevisiae/efeitos dos fármacos , Saccharomyces cerevisiae/metabolismo , Estresse Fisiológico , Fatores de Transcrição/genética , Ativação Transcricional/efeitos dos fármacos , Zineb/farmacologia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA