Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 46
Filtrar
1.
Comput Struct Biotechnol J ; 23: 2152-2162, 2024 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-38827234

RESUMEN

Background and objective: Systemic autoinflammatory diseases (SAIDs) are characterized by widespread inflammation, but for most of them there is a lack of specific biomarkers for accurate diagnosis. Although a number of machine learning algorithms have been used to analyze SAID datasets, aiding in the discovery of novel biomarkers, there is a growing recognition of the importance of SAID timeseries clustering, as it can capture the temporal dynamics of gene expression patterns. Methodology: This paper proposes a novel clustering methodology to efficiently associate three-dimensional data. The algorithm utilizes competitive learning to create a self-organizing neural network and adjust neuron positions in time-dependent and high dimensional feature space in order to assign them as clustering centers. The quantitative evaluation of the clustering was based on well-known clustering indices. Furthermore, a differential expression analysis and classification pipeline was employed to assess the capability of the proposed methodology to extract more accurate pathway-specific genes from its clusters. For that, a comparative analysis was also conducted against a heuristic timeseries clustering method. Results: The proposed methodology achieved better overall clustering indices scores and classification metrics using genes derived from its clusters. Notable cases include a threefold increase in the Calinski-Harabasz clustering index, a twofold improvement in the Davies-Bouldin clustering index and a ∼60% increase in the classification specificity score. Conclusion: A novel clustering methodology was developed and applied on several gene expression timeseries datasets from systemic autoinflammatory diseases, and its ability to efficiently produce well separated clusters compared to existing heuristic methods was demonstrated.

2.
Biomed Rep ; 20(3): 45, 2024 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-38357244

RESUMEN

Allicin is a thiosulphate molecule produced in garlic (Allium sativum) and has a wide range of biological actions and pharmaceutical applications. Its precursor molecule is the non-proteinogenic amino acid alliin (S-allylcysteine sulphoxide). The alliin biosynthetic pathway in garlic involves a group of enzymes, members of which are the γ-glutamyl-transpeptidase isoenzymes, Allium sativum γ-glutamyl-transpeptidase AsGGT1, AsGGT2 and AsGGT3, which catalyze the removal of the γ-glutamyl group from γ-glutamyl-S-allyl-L-cysteine to produce S-allyl-L-cysteine. This removal is followed by an S-oxygenation, which leads to the biosynthesis of alliin. The aim of the present study is to annotate previously discovered genes of garlic γ-glutamyl-transpeptidases, as well as a fourth candidate gene (AsGGT4) that has yet not been described. The annotation includes identifying the loci of the genes in the garlic genome, revealing the overall structure and conserved regions of these genes, and elucidating the evolutionary history of these enzymes through their phylogenetic analysis. The genomic structure of γ-glutamyl-transpeptidase genes is conserved; each gene consists of seven exons, and these genes are located on different chromosomes. AsGGT3 and AsGGT4 enzymes contain a signal peptide. To that end, the AsGGT3 protein sequence was corrected; four indel events occurring in AsGGT3 coding regions suggested that at least in the garlic variety Ershuizao, AsGGT3 may be a pseudogene. Finally, the use of protein structure prediction tools allowed the visualization of the tertiary structure of the candidate peptide.

3.
Artículo en Inglés | MEDLINE | ID: mdl-38082601

RESUMEN

An emerging area in data science that has lately gained attention is the virtual population (VP) and synthetic data generation. This field has the potential to significantly affect the healthcare industry by providing a means to augment clinical research databases that have a shortage of subjects. The current study provides a comparative analysis of five distinct approaches for creating virtual data populations from real patient data. The data set utilized for the current analyses involved clinical data collected among patients scheduled for elective coronary artery bypass graft surgery (CABG). To that end, the five computational techniques employed to augment the given dataset were: (i) Tabular Preset, (ii) Gaussian Copula Model (iii) Generative Adversarial Network based (GAN) Deep Learning data synthesizer (CTGAN), (iv) a variation of the CTGAN Model (Copula GAN), and (v) VAE-based Deep Learning data synthesizer (TVAE). The performance of these techniques was assessed against their effectiveness in producing high-quality virtual data. For this purpose, dataset correlation matrices, cosine similarity distance, density histograms, and kernel density estimation are employed to perform a comparative analysis of each attribute and the respective synthetic equivalent. Our findings demonstrate that Gaussian Copula Model prevails in creating virtual data with consistent distributions (Kolmogorov-Smirnov (KS) and Chi-Squared (CS) tests equal to 0.9 and 0.98, respectively) and correlation patterns (average cosine similarity equals to 0.95).Clinical Relevance- It has been shown that the use of a VP can increase the predictive performance of a ML model, i.e., above using a smaller non-augmented population.


Asunto(s)
Puente de Arteria Coronaria , Corazón , Humanos , Enfermedad Crónica , Exactitud de los Datos , Ciencia de los Datos
4.
Artículo en Inglés | MEDLINE | ID: mdl-38083327

RESUMEN

A preliminary analysis was conducted on data acquired from RNA sequencing and SomaScan platforms, for the classification of patients with Inflammation of Unknown Origin. To this end, a multimodal data integration approach was designed, by combining the two platforms, in order to assess the potentiality of learning estimators, using the differentially expressed features from the independent profiling experiments of both platforms. The classification framing was the differentiation of Inflammation of Unknown Origin patients against a multitude of Systemic Autoinflammatory disease patients. Separate false discovery rate analyses were performed on each dataset to extract statistically significant features between the two designated sample groups. Genomic analysis managed higher overall classification metrics compared to proteomic analysis, averaging an ~19% increase overall metrics and classifiers, with a ~0.07% increase in standard error. The multimodal data integration approach achieved similar results to the individual platforms' analyses. More specifically, it managed the same classification accuracy, sensitivity, and specificity scores as the best individual analysis, with the simple Logistic Regression estimator.Clinical Relevance- This study highlights the advantage of exploiting RNA sequencing data to identify potential Inflammation of Unknown Origin disease specific biomarkers, even against other Systemic Autoinflammatory diseases. These findings are further emphasized given the non-apparent clinical discrepancy between Inflammation of Unknown Origin and other Systemic Autoinflammatory diseases.


Asunto(s)
Enfermedades Autoinflamatorias Hereditarias , Proteómica , Humanos , Proteómica/métodos , RNA-Seq , Genómica/métodos , Análisis de Secuencia de ARN/métodos , Síndrome
5.
Annu Int Conf IEEE Eng Med Biol Soc ; 2022: 329-332, 2022 07.
Artículo en Inglés | MEDLINE | ID: mdl-36085667

RESUMEN

Glucose prediction is used in diabetes self-management as it allows to take suitable actions for proper glycemic regulation of the patient. The aim of this work is the short-term personalized glucose prediction in patients with Type 1 diabetes mellitus (T1DM). In this scope, we compared two different models, an autoregressive moving average (ARMA) model and a long short-term memory (LSTM) model for different prediction horizons. The comparison of two models was performed using the evaluation metrics of root mean square error (RMSE) and mean absolute error (MAE). The models were trained and tested in 29 real patients. The results shown that the LSTM model had better performance than ARMA with RMSE 3.13, 6.41 and 8.81 mg/dL and MAE 1.98, 5.06 and 6.47 mg/dL for 5-, 15- and 30-minutes prediction horizon.


Asunto(s)
Diabetes Mellitus Tipo 1 , Benchmarking , Glucemia , Diabetes Mellitus Tipo 1/diagnóstico , Glucosa , Conductas Relacionadas con la Salud , Humanos
6.
Annu Int Conf IEEE Eng Med Biol Soc ; 2022: 1020-1023, 2022 07.
Artículo en Inglés | MEDLINE | ID: mdl-36086001

RESUMEN

Although several studies have utilized AI (artificial intelligence)-based solutions to enhance the decision making for mechanical ventilation, as well as, for mortality in COVID-19, the extraction of explainable predictors regarding heparin's effect in intensive care and mortality has been left unresolved. In the present study, we developed an explainable AI (XAI) workflow to shed light into predictors for admission in the intensive care unit (ICU), as well as, for mortality across those hospitalized COVID-19 patients who received heparin. AI empowered classifiers, such as, the hybrid Extreme gradient boosting (HXGBoost) with customized loss functions were trained on time-series curated clinical data to develop robust AI models. Shapley additive explanation analysis (SHAP) was conducted to determine the positive or negative impact of the predictors in the model's output. The HXGBoost predicted the risk for intensive care and mortality with 0.84 and 0.85 accuracy, respectively. SHAP analysis indicated that the low percentage of lymphocytes at day 7 along with increased FiO2 at days 1 and 5, low SatO2 at days 3 and 7 increase the probability for mortality and highlight the positive effect of heparin administration at the early days of hospitalization for reducing mortality.


Asunto(s)
COVID-19 , Respiración Artificial , Inteligencia Artificial , Heparina/uso terapéutico , Mortalidad Hospitalaria , Humanos
7.
Artículo en Inglés | MEDLINE | ID: mdl-36086666

RESUMEN

A meta-analysis study was conducted to compare high-throughput technologies in the classification of Adult-Onset Still's Disease patients, using differentially expressed genes from independent profiling experiments. We exploited two publicly available datasets from the Gene Expression Omnibus and performed a separate differential expression analysis on each dataset to extract statistically important genes. We then mapped the genes of the two datasets and subsequently we employed well-established machine learning algorithms to evaluate the denoted genes as candidate biomarkers. Using next-generation sequencing data, we managed to achieve the maximum (100%) classification accuracy, sensitivity and specificity with the Gradient Boosting and the Random Forest classifiers, compared to the 83% of the DNA microarray data. Clinical Relevance- When biomarkers derived from one study are applied to the data of another, in many cases the results may diverge significantly. Here we establish that in cross-profiling meta-analysis approaches based on differential expression analysis, next-generation sequencing data provide more accurate results than microarray experiments in the classification of Adult-Onset Still's Disease patients.


Asunto(s)
Perfilación de la Expresión Génica , Enfermedad de Still del Adulto , Biomarcadores , Perfilación de la Expresión Génica/métodos , Humanos , Aprendizaje Automático , Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , Enfermedad de Still del Adulto/diagnóstico , Enfermedad de Still del Adulto/genética
8.
Comput Biol Med ; 141: 105176, 2022 02.
Artículo en Inglés | MEDLINE | ID: mdl-35007991

RESUMEN

The coronavirus disease 2019 (COVID-19) which is caused by severe acute respiratory syndrome coronavirus type 2 (SARS-CoV-2) is consistently causing profound wounds in the global healthcare system due to its increased transmissibility. Currently, there is an urgent unmet need to identify the underlying dynamic associations among COVID-19 patients and distinguish patient subgroups with common clinical profiles towards the development of robust classifiers for ICU admission and mortality. To address this need, we propose a four step pipeline which: (i) enhances the quality of multiple timeseries clinical data through an automated data curation workflow, (ii) deploys Dynamic Bayesian Networks (DBNs) for the detection of features with increased connectivity based on dynamic association analysis across multiple points, (iii) utilizes Self Organizing Maps (SOMs) and trajectory analysis for the early identification of COVID-19 patients with common clinical profiles, and (iv) trains robust multiple additive regression trees (MART) for ICU admission and mortality classification based on the extracted homogeneous clusters, to identify risk factors and biomarkers for disease progression. The contribution of the extracted clusters and the dynamically associated clinical data improved the classification performance for ICU admission to sensitivity 0.83 and specificity 0.83, and for mortality to sensitivity 0.74 and specificity 0.76. Additional information was included to enhance the performance of the classifiers yielding an increase by 4% in sensitivity and specificity for mortality. According to the risk factor analysis, the number of lymphocytes, SatO2, PO2/FiO2, and O2 supply type were highlighted as risk factors for ICU admission and the percentage of neutrophils and lymphocytes, PO2/FiO2, LDH, and ALP for mortality, among others. To our knowledge, this is the first study that combines dynamic modeling with clustering analysis to identify homogeneous groups of COVID-19 patients towards the development of robust classifiers for ICU admission and mortality.


Asunto(s)
COVID-19 , Teorema de Bayes , Hospitalización , Humanos , Unidades de Cuidados Intensivos , Estudios Retrospectivos , SARS-CoV-2
9.
Comput Struct Biotechnol J ; 19: 5546-5555, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-34712399

RESUMEN

Artificial Intelligence (AI) has recently altered the landscape of cancer research and medical oncology using traditional Machine Learning (ML) algorithms and cutting-edge Deep Learning (DL) architectures. In this review article we focus on the ML aspect of AI applications in cancer research and present the most indicative studies with respect to the ML algorithms and data used. The PubMed and dblp databases were considered to obtain the most relevant research works of the last five years. Based on a comparison of the proposed studies and their research clinical outcomes concerning the medical ML application in cancer research, three main clinical scenarios were identified. We give an overview of the well-known DL and Reinforcement Learning (RL) methodologies, as well as their application in clinical practice, and we briefly discuss Systems Biology in cancer research. We also provide a thorough examination of the clinical scenarios with respect to disease diagnosis, patient classification and cancer prognosis and survival. The most relevant studies identified in the preceding year are presented along with their primary findings. Furthermore, we examine the effective implementation and the main points that need to be addressed in the direction of robustness, explainability and transparency of predictive models. Finally, we summarize the most recent advances in the field of AI/ML applications in cancer research and medical oncology, as well as some of the challenges and open issues that need to be addressed before data-driven models can be implemented in healthcare systems to assist physicians in their daily practice.

10.
Front Immunol ; 12: 700582, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-34456913

RESUMEN

Multiple sclerosis (MS) is one of the most common autoimmune diseases which is commonly diagnosed and monitored using magnetic resonance imaging (MRI) with a combination of clinical manifestations. The purpose of this review is to highlight the main applications of Machine Learning (ML) models and their performance in the MS field using MRI. We reviewed the articles of the last decade and grouped them based on the applications of ML in MS using MRI data into four categories: 1) Automated diagnosis of MS, 2) Prediction of MS disease progression, 3) Differentiation of MS stages, 4) Differentiation of MS from similar disorders.


Asunto(s)
Interpretación de Imagen Asistida por Computador/métodos , Aprendizaje Automático , Imagen por Resonancia Magnética/métodos , Esclerosis Múltiple/diagnóstico por imagen , Neuroimagen/métodos , Humanos
11.
Comput Struct Biotechnol J ; 19: 3058-3068, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-34136104

RESUMEN

Unlike autoimmune diseases, there is no known constitutive and disease-defining biomarker for systemic autoinflammatory diseases (SAIDs). Kawasaki disease (KD) is one of the "undiagnosed" types of SAIDs whose pathogenic mechanism and gene mutation still remain unknown. To address this issue, we have developed a sequential computational workflow which clusters KD patients with similar gene expression profiles across the three different KD phases (Acute, Subacute and Convalescent) and utilizes the resulting clustermap to detect prominent genes that can be used as diagnostic biomarkers for KD. Self-Organizing Maps (SOMs) were employed to cluster patients with similar gene expressions across the three phases through inter-phase and intra-phase clustering. Then, false discovery rate (FDR)-based feature selection was applied to detect genes that significantly deviate across the per-phase clusters. Our results revealed five genes as candidate biomarkers for KD diagnosis, namely, the HLA-DQB1, HLA-DRA, ZBTB48, TNFRSF13C, and CASD1. To our knowledge, these five genes are reported for the first time in the literature. The impact of the discovered genes for KD diagnosis against the known ones was demonstrated by training boosting ensembles (AdaBoost and XGBoost) for KD classification on common platform and cross-platform datasets. The classifiers which were trained on the proposed genes from the common platform data yielded an average increase by 4.40% in accuracy, 5.52% in sensitivity, and 3.57% in specificity than the known genes in the Acute and Subacute phases, followed by a notable increase by 2.30% in accuracy, 2.20% in sensitivity, and 4.70% in specificity in the cross-platform analysis.

12.
Diagnostics (Basel) ; 12(1)2021 Dec 28.
Artículo en Inglés | MEDLINE | ID: mdl-35054223

RESUMEN

BACKGROUND: Although several studies have been launched towards the prediction of risk factors for mortality and admission in the intensive care unit (ICU) in COVID-19, none of them focuses on the development of explainable AI models to define an ICU scoring index using dynamically associated biological markers. METHODS: We propose a multimodal approach which combines explainable AI models with dynamic modeling methods to shed light into the clinical features of COVID-19. Dynamic Bayesian networks were used to seek associations among cytokines across four time intervals after hospitalization. Explainable gradient boosting trees were trained to predict the risk for ICU admission and mortality towards the development of an ICU scoring index. RESULTS: Our results highlight LDH, IL-6, IL-8, Cr, number of monocytes, lymphocyte count, TNF as risk predictors for ICU admission and survival along with LDH, age, CRP, Cr, WBC, lymphocyte count for mortality in the ICU, with prediction accuracy 0.79 and 0.81, respectively. These risk factors were combined with dynamically associated biological markers to develop an ICU scoring index with accuracy 0.9. CONCLUSIONS: to our knowledge, this is the first multimodal and explainable AI model which quantifies the risk of intensive care with accuracy up to 0.9 across multiple timepoints.

13.
Comput Biol Med ; 116: 103577, 2020 01.
Artículo en Inglés | MEDLINE | ID: mdl-32001012

RESUMEN

Genomic profiling of cancer studies has generated comprehensive gene expression patterns for diverse phenotypes. Computational methods which employ transcriptomics datasets have been proposed to model gene expression data. Dynamic Bayesian Networks (DBNs) have been used for modeling time series datasets and for the inference of regulatory networks. Furthermore, cancer classification through DBN-based approaches could reveal the importance of exploiting knowledge from statistically significant genes and key regulatory molecules. Although microarray datasets have been employed extensively by several classification methods for decision making, the use of new knowledge from the pathway level has not been addressed adequately in the literature in terms of DBNs for cancer classification. In the present study, we identify the genes that act as regulators and mediate the activity of transcription factors that have been found in all promoters of our differentially expressed gene sets. These features serve as potential priors for distinguishing tumor from normal samples using a DBN-based classification approach. We employed three microarray datasets from the Gene Expression Omnibus (GEO) public functional repository and performed differential expression analysis. Promoter and pathway analysis of the identified genes revealed the key regulators which influence the transcription mechanisms of these genes. We applied the DBN algorithm on selected genes and identified the features that can accurately classify the samples into tumors and controls. Both accuracy and Area Under the Curve (AUC) were high for the gene sets comprising of the differentially expressed genes along with their master regulators (accuracy: 70.8%-98.5%; AUC: 0.562-0.985).


Asunto(s)
Redes Reguladoras de Genes , Neoplasias , Algoritmos , Teorema de Bayes , Biología Computacional , Perfilación de la Expresión Génica , Redes Reguladoras de Genes/genética , Neoplasias/genética , Análisis de Secuencia por Matrices de Oligonucleótidos
14.
IEEE Open J Eng Med Biol ; 1: 49-56, 2020.
Artículo en Inglés | MEDLINE | ID: mdl-35402956

RESUMEN

Lymphoma development constitutes one of the most serious clinico-pathological manifestations of patients with Sjögren's Syndrome (SS). Over the last decades the risk for lymphomagenesis in SS patients has been studied aiming to identify novel biomarkers and risk factors predicting lymphoma development in this patient population. Objective: The current study aims to explore whether genetic susceptibility profiles of SS patients along with known clinical, serological and histological risk factors enhance the accuracy of predicting lymphoma development in this patient population. Methods: The potential predicting role of both genetic variants, clinical and laboratory risk factors were investigated through a Machine Learning-based (ML) framework which encapsulates ensemble classifiers. Results: Ensemble methods empower the classification accuracy with approaches which are sensitive to minor perturbations in the training phase. The evaluation of the proposed methodology based on a 10-fold stratified cross validation procedure yielded considerable results in terms of balanced accuracy (GB: 0.7780 ± 0.1514, RF Gini: 0.7626 ± 0.1787, RF Entropy: 0.7590 ± 0.1837). Conclusions: The initial clinical, serological, histological and genetic findings at an early diagnosis have been exploited in an attempt to establish predictive tools in clinical practice and further enhance our understanding towards lymphoma development in SS.

15.
Annu Int Conf IEEE Eng Med Biol Soc ; 2018: 1307-1310, 2018 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-30440631

RESUMEN

Pancreatic Cancer (PC) can be characterized as one of the most lethal cancers considering its poor diagnosis and symptoms in early stages. To assess the predictive value of regulatory molecules in terms of differentially expressed genes, we first performed a thorough search of gene expression profiling studies in pancreatic cohorts. We obtained the genes that have been identified and validated experimentally to be associated with patient outcome and also differentially expressed in tumors compared with adjacent non-tumor tissues. A two-step upstream analysis on the derived set of the genes under study was performed. The subsequent promoter and pathway analysis unveiled candidate transcription factors and regulatory molecules that potentially have regulated the detected differentially expressed genes. Predictive analysis was applied in the identified regulators and classification algorithms were implemented to model accurately patient outcome. In view of our findings, Gaussian Naïve Bayes model exhibited the highest classification accuracy and f-score concerning the predictive value of regulatory molecules in PC (accuracy =0.85, f-score =0.84).


Asunto(s)
Neoplasias Pancreáticas , Algoritmos , Teorema de Bayes , Perfilación de la Expresión Génica , Regulación Neoplásica de la Expresión Génica , Humanos , Páncreas
16.
Annu Int Conf IEEE Eng Med Biol Soc ; 2017: 3876-3879, 2017 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-29060744

RESUMEN

We propose a meta-analysis scheme for identifying differentially expressed genes in Oral Squamous Cell Carcinoma (OSCC) from different microarray studies. We detect a subset of relevant features and further classify samples under two experimental conditions (i.e healthy and cancer samples) for better patient stratification. A well-established meta-analysis method is adopted and gene expression data sets are derived from a public functional genomics data repository. Our primary aim is the accurate identification of up- and down-regulated genes in order to extract valuable biological information concerning the changes in expression between healthy and cancer samples. According to our results and the extracted informative gene list, a high classification accuracy of healthy and OSCC tumors is achieved with as few genes as possible. Furthermore, the proposed scheme implies that the combination of datasets from different origins may reduce the estimated percentage of false predictions, while the power of gene identification and disease classification is increased.


Asunto(s)
Neoplasias de la Boca , Carcinoma de Células Escamosas , Perfilación de la Expresión Génica , Regulación Neoplásica de la Expresión Génica , Humanos , Análisis de Secuencia por Matrices de Oligonucleótidos
17.
Methods Mol Biol ; 1552: 13-27, 2017.
Artículo en Inglés | MEDLINE | ID: mdl-28224488

RESUMEN

The limitation of most HMMs is their inherent high dimensionality. Therefore we developed several variations of low complexity models that can be applied even to protein families with a few members. In this chapter we present these variations. All of them include the use of a hidden Markov model (HMM), with a small number of states (called reduced state-space HMM), which is trained with both amino acid sequence and secondary structure of proteins whose 3D structure is known and it is used for protein fold classification. We used data from Protein Data Bank and annotation from SCOP database for training and evaluation of the proposed HMM variations for a number of protein folds that belong to major structural classes. Results indicate that the variations have similar performance, or even better in some cases, on classifying proteins than SAM, which is a widely used HMM-based method for protein classification. The major advantage of the proposed variations is that we employed a small number of states and the algorithms used for training and scoring are of low complexity and thus relatively fast. The main variations examined include a version of the reduced state-space HMM with seven states (7-HMM), a version of the reduced state-space HMM with three states (3-HMM) and an optimized version of the reduced state-space HMM with three states, where an optimization process is applied to its scores (optimized 3-HMM).


Asunto(s)
Biología Computacional/métodos , Cadenas de Markov , Pliegue de Proteína , Proteínas/química , Proteínas/clasificación , Algoritmos , Bases de Datos de Proteínas , Humanos , Estructura Secundaria de Proteína
18.
IEEE J Biomed Health Inform ; 21(2): 320-327, 2017 03.
Artículo en Inglés | MEDLINE | ID: mdl-28114044

RESUMEN

Oral squamous cell carcinoma has been characterized as a complex disease which involves dynamic genomic changes at the molecular level. These changes indicate the worth to explore the interactions of the molecules and especially of differentially expressed genes that contribute to cancer progression. Moreover, based on this knowledge the identification of differentially expressed genes and related molecular pathways is of great importance. In the present study, we exploit differentially expressed genes in order to further perform pathway enrichment analysis. According to our results we found significant pathways in which the disease associated genes have been identified as strongly enriched. Furthermore, based on the results of the pathway enrichment analysis we propose a methodology for predicting oral cancer recurrence using dynamic Bayesian networks. The methodology takes into consideration time series gene expression data in order to predict a disease recurrence. Subsequently, we are able to conjecture about the causal interactions between genes in consecutive time intervals. Concerning the performance of the predictive models, the overall accuracy of the algorithm is 81.8% and the area under the ROC curve 89.2% regarding the knowledge from the overrepresented pre-NOTCH Expression and processing pathway.


Asunto(s)
Biología Computacional/métodos , Modelos Estadísticos , Neoplasias de la Boca/genética , Recurrencia Local de Neoplasia/genética , Transducción de Señal/genética , Algoritmos , Carcinoma de Células Escamosas/epidemiología , Carcinoma de Células Escamosas/genética , Carcinoma de Células Escamosas/metabolismo , Perfilación de la Expresión Génica , Humanos , Neoplasias de la Boca/epidemiología , Neoplasias de la Boca/metabolismo , Recurrencia Local de Neoplasia/epidemiología , Recurrencia Local de Neoplasia/metabolismo , Curva ROC , Transcriptoma/genética
19.
Annu Int Conf IEEE Eng Med Biol Soc ; 2016: 5275-5278, 2016 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-28269454

RESUMEN

We propose a methodology for predicting oral cancer recurrence using Dynamic Bayesian Networks. The methodology takes into consideration time series gene expression data collected at the follow-up study of patients that had or had not suffered a disease relapse. Based on that knowledge, our aim is to infer the corresponding dynamic Bayesian networks and subsequently conjecture about the causal relationships among genes within the same time-slice and between consecutive time-slices. Moreover, the proposed methodology aims to (i) assess the prognosis of patients regarding oral cancer recurrence and at the same time, (ii) provide important information about the underlying biological processes of the disease.


Asunto(s)
Neoplasias de la Boca/patología , Recurrencia Local de Neoplasia/patología , Algoritmos , Teorema de Bayes , Bases de Datos Genéticas , Redes Reguladoras de Genes , Humanos , Neoplasias de la Boca/genética , Curva ROC
20.
Artículo en Inglés | MEDLINE | ID: mdl-26738067

RESUMEN

Oral cancer can arise in the head and neck region. Due to the aggressive nature of the disease, which often leads to poor prognosis, Oral Squamous Cell Carcinoma (OSCC) constitutes the 8(th) most common neoplasms in humans. In the present work we formulate gene interaction network from oral cancer genomic data using Dynamic Bayesian Networks (DBNs). Four modules were extracted after applying a clustering technique to the network. We consequently explore them by applying topological and functional analysis methods in order to identify significant network nodes. Our analysis revealed that these important nodes may correspond to candidate biomarkers of the disease.


Asunto(s)
Teorema de Bayes , Biomarcadores de Tumor/genética , Carcinoma de Células Escamosas/genética , Redes Reguladoras de Genes , Neoplasias de la Boca/genética , Carcinoma de Células Escamosas/patología , Bases de Datos Factuales , Regulación Neoplásica de la Expresión Génica , Genómica/métodos , Humanos , Neoplasias de la Boca/patología
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...