Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 46
Filtrar
Más filtros

Bases de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Bioinformatics ; 38(11): 3078-3086, 2022 05 26.
Artículo en Inglés | MEDLINE | ID: mdl-35460238

RESUMEN

MOTIVATION: Pathway analyses have led to more insight into the underlying biological functions related to the phenotype of interest in various types of omics data. Pathway-based statistical approaches have been actively developed, but most of them do not consider correlations among pathways. Because it is well known that there are quite a few biomarkers that overlap between pathways, these approaches may provide misleading results. In addition, most pathway-based approaches tend to assume that biomarkers within a pathway have linear associations with the phenotype of interest, even though the relationships are more complex. RESULTS: To model complex effects including non-linear effects, we propose a new approach, Hierarchical structural CoMponent analysis using Kernel (HisCoM-Kernel). The proposed method models non-linear associations between biomarkers and phenotype by extending the kernel machine regression and analyzes entire pathways simultaneously by using the biomarker-pathway hierarchical structure. HisCoM-Kernel is a flexible model that can be applied to various omics data. It was successfully applied to three omics datasets generated by different technologies. Our simulation studies showed that HisCoM-Kernel provided higher statistical power than other existing pathway-based methods in all datasets. The application of HisCoM-Kernel to three types of omics dataset showed its superior performance compared to existing methods in identifying more biologically meaningful pathways, including those reported in previous studies. AVAILABILITY AND IMPLEMENTATION: The HisCoM-Kernel software is freely available at http://statgen.snu.ac.kr/software/HisCom-Kernel/. The RNA-seq data underlying this article are available at https://xena.ucsc.edu/, and the others will be shared on reasonable request to the corresponding author. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Programas Informáticos , Simulación por Computador , Fenotipo , RNA-Seq , Biomarcadores
2.
Ann Neurol ; 87(5): 739-750, 2020 05.
Artículo en Inglés | MEDLINE | ID: mdl-32078179

RESUMEN

OBJECTIVE: We aimed to determine the association between striatal dopaminergic depletion, cerebral ß-amyloid deposition, and cognitive dysfunction in Lewy body disease (LBD). METHODS: This cross-sectional study recruited 48 LBD patients (30 with dementia, 18 with mild cognitive impairment) and 15 control subjects from a university-based hospital. We measured the striatal dopamine transporter (DAT) activity and regional ß-amyloid burden using N-(3-[18 F]fluoropropyl)-2ß-carbon ethoxy-3ß-(4-iodophenyl) nortropane (FP-CIT) positron emission tomography (PET) and 18 F-florbetaben (FBB) PET, respectively. The relationship between striatal FP-CIT uptake, regional cortical FBB uptake, and cognitive function scores was evaluated using path analyses. We also investigated the effects of striatal FP-CIT uptake and cortical FBB uptake on the interval between motor symptom and dementia onset. RESULTS: Reduced striatal FP-CIT uptake was associated with increased FBB uptake in the posterior cortical regions, most prominently in the occipital cortices. Reduced FP-CIT uptake in the anterior putamen was associated with visuospatial dysfunction with mediation of increased occipital FBB uptake. Reduced FP-CIT uptake in the posterior putamen and an increased parietal FBB uptake were independently associated with memory dysfunction. Reduced striatal FP-CIT uptake was associated with attention, language, and frontal/executive dysfunction, independent of amyloid deposition. Increased FBB uptake, especially in the parietal cortex, was associated with earlier onset of dementia. INTERPRETATION: Our results suggest that occipital ß-amyloid deposition may contribute to the association between striatal dopaminergic depletion and visuospatial dysfunction in LBD patients. Although the effects of reduced DAT activity are more prominent than those of ß-amyloid burden on cognitive dysfunction, the latter affects the onset of cognitive dysfunction. ANN NEUROL 2020;87:739-750.


Asunto(s)
Péptidos beta-Amiloides/metabolismo , Disfunción Cognitiva/etiología , Proteínas de Transporte de Dopamina a través de la Membrana Plasmática/metabolismo , Enfermedad por Cuerpos de Lewy/metabolismo , Enfermedad por Cuerpos de Lewy/patología , Anciano , Encéfalo/diagnóstico por imagen , Encéfalo/metabolismo , Encéfalo/patología , Cognición/fisiología , Estudios Transversales , Femenino , Humanos , Enfermedad por Cuerpos de Lewy/complicaciones , Masculino , Tomografía de Emisión de Positrones
3.
J Med Internet Res ; 23(4): e25852, 2021 04 16.
Artículo en Inglés | MEDLINE | ID: mdl-33822738

RESUMEN

BACKGROUND: Limited information is available about the present characteristics and dynamic clinical changes that occur in patients with COVID-19 during the early phase of the illness. OBJECTIVE: This study aimed to develop and validate machine learning models based on clinical features to assess the risk of severe disease and triage for COVID-19 patients upon hospital admission. METHODS: This retrospective multicenter cohort study included patients with COVID-19 who were released from quarantine until April 30, 2020, in Korea. A total of 5628 patients were included in the training and testing cohorts to train and validate the models that predict clinical severity and the duration of hospitalization, and the clinical severity score was defined at four levels: mild, moderate, severe, and critical. RESULTS: Out of a total of 5601 patients, 4455 (79.5%), 330 (5.9%), 512 (9.1%), and 301 (5.4%) were included in the mild, moderate, severe, and critical levels, respectively. As risk factors for predicting critical patients, we selected older age, shortness of breath, a high white blood cell count, low hemoglobin levels, a low lymphocyte count, and a low platelet count. We developed 3 prediction models to classify clinical severity levels. For example, the prediction model with 6 variables yielded a predictive power of >0.93 for the area under the receiver operating characteristic curve. We developed a web-based nomogram, using these models. CONCLUSIONS: Our prediction models, along with the web-based nomogram, are expected to be useful for the assessment of the onset of severe and critical illness among patients with COVID-19 and triage patients upon hospital admission.


Asunto(s)
COVID-19/diagnóstico , COVID-19/epidemiología , Modelos Estadísticos , Adolescente , Adulto , Anciano , Anciano de 80 o más Años , Niño , Preescolar , Estudios de Cohortes , Femenino , Humanos , Lactante , Recién Nacido , Masculino , Persona de Mediana Edad , República de Corea/epidemiología , Proyectos de Investigación , Estudios Retrospectivos , SARS-CoV-2/aislamiento & purificación , Adulto Joven
4.
Int J Mol Sci ; 20(8)2019 Apr 20.
Artículo en Inglés | MEDLINE | ID: mdl-31010051

RESUMEN

Interleukin (IL)-32θ, a newly identified IL-32 isoform, has been reported to exert pro-inflammatory effects through the association with protein kinase C delta (PKCδ). In this study, we further examined the effects of IL-32θ on IL-13 and IL-13Rα2 expression and the related mechanism in THP-1 cells. Upon stimulating IL-32θ-expressing and non-expressing cells with phorbol 12-myristate 13-acetate (PMA), the previous microarray analysis showed that IL-13Rα2 and IL-13 mRNA expression were significantly decreased by IL-32θ. The protein expression of these factors was also confirmed to be down-regulated. The nuclear translocation of transcription factors STAT3 and STAT6, which are necessary for IL-13Rα2 and IL-13 promoter activities, was suppressed by IL-32θ. Additionally, a direct association was found between IL-32θ, PKCδ, and signal transducer and activator of transcription 3 (STAT3), but not STAT6, revealing that IL-32θ might act mainly through STAT3 and indirectly affect STAT6. Moreover, the interaction of IL-32θ with STAT3 requires PKCδ, since blocking PKCδ activity eliminated the interaction and consequently limited the inhibitory effect of IL-32θ on STAT3 activity. Interfering with STAT3 or STAT6 binding by decoy oligodeoxynucleotides (ODNs) identified that IL-32θ had additive effects with the STAT3 decoy ODN to suppress IL-13 and IL-13Rα2 mRNA expression. Taken together, our data demonstrate the intracellular interaction of IL-32θ, PKCδ, and STAT3 to regulate IL-13 and IL-13Rα2 synthesis, supporting the role of IL-32θ as an inflammatory modulator.


Asunto(s)
Subunidad alfa2 del Receptor de Interleucina-13/metabolismo , Interleucina-13/metabolismo , Interleucinas/farmacología , Monocitos/metabolismo , Proteína Quinasa C-delta/metabolismo , Factor de Transcripción STAT3/metabolismo , Sitios de Unión , Regulación hacia Abajo/efectos de los fármacos , Regulación hacia Abajo/genética , Humanos , Subunidad alfa2 del Receptor de Interleucina-13/genética , Interleucinas/metabolismo , Modelos Biológicos , Monocitos/efectos de los fármacos , Regiones Promotoras Genéticas/genética , Unión Proteica/efectos de los fármacos , Factor de Transcripción STAT6/metabolismo , Transducción de Señal/efectos de los fármacos
5.
BMC Bioinformatics ; 19(Suppl 9): 288, 2018 Aug 13.
Artículo en Inglés | MEDLINE | ID: mdl-30367591

RESUMEN

BACKGROUND: Component-based structural equation modeling methods are now widely used in science, business, education, and other fields. This method uses unobservable variables, i.e., "latent" variables, and structural equation model relationships between observable variables. Here, we applied this structural equation modeling method to biologically structured data. To identify candidate drug-response biomarkers, we first used proteomic peptide-level data, as measured by multiple reaction monitoring mass spectrometry (MRM-MS), for liver cancer patients. MRM-MS is a highly sensitive and selective method for proteomic targeted quantitation of peptide abundances in complex biological samples. RESULTS: We developed a component-based drug response prediction model, having the advantage that it first combines collapsed peptide-level data into protein-level information, facilitating subsequent biological interpretation. Our model also uses an alternating least squares algorithm, to efficiently estimate both coefficients of peptides and proteins. This approach also considers correlations between variables, without constraint, by a multiple testing problem. Using estimated peptide and protein coefficients, we selected significant protein biomarkers by permutation testing, resulting in our model for predicting liver cancer response to the tyrosine kinase inhibitor sorafenib. CONCLUSIONS: Using data from a cohort of liver cancer patients, we then "fine-tuned" our model to successfully predict drug responses, as demonstrated by a high area under the curve (AUC) score. Such drug response prediction models may eventually find clinical translation in identifying individual patients likely to respond to specific therapies.


Asunto(s)
Algoritmos , Biomarcadores de Tumor/metabolismo , Neoplasias Hepáticas/metabolismo , Espectrometría de Masas/métodos , Modelos Estadísticos , Proteómica/métodos , Sorafenib/farmacología , Adulto , Anciano , Anciano de 80 o más Años , Antineoplásicos/farmacología , Estudios de Cohortes , Femenino , Humanos , Neoplasias Hepáticas/tratamiento farmacológico , Masculino , Persona de Mediana Edad
6.
Bioinformatics ; 32(17): i605-i610, 2016 09 01.
Artículo en Inglés | MEDLINE | ID: mdl-27587680

RESUMEN

MOTIVATION: Gene-gene interaction (GGI) is one of the most popular approaches for finding and explaining the missing heritability of common complex traits in genome-wide association studies. The multifactor dimensionality reduction (MDR) method has been widely studied for detecting GGI effects. However, there are several disadvantages of the existing MDR-based approaches, such as the lack of an efficient way of evaluating the significance of multi-locus models and the high computational burden due to intensive permutation. Furthermore, the MDR method does not distinguish marginal effects from pure interaction effects. METHODS: We propose a two-step unified model based MDR approach (UM-MDR), in which, the significance of a multi-locus model, even a high-order model, can be easily obtained through a regression framework with a semi-parametric correction procedure for controlling Type I error rates. In comparison to the conventional permutation approach, the proposed semi-parametric correction procedure avoids heavy computation in order to achieve the significance of a multi-locus model. The proposed UM-MDR approach is flexible in the sense that it is able to incorporate different types of traits and evaluate significances of the existing MDR extensions. RESULTS: The simulation studies and the analysis of a real example are provided to demonstrate the utility of the proposed method. UM-MDR can achieve at least the same power as MDR for most scenarios, and it outperforms MDR especially when there are some single nucleotide polymorphisms that only have marginal effects, which masks the detection of causal epistasis for the existing MDR approaches. CONCLUSIONS: UM-MDR provides a very good supplement of existing MDR method due to its efficiency in achieving significance for every multi-locus model, its power and its flexibility of handling different types of traits. AVAILABILITY AND IMPLEMENTATION: A R package "umMDR" and other source codes are freely available at http://statgen.snu.ac.kr/software/umMDR/ CONTACT: tspark@stats.snu.ac.kr SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Epistasis Genética , Estudio de Asociación del Genoma Completo , Reducción de Dimensionalidad Multifactorial , Simulación por Computador , Humanos , Modelos Genéticos , Polimorfismo de Nucleótido Simple
7.
Bioinformatics ; 32(17): i611-i619, 2016 09 01.
Artículo en Inglés | MEDLINE | ID: mdl-27587681

RESUMEN

MOTIVATION: Recently, many methods have been developed for conducting rare-variant association studies for sequencing data. These methods have primarily been based on gene-level associations but have not been proven to be as effective as expected. Gene-set-level tests have shown great advantages over gene-level tests in terms of power and robustness, because complex diseases are often caused by multiple genes that comprise of biological gene sets. RESULTS: Here, we propose several novel gene-set tests that employ rapid and efficient dimensionality reduction. The performance of these tests was investigated using extensive simulations and application to 1058 whole-exome sequences from a Korean population. We identified some known pathways and novel pathways whose rare or common variants are associated with elevated liver enzymes and replicated the results in an independent cohort. AVAILABILITY AND IMPLEMENTATION: Source R code for our algorithm is freely available at http://statgen.snu.ac.kr/software/QTest CONTACT: tspark@stats.snu.ac.kr SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Algoritmos , Secuenciación de Nucleótidos de Alto Rendimiento , Conjuntos de Datos como Asunto , Exoma , Estudios de Asociación Genética , Pruebas Genéticas , Variación Genética , Humanos , Corea (Geográfico) , Redes y Vías Metabólicas
8.
BMC Genomics ; 16 Suppl 9: S4, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-26328610

RESUMEN

BACKGROUND: microRNA (miRNA) expression plays an influential role in cancer classification and malignancy, and miRNAs are feasible as alternative diagnostic markers for pancreatic cancer, a highly aggressive neoplasm with silent early symptoms, high metastatic potential, and resistance to conventional therapies. METHODS: In this study, we evaluated the benefits of multi-omics data analysis by integrating miRNA and mRNA expression data in pancreatic cancer. Using support vector machine (SVM) modelling and leave-one-out cross validation (LOOCV), we evaluated the diagnostic performance of single- or multi-markers based on miRNA and mRNA expression profiles from 104 PDAC tissues and 17 benign pancreatic tissues. For selecting even more reliable and robust markers, we performed validation by independent datasets from the Gene Expression Omnibus (GEO) data depository. For validation, miRNA activity was estimated by miRNA-target gene interaction and mRNA expression datasets in pancreatic cancer. RESULTS: Using a comprehensive identification approach, we successfully identified 705 multi-markers having powerful diagnostic performance for PDAC. In addition, these marker candidates annotated with cancer pathways using gene ontology analysis. CONCLUSIONS: Our prediction models have strong potential for the diagnosis of pancreatic cancer.


Asunto(s)
Biomarcadores de Tumor/genética , Biología Computacional , MicroARNs/metabolismo , Neoplasias Pancreáticas/diagnóstico , Neoplasias Pancreáticas/genética , ARN Mensajero/metabolismo , Transcriptoma , Humanos , Neoplasias Pancreáticas/metabolismo
9.
Biomed Eng Online ; 13 Suppl 2: S5, 2014.
Artículo en Inglés | MEDLINE | ID: mdl-25560450

RESUMEN

BACKGROUND: Pancreatic ductal adenocarcinoma (PDAC) is one of the most lethal tumors and usually presented with locally advanced and distant metastasis disease, which prevent curative resection or treatments. In this regard, we considered identifying molecular subtypes associated with clinicopathological factor as prognosis factors to stratify PDAC for appropriate treatment of patients. RESULTS: In this study, we identified three molecular subtypes which were significant on survival time and metastasis. We also identified significant genes and enriched pathways represented for each molecular subtype. Considering R0 resection patients included in each subtype, metastasis and survival times are significantly associated with subtype 1 and subtype 2. CONCLUSIONS: We observed three PDAC molecular subtypes and demonstrated that those subtypes were significantly related with metastasis and survival time. The study may have utility in stratifying patients for cancer treatment.


Asunto(s)
Biomarcadores de Tumor/metabolismo , Carcinoma Ductal Pancreático/diagnóstico , Carcinoma Ductal Pancreático/metabolismo , Diagnóstico por Computador/métodos , Neoplasias Pancreáticas/diagnóstico , Neoplasias Pancreáticas/metabolismo , Anciano , Carcinoma Ductal Pancreático/clasificación , Femenino , Humanos , Masculino , Neoplasias Pancreáticas/clasificación , Reproducibilidad de los Resultados , Sensibilidad y Especificidad
10.
Bioinformatics ; 28(18): i582-i588, 2012 Sep 15.
Artículo en Inglés | MEDLINE | ID: mdl-22962485

RESUMEN

MOTIVATION: For the past few decades, many statistical methods in genome-wide association studies (GWAS) have been developed to identify SNP-SNP interactions for case-control studies. However, there has been less work for prospective cohort studies, involving the survival time. Recently, Gui et al. (2011) proposed a novel method, called Surv-MDR, for detecting gene-gene interactions associated with survival time. Surv-MDR is an extension of the multifactor dimensionality reduction (MDR) method to the survival phenotype by using the log-rank test for defining a binary attribute. However, the Surv-MDR method has some drawbacks in the sense that it needs more intensive computations and does not allow for a covariate adjustment. In this article, we propose a new approach, called Cox-MDR, which is an extension of the generalized multifactor dimensionality reduction (GMDR) to the survival phenotype by using a martingale residual as a score to classify multi-level genotypes as high- and low-risk groups. The advantages of Cox-MDR over Surv-MDR are to allow for the effects of discrete and quantitative covariates in the frame of Cox regression model and to require less computation than Surv-MDR. RESULTS: Through simulation studies, we compared the power of Cox-MDR with those of Surv-MDR and Cox regression model for various heritability and minor allele frequency combinations without and with adjusting for covariate. We found that Cox-MDR and Cox regression model perform better than Surv-MDR for low minor allele frequency of 0.2, but Surv-MDR has high power for minor allele frequency of 0.4. However, when the effect of covariate is adjusted for, Cox-MDR and Cox regression model perform much better than Surv-MDR. We also compared the performance of Cox-MDR and Surv-MDR for a real data of leukemia patients to detect the gene-gene interactions with the survival time. CONTACT: leesy@sejong.ac.kr; tspark@snu.ac.kr.


Asunto(s)
Reducción de Dimensionalidad Multifactorial/métodos , Modelos de Riesgos Proporcionales , Algoritmos , Femenino , Frecuencia de los Genes , Estudio de Asociación del Genoma Completo , Genotipo , Humanos , Leucemia Mieloide Aguda/genética , Leucemia Mieloide Aguda/mortalidad , Masculino , Modelos Genéticos , Fenotipo , Polimorfismo de Nucleótido Simple
11.
Front Med (Lausanne) ; 10: 1239789, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-38239614

RESUMEN

Background: Understanding the clinical course and pivotal time points of COVID-19 aggravation is critical for enhancing patient monitoring. This retrospective, multi-center cohort study aims to identify these significant time points and associate them with potential risk factors, leveraging data from a sizable cohort with mild-to-moderate symptoms upon admission. Methods: This study included data from 1,696 COVID-19 patients with mild-to-moderate clinical severity upon admission across multiple hospitals in Daegu-Kyungpook Province (Daegu dataset) between February 18 and early March 2020 and 321 COVID-19 patients at Seoul Boramae Hospital (Boramae dataset) collected from February to July 2020. The approach involved: (1) identifying the optimal time point for aggravation using survival analyses with maximally selected rank statistics; (2) investigating the relationship between comorbidities and time to aggravation; and (3) developing prediction models through machine learning techniques. The models were validated internally among patients from the Daegu dataset and externally among patients from the Boramae dataset. Results: The Daegu dataset showed a mean age of 51.0 ± 19.6 years, with 8 days for aggravation and day 5 being identified as the pivotal point for survival. Contrary to previous findings, specific comorbidities had no notable impact on aggravation patterns. Prediction models utilizing factors including age and chest X-ray infiltration demonstrated promising performance, with the top model achieving an AUC of 0.827 in external validation for 5 days aggravation prediction. Conclusion: Our study highlights the crucial significance of the initial 5 days period post-admission in managing COVID-19 patients. The identification of this pivotal time frame, combined with our robust predictive models, provides valuable insights for early intervention strategies. This research underscores the potential of proactive monitoring and timely interventions in enhancing patient outcomes, particularly for those at risk of rapid aggravation. Our findings offer a meaningful contribution to understanding the COVID-19 clinical course and supporting healthcare providers in optimizing patient care and resource allocation.

12.
J Hepatobiliary Pancreat Sci ; 30(1): 122-132, 2023 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-33991409

RESUMEN

BACKGROUND/PURPOSE: The current study aimed to develop a prediction model using a multi-marker panel as a diagnostic screening tool for pancreatic ductal adenocarcinoma. METHODS: Multi-center cohort of 1991 blood samples were collected from January 2011 to September 2019, of which 609 were normal, 145 were other cancer (colorectal, thyroid, and breast cancer), 314 were pancreatic benign disease, and 923 were pancreatic ductal adenocarcinoma. The automated multi-biomarker Enzyme-Linked Immunosorbent Assay kit was developed using three potential biomarkers: LRG1, TTR, and CA 19-9. Using a logistic regression model on a training data set, the predicted values for pancreatic ductal adenocarcinoma were obtained, and the result was classification into one of the three risk groups: low, intermediate, and high. The five covariates used to create the model were sex, age, and three biomarkers. RESULTS: Participants were categorized into four groups as normal (n = 609), other cancer (n = 145), pancreatic benign disease (n = 314), and pancreatic ductal adenocarcinoma (n = 923). The normal, other cancer, and pancreatic benign disease groups were clubbed into the non-pancreatic ductal adenocarcinoma group (n = 1068). The positive and negative predictive value, sensitivity, and specificity were 94.12, 90.40, 93.81, and 90.86, respectively. CONCLUSIONS: This study demonstrates a significant diagnostic performance of the multi-marker panel in distinguishing pancreatic ductal adenocarcinoma from normal and benign pancreatic disease states, as well as patients with other cancers.


Asunto(s)
Adenocarcinoma , Carcinoma Ductal Pancreático , Enfermedades Pancreáticas , Neoplasias Pancreáticas , Humanos , Biomarcadores de Tumor , Neoplasias Pancreáticas/patología , Carcinoma Ductal Pancreático/patología , Adenocarcinoma/diagnóstico , Adenocarcinoma/patología , Neoplasias Pancreáticas
14.
Breast Cancer Res Treat ; 132(2): 499-509, 2012 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-21667120

RESUMEN

A model for a more precise prognosis of the risk of relapse is needed to avoid overtreatment of lymph node-negative breast cancer patients. A large derivation data set (n = 684) was generated by pooling three independent breast cancer expression microarray data sets. Two major prognostic factors, proliferation and immune response, were identified among genes showing significant differential expression levels between the good outcome and poor outcome groups. For each factor, four proliferation-related genes (p-genes) and four immunity-related genes (i-genes) were selected as prognostic genes, and a prognostic model for lymph node-negative breast cancer patients was developed using a parametric survival analysis based on the lognormal distribution. The p-genes showed a predominantly negative correlation (coefficient: -0.603) with survival time, while the i-genes showed a positive correlation (coefficient: 0.243), reflecting the beneficial effect of the immune response against deleterious proliferative activity. The prognostic model shows that approximately 54% of lymph node-negative breast cancer patients were predicted to be distant metastasis-free for more than 5 years with at least 85% survival probability. The prognostic model showed a robust and high prognostic performance (HR 2.85-3.45) through three external validation data sets. Based on the integration of proliferation and immunity, the new prognostic model is expected to improve clinical decision making by providing easily interpretable survival probabilities at any time point and functional causality of the predicted prognosis with respect to proliferation and immune response.


Asunto(s)
Neoplasias de la Mama/inmunología , Neoplasias de la Mama/patología , Proliferación Celular , Ganglios Linfáticos/patología , Modelos Estadísticos , Neoplasias de la Mama/genética , Neoplasias de la Mama/mortalidad , Neoplasias de la Mama/secundario , Neoplasias de la Mama/terapia , Análisis por Conglomerados , Análisis Discriminante , Supervivencia sin Enfermedad , Femenino , Perfilación de la Expresión Génica/métodos , Regulación Neoplásica de la Expresión Génica , Humanos , Estimación de Kaplan-Meier , Modelos Lineales , Invasividad Neoplásica , Análisis de Secuencia por Matrices de Oligonucleótidos , Reproducibilidad de los Resultados , Medición de Riesgo , Factores de Riesgo , Factores de Tiempo , Resultado del Tratamiento
15.
Genomics Inform ; 20(2): e23, 2022 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-35794703

RESUMEN

A survival prediction model has recently been developed to evaluate the prognosis of resected nonmetastatic pancreatic ductal adenocarcinoma based on a Cox model using two nationwide databases: Surveillance, Epidemiology and End Results (SEER) and Korea Tumor Registry System-Biliary Pancreas (KOTUS-BP). In this study, we applied two machine learning methods-random survival forests (RSF) and support vector machines (SVM)-for survival analysis and compared their prediction performance using the SEER and KOTUS-BP datasets. Three schemes were used for model development and evaluation. First, we utilized data from SEER for model development and used data from KOTUS-BP for external evaluation. Second, these two datasets were swapped by taking data from KOTUS-BP for model development and data from SEER for external evaluation. Finally, we mixed these two datasets half and half and utilized the mixed datasets for model development and validation. We used 9,624 patients from SEER and 3,281 patients from KOTUS-BP to construct a prediction model with seven covariates: age, sex, histologic differentiation, adjuvant treatment, resection margin status, and the American Joint Committee on Cancer 8th edition T-stage and N-stage. Comparing the three schemes, the performance of the Cox model, RSF, and SVM was better when using the mixed datasets than when using the unmixed datasets. When using the mixed datasets, the C-index, 1-year, 2-year, and 3-year time-dependent areas under the curve for the Cox model were 0.644, 0.698, 0.680, and 0.687, respectively. The Cox model performed slightly better than RSF and SVM.

16.
J Gastrointest Surg ; 26(8): 1705-1712, 2022 08.
Artículo en Inglés | MEDLINE | ID: mdl-35641810

RESUMEN

BACKGROUND: Sequential extended cholecystectomy (SEC) is currently recommended for T2 and higher gallbladder cancer (GBC) diagnosed after simple cholecystectomy (SC), but the value and timing of re-resection has not been fully studied. We evaluated the long-term oncologic outcomes of T2 GBC according to the type of surgery performed and investigated the optimal timing for SEC. METHODS: Patients diagnosed with T2 GBC who underwent SC, extended cholecystectomy (EC), or SEC between 2002 and 2017 were retrospectively reviewed. Those who underwent other surgical procedures or those with incomplete medical records were excluded. Overall survival (OS) and disease-free survival (DFS) according to the types of surgeries and prognostic factors for OS and DFS were analyzed. Survival analysis was done between groups that were divided according to the optimal cutoff time interval between SC and SEC based on DFS data. RESULTS: Of the 226 T2 GBC patients, 53, 173, and 44 underwent SC, EC, and SEC, respectively. The 5-year OS rate was 50.1%, 73.2%, and 78.7%, and the DFS rate was 46.8%, 66.3%, and 65.2% in the SC, EC, and SEC groups, respectively. EC (p = 0.001 and p = 0.001) and SEC (p = 0.007 and p = 0.065) groups had better 5-year OS and DFS rates than the SC group. Preoperative CA 19-9 level > 37 U/mL (HR 1.56; 95% CI 1.87-2.79; p < 0.001) and N1 stage (HR 2.88; 95% CI 1.76-4.71; p < 0.001) were associated with poorer prognosis. The optimal cutoff interval between SC and SEC was 28 days. Patients who underwent SEC ≤ 28 days after the initial cholecystectomy had better 5-year DFS rates than patients who underwent SEC after > 28 days (75.0% vs. 52.8%, p = 0.023). CONCLUSIONS: SEC is recommended for T2 GBC diagnosed after SC, because SEC provides better survival outcomes than SC alone. A time interval of less than 28 days to SEC is associated with an improved DFS.


Asunto(s)
Neoplasias de la Vesícula Biliar , Colecistectomía/métodos , Neoplasias de la Vesícula Biliar/patología , Humanos , Escisión del Ganglio Linfático , Estadificación de Neoplasias , Estudios Retrospectivos
17.
Front Public Health ; 10: 1007205, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-36518574

RESUMEN

Background: As the worldwide spread of coronavirus disease 2019 (COVID-19) continues for a long time, early prediction of the maximum severity is required for effective treatment of each patient. Objective: This study aimed to develop predictive models for the maximum severity of hospitalized COVID-19 patients using artificial intelligence (AI)/machine learning (ML) algorithms. Methods: The medical records of 2,263 COVID-19 patients admitted to 10 hospitals in Daegu, Korea, from February 18, 2020, to May 19, 2020, were comprehensively reviewed. The maximum severity during hospitalization was divided into four groups according to the severity level: mild, moderate, severe, and critical. The patient's initial hospitalization records were used as predictors. The total dataset was randomly split into a training set and a testing set in a 2:1 ratio, taking into account the four maximum severity groups. Predictive models were developed using the training set and were evaluated using the testing set. Two approaches were performed: using four groups based on original severity levels groups (i.e., 4-group classification) and using two groups after regrouping the four severity level into two (i.e., binary classification). Three variable selection methods including randomForestSRC were performed. As AI/ML algorithms for 4-group classification, GUIDE and proportional odds model were used. For binary classification, we used five AI/ML algorithms, including deep neural network and GUIDE. Results: Of the four maximum severity groups, the moderate group had the highest percentage (1,115 patients; 49.5%). As factors contributing to exacerbation of maximum severity, there were 25 statistically significant predictors through simple analysis of linear trends. As a result of model development, the following three models based on binary classification showed high predictive performance: (1) Mild vs. Above Moderate, (2) Below Moderate vs. Above Severe, and (3) Below Severe vs. Critical. The performance of these three binary models was evaluated using AUC values 0.883, 0.879, and, 0.887, respectively. Based on results for each of the three predictive models, we developed web-based nomograms for clinical use (http://statgen.snu.ac.kr/software/nomogramDaeguCovid/). Conclusions: We successfully developed web-based nomograms predicting the maximum severity. These nomograms are expected to help plan an effective treatment for each patient in the clinical field.


Asunto(s)
COVID-19 , Humanos , COVID-19/epidemiología , Inteligencia Artificial , Hospitalización , Aprendizaje Automático , Redes Neurales de la Computación
18.
BMC Bioinformatics ; 12: 377, 2011 Sep 26.
Artículo en Inglés | MEDLINE | ID: mdl-21943316

RESUMEN

BACKGROUND: Many gene-set analysis methods have been previously proposed and compared through simulation studies and analysis of real datasets for binary phenotypes. We focused on the survival phenotype and compared the performances of Gene Set Enrichment Analysis (GSEA), Global Test (GT), Wald-type Test (WT) and Global Boost Test (GBST) methods in a simulation study and on two ovarian cancer data sets. We considered two versions of GSEA by allowing different weights: GSEA1 uses equal weights, yielding results similar to the Kolmogorov-Smirnov test; while GSEA2's weights are based on the correlation between genes and the phenotype. RESULTS: We compared GSEA1, GSEA2, GT, WT and GBST in a simulation study with various settings for the correlation structure of the genes and the association parameter between the survival outcome and the genes. Simulation results indicated that GT, WT and GBST consistently have higher power than GSEA1 and GSEA2 across all scenarios. However, the power of the five tests depends on the combination of correlation structure and association parameter. For the ovarian cancer data set, using the FDR threshold of q < 0.1, the GT, WT and GBST detected 12, 6 and 8 significant pathways, respectively, whereas neither GSEA1 nor GSEA2 detected any significant pathways. In addition, among the pathways found significant by GT, WT, and GBST, three pathways--Purine metabolism, Leukocyte transendothelial migration and Jak-STAT signaling pathway--overlapped with those reported in previous ovarian cancer microarray studies. CONCLUSION: Simulation studies and a real data example indicate that GT, WT and GBST tend to have high power, whereas GSEA1 and GSEA2 have lower power. We also found that the power of the five tests is much higher when genes are correlated than when genes are independent, when survival is positively associated with genes. It seems that there is a synergistic effect in detecting significant gene sets when significant genes have within-class correlation and the association between survival and genes is positive or negative (i.e., one-direction correlation).


Asunto(s)
Perfilación de la Expresión Génica/métodos , Neoplasias Ováricas/genética , Neoplasias Ováricas/mortalidad , Simulación por Computador , Femenino , Humanos , Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , Fenotipo , Proyectos de Investigación , Análisis de Supervivencia
19.
BioData Min ; 14(1): 17, 2021 Mar 01.
Artículo en Inglés | MEDLINE | ID: mdl-33648540

RESUMEN

BACKGROUND: For gene-gene interaction analysis, the multifactor dimensionality reduction (MDR) method has been widely employed to reduce multi-levels of gene-gene interactions into high- or low-risk groups using a binary attribute. For the survival phenotype, the Cox-MDR method has been proposed using a martingale residual of a Cox model since Surv-MDR was first proposed using a log-rank test statistic. Recently, the KM-MDR method was proposed using the Kaplan-Meier median survival time as a classifier. All three methods used the cross-validation procedure to identify single nucleotide polymorphism (SNP) using SNP interactions among all possible SNP pairs. Furthermore, these methods require the permutation test to verify the significance of the selected SNP pairs. However, the unified model-based multifactor dimensionality reduction method (UM-MDR) overcomes this shortcoming of MDR by unifying the significance testing with the MDR algorithm within the framework of the regression model. Neither cross-validation nor permutation testing is required to identify SNP by SNP interactions in the UM-MDR method. The UM-MDR method comprises two steps: in the first step, multi-level genotypes are classified into high- or low-risk groups, and an indicator variable for the high-risk group is defined. In the second step, the significance of the indicator variable of the high-risk group is tested in the regression model included with other adjusting covariates. The Cox-UMMDR method was recently proposed by combining Cox-MDR with UM-MDR to identify gene-gene interactions associated with the survival phenotype. In this study, we propose two simple methods either by combining KM-MDR with UM-MDR, called KM-UMMDR or by modifying Cox-UMMDR by adjusting for the covariate effect in step 1, rather than in step 2, a process called Cox2-UMMDR. The KM-UMMDR method allows the covariate effect to be adjusted for in the regression model of step 2, although KM-MDR cannot adjust for the covariate effect in the classification procedure of step 1. In contrast, Cox2-UMMDR differs from Cox-UMMDR in the sense that the martingale residuals are obtained from a Cox model by adjusting for the covariate effect in step 1 of Cox2-UMMDR whereas Cox-UMMDR adjusts for the covariate effect in the regression model in step 2. We performed simulation studies to compare the power of several methods such as KM-UMMDR, Cox-UMMDR, Cox2-UMMDR, Cox-MDR, and KM-MDR by considering the effect of covariates and the marginal effect of SNPs. We also analyzed a real example of Korean leukemia patient data for illustration and a short discussion is provided. RESULTS: In the simulation study, two different scenarios are considered: the first scenario compares the power of the cases with and without the covariate effect. The second scenario is to compare the power of cases with the main effect of SNPs versus without the main effect of SNPs. From the simulation results, Cox-UMMDR performs the best across all scenarios among KM-UMMDR, Cox2-UMMDR, Cox-MDR and KM-MDR. As expected, both Cox-UMMDR and Cox-MDR perform better than KM-UMMDR and KM-MDR when a covariate effect exists because the former adjusts for the covariate effect but the latter cannot. However, Cox2-UMMDR behaves similarly to KM-UMMDR and KM-MDR even though there is a covariate effect. This implies that the covariate effect would be more efficiently adjusted for in the regression model of the second step rather than under the classification procedure of the first step. When there is a main effect of any SNP, Cox-UMMDR, Cox2-UMMDR and KM-UMMDR perform better than Cox-MDR and KM-MDR if the main effects of SNPs are properly adjusted for in the regression model. From the simulation results of two different scenarios, Cox-UMMDR seems to be the most robust when there is either any covariate effect adjusting for or any SNP that has a main effect on the survival phenotype. In addition, the power of all methods decreased as the censoring fraction increased from 0.1 to 0.3, as heritability increased. The power of all methods seems to be greater under MAF = 0.2 than under MAF = 0.4. For illustration, both KM-UMMDR and Cox2-UMMDR were applied to identify SNP by SNP interactions with the survival phenotype to a real dataset of Korean leukemia patients. CONCLUSION: Both KM-UMMDR and Cox2-UMMDR were easily implemented by combining KM-MDR and Cox-MDR with UM-MDR, respectively, to detect significant gene-gene interactions associated with survival time without cross-validation and permutation testing. The simulation results demonstrate the utility of KM-UMMDR, Cox2-UMMDR and Cox-UMMDR, which outperforms Cox-MDR and KM-MDR when some SNPs with only marginal effects might mask the detection of causal epistasis. In addition, Cox-UMMDR, Cox2-UMMDR and Cox-MDR performed better than KM-UMMDR and KM-MDR when there were potentially confounding covariate effects.

20.
Front Genet ; 12: 634922, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-34267778

RESUMEN

In the "personalized medicine" era, one of the most difficult problems is identification of combined markers from different omics platforms. Many methods have been developed to identify candidate markers for each type of omics data, but few methods facilitate the identification of multiple markers on multi-omics platforms. microRNAs (miRNAs) is well known to affect only indirectly phenotypes by regulating mRNA expression and/or protein translation. To take into account this knowledge into practice, we suggest a miRNA-mRNA integration model for survival time analysis, called mimi-surv, which accounts for the biological relationship, to identify such integrated markers more efficiently. Through simulation studies, we found that the statistical power of mimi-surv be better than other models. Application to real datasets from Seoul National University Hospital and The Cancer Genome Atlas demonstrated that mimi-surv successfully identified miRNA-mRNA integrations sets associated with progression-free survival of pancreatic ductal adenocarcinoma (PDAC) patients. Only mimi-surv found miR-96, a previously unidentified PDAC-related miRNA in these two real datasets. Furthermore, mimi-surv was shown to identify more PDAC related miRNAs than other methods because it used the known structure for miRNA-mRNA regularization. An implementation of mimi-surv is available at http://statgen.snu.ac.kr/software/mimi-surv.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA