Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 22
Filtrar
1.
JMIR Med Inform ; 12: e53400, 2024 Mar 21.
Artigo em Inglês | MEDLINE | ID: mdl-38513229

RESUMO

BACKGROUND: Predicting the bed occupancy rate (BOR) is essential for efficient hospital resource management, long-term budget planning, and patient care planning. Although macro-level BOR prediction for the entire hospital is crucial, predicting occupancy at a detailed level, such as specific wards and rooms, is more practical and useful for hospital scheduling. OBJECTIVE: The aim of this study was to develop a web-based support tool that allows hospital administrators to grasp the BOR for each ward and room according to different time periods. METHODS: We trained time-series models based on long short-term memory (LSTM) using individual bed data aggregated hourly each day to predict the BOR for each ward and room in the hospital. Ward training involved 2 models with 7- and 30-day time windows, and room training involved models with 3- and 7-day time windows for shorter-term planning. To further improve prediction performance, we added 2 models trained by concatenating dynamic data with static data representing room-specific details. RESULTS: We confirmed the results of a total of 12 models using bidirectional long short-term memory (Bi-LSTM) and LSTM, and the model based on Bi-LSTM showed better performance. The ward-level prediction model had a mean absolute error (MAE) of 0.067, mean square error (MSE) of 0.009, root mean square error (RMSE) of 0.094, and R2 score of 0.544. Among the room-level prediction models, the model that combined static data exhibited superior performance, with a MAE of 0.129, MSE of 0.050, RMSE of 0.227, and R2 score of 0.600. Model results can be displayed on an electronic dashboard for easy access via the web. CONCLUSIONS: We have proposed predictive BOR models for individual wards and rooms that demonstrate high performance. The results can be visualized through a web-based dashboard, aiding hospital administrators in bed operation planning. This contributes to resource optimization and the reduction of hospital resource use.

2.
Heliyon ; 10(2): e24620, 2024 Jan 30.
Artigo em Inglês | MEDLINE | ID: mdl-38304832

RESUMO

Background and Objective: Although interest in predicting drug-drug interactions is growing, many predictions are not verified by real-world data. This study aimed to confirm whether predicted polypharmacy side effects using public data also occur in data from actual patients. Methods: We utilized a deep learning-based polypharmacy side effects prediction model to identify cefpodoxime-chlorpheniramine-lung edema combination with a high prediction score and a significant patient population. The retrospective study analyzed patients over 18 years old who were admitted to the Asan medical center between January 2000 and December 2020 and took cefpodoxime or chlorpheniramine orally. The three groups, cefpodoxime-treated, chlorpheniramine-treated, and cefpodoxime & chlorpheniramine-treated were compared using inverse probability of treatment weighting (IPTW) to balance them. Differences between the three groups were analyzed using the Kaplan-Meier method and Cox proportional hazards model. Results: The study population comprised 54,043 patients with a history of taking cefpodoxime, 203,897 patients with a history of taking chlorpheniramine, and 1,628 patients with a history of taking cefpodoxime and chlorpheniramine simultaneously. After adjustment, the 1-year cumulative incidence of lung edema in the patient group that took cefpodoxime and chlorpheniramine simultaneously was significantly higher than in the patient groups that took cefpodoxime or chlorpheniramine only (p=0.001). Patients taking cefpodoxime and chlorpheniramine together had an increased risk of lung edema compared to those taking cefpodoxime alone [hazard ratio (HR) 2.10, 95% CI 1.26-3.52, p<0.005] and those taking chlorpheniramine alone, which also increased the risk of lung edema (HR 1.64, 95% CI 0.99-2.69, p=0.05). Conclusions: Validation of polypharmacy side effect predictions with real-world data can aid patient and clinician decision-making before conducting randomized controlled trials. Simultaneous use of cefpodoxime and chlorpheniramine was associated with a higher long-term risk of lung edema compared to the use of cefpodoxime or chlorpheniramine alone.

4.
Eur J Clin Invest ; 54(5): e14161, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-38239087

RESUMO

BACKGROUND: The metabolically healthy obese (MHO) phenotype is associated with an increased risk of coronary heart disease (CHD) in the general population. However, association of metabolic health and obesity phenotypes with CHD risk in adult cancer survivors remains unclear. We aimed to investigate the associations between different metabolic health and obesity phenotypes with incident CHD in adult cancer survivors. METHODS: We used National Health Insurance Service (NHIS) to identify a cohort of 173,951 adult cancer survivors aged more than 20 years free of cardiovascular complications. Metabolically healthy nonobese (MHN), MHO, metabolically unhealthy nonobese (MUN), metabolically unhealthy obese (MUO) phenotypes were created using as at least three out of five metabolic health criteria along with obesity (body mass index ≥ 25.0 kg/m2). We used Cox proportional hazards model to assess CHD risk in each metabolic health and obesity phenotypes. RESULTS: During 1,376,050 person-years of follow-up, adult cancer survivors with MHO phenotype had a significantly higher risk of CHD (hazard ratio [HR] = 1.52; 95% confidence intervals [CI]: 1.41 to 1.65) as compared to those without obesity and metabolic abnormalities. MUN (HR = 1.81; 95% CI: 1.59 to 2.06) and MUO (HR = 1.92; 95% CI: 1.72 to 2.15) phenotypes were also associated with an increased risk of CHD among adult cancer survivors. CONCLUSIONS: Adult cancer survivors with MHO phenotype had a higher risk of CHD than those who are MHN. Metabolic health status and obesity were jointly associated with CHD risk in adult cancer survivors.


Assuntos
Sobreviventes de Câncer , Doenças Cardiovasculares , Doença das Coronárias , Síndrome Metabólica , Neoplasias , Obesidade Metabolicamente Benigna , Adulto , Humanos , Fatores de Risco , Doenças Cardiovasculares/epidemiologia , Neoplasias/epidemiologia , Neoplasias/complicações , Obesidade/complicações , Obesidade/epidemiologia , Índice de Massa Corporal , Doença das Coronárias/epidemiologia , Doença das Coronárias/complicações , Fenótipo , Obesidade Metabolicamente Benigna/epidemiologia , Síndrome Metabólica/epidemiologia , Síndrome Metabólica/complicações
5.
Comput Biol Med ; 168: 107738, 2024 01.
Artigo em Inglês | MEDLINE | ID: mdl-37995536

RESUMO

Electronic medical records(EMR) have considerable potential to advance healthcare technologies, including medical AI. Nevertheless, due to the privacy issues associated with the sharing of patient's personal information, it is difficult to sufficiently utilize them. Generative models based on deep learning can solve this problem by creating synthetic data similar to real patient data. However, the data used for training these deep learning models run into the risk of getting leaked because of malicious attacks. This means that traditional deep learning-based generative models cannot completely solve the privacy issues. Therefore, we suggested a method to prevent the leakage of training data by protecting the model from malicious attacks using local differential privacy(LDP). Our method was evaluated in terms of utility and privacy. Experimental results demonstrated that the proposed method can generate medical data with reasonable performance while protecting training data from malicious attacks.


Assuntos
Registros Eletrônicos de Saúde , Privacidade , Humanos , Instalações de Saúde
6.
Health Care Manag Sci ; 27(1): 114-129, 2024 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-37921927

RESUMO

Overcrowding of emergency departments is a global concern, leading to numerous negative consequences. This study aimed to develop a useful and inexpensive tool derived from electronic medical records that supports clinical decision-making and can be easily utilized by emergency department physicians. We presented machine learning models that predicted the likelihood of hospitalizations within 24 hours and estimated waiting times. Moreover, we revealed the enhanced performance of these machine learning models compared to existing models by incorporating unstructured text data. Among several evaluated models, the extreme gradient boosting model that incorporated text data yielded the best performance. This model achieved an area under the receiver operating characteristic curve score of 0.922 and an area under the precision-recall curve score of 0.687. The mean absolute error revealed a difference of approximately 3 hours. Using this model, we classified the probability of patients not being admitted within 24 hours as Low, Medium, or High and identified important variables influencing this classification through explainable artificial intelligence. The model results are readily displayed on an electronic dashboard to support the decision-making of emergency department physicians and alleviate overcrowding, thereby resulting in socioeconomic benefits for medical facilities.


Assuntos
Inteligência Artificial , Listas de Espera , Humanos , Hospitalização , Serviço Hospitalar de Emergência , Aprendizado de Máquina , Estudos Retrospectivos
7.
Sci Rep ; 13(1): 22461, 2023 12 18.
Artigo em Inglês | MEDLINE | ID: mdl-38105280

RESUMO

As warfarin has a narrow therapeutic window and obvious response variability among individuals, it is difficult to rapidly determine personalized warfarin dosage. Adverse drug events(ADE) resulting from warfarin overdose can be critical, so that typically physicians adjust the warfarin dosage through the INR monitoring twice a week when starting warfarin. Our study aimed to develop machine learning (ML) models that predicts the discharge dosage of warfarin as the initial warfarin dosage using clinical data derived from electronic medical records within 2 days of hospitalization. During this retrospective study, adult patients who were prescribed warfarin at Asan Medical Center (AMC) between January 1, 2018, and October 31, 2020, were recruited as a model development cohort (n = 3168). Additionally, we created an external validation dataset (n = 891) from a Medical Information Mart for Intensive Care III (MIMIC-III). Variables for a model prediction were selected based on the clinical rationale that turned out to be associated with warfarin dosage, such as bleeding. The discharge dosage of warfarin was used the study outcome, because we assumed that patients achieved target INR at discharge. In this study, four ML models that predicted the warfarin discharge dosage were developed. We evaluated the model performance using the mean absolute error (MAE) and prediction accuracy. Finally, we compared the accuracy of the predictions of our models and the predictions of physicians for 40 data point to verify a clinical relevance of the models. The MAEs obtained using the internal validation set were as follows: XGBoost, 0.9; artificial neural network, 0.9; random forest, 1.0; linear regression, 1.0; and physicians, 1.3. As a result, our models had better prediction accuracy than the physicians, who have difficulty determining the warfarin discharge dosage using clinical information obtained within 2 days of hospitalization. We not only conducted the internal validation but also external validation. In conclusion, our ML model could help physicians predict the warfarin discharge dosage as the initial warfarin dosage from Korean population. However, conducting a successfully external validation in a further work is required for the application of the models.


Assuntos
Alta do Paciente , Varfarina , Adulto , Humanos , Varfarina/efeitos adversos , Estudos Retrospectivos , Pacientes Internados , Anticoagulantes/efeitos adversos , Aprendizado de Máquina
8.
Sci Rep ; 13(1): 16837, 2023 10 06.
Artigo em Inglês | MEDLINE | ID: mdl-37803039

RESUMO

Adult cancer survivors may have an increased risk of developing ischemic stroke, potentially influenced by cancer treatment-related factors and shared risk factors with stroke. However, the association between gamma-glutamyl transferase (GGT) levels and the risk of ischemic stroke in this population remains understudied. Therefore, our study aimed to examine the relationship between GGT levels and the risk of ischemic stroke using a population-based cohort of adult cancer survivors. A population-based cohort of adult cancer survivors was derived from the National Health Insurance Service-Health Screening Cohort between 2003 and 2005 who survived after diagnosis of primary cancer and participated in the biennial national health screening program between 2009 and 2010. Cox proportional hazards model adjusted for sociodemographic factors, health status and behavior, and clinical characteristics was used to investigate the association between GGT level and ischemic stroke in adult cancer survivors. Among 3095 adult cancer survivors, 80 (2.58%) incident cases of ischemic stroke occurred over a mean follow-up of 8.2 years. Compared to the lowest GGT quartile, the hazard ratios (HRs) for ischemic stroke were 1.56 (95% CI 0.75-3.26), 2.36 (95% CI 1.12-4.99), and 2.40 (95% CI 1.05-5.46) for the second, third, and fourth sex-specific quartiles, respectively (Ptrend = 0.013). No significant effect modification was observed by sex, insurance premium, and alcohol consumption. High GGT level is associated with an increased risk of ischemic stroke in adult cancer survivors independent of sex, insurance premium, and alcohol consumption.


Assuntos
Sobreviventes de Câncer , AVC Isquêmico , Neoplasias , Acidente Vascular Cerebral , Masculino , Feminino , Humanos , Adulto , AVC Isquêmico/complicações , Estudos de Coortes , gama-Glutamiltransferase , Fatores de Risco , Acidente Vascular Cerebral/epidemiologia , Acidente Vascular Cerebral/etiologia , Neoplasias/complicações
9.
PLoS One ; 18(5): e0286346, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37228155

RESUMO

BACKGROUND: Dietary sodium intake is a crucial lifestyle factor that should be assessed in adult cancer survivors due to their increased risk of adverse health outcomes compared to the general population. However, its with impaired fasting glucose (IFG) in adult cancer survivors remains unclear. This study aimed to investigate the association of dietary sodium intake categorized by the American Heart Association (AHA) recommendation with IFG in the community-dwelling adult cancer survivors. METHODS: A total of 1,052 adult cancer survivors without diabetes were identified from the sixth and seventh Korea National Health and Nutrition Examination Survey (KNHANES), 2013-2018. Data on dietary sodium intake was categorized as <1,500 mg/day, 1,500-2,999 mg/day, 2,300-3,999 mg/day, and ≥4,000 mg/day according to the AHA recommendation. A multiple logistic regression model adjusted for demographic, lifestyle, and health status was used to compute odds ratios (OR) and 95% confidence intervals (95% CI) for IFG according to dietary sodium intake categories. RESULTS: After adjusting for confounding variables identified in the KNHANES, the adjusted OR among the adult cancer survivors who consumed 1,500-2,999 mg/day, 2,300-3,999 mg/day, and ≥4,000 mg/day of dietary sodium were 1.16 (95% CI: 0.25-5.27), 1.93 (95% CI: 0.40-9.37), and 2.67 (95% CI: 0.59-12.18), respectively, as compared to those who consumed <1,500 mg/day (P value for trend = 0.036). CONCLUSION: Among community-dwelling adult cancer survivors, high dietary sodium intake was marginally associated with increased odds of IFG. Well-designed cohort studies or randomized clinical trials are needed to establish more epidemiologic evidence on this association in adult cancer survivors.


Assuntos
Sobreviventes de Câncer , Neoplasias , Estado Pré-Diabético , Sódio na Dieta , Humanos , Adulto , Estudos Transversais , Inquéritos Nutricionais , Estado Pré-Diabético/epidemiologia , Jejum , Glucose , Neoplasias/epidemiologia
11.
Cardiovasc Drugs Ther ; 37(1): 129-140, 2023 02.
Artigo em Inglês | MEDLINE | ID: mdl-34622354

RESUMO

PURPOSE: To estimate the risk of recurrent cardiovascular events in a real-world population of very high-risk Korean patients with prior myocardial infarction (MI), ischemic stroke (IS), or symptomatic peripheral artery disease (sPAD), similar to the Further cardiovascular OUtcomes Research with proprotein convertase subtilisin-kexin type 9 Inhibition in subjects with Elevated Risk (FOURIER) trial population. METHODS: This retrospective study used the Asan Medical Center Heart Registry database built on electronic medical records (EMR) from 2000 to 2016. Patients with a history of clinically evident atherosclerotic cardiovascular disease (ASCVD) with multiple risk factors were followed up for 3 years. The primary endpoint was a composite of MI, stroke, hospitalization for unstable angina, coronary revascularization, and all-cause mortality. RESULTS: Among 15,820 patients, the 3-year cumulative incidence of the composite primary endpoint was 15.3% and the 3-year incidence rate was 5.7 (95% CI 5.5-5.9) per 100 person-years. At individual endpoints, the rates of deaths, MI, and IS were 0.4 (0.3-0.4), 0.9 (0.8-0.9), and 0.8 (0.7-0.9), respectively. The risk of the primary endpoint did not differ significantly between recipients of different intensities of statin therapy. Low-density lipoprotein cholesterol (LDL-C) goals were only achieved in 24.4% of patients during the first year of follow-up. CONCLUSION: By analyzing EMR data representing routine practice in Korea, we found that patients with very high-risk ASCVD were at substantial risk of further cardiovascular events in 3 years. Given the observed risk of recurrent events with suboptimal lipid management by statin, additional treatment to control LDL-C might be necessary to reduce the burden of further cardiovascular events for very high-risk ASCVD patients.


Assuntos
Anticolesterolemiantes , Aterosclerose , Doenças Cardiovasculares , Inibidores de Hidroximetilglutaril-CoA Redutases , Humanos , Inibidores de Hidroximetilglutaril-CoA Redutases/efeitos adversos , Doenças Cardiovasculares/diagnóstico , Doenças Cardiovasculares/tratamento farmacológico , Doenças Cardiovasculares/epidemiologia , LDL-Colesterol , Anticolesterolemiantes/efeitos adversos , Registros Eletrônicos de Saúde , Estudos Retrospectivos , Pró-Proteína Convertase 9 , República da Coreia/epidemiologia
12.
Sci Rep ; 12(1): 21152, 2022 12 07.
Artigo em Inglês | MEDLINE | ID: mdl-36477457

RESUMO

Graph representation learning is a method for introducing how to effectively construct and learn patient embeddings using electronic medical records. Adapting the integration will support and advance the previous methods to predict the prognosis of patients in network models. This study aims to address the challenge of implementing a complex and highly heterogeneous dataset, including the following: (1) demonstrating how to build a multi-attributed and multi-relational graph model (2) and applying a downstream disease prediction task of a patient's prognosis using the HinSAGE algorithm. We present a bipartite graph schema and a graph database construction in detail. The first constructed graph database illustrates a query of a predictive network that provides analytical insights using a graph representation of a patient's journey. Moreover, we demonstrate an alternative bipartite model where we apply the model to the HinSAGE to perform the link prediction task for predicting the event occurrence. Consequently, the performance evaluation indicated that our heterogeneous graph model was successfully predicted as a baseline model. Overall, our graph database successfully demonstrated efficient real-time query performance and showed HinSAGE implementation to predict cardiovascular disease event outcomes on supervised link prediction learning.


Assuntos
Registros Eletrônicos de Saúde , Humanos
13.
JMIR Med Inform ; 10(5): e26801, 2022 May 11.
Artigo em Inglês | MEDLINE | ID: mdl-35544292

RESUMO

BACKGROUND: Although there is a growing interest in prediction models based on electronic medical records (EMRs) to identify patients at risk of adverse cardiac events following invasive coronary treatment, robust models fully utilizing EMR data are limited. OBJECTIVE: We aimed to develop and validate machine learning (ML) models by using diverse fields of EMR to predict the risk of 30-day adverse cardiac events after percutaneous intervention or bypass surgery. METHODS: EMR data of 5,184,565 records of 16,793 patients at a quaternary hospital between 2006 and 2016 were categorized into static basic (eg, demographics), dynamic time-series (eg, laboratory values), and cardiac-specific data (eg, coronary angiography). The data were randomly split into training, tuning, and testing sets in a ratio of 3:1:1. Each model was evaluated with 5-fold cross-validation and with an external EMR-based cohort at a tertiary hospital. Logistic regression (LR), random forest (RF), gradient boosting machine (GBM), and feedforward neural network (FNN) algorithms were applied. The primary outcome was 30-day mortality following invasive treatment. RESULTS: GBM showed the best performance with area under the receiver operating characteristic curve (AUROC) of 0.99; RF had a similar AUROC of 0.98. AUROCs of FNN and LR were 0.96 and 0.93, respectively. GBM had the highest area under the precision-recall curve (AUPRC) of 0.80, and the AUPRCs of RF, LR, and FNN were 0.73, 0.68, and 0.63, respectively. All models showed low Brier scores of <0.1 as well as highly fitted calibration plots, indicating a good fit of the ML-based models. On external validation, the GBM model demonstrated maximal performance with an AUROC of 0.90, while FNN had an AUROC of 0.85. The AUROCs of LR and RF were slightly lower at 0.80 and 0.79, respectively. The AUPRCs of GBM, LR, and FNN were similar at 0.47, 0.43, and 0.41, respectively, while that of RF was lower at 0.33. Among the categories in the GBM model, time-series dynamic data demonstrated a high AUROC of >0.95, contributing majorly to the excellent results. CONCLUSIONS: Exploiting the diverse fields of the EMR data set, the ML-based 30-day adverse cardiac event prediction models demonstrated outstanding results, and the applied framework could be generalized for various health care prediction models.

14.
Comput Methods Programs Biomed ; 221: 106866, 2022 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-35594580

RESUMO

BACKGROUND AND OBJECTIVE: With the advent of bioinformatics, biological databases have been constructed to computerize data. Biological systems can be described as interactions and relationships between elements constituting the systems, and they are organized in various biomedical open databases. These open databases have been used in approaches to predict functional interactions such as protein-protein interactions (PPI), drug-drug interactions (DDI) and disease-disease relationships (DDR). However, just combining interaction data has limited effectiveness in predicting the complex relationships occurring in a whole context. Each contributing source contains information on each element in a specific field of knowledge but there is a lack of inter-disciplinary insight in combining them. METHODS: In this study, we propose the RWD Integrated platform for Discovering Associations in Biomedical research (RIDAB) to predict interactions between biomedical entities. RIDAB is established as a graph network to construct a platform that predicts the interactions of target entities. Biomedical open database is combined with EMRs each representing a biomedical network and a real-world data. To integrate databases from different domains to build the platform, mapping of the vocabularies was required. In addition, the appropriate structure of the network and the graph embedding method to be used were needed to be selected to fit the tasks. RESULTS: The feasibility of the platform was evaluated using node similarity and link prediction for drug repositioning task, a commonly used task for biomedical network. In addition, we compared the US Food and Drug Administration (FDA)-approved repositioned drugs with the predicted result. By integrating EMR database with biomedical networks, the platform showed increased f1 score in predicting repositioned drugs, from 45.62% to 57.26%, compared to platforms based on biomedical networks alone. CONCLUSIONS: This study demonstrates that the elements of biomedical research findings can be reflected by integrating EMR data with open-source biomedical networks. In addition, showed the feasibility of using the established platform to represent the integration of biomedical networks and reflected the relationship between real world networks.


Assuntos
Pesquisa Biomédica , Registros Eletrônicos de Saúde , Bases de Dados Factuais
15.
JMIR Med Inform ; 10(3): e32313, 2022 Mar 07.
Artigo em Inglês | MEDLINE | ID: mdl-35254275

RESUMO

BACKGROUND: Scoring systems developed for predicting survival after allogeneic hematopoietic cell transplantation (HCT) show suboptimal prediction power, and various factors affect posttransplantation outcomes. OBJECTIVE: A prediction model using a machine learning-based algorithm can be an alternative for concurrently applying multiple variables and can reduce potential biases. In this regard, the aim of this study is to establish and validate a machine learning-based predictive model for survival after allogeneic HCT in patients with hematologic malignancies. METHODS: Data from 1470 patients with hematologic malignancies who underwent allogeneic HCT between December 1993 and June 2020 at Asan Medical Center, Seoul, South Korea, were retrospectively analyzed. Using the gradient boosting machine algorithm, we evaluated a model predicting the 5-year posttransplantation survival through 10-fold cross-validation. RESULTS: The prediction model showed good performance with a mean area under the receiver operating characteristic curve of 0.788 (SD 0.03). Furthermore, we developed a risk score predicting probabilities of posttransplantation survival in 294 randomly selected patients, and an agreement between the estimated predicted and observed risks of overall death, nonrelapse mortality, and relapse incidence was observed according to the risk score. Additionally, the calculated score demonstrated the possibility of predicting survival according to the different transplantation-related factors, with the visualization of the importance of each variable. CONCLUSIONS: We developed a machine learning-based model for predicting long-term survival after allogeneic HCT in patients with hematologic malignancies. Our model provides a method for making decisions regarding patient and donor candidates or selecting transplantation-related resources, such as conditioning regimens.

16.
JMIR Med Inform ; 9(11): e32662, 2021 Nov 17.
Artigo em Inglês | MEDLINE | ID: mdl-34787584

RESUMO

BACKGROUND: Effective resource management in hospitals can improve the quality of medical services by reducing labor-intensive burdens on staff, decreasing inpatient waiting time, and securing the optimal treatment time. The use of hospital processes requires effective bed management; a stay in the hospital that is longer than the optimal treatment time hinders bed management. Therefore, predicting a patient's hospitalization period may support the making of judicious decisions regarding bed management. OBJECTIVE: First, this study aims to develop a machine learning (ML)-based predictive model for predicting the discharge probability of inpatients with cardiovascular diseases (CVDs). Second, we aim to assess the outcome of the predictive model and explain the primary risk factors of inpatients for patient-specific care. Finally, we aim to evaluate whether our ML-based predictive model helps manage bed scheduling efficiently and detects long-term inpatients in advance to improve the use of hospital processes and enhance the quality of medical services. METHODS: We set up the cohort criteria and extracted the data from CardioNet, a manually curated database that specializes in CVDs. We processed the data to create a suitable data set by reindexing the date-index, integrating the present features with past features from the previous 3 years, and imputing missing values. Subsequently, we trained the ML-based predictive models and evaluated them to find an elaborate model. Finally, we predicted the discharge probability within 3 days and explained the outcomes of the model by identifying, quantifying, and visualizing its features. RESULTS: We experimented with 5 ML-based models using 5 cross-validations. Extreme gradient boosting, which was selected as the final model, accomplished an average area under the receiver operating characteristic curve score that was 0.865 higher than that of the other models (ie, logistic regression, random forest, support vector machine, and multilayer perceptron). Furthermore, we performed feature reduction, represented the feature importance, and assessed prediction outcomes. One of the outcomes, the individual explainer, provides a discharge score during hospitalization and a daily feature influence score to the medical team and patients. Finally, we visualized simulated bed management to use the outcomes. CONCLUSIONS: In this study, we propose an individual explainer based on an ML-based predictive model, which provides the discharge probability and relative contributions of individual features. Our model can assist medical teams and patients in identifying individual and common risk factors in CVDs and can support hospital administrators in improving the management of hospital beds and other resources.

17.
JMIR Public Health Surveill ; 7(10): e30824, 2021 10 13.
Artigo em Inglês | MEDLINE | ID: mdl-34643539

RESUMO

BACKGROUND: When using machine learning in the real world, the missing value problem is the first problem encountered. Methods to impute this missing value include statistical methods such as mean, expectation-maximization, and multiple imputations by chained equations (MICE) as well as machine learning methods such as multilayer perceptron, k-nearest neighbor, and decision tree. OBJECTIVE: The objective of this study was to impute numeric medical data such as physical data and laboratory data. We aimed to effectively impute data using a progressive method called self-training in the medical field where training data are scarce. METHODS: In this paper, we propose a self-training method that gradually increases the available data. Models trained with complete data predict the missing values in incomplete data. Among the incomplete data, the data in which the missing value is validly predicted are incorporated into the complete data. Using the predicted value as the actual value is called pseudolabeling. This process is repeated until the condition is satisfied. The most important part of this process is how to evaluate the accuracy of pseudolabels. They can be evaluated by observing the effect of the pseudolabeled data on the performance of the model. RESULTS: In self-training using random forest (RF), mean squared error was up to 12% lower than pure RF, and the Pearson correlation coefficient was 0.1% higher. This difference was confirmed statistically. In the Friedman test performed on MICE and RF, self-training showed a P value between .003 and .02. A Wilcoxon signed-rank test performed on the mean imputation showed the lowest possible P value, 3.05e-5, in all situations. CONCLUSIONS: Self-training showed significant results in comparing the predicted values and actual values, but it needs to be verified in an actual machine learning system. And self-training has the potential to improve performance according to the pseudolabel evaluation method, which will be the main subject of our future research.


Assuntos
Algoritmos , Registros Eletrônicos de Saúde , Humanos , Aprendizado de Máquina , Projetos de Pesquisa
18.
Comput Methods Programs Biomed ; 208: 106281, 2021 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-34333207

RESUMO

Background and objectiveDetecting abnormal patterns within an electrocardiogram (ECG) is crucial for diagnosing cardiovascular diseases. We start from two unresolved problems in applying deep-learning-based ECG classification models to clinical practice: first, although multiple cardiac arrhythmia (CA) types may co-occur in real life, the majority of previous detection methods have focused on one-to-one relationships between ECG and CA type, and second, it has been difficult to explain how neural-network-based CA classifiers make decisions. We hypothesize that fine-tuning attention maps with regard to all possible combinations of ground-truth (GT) labels will improve both the detection and interpretability of co-occurring CAs. Methods To test our hypothesis, we propose an end-to-end convolutional neural network (CNN), xECGNet, that fine-tunes the attention map to resemble the averaged response maps of GT labels. Fine-tuning is achieved by adding to the objective function a regularization loss between the attention map and the reference (averaged) map. Performance is assessed by F1 score and subset accuracy. Results The main experiment demonstrates that fine-tuning alone significantly improves a model's multilabel subset accuracy from 75.8% to 84.5% when compared with the baseline model. Also, xECGNet shows the highest F1 score of 0.812 and yields a more explainable map that encompasses multiple CA types, when compared to other baseline methods. Conclusions xECGNet has implications in that it tackles the two obstacles for the clinical application of CNN-based CA detection models with a simple solution of adding one additional term to the objective function.


Assuntos
Algoritmos , Redes Neurais de Computação , Arritmias Cardíacas/diagnóstico , Atenção , Eletrocardiografia , Humanos
19.
BMC Med Inform Decis Mak ; 21(1): 29, 2021 01 28.
Artigo em Inglês | MEDLINE | ID: mdl-33509180

RESUMO

BACKGROUND: Cardiovascular diseases (CVDs) are difficult to diagnose early and have risk factors that are easy to overlook. Early prediction and personalization of treatment through the use of artificial intelligence (AI) may help clinicians and patients manage CVDs more effectively. However, to apply AI approaches to CVDs data, it is necessary to establish and curate a specialized database based on electronic health records (EHRs) and include pre-processed unstructured data. METHODS: To build a suitable database (CardioNet) for CVDs that can utilize AI technology, contributing to the overall care of patients with CVDs. First, we collected the anonymized records of 748,474 patients who had visited the Asan Medical Center (AMC) or Ulsan University Hospital (UUH) because of CVDs. Second, we set clinically plausible criteria to remove errors and duplication. Third, we integrated unstructured data such as readings of medical examinations with structured data sourced from EHRs to create the CardioNet. We subsequently performed natural language processing to structuralize the significant variables associated with CVDs because most results of the principal CVD-related medical examinations are free-text readings. Additionally, to ensure interoperability for convergent multi-center research, we standardized the data using several codes that correspond to the common data model. Finally, we created the descriptive table (i.e., dictionary of the CardioNet) to simplify access and utilization of data for clinicians and engineers and continuously validated the data to ensure reliability. RESULTS: CardioNet is a comprehensive database that can serve as a training set for AI models and assist in all aspects of clinical management of CVDs. It comprises information extracted from EHRs and results of readings of CVD-related digital tests. It consists of 27 tables, a code-master table, and a descriptive table. CONCLUSIONS: CardioNet database specialized in CVDs was established, with continuing data collection. We are actively supporting multi-center research, which may require further data processing, depending on the subject of the study. CardioNet will serve as the fundamental database for future CVD-related research projects.


Assuntos
Inteligência Artificial , Doenças Cardiovasculares , Doenças Cardiovasculares/diagnóstico , Doenças Cardiovasculares/epidemiologia , Bases de Dados Factuais , Humanos , Processamento de Linguagem Natural , Reprodutibilidade dos Testes
20.
Neural Netw ; 128: 216-233, 2020 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-32447265

RESUMO

In this paper, we proposed nested encoder-decoder architecture named T-Net. T-Net consists of several small encoder-decoders for each block constituting convolutional network. T-Net overcomes the limitation that U-Net can only have a single set of the concatenate layer between encoder and decoder block. To be more precise, the U-Net symmetrically forms the concatenate layers, so the low-level feature of the encoder is connected to the latter part of the decoder, and the high-level feature is connected to the beginning of the decoder. T-Net arranges the pooling and up-sampling appropriately during the encoding process, and likewise during the decoding process so that feature-maps of various sizes are obtained in a single block. As a result, all features from the low-level to the high-level extracted from the encoder are delivered from the beginning of the decoder to predict a more accurate mask. We evaluated T-Net for the problem of segmenting three main vessels in coronary angiography images. The experiment consisted of a comparison of U-Net and T-Nets under the same conditions, and an optimized T-Net for the main vessel segmentation. As a result, T-Net recorded a Dice Similarity Coefficient score (DSC) of 83.77%, 10.69% higher than that of U-Net, and the optimized T-Net recorded a DSC of 88.97% which was 15.89% higher than that of U-Net. In addition, we visualized the weight activation of the convolutional layer of T-Net and U-Net to show that T-Net actually predicts the mask from earlier decoders. Therefore, we expect that T-Net can be effectively applied to other similar medical image segmentation problems.


Assuntos
Angiografia Coronária/métodos , Processamento de Imagem Assistida por Computador/métodos , Redes Neurais de Computação , Humanos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA