RESUMEN
Rapid and accurate prediction of drug-target affinity can accelerate and improve the drug discovery process. Recent studies show that deep learning models may have the potential to provide fast and accurate drug-target affinity prediction. However, the existing deep learning models still have their own disadvantages that make it difficult to complete the task satisfactorily. Complex-based models rely heavily on the time-consuming docking process, and complex-free models lacks interpretability. In this study, we introduced a novel knowledge-distillation insights drug-target affinity prediction model with feature fusion inputs to make fast, accurate and explainable predictions. We benchmarked the model on public affinity prediction and virtual screening dataset. The results show that it outperformed previous state-of-the-art models and achieved comparable performance to previous complex-based models. Finally, we study the interpretability of this model through visualization and find it can provide meaningful explanations for pairwise interaction. We believe this model can further improve the drug-target affinity prediction for its higher accuracy and reliable interpretability.
Asunto(s)
Benchmarking , Descubrimiento de Drogas , Sistemas de Liberación de MedicamentosRESUMEN
MOTIVATION: Natural language is poised to become a key medium for human-machine interactions in the era of large language models. In the field of biochemistry, tasks such as property prediction and molecule mining are critically important yet technically challenging. Bridging molecular expressions in natural language and chemical language can significantly enhance the interpretability and ease of these tasks. Moreover, it can integrate chemical knowledge from various sources, leading to a deeper understanding of molecules. RESULTS: Recognizing these advantages, we introduce the concept of conversational molecular design, a novel task that utilizes natural language to describe and edit target molecules. To better accomplish this task, we develop ChatMol, a knowledgeable and versatile generative pretrained model. This model is enhanced by incorporating experimental property information, molecular spatial knowledge, and the associations between natural and chemical languages. Several typical solutions including large language models (e.g. ChatGPT) are evaluated, proving the challenge of conversational molecular design and the effectiveness of our knowledge enhancement approach. Case observations and analysis offer insights and directions for further exploration of natural-language interaction in molecular discovery. AVAILABILITY AND IMPLEMENTATION: Codes and data are provided in https://github.com/Ellenzzn/ChatMol/tree/main.
Asunto(s)
Procesamiento de Lenguaje Natural , Humanos , Programas Informáticos , Biología Computacional/métodosRESUMEN
BACKGROUND: Glomerular lesions are the main injuries of diabetic nephropathy (DN) and are used as a crucial index for pathologic classification. Manual quantification of these morphologic features currently used is semi-quantitative and time-consuming. Automatically quantifying glomerular morphologic features is urgently needed. METHODS: A series of convolutional neural networks (CNN) were designed to identify and classify glomerular morphologic features in DN patients. Associations of these digital features with pathologic classification and prognosis were further analyzed. RESULTS: Our CNN-based model achieved a 0.928 F1-score for global glomerulosclerosis and 0.953 F1-score for Kimmelstiel-Wilson lesion, further obtained a dice of 0.870 for the mesangial area and F1-score beyond 0.839 for three glomerular intrinsic cells. As the pathologic classes increased, mesangial cell numbers and mesangial area increased, and podocyte numbers decreased (p for all < 0.001), while endothelial cell numbers remained stable (p = 0.431). Glomeruli with Kimmelstiel-Wilson lesion showed more severe podocyte deletion compared to those without (p < 0.001). Furthermore, CNN-based classifications showed moderate agreement with pathologists-based classification, the kappa value between the CNN model 3 and pathologists reached 0.624 (ranging from 0.529 to 0.688, p < 0.001). Notably, CNN-based classifications obtained equivalent performance to pathologists-based classifications on predicting baseline and long-term renal function. CONCLUSION: Our CNN-based model is promising in assisting the identification and pathologic classification of glomerular lesions in DN patients.
Asunto(s)
Inteligencia Artificial , Nefropatías Diabéticas , Glomérulos Renales , Humanos , Nefropatías Diabéticas/patología , Nefropatías Diabéticas/clasificación , Glomérulos Renales/patología , Masculino , Femenino , Persona de Mediana Edad , Redes Neurales de la ComputaciónRESUMEN
BACKGROUND: The current literature shows a strong relationship between retinal neuronal and vascular alterations in dementia. The purpose of the study was to use NFN+ deep learning models to analyze retinal vessel characteristics for cognitive impairment (CI) recognition. METHODS: We included 908 participants from a community-based cohort followed for over 15 years (the prospective KaiLuan Study) who underwent brain magnetic resonance imaging (MRI) and fundus photography between 2021 and 2022. The cohort consisted of both cognitively healthy individuals (N = 417) and those with cognitive impairment (N = 491). We employed the NFN+ deep learning framework for retinal vessel segmentation and measurement. Associations between Retinal microvascular parameters (RMPs: central retinal arteriolar / venular equivalents, arteriole to venular ratio, fractal dimension) and CI were assessed by Pearson correlation. P < 0.05 was considered statistically significant. The correlation between the CI and RMPs were explored, then the correlation coefficients between CI and RMPs were analyzed. Random Forest nonlinear classification model was used to predict whether one having cognitive decline or not. The assessment criterion was the AUC value derived from the working characteristic curve. RESULTS: The fractal dimension (FD) and global vein width were significantly correlated with the CI (P < 0.05). Age (0.193), BMI (0.154), global vein width (0.106), retinal vessel FD (0.099), and CRAE (0.098) were the variables in this model that were ranked in order of feature importance. The AUC values of the model were 0.799. CONCLUSIONS: Establishment of a predictive model based on the extraction of vascular features from fundus images has a high recognizability and predictive power for cognitive function and can be used as a screening method for CI.
Asunto(s)
Disfunción Cognitiva , Aprendizaje Profundo , Humanos , Estudios Prospectivos , Disfunción Cognitiva/diagnóstico por imagen , Retina , Vasos Retinianos/diagnóstico por imagen , BiomarcadoresRESUMEN
INTRODUCTION: This study aimed to compare retinal vascular parameters and density in patients with moyamoya disease using the optical coherence tomography angiography. METHODS: This clinical trial totally enrolls 78 eyes from 39 participants, and all these patients with moyamoya disease (N = 13) are set as experimental group and participants with health who matched with age and gender are considered as the control group (N = 26). Then all these participants receive optical coherence tomography angiography detection. Participants' general data are collected and analyzed. Skeleton density (SD) value, vessel density (VD) value, fractal dimension (FD) value, vessel diameter index (VDI) value, foveal avascular zone (FAZ) value are analyzed. RESULTS: A total of 39 participants are included in this study. The SD value in the experimental group was significantly lower than that in control group (0.175 [0.166, 0.181] vs. 0.184 [0.175, 0.188], p = 0.017). Similarly, the VD value in the experimental group was significantly lower than that in the control group (0.333 [0.320, 0.350] vs. 0.354 [0.337, 0.364], p = 0.024). Additionally, the FD value in the experimental group was significantly lower than that in the control group (2.088 [2.083, 2.094] vs. 2.096 [2.090, 2.101], p = 0.022). As for the VDI and FAZ, VDI and FAZ values in the experimental group were lower than those in the control group, there was no significant difference in VDI and FAZ values between the two groups. CONCLUSIONS: Our study, using non-invasive and rapid OCTA imaging, confirmed decreased retinal vascular parameters and density in patients with moyamoya disease.
Asunto(s)
Angiografía con Fluoresceína , Fondo de Ojo , Enfermedad de Moyamoya , Vasos Retinianos , Tomografía de Coherencia Óptica , Humanos , Enfermedad de Moyamoya/diagnóstico , Enfermedad de Moyamoya/fisiopatología , Enfermedad de Moyamoya/diagnóstico por imagen , Femenino , Masculino , Vasos Retinianos/diagnóstico por imagen , Vasos Retinianos/patología , Tomografía de Coherencia Óptica/métodos , Estudios Retrospectivos , Angiografía con Fluoresceína/métodos , Adulto , Persona de Mediana Edad , Agudeza Visual , Adulto Joven , Adolescente , Estudios de SeguimientoRESUMEN
BACKGROUND: One of the significant changes in intensive care medicine over the past 2 decades is the acknowledgment that improper mechanical ventilation settings substantially contribute to pulmonary injury in critically ill patients. Artificial intelligence (AI) solutions can optimize mechanical ventilation settings in intensive care units (ICUs) and improve patient outcomes. Specifically, machine learning algorithms can be trained on large datasets of patient information and mechanical ventilation settings. These algorithms can then predict patient responses to different ventilation strategies and suggest personalized ventilation settings for individual patients. OBJECTIVE: In this study, we aimed to design and evaluate an AI solution that could tailor an optimal ventilator strategy for each critically ill patient who requires mechanical ventilation. METHODS: We proposed a reinforcement learning-based AI solution using observational data from multiple ICUs in the United States. The primary outcome was hospital mortality. Secondary outcomes were the proportion of optimal oxygen saturation and the proportion of optimal mean arterial blood pressure. We trained our AI agent to recommend low, medium, and high levels of 3 ventilator settings-positive end-expiratory pressure, fraction of inspired oxygen, and ideal body weight-adjusted tidal volume-according to patients' health conditions. We defined a policy as rules guiding ventilator setting changes given specific clinical scenarios. Off-policy evaluation metrics were applied to evaluate the AI policy. RESULTS: We studied 21,595 and 5105 patients' ICU stays from the e-Intensive Care Unit Collaborative Research (eICU) and Medical Information Mart for Intensive Care IV (MIMIC-IV) databases, respectively. Using the learned AI policy, we estimated the hospital mortality rate (eICU 12.1%, SD 3.1%; MIMIC-IV 29.1%, SD 0.9%), the proportion of optimal oxygen saturation (eICU 58.7%, SD 4.7%; MIMIC-IV 49%, SD 1%), and the proportion of optimal mean arterial blood pressure (eICU 31.1%, SD 4.5%; MIMIC-IV 41.2%, SD 1%). Based on multiple quantitative and qualitative evaluation metrics, our proposed AI solution outperformed observed clinical practice. CONCLUSIONS: Our study found that customizing ventilation settings for individual patients led to lower estimated hospital mortality rates compared to actual rates. This highlights the potential effectiveness of using reinforcement learning methodology to develop AI models that analyze complex clinical data for optimizing treatment parameters. Additionally, our findings suggest the integration of this model into a clinical decision support system for refining ventilation settings, supporting the need for prospective validation trials.
Asunto(s)
Respiración Artificial , Humanos , Respiración Artificial/métodos , Estudios Retrospectivos , Masculino , Persona de Mediana Edad , Femenino , Anciano , Unidades de Cuidados Intensivos , Mortalidad Hospitalaria , Enfermedad Crítica , Inteligencia Artificial , Ventiladores Mecánicos/estadística & datos numéricos , Aprendizaje AutomáticoRESUMEN
BACKGROUND: Machine learning (ML) risk prediction models, although much more accurate than traditional statistical methods, are inconvenient to use in clinical practice due to their nontransparency and requirement of a large number of input variables. OBJECTIVE: We aimed to develop a precise, explainable, and flexible ML model to predict the risk of in-hospital mortality in patients with ST-segment elevation myocardial infarction (STEMI). METHODS: This study recruited 18,744 patients enrolled in the 2013 China Acute Myocardial Infarction (CAMI) registry and 12,018 patients from the China Patient-Centered Evaluative Assessment of Cardiac Events (PEACE)-Retrospective Acute Myocardial Infarction Study. The Extreme Gradient Boosting (XGBoost) model was derived from 9616 patients in the CAMI registry (2014, 89 variables) with 5-fold cross-validation and validated on both the 9125 patients in the CAMI registry (89 variables) and the independent China PEACE cohort (10 variables). The Shapley Additive Explanations (SHAP) approach was employed to interpret the complex relationships embedded in the proposed model. RESULTS: In the XGBoost model for predicting all-cause in-hospital mortality, the variables with the top 8 most important scores were age, left ventricular ejection fraction, Killip class, heart rate, creatinine, blood glucose, white blood cell count, and use of angiotensin-converting enzyme inhibitors (ACEIs) and angiotensin II receptor blockers (ARBs). The area under the curve (AUC) on the CAMI validation set was 0.896 (95% CI 0.884-0.909), significantly higher than the previous models. The AUC for the Global Registry of Acute Coronary Events (GRACE) model was 0.809 (95% CI 0.790-0.828), and for the TIMI model, it was 0.782 (95% CI 0.763-0.800). Despite the China PEACE validation set only having 10 available variables, the AUC reached 0.840 (0.829-0.852), showing a substantial improvement to the GRACE (0.762, 95% CI 0.748-0.776) and TIMI (0.789, 95% CI 0.776-0.803) scores. Several novel and nonlinear relationships were discovered between patients' characteristics and in-hospital mortality, including a U-shape pattern of high-density lipoprotein cholesterol (HDL-C). CONCLUSIONS: The proposed ML risk prediction model was highly accurate in predicting in-hospital mortality. Its flexible and explainable characteristics make the model convenient to use in clinical practice and could help guide patient management. TRIAL REGISTRATION: ClinicalTrials.gov NCT01874691; https://clinicaltrials.gov/study/NCT01874691.
Asunto(s)
Mortalidad Hospitalaria , Aprendizaje Automático , Sistema de Registros , Infarto del Miocardio con Elevación del ST , Humanos , China , Masculino , Femenino , Persona de Mediana Edad , Anciano , Infarto del Miocardio con Elevación del ST/mortalidad , Estudios Retrospectivos , Infarto del Miocardio/mortalidad , Pueblos del Este de AsiaRESUMEN
BACKGROUND: The analytical renal pathology system (ARPS) based on convolutional neural networks has been used successfully in native IgA nephropathy (IgAN) patients. Considering the similarity of pathologic features, we aim to evaluate the performance of the ARPS in allograft IgAN patients and broaden its implementation. METHODS: Biopsy-proven allograft IgAN patients from two different centers were enrolled for internal and external validation. We implemented the ARPS to identify glomerular lesions and intrinsic glomerular cells, and then evaluated its performance. Consistency between the ARPS and pathologists was assessed using intraclass correlation coefficients. The association of digital pathological features with clinical and pathological data was measured. Kaplan-Meier survival curve and cox proportional hazards model were applied to investigate prognosis prediction. RESULTS: A total of 56 biopsy-proven allograft IgAN patients from the internal center and 17 biopsy-proven allograft IgAN patients from the external center were enrolled in this study. The ARPS was successfully applied to identify the glomerular lesions (F1-score, 0.696-0.959) and quantify intrinsic glomerular cells (F1-score, 0.888-0.968) in allograft IgAN patients rapidly and precisely. Furthermore, the mesangial hypercellularity score was positively correlated with all mesangial metrics provided by ARPS [Spearman's correlation coefficient (r), 0.439-0.472, and all p values < 0.001]. Besides, a higher allograft survival was noticed among patients in the high-level groups of the maximum and ratio of endothelial cells, as well as the maximum and density of podocytes. CONCLUSION: We propose that the ARPS could be implemented in future clinical practice with outstanding capability.
Asunto(s)
Glomerulonefritis por IGA , Humanos , Glomerulonefritis por IGA/cirugía , Glomerulonefritis por IGA/patología , Células Endoteliales/patología , Glomérulos Renales/patología , Trasplante Homólogo , Pronóstico , Aloinjertos/patología , Estudios RetrospectivosRESUMEN
How to produce expressive molecular representations is a fundamental challenge in artificial intelligence-driven drug discovery. Graph neural network (GNN) has emerged as a powerful technique for modeling molecular data. However, previous supervised approaches usually suffer from the scarcity of labeled data and poor generalization capability. Here, we propose a novel molecular pre-training graph-based deep learning framework, named MPG, that learns molecular representations from large-scale unlabeled molecules. In MPG, we proposed a powerful GNN for modelling molecular graph named MolGNet, and designed an effective self-supervised strategy for pre-training the model at both the node and graph-level. After pre-training on 11 million unlabeled molecules, we revealed that MolGNet can capture valuable chemical insights to produce interpretable representation. The pre-trained MolGNet can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of drug discovery tasks, including molecular properties prediction, drug-drug interaction and drug-target interaction, on 14 benchmark datasets. The pre-trained MolGNet in MPG has the potential to become an advanced molecular encoder in the drug discovery pipeline.
Asunto(s)
Bases de Datos de Compuestos Químicos , Sistemas de Liberación de Medicamentos , Descubrimiento de Drogas , Modelos Moleculares , Redes Neurales de la ComputaciónRESUMEN
PURPOSE: The goal of this study was to propose a knowledge-based planning system which could automatically design plans for lung cancer patients treated with intensity-modulated radiotherapy (IMRT). METHODS AND MATERIALS: From May 2018 to June 2020, 612 IMRT treatment plans of lung cancer patients were retrospectively selected to construct a planning database. Knowledge-based planning (KBP) architecture named αDiar was proposed in this study. It consisted of two parts separated by a firewall. One was the in-hospital workstation, and the other was the search engine in the cloud. Based on our previous study, ANet in the in-hospital workstation was used to generate predicted virtual dose images. A search engine including a three-dimensional convolutional neural network (3D CNN) was constructed to derive the feature vectors of dose images. By comparing the similarity of the features between virtual dose images and the clinical dose images in the database, the most similar feature was found. The optimization parameters (OPs) of the treatment plan corresponding to the most similar feature were assigned to the new plan, and the design of a new treatment plan was automatically completed. After αDiar was developed, we performed two studies. The first retrospective study was conducted to validate whether this architecture was qualified for clinical practice and involved 96 patients. The second comparative study was performed to investigate whether αDiar could assist dosimetrists in improving the quality of planning for the patients. Two dosimetrists were involved and designed plans for only one trial with and without αDiar; 26 patients were involved in this study. RESULTS: The first study showed that about 54% (52/96) of the automatically generated plans would achieve the dosimetric constraints of the Radiation Therapy Oncology Group (RTOG) and about 93% (89/96) of the automatically generated plans would achieve the dosimetric constraints of the National Comprehensive Cancer Network (NCCN). The second study showed that the quality of treatment planning designed by junior dosimetrists was improved with the help of αDiar. CONCLUSIONS: Our results showed that αDiar was an effective tool to improve planning quality. Over half of the patients' plans could be designed automatically. For the remaining patients, although the automatically designed plans did not fully meet the clinical requirements, their quality was also better than that of manual plans.
RESUMEN
OBJECTIVE: Patients with atrial fibrillation (AF) are highly heterogeneous, and current risk stratification scores are only modestly good at predicting an individual's stroke risk. We aim to identify distinct AF clinical phenotypes with cluster analysis to optimize stroke prevention practices. METHODS: From the prospective Chinese Atrial Fibrillation Registry cohort study, we included 4337 AF patients with CHA2 DS2 -VASc≥2 for males and 3 for females who were not treated with oral anticoagulation. We randomly split the patients into derivation and validation sets by a ratio of 7:3. In the derivation set, we used outcome-driven patient clustering with metric learning to group patients into clusters with different risk levels of ischemic stroke and systemic embolism, and identify clusters of patients with low risks. Then we tested the results in the validation set, using the clustering rules generated from the derivation set. Finally, the survival decision tree was applied as a sensitivity analysis to confirm the results. RESULTS: Up to the follow-up of 1 year, 140 thromboembolic events (ischemic stroke or systemic embolism) occurred. After supervised metric learning from six variables involved in CHA2 DS2 -VASc scheme, we identified a cluster of patients (255/3035, 8.4%) at an annual thromboembolism risk of 0.8% in the derivation set. None of the patients in the low-risk cluster had prior thromboembolism, heart failure, diabetes, or age older than 70 years. After applying the regularities from metric learning on the validation set, we also identified a cluster of patients (137/1302, 10.5%) with an incident thromboembolism rate of 0.7%. Sensitivity analysis based on the survival decision tree approach selected a subgroup of patients with the same phenotypes as the metric-learning algorithm. CONCLUSIONS: Cluster analysis identified a distinct clinical phenotype at low risk of stroke among high-risk [CHA2 DS2 -VASc≥2 (3 for females)] patients with AF. The use of the novel analytic approach has the potential to prevent a subset of AF patients from unnecessary anticoagulation and avoid the associated risk of major bleeding.
RESUMEN
BACKGROUND: Dysregulation of lipid metabolism is closely associated with cancer progression. The study aimed to establish a prognostic model to predict distant metastasis-free survival (DMFS) in patients with nasopharyngeal carcinoma (NPC), based on lipidomics. METHODS: The plasma lipid profiles of 179 patients with locoregionally advanced NPC (LANPC) were measured and quantified using widely targeted quantitative lipidomics. Then, patients were randomly split into the training (125 patients, 69.8%) and validation (54 patients, 30.2%) sets. To identify distant metastasis-associated lipids, univariate Cox regression was applied to the training set (P < 0.05). A deep survival method called DeepSurv was employed to develop a proposed model based on significant lipid species (P < 0.01) and clinical biomarkers to predict DMFS. Concordance index and receiver operating curve analyses were performed to assess model effectiveness. The study also explored the potential role of lipid alterations in the prognosis of NPC. RESULTS: Forty lipids were recognized as distant metastasis-associated (P < 0.05) by univariate Cox regression. The concordance indices of the proposed model were 0.764 (95% confidence interval (CI), 0.682-0.846) and 0.760 (95% CI, 0.649-0.871) in the training and validation sets, respectively. High-risk patients had poorer 5-year DMFS compared with low-risk patients (Hazard ratio, 26.18; 95% CI, 3.52-194.80; P < 0.0001). Moreover, the six lipids were significantly correlated with immunity- and inflammation-associated biomarkers and were mainly enriched in metabolic pathways. CONCLUSIONS: Widely targeted quantitative lipidomics reveals plasma lipid predictors for LANPC, the prognostic model based on that demonstrated superior performance in predicting metastasis in LANPC patients.
Asunto(s)
Carcinoma , Neoplasias Nasofaríngeas , Humanos , Carcinoma Nasofaríngeo/patología , Pronóstico , Carcinoma/patología , Lipidómica , LípidosRESUMEN
PURPOSE: To develop and validate an artificial intelligence framework for identifying multiple retinal lesions at image level and performing an explainable macular disease diagnosis at eye level in optical coherence tomography images. METHODS: A total of 26,815 optical coherence tomography images were collected from 865 eyes, and 9 retinal lesions and 3 macular diseases were labeled by ophthalmologists, including diabetic macular edema and dry/wet age-related macular degeneration. We applied deep learning to classify retinal lesions at image level and random forests to achieve an explainable disease diagnosis at eye level. The performance of the integrated two-stage framework was evaluated and compared with human experts. RESULTS: On testing data set of 2,480 optical coherence tomography images from 80 eyes, the deep learning model achieved an average area under curve of 0.978 (95% confidence interval, 0.971-0.983) for lesion classification. In addition, random forests performed accurate disease diagnosis with a 0% error rate, which achieved the same accuracy as one of the human experts and was better than the other three experts. It also revealed that the detection of specific lesions in the center of macular region had more contribution to macular disease diagnosis. CONCLUSION: The integrated method achieved high accuracy and interpretability in retinal lesion classification and macular disease diagnosis in optical coherence tomography images and could have the potential to facilitate the clinical diagnosis.
Asunto(s)
Inteligencia Artificial , Retinopatía Diabética/diagnóstico por imagen , Atrofia Geográfica/diagnóstico por imagen , Edema Macular/diagnóstico por imagen , Tomografía de Coherencia Óptica/métodos , Degeneración Macular Húmeda/diagnóstico por imagen , Adulto , Anciano , Retinopatía Diabética/clasificación , Femenino , Atrofia Geográfica/clasificación , Humanos , Edema Macular/clasificación , Masculino , Persona de Mediana Edad , Curva ROC , Estudios Retrospectivos , Degeneración Macular Húmeda/clasificaciónRESUMEN
PURPOSE: To develop a deep learning-based framework to improve the image quality of optical coherence tomography (OCT) and evaluate its image enhancement effect with the traditional image averaging method from a clinical perspective. METHODS: 359 normal eyes and 456 eyes with various retinal conditions were included. A deep learning framework with high-resolution representation was developed to achieve image quality enhancement for OCT images. The quantitative comparisons, including expert subjective scores from ophthalmologists and three objective metrics of image quality (structural similarity index measure (SSIM), peak signal-to-noise ratio (PSNR) and contrast-to-noise ratio (CNR)), were performed between deep learning method and traditional image averaging. RESULTS: With the increase of frame count from 1 to 20, our deep learning method always obtained higher SSIM and PSNR values than the image averaging method while importing the same number of frames. When we selected 5 frames as inputs, the local objective assessment with CNR illustrated that the deep learning method had more obvious tissue contrast enhancement than averaging method. The subjective scores of image quality were all highest in our deep learning method, both for normal retinal structure and various retinal lesions. All the objective and subjective indicators had significant statistical differences (P < 0.05). CONCLUSION: Compared to traditional image averaging methods, our proposed deep learning enhancement framework can achieve a reasonable trade-off between image quality and scanning times, reducing the number of repeated scans.
Asunto(s)
Aprendizaje Profundo , Enfermedades de la Retina , Humanos , Aumento de la Imagen/métodos , Enfermedades de la Retina/diagnóstico por imagen , Tomografía de Coherencia Óptica/métodosRESUMEN
BACKGROUND: Several studies have investigated the correlation between physiological parameters and the risk of acute respiratory distress syndrome (ARDS), in addition, etiology-associated heterogeneity in ARDS has become an emerging topic quite recently; however, the intersection between the two, which is early prediction of target conditions in etiology-specific ARDS, has not been well-studied. We aimed to develop and validate a machine-learning model for the early prediction of moderate-to-severe condition of inhalation-induced ARDS. METHODS: Clinical expertise was applied with data-driven analysis. Using data from electronic intensive care units (retrospective derivation cohort) and the three most accessible vital signs (i.e. heart rate, temperature, and respiratory rate) together with feature engineering, we applied a random forest approach during the time window of 90 h that ended 6 h prior to the onset of moderate-to-severe respiratory failure (the ratio of partial pressure of arterial oxygen to fraction of inspired oxygen ≤ 200 mmHg). RESULTS: The trained random forest classifier was validated using two independent validation cohorts, with an area under the curve of 0.9127 (95% confidence interval 0.8713-0.9542) and 0.9026 (95% confidence interval 0.8075-1), respectively. A Stable and Interpretable RUle Set (SIRUS) was used to extract rules from the RF to provide guidelines for clinicians. We identified several predictive factors, including resp_96h_6h_min < 9, resp_96h_6h_mean ≥ 16.1, HR_96h_6h_mean ≥ 102, and temp_96h_6h_max > 100, that could be used for predicting inhalation-induced ARDS (moderate-to-severe condition) 6 h prior to onset in critical care units. ('xxx_96h_6h_min/mean/max': the minimum/mean/maximum values of the xxx vital sign collected during a 90 h time window beginning 96 h prior to the onset of ARDS and ending 6 h prior to the onset from every recorded blood gas test). CONCLUSIONS: This newly established random forestbased interpretable model shows good predictive ability for moderate-to-severe inhalation-induced ARDS and may assist clinicians in decision-making, as well as facilitate the enrolment of patients in prevention programmes to improve their outcomes.
Asunto(s)
Síndrome de Dificultad Respiratoria , Humanos , Aprendizaje Automático , Oxígeno , Presión Parcial , Estudios RetrospectivosRESUMEN
BACKGROUND: Renal flare of lupus nephritis (LN) is strongly associated with poor kidney outcomes, and predicting renal flare and stratifying its risk are important for clinical decision-making and individualized management to reduce LN flare. METHODS: We randomly divided 1,694 patients with biopsy-proven LN, who had achieved remission after treatment, into a derivation cohort (n = 1,186) and an internal validation cohort (n = 508), at a ratio of 7:3. The risk of renal flare 5 years after remission was predicted using an eXtreme Gradient Boosting (XGBoost) method model, developed from 59 variables, including demographic, clinical, immunological, pathological, and therapeutic characteristics. A simplified risk score prediction model (SRSPM) was developed from important variables selected by XGBoost model using stepwise Cox regression for practical convenience. RESULTS: The 5-year relapse rates were 39.5% and 38.2% in the derivation and internal validation cohorts, respectively. Both the XGBoost model and the SRSPM had good predictive performance, with a C-index of 0.819 (95% confidence interval [CI]: 0.774-0.857) and 0.746 (95% CI: 0.697-0.795), respectively, in the validation cohort. The SRSPM comprised 6 variables, including partial remission and endocapillary hypercellularity at baseline, age, serum Alb, anti-dsDNA, and serum complement C3 at the point of remission. Using Kaplan-Meier analysis, the SRSPM identified significant risk stratification for renal flares (p < 0.001). CONCLUSIONS: Renal flare of LN can be readily predicted using the XGBoost model and the SRSPM, and the SRSPM can also stratify flare risk. Both models are useful for clinical decision-making and individualized management in LN.
Asunto(s)
Nefritis Lúpica/fisiopatología , Aprendizaje Automático , Modelos Estadísticos , Brote de los Síntomas , Adulto , Factores de Edad , Anticuerpos Antinucleares/sangre , Capilares/patología , Toma de Decisiones Clínicas , Complemento C3/metabolismo , Femenino , Humanos , Estimación de Kaplan-Meier , Nefritis Lúpica/tratamiento farmacológico , Nefritis Lúpica/patología , Masculino , Modelos de Riesgos Proporcionales , Recurrencia , Medición de Riesgo/métodos , Factores de Riesgo , Albúmina Sérica/metabolismo , Adulto JovenRESUMEN
OBJECTIVES: To evaluate for the first time the performance of a deep learning method based on no-new-Net for fully automated segmentation and volumetric measurements of intracerebral hemorrhage (ICH), intraventricular extension of intracerebral hemorrhage (IVH), and perihematomal edema (PHE) in primary ICH on CT. METHODS: Three hundred and eighty primary ICH patients who underwent CT at hospital arrival were divided into a training cohort (n = 300) and a validation cohort (n = 80). An independent cohort with 80 patients was used for testing. Ground truth (segmentation masks) was manually generated by radiologists. Model performance on lesion segmentation and volumetric measurement of ICH, IVH, and PHE were evaluated by comparing the model results with the segmentations performed by radiologists. RESULTS: In the test cohort, the Dice scores of lesion segmentation were 0.92, 0.79, and 0.71 for ICH, IVH, and PHE, respectively. The sensitivities were 0.93 for ICH, 0.88 for IVH, and 0.81 for PHE. The positive predictive values were 0.92, 0.76, and 0.69 for ICH, IVH, and PHE, respectively. Excellent concordance (concordance correlation coefficients [CCCs] ≥ 0.98) of ICH and IVH and good concordance of PHE (CCCs ≥ 0.92) were demonstrated between manually and automatically measured volumes. The model took approximately 15 s to provide automatic segmentation and volume analysis for each patient. CONCLUSION: Our model demonstrates good reliability for automatic segmentation and volume measurement of ICH, IVH, and PHE in primary ICH, which can be useful to reduce the effort and time of doctors to calculate volumes of ICH, IVH, and PHE. KEY POINTS: ⢠Deep learning algorithms can provide automatic and reliable assessment of intracerebral hemorrhage, intraventricular hemorrhage, and perihematomal edema on CT. ⢠Non-contrast CT-based deep learning method can be helpful to provide efficient and accurate measurements of ICH, IVH, and PHE in primary ICH patients, thereby reducing the effort and time of doctors to segment and calculate volumes of ICH, IVH, and PHE in primary ICH patients.
Asunto(s)
Edema Encefálico , Aprendizaje Profundo , Hemorragia Cerebral/diagnóstico por imagen , Edema , Humanos , Hemorragias Intracraneales , Reproducibilidad de los ResultadosRESUMEN
Identification of glomerular lesions and structures is a key point for pathological diagnosis, treatment instructions, and prognosis evaluation in kidney diseases. These time-consuming tasks require a more accurate and reproducible quantitative analysis method. We established derivation and validation cohorts composed of 400 Chinese patients with immunoglobulin A nephropathy (IgAN) retrospectively. Deep convolutional neural networks and biomedical image processing algorithms were implemented to locate glomeruli, identify glomerular lesions (global and segmental glomerular sclerosis, crescent, and none of the above), identify and quantify different intrinsic glomerular cells, and assess a network-based mesangial hypercellularity score in periodic acid-Schiff (PAS)-stained slides. Our framework achieved 93.1% average precision and 94.9% average recall for location of glomeruli, and a total Cohen's kappa of 0.912 [95% confidence interval (CI), 0.892-0.932] for glomerular lesion classification. The evaluation of global, segmental glomerular sclerosis, and crescents achieved Cohen's kappa values of 1.0, 0.776, 0.861, and 95% CI of (1.0, 1.0), (0.727, 0.825), (0.824, 0.898), respectively. The well-designed neural network can identify three kinds of intrinsic glomerular cells with 92.2% accuracy, surpassing the about 5-11% average accuracy of junior pathologists. Statistical interpretation shows that there was a significant difference (P value < 0.0001) between this analytic renal pathology system (ARPS) and four junior pathologists for identifying mesangial and endothelial cells, while that for podocytes was similar, with P value = 0.0602. In addition, this study indicated that the ratio of mesangial cells, endothelial cells, and podocytes within glomeruli from IgAN was 0.41:0.36:0.23, and the performance of mesangial score assessment reached a Cohen's kappa of 0.42 and 95% CI (0.18, 0.69). The proposed computer-aided diagnosis system has feasibility for quantitative analysis and auxiliary recognition of glomerular pathological features. © 2020 The Authors. The Journal of Pathology published by John Wiley & Sons Ltd on behalf of Pathological Society of Great Britain and Ireland.
Asunto(s)
Aprendizaje Profundo , Glomerulonefritis por IGA/patología , Enfermedades Renales/diagnóstico , Glomérulos Renales/patología , Células Mesangiales/patología , Podocitos/patología , Adulto , Diagnóstico por Computador , Femenino , Humanos , Enfermedades Renales/patología , Masculino , Redes Neurales de la ComputaciónRESUMEN
OBJECTIVE: As the potential spread of COVID-19 sparked by imported cases from overseas will pose continuous challenges, it is essential to estimate the effects of control measures on reducing the importation risk of COVID-19. Our objective is to provide a framework of methodology for quantifying the combined effects of entry restrictions and travel quarantine on managing the importation risk of COVID-19 and other pandemics by leveraging different sets of parameters. METHODS: Three major categories of control measures on controlling importation risk were parameterized and modelled by the framework: 1) entry restrictions, 2) travel quarantine, and 3) domestic containment measures. Integrating the parameterized intensity of control measures, a modified SEIR model was developed to simulate the case importation and local epidemic under different scenarios of global epidemic dynamics. A web-based tool was also provided to enable interactive visualization of epidemic simulation. RESULTS: The simulated number of case importation and local spread modelled by the proposed framework of methods fitted well to the historical epidemic curve of China and Singapore. Based on the simulation results, the total numbers of infected cases when reducing 30% of visitor arrivals would be 88·4 (IQR 87·5-89·6) and 58·8 (IQR 58·3-59·5) times more than those when reducing 99% of visitor arrivals in mainland China and Singapore respectively, assuming actual time-varying Rt and travel quarantine policy. If the number of global daily new infections reached 100,000, 85%-91% of inbound travels should be reduced to keep the daily new infected number below 100 for a country with a similar travel volume as Singapore (daily 52,000 tourist arrivals in 2019). Whereas if the number was lower than 10,000, the daily new infected case would be less than 100 even with no entry restrictions. DISCUSSIONS: We proposed a framework that first estimated the intensity of travel restrictions and local containment measures for countries since the first overseas imported case. Our approach then quantified the combined effects of entry restrictions and travel quarantine using a modified SEIR model to simulate the potential epidemic spread under hypothetical intensities of these control measures. We also developed a web-based system that enables interactive simulation, which could serve as a valuable tool for health system administrators to assess policy effects on managing the importation risk. By leveraging different sets of parameters, it could adapt to any specific country and specific type of epidemic. CONCLUSIONS: This framework has provided a valuable tool to parameterize the intensity of control measures, simulate both the case importation and local epidemic, and quantify the combined effects of entry restrictions and travel quarantine on managing the importation risk.
Asunto(s)
COVID-19/prevención & control , Cuarentena , Viaje , China/epidemiología , Humanos , Singapur/epidemiologíaRESUMEN
PURPOSE: To predict short-term anti-vascular endothelial growth factor (anti-VEGF) treatment responder/non-responder for neovascular age-related macular degeneration (nAMD) patients based on optical coherence tomography (OCT) images. METHODS: A total of 4944 OCT scans from 206 patients with nAMD were involved to develop and evaluate a responder/non-responder prediction method for the short-term effect of anti-VEGF therapy. A deep learning architecture named sensitive structure guided network (SSG-Net) was proposed to make the prediction leveraging a sensitive structure guidance module trained from pre- and post-treatment images. To verify its clinical efficiency, other 2 deep learning methods and 4 experienced ophthalmologists were involved to evaluate the performance of the developed model. RESULTS: For the testing dataset, SSG-Net could predict the response by an accuracy of 84.6% and an area under the receiver curve (AUC) of 0.83, with a sensitivity of 0.692 and specificity of 1. In contrast, the 2 compared deep learning methods achieved an accuracy of 65.4% with a sensitivity of 0.461 and specificity of 0.846, and an accuracy of 73.1% with a sensitivity of 0.692 and specificity of 0.846, respectively. The predicted accuracy for 4 experienced ophthalmologists was 53.8 to 76.9%, with sensitivity of 0.538 to 0.923 and specificity of 0.385 to 0.846, respectively. CONCLUSION: Our proposed SSG-Net shows effective prediction on the short-term efficacy of anti-VEGF treatment for nAMD patients. This technique could potentially help clinicians explain the necessity of anti-VEGF treatment to the potential responder and avoid unnecessary treatment for the non-responder.