Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 419
Filter
1.
Article in English | MEDLINE | ID: mdl-38972630

ABSTRACT

OBJECTIVE: Challenging infrarenal aortic neck characteristics have been associated with increased risk of a type Ia endoleak after endovascular aneurysm repair (EVAR). Short apposition (< 10 mm circumferential shortest apposition length [SAL]) on the first post-operative computerised tomography angiography (CTA) has been associated with type Ia endoleak. Therefore, this study aimed to develop a model to predict post-operative SAL in patients with an abdominal aortic aneurysm based on the pre-operative shape. METHODS: A statistical shape model was developed to obtain principal component scores. The dataset comprised patients treated with standard EVAR without complications (n = 93) enriched with patients with a late type Ia endoleak (n = 54). The infrarenal SAL was obtained from the first post-operative CTA and subsequently binarised (< 10 mm and ≥ 10 mm). The principal component scores that were statistically different between the SAL groups were used as input for five classification models, and evaluated by means of leave one out cross validation. Area under the receiver operating characteristics curves (AUC), accuracy, sensitivity, and specificity were determined for each classification model. RESULTS: Of the 147 patients, 24 patients had an infrarenal SAL < 10 mm and 123 patients had a SAL ≥ 10 mm. The gradient boosting model resulted in the highest AUC of 0.77. Using this model, 114 (78.0%) patients were correctly classified; sensitivity (< 10 mm apposition was correctly predicted) and specificity (≥ 10 mm apposition was correctly predicted) were 0.70 and 0.79, and were based on a threshold of 0.21, respectively. CONCLUSION: A model was developed to predict which patients undergoing EVAR will achieve sufficient graft apposition (≥ 10 mm) in the infrarenal aortic neck based on a statistical shape model of pre-operative CTA data. This model can help vascular specialists during the planning phase to accurately identify patients who are unlikely to achieve sufficient apposition after standard EVAR.

2.
BMC Womens Health ; 24(1): 393, 2024 Jul 08.
Article in English | MEDLINE | ID: mdl-38978015

ABSTRACT

BACKGROUND: Cervical cancer (CC) is among the most prevalent cancer types among women with the highest prevalence in low- and middle-income countries (LMICs). It is a curable disease if detected early. Machine learning (ML) techniques can aid in early detection and prediction thus reducing screening and treatment costs. This study focused on women living with HIV (WLHIV) in Uganda. Its aim was to identify the best predictors of CC and the supervised ML model that best predicts CC among WLHIV. METHODS: Secondary data that included 3025 women from three health facilities in central Uganda was used. A multivariate binary logistic regression and recursive feature elimination with random forest (RFERF) were used to identify the best predictors. Five models; logistic regression (LR), random forest (RF), K-Nearest neighbor (KNN), support vector machine (SVM), and multi-layer perceptron (MLP) were applied to identify the out-performer. The confusion matrix and the area under the receiver operating characteristic curve (AUC/ROC) were used to evaluate the models. RESULTS: The results revealed that duration on antiretroviral therapy (ART), WHO clinical stage, TPT status, Viral load status, and family planning were commonly selected by the two techniques and thus highly significant in CC prediction. The RF from the RFERF-selected features outperformed other models with the highest scores of 90% accuracy and 0.901 AUC. CONCLUSION: Early identification of CC and knowledge of the risk factors could help control the disease. The RF outperformed other models applied regardless of the selection technique used. Future research can be expanded to include ART-naïve women in predicting CC.


Subject(s)
HIV Infections , Uterine Cervical Neoplasms , Humans , Female , Uganda/epidemiology , Uterine Cervical Neoplasms/diagnosis , HIV Infections/drug therapy , Adult , Supervised Machine Learning , Middle Aged , Precancerous Conditions/diagnosis , Logistic Models , Algorithms , Support Vector Machine
3.
Hypertension ; 2024 Jul 16.
Article in English | MEDLINE | ID: mdl-39011632

ABSTRACT

Recent breakthroughs in artificial intelligence (AI) have caught the attention of many fields, including health care. The vision for AI is that a computer model can process information and provide output that is indistinguishable from that of a human and, in specific repetitive tasks, outperform a human's capability. The 2 critical underlying technologies in AI are used for supervised and unsupervised machine learning. Machine learning uses neural networks and deep learning modeled after the human brain from structured or unstructured data sets to learn, make decisions, and continuously improve the model. Natural language processing, used for supervised learning, is understanding, interpreting, and generating information using human language in chatbots and generative and conversational AI. These breakthroughs result from increased computing power and access to large data sets, setting the stage for releasing large language models, such as ChatGPT and others, and new imaging models using computer vision. Hypertension management involves using blood pressure and other biometric data from connected devices and generative AI to communicate with patients and health care professionals. AI can potentially improve hypertension diagnosis and treatment through remote patient monitoring and digital therapeutics.

4.
Nat Sci Sleep ; 16: 699-710, 2024.
Article in English | MEDLINE | ID: mdl-38863481

ABSTRACT

Purpose: Body-worn accelerometers are commonly used to estimate sleep duration in population-based studies. However, since accelerometry-based sleep/wake-scoring relies on detecting body movements, the prediction of sleep duration remains a challenge. The aim was to develop and evaluate the performance of a machine learning (ML) model to predict accelerometry-based sleep duration and to explore if this prediction can be improved by adding skin temperature data, circadian rhythm based on the estimated midpoint of sleep, and cyclic time features to the model. Patients and Methods: Twenty-nine adults (17 females), mean (SD) age 40.2 (15.0) years (range 17-70) participated in the study. Overnight polysomnography (PSG) was recorded in a sleep laboratory or at home along with body movement by two accelerometers with an embedded skin temperature sensor (AX3, Axivity, UK) positioned at the low back and thigh. The PSG scoring of sleep/wake was used as ground truth for training the ML model. Results: Based on pure accelerometer data input to the ML model, the specificity and sensitivity for predicting sleep/wake was 0.52 (SD 0.24) and 0.95 (SD 0.03), respectively. Adding skin temperature data and contextual information to the ML model improved the specificity to 0.72 (SD 0.20), while sensitivity remained unchanged at 0.95 (SD 0.05). Correspondingly, sleep overestimation was reduced from 54 min (228 min, limits of agreement range [LoAR]) to 19 min (154 min LoAR). Conclusion: An ML model can predict sleep/wake periods with excellent sensitivity and moderate specificity based on a dual-accelerometer set-up when adding skin temperature data and contextual information to the model.

5.
Front Mol Neurosci ; 17: 1398447, 2024.
Article in English | MEDLINE | ID: mdl-38854587

ABSTRACT

The functionality of photoreceptors, rods, and cones is highly dependent on their outer segments (POS), a cellular compartment containing highly organized membranous structures that generate biochemical signals from incident light. While POS formation and degeneration are qualitatively assessed on microscopy images, reliable methodology for quantitative analyses is still limited. Here, we developed methods to quantify POS (QuaPOS) maturation and quality on retinal sections using automated image analyses. POS formation was examined during the development and in adulthood of wild-type mice via light microscopy (LM) and transmission electron microscopy (TEM). To quantify the number, size, shape, and fluorescence intensity of POS, retinal cryosections were immunostained for the cone POS marker S-opsin. Fluorescence images were used to train the robust classifier QuaPOS-LM based on supervised machine learning for automated image segmentation. Characteristic features of segmentation results were extracted to quantify the maturation of cone POS. Subsequently, this quantification method was applied to characterize POS degeneration in "cone photoreceptor function loss 1" mice. TEM images were used to establish the ultrastructural quantification method QuaPOS-TEM for the alignment of POS membranes. Images were analyzed using a custom-written MATLAB code to extract the orientation of membranes from the image gradient and their alignment (coherency). This analysis was used to quantify the POS morphology of wild-type and two inherited retinal degeneration ("retinal degeneration 19" and "rhodopsin knock-out") mouse lines. Both automated analysis technologies provided robust characterization and quantification of POS based on LM or TEM images. Automated image segmentation by the classifier QuaPOS-LM and analysis of the orientation of membrane stacks by QuaPOS-TEM using fluorescent or TEM images allowed quantitative evaluation of POS formation and quality. The assessments showed an increase in POS number, volume, and membrane coherency during wild-type postnatal development, while a decrease in all three observables was detected in different retinal degeneration mouse models. All the code used for the presented analysis is open source, including example datasets to reproduce the findings. Hence, the QuaPOS quantification methods are useful for in-depth characterization of POS on retinal sections in developmental studies, for disease modeling, or after therapeutic interventions affecting photoreceptors.

6.
J Neural Eng ; 21(3)2024 Jun 26.
Article in English | MEDLINE | ID: mdl-38885675

ABSTRACT

Objective. To demonstrate the capability of utilizing graph feature-based supervised machine learning (ML) algorithm on intracranial electroencephalogram recordings for the identification of seizure onset zones (SOZs) in individuals with drug-resistant epilepsy.Approach. Utilizing three model-free measures of effective connectivity (EC)-directed information, mutual information-guided Granger causality index (MI-GCI), and frequency-domain convergent cross-mapping (FD-CCM) - directed graphs are generated. Graph centrality measures at different sparsity are used as the classifier's features.Main results. The centrality features achieve high accuracies exceeding 90% in distinguishing SOZ electrodes from non-SOZ electrodes. Notably, a sparse graph representation with just ten features and simple ML models effectively achieves such performance. The study identifies FD-CCM centrality measures as particularly significant, with a mean AUC of 0.93, outperforming prior literature. The FD-CCM-based graph modeling also highlights elevated centrality measures among SOZ electrodes, emphasizing heightened activity relative to non-SOZ electrodes during ictogenesis.Significance. This research not only underscores the efficacy of automated SOZ identification but also illuminates the potential of specific EC measures in enhancing discriminative power within the context of epilepsy research.


Subject(s)
Brain , Electrocorticography , Seizures , Humans , Seizures/physiopathology , Seizures/diagnosis , Electrocorticography/methods , Brain/physiopathology , Brain/physiology , Nerve Net/physiopathology , Drug Resistant Epilepsy/physiopathology , Male , Female , Electroencephalography/methods , Adult , Supervised Machine Learning , Young Adult , Algorithms , Adolescent
7.
BMC Bioinformatics ; 25(1): 218, 2024 Jun 19.
Article in English | MEDLINE | ID: mdl-38898392

ABSTRACT

BACKGROUND: Compared to traditional supervised machine learning approaches employing fully labeled samples, positive-unlabeled (PU) learning techniques aim to classify "unlabeled" samples based on a smaller proportion of known positive examples. This more challenging modeling goal reflects many real-world scenarios in which negative examples are not available-posing direct challenges to defining prediction accuracy and robustness. While several studies have evaluated predictions learned from only definitive positive examples, few have investigated whether correct classification of a high proportion of known positives (KP) samples from among unlabeled samples can act as a surrogate to indicate model quality. RESULTS: In this study, we report a novel methodology combining multiple established PU learning-based strategies with permutation testing to evaluate the potential of KP samples to accurately classify unlabeled samples without using "ground truth" positive and negative labels for validation. Multivariate synthetic and real-world high-dimensional benchmark datasets were employed to demonstrate the suitability of the proposed pipeline to provide evidence of model robustness across varied underlying ground truth class label compositions among the unlabeled set and with different proportions of KP examples. Comparisons between model performance with actual and permuted labels could be used to distinguish reliable from unreliable models. CONCLUSIONS: As in fully supervised machine learning, permutation testing offers a means to set a baseline "no-information rate" benchmark in the context of semi-supervised PU learning inference tasks-providing a standard against which model performance can be compared.


Subject(s)
Machine Learning , Supervised Machine Learning , Humans , Computational Biology/methods , Algorithms
8.
Autism Res ; 2024 Jun 24.
Article in English | MEDLINE | ID: mdl-38925611

ABSTRACT

Autism spectrum disorder (ASD) is a heterogeneous, early-onset neurodevelopmental condition characterized by persistent impairments in social interaction and communication. This study aims to delineate ASD subtypes based on individual gray matter brain networks and provide new insights from a graph theory perspective. In this study, we extracted and normalized single-subject gray matter networks and calculated each network's topological properties. The heterogeneity through discriminative analysis (HYDRA) method was utilized to subtype all patients based on network properties. Next, we explored the differences among ASD subtypes in terms of network properties and clinical measures. Our investigation identified three distinct ASD subtypes. In the case-control study, these subtypes exhibited significant differences, particularly in the precentral gyrus, lingual gyrus, and middle frontal gyrus. In the case analysis, significant differences in global and nodal properties were observed between any two subtypes. Clinically, subtype 1 showed lower VIQ and PIQ compared to subtype 3, but exhibited higher scores in ADOS-Communication and ADOS-Total compared to subtype 2. The results highlight the distinct brain network properties and behaviors among different subtypes of male patients with ASD, providing valuable insights into the neural mechanisms underlying ASD heterogeneity.

9.
BMC Microbiol ; 24(1): 162, 2024 May 10.
Article in English | MEDLINE | ID: mdl-38730339

ABSTRACT

BACKGROUND: Coastal areas are subject to various anthropogenic and natural influences. In this study, we investigated and compared the characteristics of two coastal regions, Andhra Pradesh (AP) and Goa (GA), focusing on pollution, anthropogenic activities, and recreational impacts. We explored three main factors influencing the differences between these coastlines: The Bay of Bengal's shallower depth and lower salinity; upwelling phenomena due to the thermocline in the Arabian Sea; and high tides that can cause strong currents that transport pollutants and debris. RESULTS: The microbial diversity in GA was significantly higher than that in AP, which might be attributed to differences in temperature, soil type, and vegetation cover. 16S rRNA amplicon sequencing and bioinformatics analysis indicated the presence of diverse microbial phyla, including candidate phyla radiation (CPR). Statistical analysis, random forest regression, and supervised machine learning models classification confirm the diversity of the microbiome accurately. Furthermore, we have identified 450 cultures of heterotrophic, biotechnologically important bacteria. Some strains were identified as novel taxa based on 16S rRNA gene sequencing, showing promising potential for further study. CONCLUSION: Thus, our study provides valuable insights into the microbial diversity and pollution levels of coastal areas in AP and GA. These findings contribute to a better understanding of the impact of anthropogenic activities and climate variations on biology of coastal ecosystems and biodiversity.


Subject(s)
Bacteria , Bays , Microbiota , Phylogeny , RNA, Ribosomal, 16S , Seawater , Supervised Machine Learning , RNA, Ribosomal, 16S/genetics , Bacteria/classification , Bacteria/genetics , Bacteria/isolation & purification , Microbiota/genetics , Seawater/microbiology , India , Bays/microbiology , Biodiversity , DNA, Bacterial/genetics , Salinity , Sequence Analysis, DNA/methods
10.
J Forensic Odontostomatol ; 42(1): 22-29, 2024 Apr 30.
Article in English | MEDLINE | ID: mdl-38742569

ABSTRACT

BACKGROUND: The utilization of segmentation method using volumetric data in adults dental age estimation (DAE) from cone-beam computed tomography (CBCT) was further expanded by using current 5-Part Tooth Segmentation (SG) method. Additionally, supervised machine learning modelling -namely support vector regression (SVR) with linear and polynomial kernel, and regression tree - was tested and compared with the multiple linear regression model. MATERIAL AND METHODS: CBCT scans from 99 patients aged between 20 to 59.99 was collected. Eighty eligible teeth including maxillary canine, lateral incisor, and central incisor were used in this study. Enamel to dentine volume ratio, pulp to dentine volume ratio, lower tooth volume ratio, and sex was utilized as independent variable to predict chronological age. RESULTS: No multicollinearity was detected in the models. The best performing model comes from maxillary lateral incisor using SVR with polynomial kernel ( = 0.73). The lowest error rate achieved by the model was given also by maxillary lateral incisor, with 4.86 years of mean average error and 6.05 years of root means squared error. However, demands a complex approach to segment the enamel volume in the crown section and a lengthier labour time of 45 minutes per tooth.


Subject(s)
Age Determination by Teeth , Cone-Beam Computed Tomography , Machine Learning , Humans , Adult , Age Determination by Teeth/methods , Male , Female , Young Adult , Middle Aged , Dental Enamel/diagnostic imaging , Dentin/diagnostic imaging , Linear Models , Dental Pulp/diagnostic imaging , Support Vector Machine
11.
Int J Nurs Stud ; 156: 104797, 2024 Aug.
Article in English | MEDLINE | ID: mdl-38788263

ABSTRACT

BACKGROUND: ICU readmissions and post-discharge mortality pose significant challenges. Previous studies used EHRs and machine learning models, but mostly focused on structured data. Nursing records contain crucial unstructured information, but their utilization is challenging. Natural language processing (NLP) can extract structured features from clinical text. This study proposes the Crucial Nursing Description Extractor (CNDE) to predict post-ICU discharge mortality rates and identify high-risk patients for unplanned readmission by analyzing electronic nursing records. OBJECTIVE: Developed a deep neural network (NurnaNet) with the ability to perceive nursing records, combined with a bio-clinical medicine pre-trained language model (BioClinicalBERT) to analyze the electronic health records (EHRs) in the MIMIC III dataset to predict the death of patients within six month and two year risk. DESIGN: A cohort and system development design was used. SETTING(S): Based on data extracted from MIMIC-III, a database of critically ill in the US between 2001 and 2012, the results were analyzed. PARTICIPANTS: We calculated patients' age using admission time and date of birth information from the MIMIC dataset. Patients under 18 or over 89 years old, or who died in the hospital, were excluded. We analyzed 16,973 nursing records from patients' ICU stays. METHODS: We have developed a technology called the Crucial Nursing Description Extractor (CNDE), which extracts key content from text. We use the logarithmic likelihood ratio to extract keywords and combine BioClinicalBERT. We predict the survival of discharged patients after six months and two years and evaluate the performance of the model using precision, recall, the F1-score, the receiver operating characteristic curve (ROC curve), the area under the curve (AUC), and the precision-recall curve (PR curve). RESULTS: The research findings indicate that NurnaNet achieved good F1-scores (0.67030, 0.70874) within six months and two years. Compared to using BioClinicalBERT alone, there was an improvement in performance of 2.05 % and 1.08 % for predictions within six months and two years, respectively. CONCLUSIONS: CNDE can effectively reduce long-form records and extract key content. NurnaNet has a good F1-score in analyzing the data of nursing records, which helps to identify the risk of death of patients after leaving the hospital and adjust the regular follow-up and treatment plan of relevant medical care as soon as possible.


Subject(s)
Neural Networks, Computer , Patient Discharge , Humans , Patient Discharge/statistics & numerical data , Nursing Records , Electronic Health Records , Middle Aged , Female , Aged , Male , Risk Assessment/methods , Natural Language Processing , Cohort Studies
12.
J Thromb Haemost ; 22(7): 1997-2008, 2024 Jul.
Article in English | MEDLINE | ID: mdl-38642704

ABSTRACT

BACKGROUND: Thus far, all the clinical models developed to predict major bleeding in patients on extended anticoagulation therapy use the baseline predictors to stratify patients into different risk groups. Therefore, these models do not account for the clinical changes and events that occur after the baseline visit, which can modify risk of bleeding. However, it is difficult to develop predictive models from the routine follow-up clinical interviews, which are irregular sequences of multivariate time series data. OBJECTIVES: To demonstrate that deep learning can incorporate patient time series follow-up data to improve prediction of major bleeding. METHODS: We used the baseline and follow-up data that were collected over 8 years in a longitudinal cohort study of 2542 patients, of whom 118 had major bleeding. Four supervised neural network-based machine-learning models were trained on the baseline, follow-up, or both datasets using 70% of the data. The performance of these models was evaluated, along with modified versions of 6 previously developed clinical models, on the remaining 30% of the data. RESULTS: An ensemble of feedforward and recurrent neural networks that used the baseline and follow-up data was the best-performing model, achieving a sensitivity and a specificity of 61% and 82%, respectively, in identifying major bleeding, and it outperformed the previously developed clinical models in terms of area under the receiver operating characteristic curve (82%) and area under the precision-recall curve (14%). CONCLUSION: Time series follow-up data can improve major bleeding prediction in patients on extended anticoagulation therapy.


Subject(s)
Anticoagulants , Deep Learning , Hemorrhage , Humans , Anticoagulants/adverse effects , Anticoagulants/administration & dosage , Hemorrhage/chemically induced , Male , Female , Aged , Risk Assessment , Time Factors , Risk Factors , Middle Aged , Longitudinal Studies , Predictive Value of Tests , Drug Administration Schedule , Treatment Outcome , Neural Networks, Computer , Aged, 80 and over
14.
Pragmat Obs Res ; 15: 65-78, 2024.
Article in English | MEDLINE | ID: mdl-38559704

ABSTRACT

Background: Lack of body mass index (BMI) measurements limits the utility of claims data for bariatric surgery research, but pre-operative BMI may be imputed due to existence of weight-related diagnosis codes and BMI-related reimbursement requirements. We used a machine learning pipeline to create a claims-based scoring system to predict pre-operative BMI, as documented in the electronic health record (EHR), among patients undergoing a new bariatric surgery. Methods: Using the Optum Labs Data Warehouse, containing linked de-identified claims and EHR data for commercial or Medicare Advantage enrollees, we identified adults undergoing a new bariatric surgery between January 2011 and June 2018 with a BMI measurement in linked EHR data ≤30 days before the index surgery (n=3226). We constructed predictors from claims data and applied a machine learning pipeline to create a scoring system for pre-operative BMI, the B3S3. We evaluated the B3S3 and a simple linear regression model (benchmark) in test patients whose index surgery occurred concurrent (2011-2017) or prospective (2018) to the training data. Results: The machine learning pipeline yielded a final scoring system that included weight-related diagnosis codes, age, and number of days hospitalized and distinct drugs dispensed in the past 6 months. In concurrent test data, the B3S3 had excellent performance (R2 0.862, 95% confidence interval [CI] 0.815-0.898) and calibration. The benchmark algorithm had good performance (R2 0.750, 95% CI 0.686-0.799) and calibration but both aspects were inferior to the B3S3. Findings in prospective test data were similar. Conclusion: The B3S3 is an accessible tool that researchers can use with claims data to obtain granular and accurate predicted values of pre-operative BMI, which may enhance confounding control and investigation of effect modification by baseline obesity levels in bariatric surgery studies utilizing claims data.


Pre-operative BMI is an important potential confounder in comparative effectiveness studies of bariatric surgeries.Claims data lack clinical measurements, but insurance reimbursement requirements for bariatric surgery often result in pre-operative BMI being coded in claims data.We used a machine learning pipeline to create a model, the B3S3, to predict pre-operative BMI, as documented in the EHR, among bariatric surgery patients based on the presence of certain weight-related diagnosis codes and other patient characteristics derived from claims data.Researchers can easily use the B3S3 with claims data to obtain granular and accurate predicted values of pre-operative BMI among bariatric surgery patients.

15.
Proc (Bayl Univ Med Cent) ; 37(3): 437-447, 2024.
Article in English | MEDLINE | ID: mdl-38628340

ABSTRACT

Background: Acute pancreatitis (AP) is a complex and life-threatening disease. Early recognition of factors predicting morbidity and mortality is crucial. We aimed to develop and validate a pragmatic model to predict the individualized risk of early intensive care unit (ICU) admission for patients with AP. Methods: The 2019 Nationwide Readmission Database was used to identify patients hospitalized with a primary diagnosis of AP without ICU admission. A matched comparison cohort of AP patients with ICU admission within 7 days of hospitalization was identified from the National Inpatient Sample after 1:N propensity score matching. The least absolute shrinkage and selection operator (LASSO) regression was used to select predictors and develop an ICU acute pancreatitis risk (IAPR) score validated by 10-fold cross-validation. Results: A total of 1513 patients hospitalized for AP were included. The median age was 50.0 years (interquartile range: 39.0-63.0). The three predictors that were selected included hypoxia (area under the curve [AUC] 0.78), acute kidney injury (AUC 0.72), and cardiac arrhythmia (AUC 0.61). These variables were used to develop a nomogram that displayed excellent discrimination (AUC 0.874) (bootstrap bias-corrected 95% confidence interval 0.824-0.876). There was no evidence of miscalibration (test statistic = 2.88; P = 0.09). For high-risk patients (total score >6 points), the sensitivity was 68.94% and the specificity was 92.66%. Conclusions: This supervised machine learning-based model can help recognize high-risk AP hospitalizations. Clinicians may use the IAPR score to identify patients with AP at high risk of ICU admission within the first week of hospitalization.

16.
Appl Spectrosc ; 78(5): 456-476, 2024 May.
Article in English | MEDLINE | ID: mdl-38439705

ABSTRACT

Here, Raman spectroscopy is used to develop a univariate partial least squares (PLS) calibration capable of quantifying geochemistry in synthetic and natural silicate glass samples. The calibration yields eight oxide-specific models that allow predictions of silicon dioxide (SiO2), sodium oxide (Na2O), potassium oxide (K2O), calcium oxide (CaO), titanium dioxide (TiO2), aluminum oxide (Al2O3), ferrous oxide (FeOT), and magnesium oxide (MgO) (wt%) in glasses spanning a wide range of compositions, while also providing correlation-coefficient matrices that highlight the importance of specific Raman channels in the regression of a particular oxide. The PLS suite is trained on 48 of the 69 total glasses, and tested against 21 validation samples (i.e., held out of training). Trends in root mean square error of calibration (RMSEC), root mean square error of cross-validation (RMSECV), and root mean square error of prediction (RMSEP) model accuracy metrics are investigated to uncover the efficacy of utilizing multivariate analysis for such Raman data and are contextualized against recently produced strategies. The technique yields an average root mean of calibration (∼2.4 wt%), cross-validation (∼ 2.9 wt%), prediction (∼ 2.6 wt%), and normalized variance (∼ 28%). Raman band positional shifts are also mapped against underlying chemical variations; with major influences arising primarily as a function of overall oxidation state and silica concentration: via ferric cation (Fe3+)/ferrous cation (Fe2+) ratios and SiO2 (wt%). The algorithm is further validated preliminarily against a separate external set of 11 natural basaltic glasses to unravel the limitations of the synthetic models on natural samples, and to determine the suitability of "universal" Raman-model applications in scenarios where prior chemical contextualization of the target sample is possible. This study represents the first time Raman spectra of amorphous silicates have been paired with PLS, offering a foundation for future improvements utilizing these systems.

17.
J Dent ; 144: 104921, 2024 05.
Article in English | MEDLINE | ID: mdl-38437976

ABSTRACT

OBJECTIVES: This study aimed to identify predictors associated with the tooth loss phenotype in a large periodontitis patient cohort in the university setting. METHODS: Information on periodontitis patients and nineteen factors identified at the initial visit was extracted from electronic health records. The primary outcome is tooth loss phenotype (presence or absence of tooth loss). Prediction models were built on significant factors (single or combinatory) selected by the RuleFit algorithm, and these factors were further adopted by regression models. Model performance was evaluated by Area Under the Receiver Operating Characteristic Curve (AUROC) and Area Under the Precision-Recall Curve (AUPRC). Associations between predictors and the tooth loss phenotype were also evaluated by classical statistical approaches to validate the performance of machine learning models. RESULTS: In total, 7840 patients were included. The machine learning model predicting the tooth loss phenotype achieved AUROC of 0.71 and AUPRC of 0.66. Age, periodontal diagnosis, number of missing teeth at baseline, furcation involvement, and tooth mobility were associated with the tooth loss phenotype in both machine learning and classical statistical models. CONCLUSIONS: The rule-based machine learning approach improves model explainability compared to classical statistical methods. However, the model's generalizability needs to be further validated by external datasets. CLINICAL SIGNIFICANCE: Predictors identified by the current machine learning approach using the RuleFit algorithm had clinically relevant thresholds in predicting the tooth loss phenotype in a large and diverse periodontitis patient cohort. The results of this study will assist clinicians in performing risk assessment for periodontitis at the initial visit.


Subject(s)
Machine Learning , Periodontitis , Phenotype , Tooth Loss , Humans , Male , Female , Periodontitis/complications , Middle Aged , Adult , ROC Curve , Tooth Mobility , Risk Factors , Algorithms , Electronic Health Records , Cohort Studies , Area Under Curve , Furcation Defects , Aged
18.
Front Psychiatry ; 15: 1356843, 2024.
Article in English | MEDLINE | ID: mdl-38516261

ABSTRACT

Introduction: Comorbid substance use disorder (SUD) is linked to a higher risk of violence in patients with schizophrenia spectrum disorder (SSD). The objective of this study is to explore the most distinguishing factors between offending and non-offending patients diagnosed with SSD and comorbid SUD using supervised machine learning. Methods: A total of 269 offender patients and 184 non-offender patients, all diagnosed with SSD and SUD, were assessed using supervised machine learning algorithms. Results: Failures during opening, referring to rule violations during a permitted temporary leave from an inpatient ward or during the opening of an otherwise closed ward, was found to be the most influential distinguishing factor, closely followed by non-compliance with medication (in the psychiatric history). Following in succession were social isolation in the past, no antipsychotics prescribed (in the psychiatric history), and no outpatient psychiatric treatments before the current hospitalization. Discussion: This research identifies critical factors distinguishing offending patients from non-offending patients with SSD and SUD. Among various risk factors considered in prior research, this study emphasizes treatment-related differences between the groups, indicating the potential for improvement regarding access and maintenance of treatment in this particular population. Further research is warranted to explore the relationship between social isolation and delinquency in this patient population.

19.
PeerJ Comput Sci ; 10: e1896, 2024.
Article in English | MEDLINE | ID: mdl-38435625

ABSTRACT

Diabetes is a metabolic disorder that affects more than 420 million of people worldwide, and it is caused by the presence of a high level of sugar in blood for a long period. Diabetes can have serious long-term health consequences, such as cardiovascular diseases, strokes, chronic kidney diseases, foot ulcers, retinopathy, and others. Even if common, this disease is uneasy to spot, because it often comes with no symptoms. Especially for diabetes type 2, that happens mainly in the adults, knowing how long the diabetes has been present for a patient can have a strong impact on the treatment they can receive. This information, although pivotal, might be absent: for some patients, in fact, the year when they received the diabetes diagnosis might be well-known, but the year of the disease unset might be unknown. In this context, machine learning applied to electronic health records can be an effective tool to predict the past duration of diabetes for a patient. In this study, we applied a regression analysis based on several computational intelligence methods to a dataset of electronic health records of 73 patients with diabetes type 1 with 20 variables and another dataset of records of 400 patients of diabetes type 2 with 49 variables. Among the algorithms applied, Random Forests was able to outperform the other ones and to efficiently predict diabetes duration for both the cohorts, with the regression performances measured through the coefficient of determination R2. Afterwards, we applied the same method for feature ranking, and we detected the most relevant factors of the clinical records correlated with past diabetes duration: age, insulin intake, and body-mass index. Our study discoveries can have profound impact on clinical practice: when the information about the duration of diabetes of patient is missing, medical doctors can use our tool and focus on age, insulin intake, and body-mass index to infer this important aspect. Regarding limitations, unfortunately we were unable to find additional dataset of EHRs of patients with diabetes having the same variables of the two analyzed here, so we could not verify our findings on a validation cohort.

20.
Biotechnol Prog ; 40(3): e3436, 2024.
Article in English | MEDLINE | ID: mdl-38357841

ABSTRACT

Although the contributions of individual components of cell culture media are largely known, their combinatorial effects are far less understood. Experiments varying one component at a time cannot identify combinatorial effects, and analysis of the large number of experiments required to decipher such effects is challenging. Machine learning algorithms can help in the analysis of such datasets to identify multi-component interactions. Zinc toxicity in vitro is known to change depending on amino acid concentration in the extracellular medium. Multiple amino acids are known to be involved in this protection. Thirty-two amino acid compositions were formulated to evaluate their effect on the growth of CHO cells under high zinc conditions. A sequential machine learning analysis methodology was used, which led to the identification of a set of amino acids (threonine, proline, glutamate, aspartate, asparagine, and tryptophan) contributing to protection from zinc. Our results suggest that a decrease in availability of these set of amino acids due to consumption may affect cell growth in media formulated with high zinc concentrations, and in contrast, normal levels of these amino acids are associated with better tolerance to high zinc concentration. Our sequential analysis method may be similarly employed for high throughput medium design and optimization experiments to identify interactions among a large number of cell culture medium components.


Subject(s)
Amino Acids , Cell Proliferation , Cricetulus , Machine Learning , Zinc , CHO Cells , Amino Acids/pharmacology , Amino Acids/chemistry , Animals , Zinc/pharmacology , Zinc/chemistry , Cell Proliferation/drug effects , Culture Media/chemistry
SELECTION OF CITATIONS
SEARCH DETAIL