Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 11.012
Filter
1.
Food Chem ; 462: 140931, 2025 Jan 01.
Article in English | MEDLINE | ID: mdl-39217752

ABSTRACT

This research focused on distinguishing distinct matrix assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) spectral signatures of three Enterococcus species. We evaluated and compared the predictive performance of four supervised machine learning algorithms, K-nearest neighbor (KNN), support vector machine (SVM), and random forest (RF), to accurately classify Enterococcus species. This study involved a comprehensive dataset of 410 strains, generating 1640 individual spectra through on-plate and off-plate protein extraction methods. Although the commercial database correctly identified 76.9% of the strains, machine learning classifiers demonstrated superior performance (accuracy 0.991). In the RF model, top informative peaks played a significant role in the classification. Whole-genome sequencing showed that the most informative peaks are biomarkers connected to proteins, which are essential for understanding bacterial classification and evolution. The integration of MALDI-TOF MS and machine learning provides a rapid and accurate method for identifying Enterococcus species, improving healthcare and food safety.


Subject(s)
Enterococcus , Spectrometry, Mass, Matrix-Assisted Laser Desorption-Ionization , Supervised Machine Learning , Spectrometry, Mass, Matrix-Assisted Laser Desorption-Ionization/methods , Enterococcus/classification , Enterococcus/chemistry , Enterococcus/isolation & purification , Enterococcus/genetics , Algorithms , Support Vector Machine , Bacterial Typing Techniques/methods , Machine Learning
2.
Langenbecks Arch Surg ; 409(1): 288, 2024 Sep 24.
Article in English | MEDLINE | ID: mdl-39316140

ABSTRACT

OBJECTIVES: This study aimed to assess the diagnostic performance of a support vector machine (SVM) algorithm for acute cholecystitis and evaluate its effectiveness in accurately diagnosing this condition. METHODS: Using a retrospective analysis of patient data from a single center, individuals with abdominal pain lasting one week or less were included. The SVM model was trained and optimized using standard procedures. Model performance was assessed through sensitivity, specificity, accuracy, and AUC-ROC, with probability calibration evaluated using the Brier score. RESULTS: Among 534 patients, 198 (37.07%) were diagnosed with acute cholecystitis. The SVM model showed balanced performance, with a sensitivity of 83.08% (95% CI: 71.73-91.24%), a specificity of 80.21% (95% CI: 70.83-87.64%), and an accuracy of 81.37% (95% CI: 74.48-87.06%). The positive predictive value (PPV) was 73.97% (95% CI: 65.18-81.18%), the negative predictive value (NPV) was 87.50% (95% CI: 80.19-92.37%), and the AUC-ROC was 0.89 (95% CI: 0.85 to 0.93). The Brier score indicated well-calibrated probability estimates. CONCLUSION: The SVM algorithm demonstrated promising potential for accurately diagnosing acute cholecystitis. Further refinement and validation are needed to enhance its reliability in clinical practice.


Subject(s)
Artificial Intelligence , Cholecystitis, Acute , Emergency Service, Hospital , Support Vector Machine , Humans , Cholecystitis, Acute/diagnosis , Female , Retrospective Studies , Male , Middle Aged , Aged , Adult , Sensitivity and Specificity , Algorithms , Predictive Value of Tests , Aged, 80 and over
3.
Chem Biol Drug Des ; 104(3): e14627, 2024 Sep.
Article in English | MEDLINE | ID: mdl-39317691

ABSTRACT

Breast cancer (BC) is one of the leading causes of high mortality rates in women worldwide. Although advancements have been made in the design of therapeutic strategies and drug discovery, drug resistance remains one of the key challenges. One of the ways to overcome drug resistance is finding potential drug combinations since the efficacy of combined drugs is higher than their individual efficacies if the combination is a synergistic pair. Therefore, the current study uses a BC patient-derived xenograft (PDX) dataset to evaluate the effects of various cancer drugs on breast cancer in vivo models. The drug effects are further validated by four machine learning models, namely Elastic Net, Least Absolute Shrinkage and Selection (LASSO), Support Vector Machine (SVM), Random Forests (RF), as well as exploring the shortlisted drugs in combination with paclitaxel, a baseline drug for enhanced efficacy on tumor volume reduction. Additionally, the study also shortlists the top 50 in vivo biomarkers correlated with the effects of the drugs. The outcomes could be significantly important for the design of an effective anti-breast cancer therapy.


Subject(s)
Biomarkers, Tumor , Breast Neoplasms , Paclitaxel , Paclitaxel/pharmacology , Paclitaxel/therapeutic use , Breast Neoplasms/drug therapy , Breast Neoplasms/pathology , Humans , Female , Animals , Biomarkers, Tumor/metabolism , Mice , Support Vector Machine , Xenograft Model Antitumor Assays , Machine Learning , Antineoplastic Combined Chemotherapy Protocols/pharmacology , Antineoplastic Combined Chemotherapy Protocols/therapeutic use
4.
Sci Rep ; 14(1): 22450, 2024 Sep 28.
Article in English | MEDLINE | ID: mdl-39341981

ABSTRACT

Hepatic ischemia-reperfusion injury (HIRI) may cause severe hepatic impairment, acute hepatic insufficiency, and multiorgan system collapse. Exosomes can alleviate HIRI. Therefore, this study explored the role of exosomal-related genes (ERGs) in HIRI using bioinformatics to determine the underlying molecular mechanisms and novel diagnostic markers for HIRI. We merged the GSE12720, GSE14951, and GSE15480 datasets obtained from the Gene Expression Omnibus (GEO) database into a combined gene dataset (CGD). CGD was used to identify differentially expressed genes (DEGs) based on a comparison of the HIRI and healthy control cohorts. The impact of these DEGs on HIRI was assessed through gene set enrichment analysis (GSEA) and gene set variation analysis (GSVA). ERGs were retrieved from the GeneCards database and prior studies, and overlapped with the identified DEGs to yield the set of exosome-related differentially expressed genes (ERDEGs). Functional annotations and enrichment pathways of these genes were determined using Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses. Diagnostic models for HIRI were developed using least absolute shrinkage and selection operator (LASSO) regression and support vector machine (SVM) algorithms. Key genes with diagnostic value were identified from the overlap, and single-sample gene-set enrichment analysis (ssGSEA) was conducted to evaluate the immune infiltration characteristics. A molecular regulatory interaction network was established using Cytoscape software to elucidate the intricate regulatory mechanisms of key genes in HIRI. Finally, exosome score (Es) was obtained using ssGSEA and the HIRI group was divided into the Es_High and Es_Low groups based on the median Es. Gene expression was analyzed to understand the impact of all genes in the CGD on HIRI. Finally, the relative expression levels of the five key genes in the hypoxia-reoxygenation (H/R) model were determined using quantitative real-time PCR (qRT-PCR). A total of 3810 DEGs were identified through differential expression analysis of the CGD, and 61 of these ERDEGs were screened. Based on GO and KEGG enrichment analyses, the ERDEGs were mainly enriched in wound healing, MAPK, protein kinase B signaling, and other pathways. GSEA and GSVA revealed that these genes were mainly enriched in the TP53, MAPK, TGF[Formula: see text], JAK-STAT, MAPK, and NFKB pathways. Five key genes (ANXA1, HNRNPA2B1, ICAM1, PTEN, and THBS1) with diagnostic value were screened using the LASSO regression and SVM algorithms and their molecular interaction network was established using Cytoscape software. Based on ssGSEA, substantial variations were found in the expression of 18 immune cell types among the groups (p < 0.05). Finally, the Es of each HIRI patient was calculated. ERDEGs in the Es_High and Es_Low groups were enriched in the IL18, TP53, MAPK, TGF[Formula: see text], and JAK-STAT pathways. The differential expression of these five key genes in the H/R model was verified using qRT-PCR. Herein, five key genes were identified as potential diagnostic markers. Moreover, the potential impact of these genes on pathways and the regulatory mechanisms of their interaction network in HIRI were revealed. Altogether, our findings may serve as a theoretical foundation for enhancing clinical diagnosis and elucidating underlying pathogeneses.


Subject(s)
Computational Biology , Exosomes , Gene Expression Profiling , Liver , Reperfusion Injury , Exosomes/genetics , Exosomes/metabolism , Humans , Reperfusion Injury/genetics , Liver/metabolism , Liver/pathology , Computational Biology/methods , Gene Ontology , Databases, Genetic , Gene Regulatory Networks , Support Vector Machine , Biomarkers/metabolism
5.
Cardiovasc Diabetol ; 23(1): 351, 2024 Sep 28.
Article in English | MEDLINE | ID: mdl-39342281

ABSTRACT

BACKGROUND: Cardiovascular disease, also known as circulation system disease, remains the leading cause of morbidity and mortality worldwide. Traditional methods for diagnosing cardiovascular disease are often expensive and time-consuming. So the purpose of this study is to construct machine learning models for the diagnosis of cardiovascular diseases using easily accessible blood routine and biochemical detection data and explore the unique hematologic features of cardiovascular diseases, including some metabolic indicators. METHODS: After the data preprocessing, 25,794 healthy people and 32,822 circulation system disease patients with the blood routine and biochemical detection data were utilized for our study. We selected logistic regression, random forest, support vector machine, eXtreme Gradient Boosting (XGBoost), and deep neural network to construct models. Finally, the SHAP algorithm was used to interpret models. RESULTS: The circulation system disease prediction model constructed by XGBoost possessed the best performance (AUC: 0.9921 (0.9911-0.9930); Acc: 0.9618 (0.9588-0.9645); Sn: 0.9690 (0.9655-0.9723); Sp: 0.9526 (0.9477-0.9572); PPV: 0.9631 (0.9592-0.9668); NPV: 0.9600 (0.9556-0.9644); MCC: 0.9224 (0.9165-0.9279); F1 score: 0.9661 (0.9634-0.9686)). Most models of distinguishing various circulation system diseases also had good performance, the model performance of distinguishing dilated cardiomyopathy from other circulation system diseases was the best (AUC: 0.9267 (0.8663-0.9752)). The model interpretation by the SHAP algorithm indicated features from biochemical detection made major contributions to predicting circulation system disease, such as potassium (K), total protein (TP), albumin (ALB), and indirect bilirubin (NBIL). But for models of distinguishing various circulation system diseases, we found that red blood cell count (RBC), K, direct bilirubin (DBIL), and glucose (GLU) were the top 4 features subdividing various circulation system diseases. CONCLUSIONS: The present study constructed multiple models using 50 features from the blood routine and biochemical detection data for the diagnosis of various circulation system diseases. At the same time, the unique hematologic features of various circulation system diseases, including some metabolic-related indicators, were also explored. This cost-effective work will benefit more people and help diagnose and prevent circulation system diseases.


Subject(s)
Biomarkers , Cardiovascular Diseases , Predictive Value of Tests , Humans , Cardiovascular Diseases/blood , Cardiovascular Diseases/diagnosis , Biomarkers/blood , Male , Female , Middle Aged , Case-Control Studies , Diagnosis, Computer-Assisted , Aged , Prognosis , Decision Support Techniques , Databases, Factual , Support Vector Machine , Reproducibility of Results , Machine Learning , Deep Learning , Data Mining , Adult , Risk Assessment
6.
Eur J Med Res ; 29(1): 476, 2024 Sep 30.
Article in English | MEDLINE | ID: mdl-39343945

ABSTRACT

Osteoporosis (OP) is a chronic disease characterized by diminished bone mass and structural deterioration, ultimately leading to compromised bone strength and an increased risk of fractures. Diagnosis primarily relies on medical imaging findings and clinical symptoms. This study aims to explore an adjunctive diagnostic technique for OP based on surface-enhanced Raman scattering (SERS). Serum SERS spectra from the normal, low bone density, and osteoporosis groups were analyzed to discern OP-related expression profiles. This study utilized partial least squares (PLS) and support vector machine (SVM) algorithms to establish an OP diagnostic model. The combination of Raman peak assignments and spectral difference analysis reflected biochemical changes associated with OP, including amino acids, carbohydrates, and collagen. Using the PLS-SVM approach, sensitivity, specificity, and accuracy for screening OP were determined to be 77.78%, 100%, and 88.24%, respectively. This study demonstrates the substantial potential of SERS as an adjunctive diagnostic technology for OP.


Subject(s)
Osteoporosis , Spectrum Analysis, Raman , Spectrum Analysis, Raman/methods , Humans , Osteoporosis/diagnosis , Osteoporosis/diagnostic imaging , Female , Middle Aged , Support Vector Machine , Aged , Least-Squares Analysis , Male , Adult , Bone Density
7.
Int J Mol Sci ; 25(18)2024 Sep 18.
Article in English | MEDLINE | ID: mdl-39337508

ABSTRACT

Diabetes mellitus (DM) presents a critical global health challenge, characterized by persistent hyperglycemia and associated with substantial economic and health-related burdens. This study employs advanced machine-learning techniques to improve the prediction and classification of antidiabetic peptides, with a particular focus on differentiating those effective against T1DM from those targeting T2DM. We integrate feature selection with analysis methods, including logistic regression, support vector machines (SVM), and adaptive boosting (AdaBoost), to classify antidiabetic peptides based on key features. Feature selection through the Lasso-penalized method identifies critical peptide characteristics that significantly influence antidiabetic activity, thereby establishing a robust foundation for future peptide design. A comprehensive evaluation of logistic regression, SVM, and AdaBoost shows that AdaBoost consistently outperforms the other methods, making it the most effective approach for classifying antidiabetic peptides. This research underscores the potential of machine learning in the systematic evaluation of bioactive peptides, contributing to the advancement of peptide-based therapies for diabetes management.


Subject(s)
Diabetes Mellitus, Type 1 , Diabetes Mellitus, Type 2 , Hypoglycemic Agents , Machine Learning , Peptides , Support Vector Machine , Diabetes Mellitus, Type 2/drug therapy , Hypoglycemic Agents/therapeutic use , Hypoglycemic Agents/pharmacology , Humans , Peptides/therapeutic use , Peptides/pharmacology , Peptides/chemistry , Diabetes Mellitus, Type 1/drug therapy
8.
Sensors (Basel) ; 24(18)2024 Sep 13.
Article in English | MEDLINE | ID: mdl-39338685

ABSTRACT

This study investigates the efficacy of machine learning models for intrusion detection in the Internet of Medical Things, aiming to enhance cybersecurity defenses and protect sensitive healthcare data. The analysis focuses on evaluating the performance of ensemble learning algorithms, specifically Stacking, Bagging, and Boosting, using Random Forest and Support Vector Machines as base models on the WUSTL-EHMS-2020 dataset. Through a comprehensive examination of performance metrics such as accuracy, precision, recall, and F1-score, Stacking demonstrates exceptional accuracy and reliability in detecting and classifying cyber attack incidents with an accuracy rate of 98.88%. Bagging is ranked second, with an accuracy rate of 97.83%, while Boosting yielded the lowest accuracy rate of 88.68%.


Subject(s)
Algorithms , Computer Security , Internet of Things , Machine Learning , Humans , Support Vector Machine , Delivery of Health Care
9.
Sensors (Basel) ; 24(18)2024 Sep 17.
Article in English | MEDLINE | ID: mdl-39338748

ABSTRACT

Movement intentions of motor impaired individuals can be detected in laboratory settings via electroencephalography Brain-Computer Interfaces (EEG-BCIs) and used for motor rehabilitation and external system control. The real-world BCI use is limited by the costly, time-consuming, obtrusive, and uncomfortable setup of scalp EEG. Ear-EEG offers a faster, more convenient, and more aesthetic setup for recording EEG, but previous work using expensive amplifiers detected motor intentions at chance level. This study investigates the feasibility of a low-cost ear-EEG BCI for the detection of tongue and hand movements for rehabilitation and control purposes. In this study, ten able-bodied participants performed 100 right wrist extensions and 100 tongue-palate movements while three channels of EEG were recorded around the left ear. Offline movement vs. idle activity classification of ear-EEG was performed using temporal and spectral features classified with Random Forest, Support Vector Machine, K-Nearest Neighbours, and Linear Discriminant Analysis in three scenarios: Hand (rehabilitation purpose), hand (control purpose), and tongue (control purpose). The classification accuracies reached 70%, 73%, and 83%, respectively, which was significantly higher than chance level. These results suggest that a low-cost ear-EEG BCI can detect movement intentions for rehabilitation and control purposes. Future studies should include online BCI use with the intended user group in real-life settings.


Subject(s)
Brain-Computer Interfaces , Electroencephalography , Hand , Movement , Tongue , Humans , Tongue/physiology , Electroencephalography/methods , Movement/physiology , Hand/physiology , Male , Adult , Female , Brain/physiology , Support Vector Machine , Young Adult , Ear/physiology
10.
Sensors (Basel) ; 24(18)2024 Sep 22.
Article in English | MEDLINE | ID: mdl-39338869

ABSTRACT

Brain-computer interfaces (BCIs) are promising tools for motor neurorehabilitation. Achieving a balance between classification accuracy and system responsiveness is crucial for real-time applications. This study aimed to assess how the duration of time windows affects performance, specifically classification accuracy and the false positive rate, to optimize the temporal parameters of MI-BCI systems. We investigated the impact of time window duration on classification accuracy and false positive rate, employing Linear Discriminant Analysis (LDA), Multilayer Perceptron (MLP), and Support Vector Machine (SVM) on data acquired from six post-stroke patients and on the external BCI IVa dataset. EEG signals were recorded and processed using the Common Spatial Patterns (CSP) algorithm for feature extraction. Our results indicate that longer time windows generally enhance classification accuracy and reduce false positives across all classifiers, with LDA performing the best. However, to maintain the real-time responsiveness, crucial for practical applications, a balance must be struck. The results suggest an optimal time window of 1-2 s, offering a trade-off between classification performance and excessive delay to guarantee the system responsiveness. These findings underscore the importance of temporal optimization in MI-BCI systems to improve usability in real rehabilitation scenarios.


Subject(s)
Brain-Computer Interfaces , Electroencephalography , Stroke , Support Vector Machine , Humans , Electroencephalography/methods , Stroke/physiopathology , Male , Female , Algorithms , Middle Aged , Stroke Rehabilitation/methods , Aged , Discriminant Analysis , Time Factors
11.
Sci Rep ; 14(1): 22281, 2024 Sep 27.
Article in English | MEDLINE | ID: mdl-39333659

ABSTRACT

This study aimed to investigate the advantages and applications of machine learning models in predicting the risk of allergic rhinitis (AR) in children aged 2-8, compared to traditional logistic regression. The study analyzed questionnaire data from 7131 children aged 2-8, which was randomly divided into training, validation, and testing sets in a ratio of 55:15:30, repeated 100 times. Predictor variables included parental allergy, medical history during the child's first year (cfy), and early life environmental factors. The time of first onset of AR was restricted to after the age of 1 year to establish a clear temporal relationship between the predictor variables and the outcome. Feature engineering utilized the chi-square test and the Boruta algorithm, refining the dataset for analysis. The construction utilized Logistic Regression (LR), Support Vector Machine (SVM), Random Forest (RF), and Extreme Gradient Boosting Tree (XGBoost) as the models. Model performance was evaluated using the area under the receiver operating characteristic curve (AUROC), and the optimal decision threshold was determined by weighing multiple metrics on the validation sets and reporting results on the testing set. Additionally, the strengths and limitations of the different models were comprehensively analyzed by stratifying gender, mode of birth, and age subgroups, as well as by varying the number of predictor variables. Furthermore, methods such as Shapley additive explanations (SHAP) and purity of node partition in Random Forest were employed to assess feature importance, along with exploring model stability through alterations in the number of features. In this study, 7131 children aged 2-8 were analyzed, with 524 (7.35%) diagnosed with AR, with an onset age ranging from 2 to 8 years. Optimal parameters were refined using the validation set, and a rigorous process of 100 random divisions and repeated training ensured robust evaluation of the models on the testing set. The model construction involved incorporating fourteen variables, including the history of allergy-related diseases during the child's first year, familial genetic factors, and early-life indoor environmental factors. The performance of LR, SVM, RF, and XGBoost on the unstratified data test set was 0.715 (standard deviation = 0.023), 0.723 (0.022), 0.747 (0.015), and 0.733 (0.019), respectively; the performance of each model was stable on the stratified data, and the RF performance was significantly better than that of LR (paired samples t-test: p < 0.001). Different techniques for evaluating the importance of features showed that the top5 variables were father or mother with AR, having older siblings, history of food allergy and father's educational level. Utilizing strategies like stratification and adjusting the number of features, this study constructed a random forest model that outperforms traditional logistic regression. Specifically designed to detect the occurrence of allergic rhinitis (AR) in children aged 2-8, the model incorporates parental allergic history and early life environmental factors. The selection of the optimal cut-off value was determined through a comprehensive evaluation strategy. Additionally, we identified the top 5 crucial features that greatly influence the model's performance. This study serves as a valuable reference for implementing machine learning-based AR prediction in pediatric populations.


Subject(s)
Machine Learning , Rhinitis, Allergic , Humans , Rhinitis, Allergic/epidemiology , Rhinitis, Allergic/diagnosis , China/epidemiology , Male , Female , Child, Preschool , Child , Logistic Models , ROC Curve , Support Vector Machine , Algorithms
12.
Artif Intell Med ; 156: 102953, 2024 Oct.
Article in English | MEDLINE | ID: mdl-39222579

ABSTRACT

BACKGROUND: Chronic obstructive pulmonary disease (COPD) is a severe condition affecting millions worldwide, leading to numerous annual deaths. The absence of significant symptoms in its early stages promotes high underdiagnosis rates for the affected people. Besides pulmonary function failure, another harmful problem of COPD is the systemic effects, e.g., heart failure or voice distortion. However, the systemic effects of COPD might provide valuable information for early detection. In other words, symptoms caused by systemic effects could be helpful to detect the condition in its early stages. OBJECTIVE: The proposed study aims to explore whether the voice features extracted from the vowel "a" utterance carry any information that can be predictive of COPD by employing Machine Learning (ML) on a newly collected voice dataset. METHODS: Forty-eight participants were recruited from the pool of research clinic visitors at Blekinge Institute of Technology (BTH) in Sweden between January 2022 and May 2023. A dataset consisting of 1246 recordings from 48 participants was gathered. The collection of voice recordings containing the vowel "a" utterance commenced following an information and consent meeting with each participant using the VoiceDiagnostic application. The collected voice data was subjected to silence segment removal, feature extraction of baseline acoustic features, and Mel Frequency Cepstrum Coefficients (MFCC). Sociodemographic data was also collected from the participants. Three ML models were investigated for the binary classification of COPD and healthy controls: Random Forest (RF), Support Vector Machine (SVM), and CatBoost (CB). A nested k-fold cross-validation approach was employed. Additionally, the hyperparameters were optimized using grid-search on each ML model. For best performance assessment, accuracy, F1-score, precision, and recall metrics were computed. Afterward, we further examined the best classifier by utilizing the Area Under the Curve (AUC), Average Precision (AP), and SHapley Additive exPlanations (SHAP) feature-importance measures. RESULTS: The classifiers RF, SVM, and CB achieved a maximum accuracy of 77 %, 69 %, and 78 % on the test set and 93 %, 78 % and 97 % on the validation set, respectively. The CB classifier outperformed RF and SVM. After further investigation of the best-performing classifier, CB demonstrated the highest performance, producing an AUC of 82 % and AP of 76 %. In addition to age and gender, the mean values of baseline acoustic and MFCC features demonstrate high importance and deterministic characteristics for classification performance in both test and validation sets, though in varied order. CONCLUSION: This study concludes that the utterance of vowel "a" recordings contain information that can be captured by the CatBoost classifier with high accuracy for the classification of COPD. Additionally, baseline acoustic and MFCC features, in conjunction with age and gender information, can be employed for classification purposes and benefit healthcare for decision support in COPD diagnosis. CLINICAL TRIAL REGISTRATION NUMBER: NCT05897944.


Subject(s)
Machine Learning , Pulmonary Disease, Chronic Obstructive , Pulmonary Disease, Chronic Obstructive/classification , Pulmonary Disease, Chronic Obstructive/physiopathology , Pulmonary Disease, Chronic Obstructive/diagnosis , Humans , Male , Female , Aged , Middle Aged , Voice/physiology , Support Vector Machine
13.
Sci Rep ; 14(1): 20369, 2024 09 02.
Article in English | MEDLINE | ID: mdl-39223223

ABSTRACT

This study aims to evaluate the applicability of the high-resolution WaveFront Phase Imaging Sensor (WFPI) in eyes with Fuchs' Endothelial Corneal Dystrophy (FECD) through qualitative and quantitative analysis using a custom-designed Automatic Guttae Detection Method (AGDM). The ocular phase was measured using the t · eyede aberrometer and then was processed to obtain its High-Pass Filter Map (HPFM). The subjects were pathological and healthy patients from the Fundación Jiménez-Díaz Hospital (Madrid, Spain). The AGDM was developed and applied in pupils with 3 and 5 mm of diameter. A set of metrics were extracted and evaluated like the Root-Mean-Square error (RMS), Number of guttae, Guttae Area, and Area of Delaunay Triangulation (DT). Finally, a Support Vector Machine (SVM) model was trained to classify between pathological and healthy eyes. Quantitatively, the HPFM reveals a dark spots pattern according to the ophthalmologist's description of the slit-lamp examination of guttae distribution. There were significant statistical differences in all the metrics when FECD and Healthy groups were compared using the same pupil size; but comparing both pupil sizes for the same group there were significant differences in most of the variables. This sensor is a value tool to objectively diagnose and monitor this pathology through wavefront phase changes.


Subject(s)
Fuchs' Endothelial Dystrophy , Humans , Fuchs' Endothelial Dystrophy/diagnosis , Female , Male , Middle Aged , Aged , Support Vector Machine , Aberrometry/methods , Aberrometry/instrumentation , Adult
14.
Sci Rep ; 14(1): 20305, 2024 09 01.
Article in English | MEDLINE | ID: mdl-39218940

ABSTRACT

Approximately 15% of patients with colorectal cancer (CRC) exhibit a distinct molecular phenotype known as microsatellite instability (MSI). Accurate and non-invasive prediction of MSI status is crucial for cost savings and guiding clinical treatment strategies. The retrospective study enrolled 307 CRC patients between January 2020 and October 2022. Preoperative images of computed tomography and postoperative status of MSI information were available for analysis. The stratified fivefold cross-validation was used to avoid sample bias in grouping. Feature extraction and model construction were performed as follows: first, inter-/intra-correlation coefficients and the least absolute shrinkage and selection operator algorithm were used to identify the most predictive feature subset. Subsequently, multiple discriminant models were constructed to explore and optimize the combination of six feature preprocessors (Box-Cox, Yeo-Johnson, Max-Abs, Min-Max, Z-score, and Quantile) and three classifiers (logistic regression, support vector machine, and random forest). Selecting the one with the highest average value of the area under the curve (AUC) in the test set as the radiomics model, and the clinical screening model and combined model were also established using the same processing steps as the radiomics model. Finally, the performances of the three models were evaluated and analyzed using decision and correction curves.We observed that the logistic regression model based on the quantile preprocessor had the highest average AUC value in the discriminant models. Additionally, tumor location, the clinical of N stage, and hypertension were identified as independent clinical predictors of MSI status. In the test set, the clinical screening model demonstrated good predictive performance, with the average AUC of 0.762 (95% confidence interval, 0.635-0.890). Furthermore, the combined model showed excellent predictive performance (AUC, 0.958; accuracy, 0.899; sensitivity, 0.929) and favorable clinical applicability and correction effects. The logistic regression model based on the quantile preprocessor exhibited excellent performance and repeatability, which may further reduce the variability of input data and improve the model performance for predicting MSI status in CRC.


Subject(s)
Colorectal Neoplasms , Microsatellite Instability , Humans , Colorectal Neoplasms/genetics , Colorectal Neoplasms/pathology , Female , Male , Middle Aged , Retrospective Studies , Aged , Tomography, X-Ray Computed/methods , Adult , Algorithms , Support Vector Machine , Logistic Models
15.
Article in English | MEDLINE | ID: mdl-39259640

ABSTRACT

Functional near-infrared spectroscopy (fNIRS) is an increasingly popular tool for cross-cultural neuroimaging studies. However, the reproducibility and comparability of fNIRS studies is still an open issue in the scientific community. The paucity of experimental practices and the lack of clear guidelines regarding fNIRS use contribute to undermining the reproducibility of results. For this reason, much effort is now directed at assessing the impact of heterogeneous experimental practices in creating divergent fNIRS results. The current work aims to assess differences in fNIRS signal quality in data collected by two different labs in two different cohorts: Singapore (N=74) and Italy (N=84). Random segments of 20s were extracted from each channel in each participant's NIRScap and 1280 deep features were obtained using a deep learning model trained to classify the quality of fNIRS data. Two datasets were generated: ALL dataset (segments with bad and good data quality) and GOOD dataset (segments with good quality). Each dataset was divided into train and test partitions, which were used to train and evaluate the performance of a Support Vector Machine (SVM) model in classifying the cohorts from signal quality features. Results showed that the SG cohort had significantly higher occurrences of bad signal quality in the majority of the fNIRS channels. Moreover, the SVM correctly classified the cohorts when using the ALL dataset. However, the performance dropped almost completely (except for five channels) when the SVM had to classify the cohorts using data from the GOOD dataset. These results suggest that fNIRS raw data obtained by different labs might possess different levels of quality as well as different latent characteristics beyond quality per se. The current study highlights the importance of defining clear guidelines in the conduction of fNIRS experiments in the reporting of data quality in fNIRS manuscripts.


Subject(s)
Spectroscopy, Near-Infrared , Support Vector Machine , Humans , Spectroscopy, Near-Infrared/methods , Male , Female , Adult , Reproducibility of Results , Young Adult , Deep Learning , Cohort Studies , Italy , Algorithms , Functional Neuroimaging , Data Accuracy
16.
Sensors (Basel) ; 24(17)2024 Aug 28.
Article in English | MEDLINE | ID: mdl-39275482

ABSTRACT

Gait analysis systems are critical for assessing motor function in rehabilitation and elderly care. This study aimed to develop and optimize an abnormal gait classification algorithm considering joint impairments using inertial measurement units (IMUs) and walkway systems. Ten healthy male participants simulated normal walking, walking with knee impairment, and walking with ankle impairment under three conditions: without joint braces, with a knee brace, and with an ankle brace. Based on these simulated gaits, we developed classification models: distinguishing abnormal gait due to joint impairments, identifying specific joint disorders, and a combined model for both tasks. Recursive Feature Elimination with Cross-Validation (RFECV) was used for feature extraction, and models were fine-tuned using support vector machine (SVM), random forest (RF), and extreme gradient boosting (XGB). The IMU-based system achieved over 91% accuracy in classifying the three types of gait. In contrast, the walkway system achieved less than 77% accuracy in classifying the three types of gait, primarily due to high misclassification rates between knee and ankle joint impairments. The IMU-based system shows promise for accurate gait assessment in patients with joint impairments, suggesting future research for clinical application improvements in rehabilitation and patient management.


Subject(s)
Gait , Machine Learning , Humans , Male , Gait/physiology , Adult , Support Vector Machine , Algorithms , Walking/physiology , Ankle Joint/physiopathology , Knee Joint/physiopathology , Gait Analysis/methods , Young Adult
17.
Sensors (Basel) ; 24(17)2024 Aug 29.
Article in English | MEDLINE | ID: mdl-39275501

ABSTRACT

This study used an odor sensing system with a 16-channel electrochemical sensor array to measure beef odors, aiming to distinguish odors under different storage days and processing temperatures for quality monitoring. Six storage days ranged from purchase (D0) to eight days (D8), with three temperature conditions: no heat (RT), boiling (100 °C), and frying (180 °C). Gas chromatography-mass spectrometry (GC-MS) analysis showed that odorants in the beef varied under different conditions. Compounds like acetoin and 1-hexanol changed significantly with the storage days, while pyrazines and furans were more detectable at higher temperatures. The odor sensing system data were visualized using principal component analysis (PCA) and uniform manifold approximation and projection (UMAP). PCA and unsupervised UMAP clustered beef odors by storage days but struggled with the processing temperatures. Supervised UMAP accurately clustered different temperatures and dates. Machine learning analysis using six classifiers, including support vector machine, achieved 57% accuracy for PCA-reduced data, while unsupervised UMAP reached 49.1% accuracy. Supervised UMAP significantly enhanced the classification accuracy, achieving over 99.5% with the dimensionality reduced to three or above. Results suggest that the odor sensing system can sufficiently enhance non-destructive beef quality and safety monitoring. This research advances electronic nose applications and explores data downscaling techniques, providing valuable insights for future studies.


Subject(s)
Gas Chromatography-Mass Spectrometry , Odorants , Principal Component Analysis , Temperature , Odorants/analysis , Cattle , Animals , Gas Chromatography-Mass Spectrometry/methods , Food Storage/methods , Electronic Nose , Red Meat/analysis , Support Vector Machine
18.
Sensors (Basel) ; 24(17)2024 Sep 08.
Article in English | MEDLINE | ID: mdl-39275739

ABSTRACT

Gait phase recognition systems based on surface electromyographic signals (EMGs) are crucial for developing advanced myoelectric control schemes that enhance the interaction between humans and lower limb assistive devices. However, machine learning models used in this context, such as Linear Discriminant Analysis (LDA) and Support Vector Machine (SVM), typically experience performance degradation when modeling the gait cycle with more than just stance and swing phases. This study introduces a generalized phasor-based feature extraction approach (PHASOR) that captures spatial myoelectric features to improve the performance of LDA and SVM in gait phase recognition. A publicly available dataset of 40 subjects was used to evaluate PHASOR against state-of-the-art feature sets in a five-phase gait recognition problem. Additionally, fully data-driven deep learning architectures, such as Rocket and Mini-Rocket, were included for comparison. The separability index (SI) and mean semi-principal axis (MSA) analyses showed mean SI and MSA metrics of 7.7 and 0.5, respectively, indicating the proposed approach's ability to effectively decode gait phases through EMG activity. The SVM classifier demonstrated the highest accuracy of 82% using a five-fold leave-one-trial-out testing approach, outperforming Rocket and Mini-Rocket. This study confirms that in gait phase recognition based on EMG signals, novel and efficient muscle synergy information feature extraction schemes, such as PHASOR, can compete with deep learning approaches that require greater processing time for feature extraction and classification.


Subject(s)
Electromyography , Gait , Support Vector Machine , Humans , Electromyography/methods , Gait/physiology , Discriminant Analysis , Signal Processing, Computer-Assisted , Male , Female , Algorithms , Adult , Deep Learning
19.
Sci Justice ; 64(5): 485-497, 2024 Sep.
Article in English | MEDLINE | ID: mdl-39277331

ABSTRACT

Verifying the speaker of a speech fragment can be crucial in attributing a crime to a suspect. The question can be addressed given disputed and reference speech material, adopting the recommended and scientifically accepted likelihood ratio framework for reporting evidential strength in court. In forensic practice, usually, auditory and acoustic analyses are performed to carry out such a verification task considering a diversity of features, such as language competence, pronunciation, or other linguistic features. Automated speaker comparison systems can also be used alongside those manual analyses. State-of-the-art automatic speaker comparison systems are based on deep neural networks that take acoustic features as input. Additional information, though, may be obtained from linguistic analysis. In this paper, we aim to answer if, when and how modern acoustic-based systems can be complemented by an authorship technique based on frequent words, within the likelihood ratio framework. We consider three different approaches to derive a combined likelihood ratio: using a support vector machine algorithm, fitting bivariate normal distributions, and passing the score of the acoustic system as additional input to the frequent-word analysis. We apply our method to the forensically relevant dataset FRIDA and the FISHER corpus, and we explore under which conditions fusion is valuable. We evaluate our results in terms of log likelihood ratio cost (Cllr) and equal error rate (EER). We show that fusion can be beneficial, especially in the case of intercepted phone calls with noise in the background.


Subject(s)
Forensic Sciences , Humans , Forensic Sciences/methods , Likelihood Functions , Linguistics , Support Vector Machine , Speech Acoustics , Algorithms , Speech
20.
Int J Mol Sci ; 25(17)2024 Aug 30.
Article in English | MEDLINE | ID: mdl-39273410

ABSTRACT

Amelogenesis imperfecta (AI) is a genetic disease characterized by poor formation of tooth enamel. AI occurs due to mutations, especially in AMEL, ENAM, KLK4, MMP20, and FAM83H, associated with changes in matrix proteins, matrix proteases, cell-matrix adhesion proteins, and transport proteins of enamel. Due to the wide variety of phenotypes, the diagnosis of AI is complex, requiring a genetic test to characterize it better. Thus, there is a demand for developing low-cost, noninvasive, and accurate platforms for AI diagnostics. This case-control pilot study aimed to test salivary vibrational modes obtained in attenuated total reflection fourier-transformed infrared (ATR-FTIR) together with machine learning algorithms: linear discriminant analysis (LDA), random forest, and support vector machine (SVM) could be used to discriminate AI from control subjects due to changes in salivary components. The best-performing SVM algorithm discriminates AI better than matched-control subjects with a sensitivity of 100%, specificity of 79%, and accuracy of 88%. The five main vibrational modes with higher feature importance in the Shapley Additive Explanations (SHAP) were 1010 cm-1, 1013 cm-1, 1002 cm-1, 1004 cm-1, and 1011 cm-1 in these best-performing SVM algorithms, suggesting these vibrational modes as a pre-validated salivary infrared spectral area as a potential biomarker for AI screening. In summary, ATR-FTIR spectroscopy and machine learning algorithms can be used on saliva samples to discriminate AI and are further explored as a screening tool.


Subject(s)
Amelogenesis Imperfecta , Machine Learning , Saliva , Humans , Amelogenesis Imperfecta/diagnosis , Amelogenesis Imperfecta/genetics , Amelogenesis Imperfecta/metabolism , Saliva/metabolism , Saliva/chemistry , Spectroscopy, Fourier Transform Infrared/methods , Female , Case-Control Studies , Male , Algorithms , Adult , Support Vector Machine , Pilot Projects , Discriminant Analysis , Biomarkers , Triage/methods , Adolescent , Young Adult
SELECTION OF CITATIONS
SEARCH DETAIL