Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 13 de 13
Filter
Add more filters










Publication year range
1.
Health Inf Sci Syst ; 12(1): 20, 2024 Dec.
Article in English | MEDLINE | ID: mdl-38455725

ABSTRACT

Purpose: The main aim of our study was to explore the utility of artificial intelligence (AI) in diagnosing autism spectrum disorder (ASD). The study primarily focused on using machine learning (ML) and deep learning (DL) models to detect ASD potential cases by analyzing text inputs, especially from social media platforms like Twitter. This is to overcome the ongoing challenges in ASD diagnosis, such as the requirement for specialized professionals and extensive resources. Timely identification, particularly in children, is essential to provide immediate intervention and support, thereby improving the quality of life for affected individuals. Methods: We employed natural language processing (NLP) techniques along with ML models like decision trees, extreme gradient boosting (XGB), k-nearest neighbors algorithm (KNN), and DL models such as recurrent neural networks (RNN), long short-term memory (LSTM), bidirectional long short-term memory (Bi-LSTM), bidirectional encoder representations from transformers (BERT and BERTweet). We extracted a dataset of 404,627 tweets from Twitter users using the platform's API and classified them based on whether they were written by individuals claiming to have ASD (ASD users) or by those without ASD (non-ASD users). From this dataset, we used a subset of 90,000 tweets (45,000 from each classification group) for the training and testing of these models. Results: The application of our AI models yielded promising results, with the predictive model reaching an accuracy of almost 88% when classifying texts that potentially originated from individuals with ASD. Conclusion: Our research demonstrated the potential of using AI, particularly DL models, in enhancing the accuracy of ASD detection and diagnosis. This innovative approach signifies the critical role AI can play in advancing early diagnostic techniques, enabling better patient outcomes and underlining the importance of early identification of ASD, especially in children.

2.
PLoS One ; 18(8): e0289553, 2023.
Article in English | MEDLINE | ID: mdl-37582086

ABSTRACT

AIM: To study the existence of subgroups by exploring the similarities between the attributes of the nodes of the groups, in relation to diet and gender and, to analyse the connectivity between groups based on aspects of similarities between them through SNA and artificial intelligence techniques. METHODS: 235 students from 5 different educational centres participate in this study between March and December 2015. Data analysis carried out is divided into two blocks: social network analysis and unsupervised machine learning techniques. As for the social network analysis, the Girvan-Newman technique was applied to find the best number of cohesive groups within each of the friendship networks of the different classes analysed. RESULTS: After applying Girvan-Newman in the three classes, the best division into clusters was respectively 2 for classroom A, 7 for classroom B and 6 for classroom C. There are significant differences between the groups and the gender and diet variables. After applying K-means using population diet as an input variable, a K-means clustering of 2 clusters for class A, 3 clusters for class B and 3 clusters for class C is obtained. CONCLUSION: Adolescents form subgroups within their classrooms. Subgroup cohesion is defined by the fact that nodes share similarities in aspects that influence obesity, they share attributes related to food quality and gender. The concept of homophily, related to SNA, justifies our results. Artificial intelligence techniques together with the application of the Girvan-Newman provide robustness to the structural analysis of similarities and cohesion between subgroups.


Subject(s)
Artificial Intelligence , Pandemics , Humans , Adolescent , Social Network Analysis , Obesity/epidemiology , Machine Learning
3.
PeerJ Comput Sci ; 9: e1723, 2023.
Article in English | MEDLINE | ID: mdl-38192446

ABSTRACT

Glioblastoma is a highly malignant brain tumor with a life expectancy of only 3-6 months without treatment. Detecting and predicting its survival and grade accurately are crucial. This study introduces a novel approach using transfer learning techniques. Various pre-trained networks, including EfficientNet, ResNet, VGG16, and Inception, were tested through exhaustive optimization to identify the most suitable architecture. Transfer learning was applied to fine-tune these models on a glioblastoma image dataset, aiming to achieve two objectives: survival and tumor grade prediction.The experimental results show 65% accuracy in survival prediction, classifying patients into short, medium, or long survival categories. Additionally, the prediction of tumor grade achieved an accuracy of 97%, accurately differentiating low-grade gliomas (LGG) and high-grade gliomas (HGG). The success of the approach is attributed to the effectiveness of transfer learning, surpassing the current state-of-the-art methods. In conclusion, this study presents a promising method for predicting the survival and grade of glioblastoma. Transfer learning demonstrates its potential in enhancing prediction models, particularly in scenarios with limited large datasets. These findings hold promise for improving diagnostic and treatment approaches for glioblastoma patients.

4.
Digit Health ; 8: 20552076221111289, 2022.
Article in English | MEDLINE | ID: mdl-35832475

ABSTRACT

Background: Postpartum urinary incontinence is a fairly widespread health problem in today's society among women who have given birth. Recent studies analysing the different variables that may be related to Postpartum urinary incontinence have brought to light some variables that may be related to Postpartum urinary incontinence in order to try to prevent it. However, no studies have been found that analyse some of the intrinsic and extrinsic variables of patients during pregnancy that could give rise to this pathology. Objective: The objective of this study is to assess the most influential variables in Postpartum urinary incontinence by means of machine learning techniques, starting from a group of intrinsic variables, another group of extrinsic variables and a mixed group that combines both types. Methods: Information was collected on 93 patients, pregnant women who gave birth. Experiments were conducted using different machine learning classification techniques combined with oversampling techniques to predict four variables: urinary incontinence, urinary incontinence frequency, urinary incontinence intensity and stress urinary incontinence. Results: The results showed that the most accurate predictive models were those trained with extrinsic variables, obtaining accuracy values of 70% for urinary incontinence, 77% for urinary incontinence frequency, 71% for urinary incontinence intensity and 93% for stress urinary incontinence. Conclusions: This research has shown that extrinsic variables are more important than intrinsic variables in predicting problems related to postpartum urinary incontinence. Therefore, although not conclusive, it opens a line of research that could confirm that the prevention of Postpartum urinary incontinence could be achieved by following healthy habits in pregnant women.

5.
PeerJ Comput Sci ; 8: e906, 2022.
Article in English | MEDLINE | ID: mdl-35494847

ABSTRACT

With the growth that social networks have experienced in recent years, it is entirely impossible to moderate content manually. Thanks to the different existing techniques in natural language processing, it is possible to generate predictive models that automatically classify texts into different categories. However, a weakness has been detected concerning the language used to train such models. This work aimed to develop a predictive model based on BERT, capable of detecting racist and xenophobic messages in tweets written in Spanish. A comparison was made with different Deep Learning models. A total of five predictive models were developed, two based on BERT and three using other deep learning techniques, CNN, LSTM and a model combining CNN + LSTM techniques. After exhaustively analyzing the results obtained by the different models, it was found that the one that got the best metrics was BETO, a BERT-based model trained only with texts written in Spanish. The results of our study show that the BETO model achieves a precision of 85.22% compared to the 82.00% precision of the mBERT model. The rest of the models obtained between 79.34% and 80.48% precision. On this basis, it has been possible to justify the vital importance of developing native transfer learning models for solving Natural Language Processing (NLP) problems in Spanish. Our main contribution is the achievement of promising results in the field of racism and hate speech in Spanish by applying different deep learning techniques.

6.
JMIR Med Inform ; 10(2): e34492, 2022 Feb 24.
Article in English | MEDLINE | ID: mdl-35200156

ABSTRACT

BACKGROUND: Eating disorders affect an increasing number of people. Social networks provide information that can help. OBJECTIVE: We aimed to find machine learning models capable of efficiently categorizing tweets about eating disorders domain. METHODS: We collected tweets related to eating disorders, for 3 consecutive months. After preprocessing, a subset of 2000 tweets was labeled: (1) messages written by people suffering from eating disorders or not, (2) messages promoting suffering from eating disorders or not, (3) informative messages or not, and (4) scientific or nonscientific messages. Traditional machine learning and deep learning models were used to classify tweets. We evaluated accuracy, F1 score, and computational time for each model. RESULTS: A total of 1,058,957 tweets related to eating disorders were collected. were obtained in the 4 categorizations, with The bidirectional encoder representations from transformer-based models had the best score among the machine learning and deep learning techniques applied to the 4 categorization tasks (F1 scores 71.1%-86.4%). CONCLUSIONS: Bidirectional encoder representations from transformer-based models have better performance, although their computational cost is significantly higher than those of traditional techniques, in classifying eating disorder-related tweets.

7.
Article in English | MEDLINE | ID: mdl-34682416

ABSTRACT

This study evaluates several feature ranking techniques together with some classifiers based on machine learning to identify relevant factors regarding the probability of contracting breast cancer and improve the performance of risk prediction models for breast cancer in a healthy population. The dataset with 919 cases and 946 controls comes from the MCC-Spain study and includes only environmental and genetic features. Breast cancer is a major public health problem. Our aim is to analyze which factors in the cancer risk prediction model are the most important for breast cancer prediction. Likewise, quantifying the stability of feature selection methods becomes essential before trying to gain insight into the data. This paper assesses several feature selection algorithms in terms of performance for a set of predictive models. Furthermore, their robustness is quantified to analyze both the similarity between the feature selection rankings and their own stability. The ranking provided by the SVM-RFE approach leads to the best performance in terms of the area under the ROC curve (AUC) metric. Top-47 ranked features obtained with this approach fed to the Logistic Regression classifier achieve an AUC = 0.616. This means an improvement of 5.8% in comparison with the full feature set. Furthermore, the SVM-RFE ranking technique turned out to be highly stable (as well as Random Forest), whereas relief and the wrapper approaches are quite unstable. This study demonstrates that the stability and performance of the model should be studied together as Random Forest and SVM-RFE turned out to be the most stable algorithms, but in terms of model performance SVM-RFE outperforms Random Forest.


Subject(s)
Breast Neoplasms , Algorithms , Area Under Curve , Breast Neoplasms/epidemiology , Female , Humans , Logistic Models , Machine Learning , Support Vector Machine
8.
Comput Methods Programs Biomed ; 202: 105968, 2021 Apr.
Article in English | MEDLINE | ID: mdl-33631638

ABSTRACT

BACKGROUND AND OBJECTIVE: Diabetes is a chronic pathology which is affecting more and more people over the years. It gives rise to a large number of deaths each year. Furthermore, many people living with the disease do not realize the seriousness of their health status early enough. Late diagnosis brings about numerous health problems and a large number of deaths each year so the development of methods for the early diagnosis of this pathology is essential. METHODS: In this paper, a pipeline based on deep learning techniques is proposed to predict diabetic people. It includes data augmentation using a variational autoencoder (VAE), feature augmentation using an sparse autoencoder (SAE) and a convolutional neural network for classification. Pima Indians Diabetes Database, which takes into account information on the patients such as the number of pregnancies, glucose or insulin level, blood pressure or age, has been evaluated. RESULTS: A 92.31% of accuracy was obtained when CNN classifier is trained jointly the SAE for featuring augmentation over a well balanced dataset. This means an increment of 3.17% of accuracy with respect the state-of-the-art. CONCLUSIONS: Using a full deep learning pipeline for data preprocessing and classification has demonstrate to be very promising in the diabetes detection field outperforming the state-of-the-art proposals.


Subject(s)
Deep Learning , Diabetes Mellitus , Databases, Factual , Diabetes Mellitus/diagnosis , Humans , Neural Networks, Computer
9.
Healthcare (Basel) ; 8(4)2020 Sep 29.
Article in English | MEDLINE | ID: mdl-33003439

ABSTRACT

COVID-19 disease has affected almost every country in the world. The large number of infected people and the different mortality rates between countries has given rise to many hypotheses about the key points that make the virus so lethal in some places. In this study, the eating habits of 170 countries were evaluated in order to find correlations between these habits and mortality rates caused by COVID-19 using machine learning techniques that group the countries together according to the different distribution of fat, energy, and protein across 23 different types of food, as well as the amount ingested in kilograms. Results shown how obesity and the high consumption of fats appear in countries with the highest death rates, whereas countries with a lower rate have a higher level of cereal consumption accompanied by a lower total average intake of kilocalories.

10.
Sensors (Basel) ; 20(5)2020 Mar 04.
Article in English | MEDLINE | ID: mdl-32143458

ABSTRACT

This study shows the feasibility of an eHealth solution for tackling eating habits and physical activity in the adolescent population. The participants were children from 11 to 15 years old. An intervention was carried out on 139 students in the intervention group and 91 students in the control group, in two schools during 14 weeks. The intervention group had access to the web through a user account and a password. They were able to create friendship relationships, post comments, give likes and interact with other users, as well as receive notifications and information about nutrition and physical activity on a daily basis and get (virtual) rewards for improving their habits. The control group did not have access to any of these features. The homogeneity of the samples in terms of gender, age, body mass index and initial health-related habits was demonstrated. Pre- and post-measurements were collected through self-reports on the application website. After applying multivariate analysis of variance, a significant alteration in the age-adjusted body mass index percentile was observed in the intervention group versus the control group, as well as in the PAQ-A score and the KIDMED score. It can be concluded that eHealth interventions can help to obtain healthy habits. More research is needed to examine the effectiveness in achieving adherence to these new habits.


Subject(s)
Habits , Social Networking , Telemedicine , Adolescent , Body Mass Index , Case-Control Studies , Child , Feasibility Studies , Feeding Behavior , Female , Humans , Male , Surveys and Questionnaires
11.
Sensors (Basel) ; 20(4)2020 Feb 22.
Article in English | MEDLINE | ID: mdl-32098446

ABSTRACT

The aim of this paper was the detection of pathologies through respiratory sounds. The ICBHI (International Conference on Biomedical and Health Informatics) Benchmark was used. This dataset is composed of 920 sounds of which 810 are of chronic diseases, 75 of non-chronic diseases and only 35 of healthy individuals. As more than 88% of the samples of the dataset are from the same class (Chronic), the use of a Variational Convolutional Autoencoder was proposed to generate new labeled data and other well known oversampling techniques after determining that the dataset classes are unbalanced. Once the preprocessing step was carried out, a Convolutional Neural Network (CNN) was used to classify the respiratory sounds into healthy, chronic, and non-chronic disease. In addition, we carried out a more challenging classification trying to distinguish between the different types of pathologies or healthy: URTI, COPD, Bronchiectasis, Pneumonia, and Bronchiolitis. We achieved results up to 0.993 F-Score in the three-label classification and 0.990 F-Score in the more challenging six-class classification.


Subject(s)
Neural Networks, Computer , Algorithms , Humans
12.
Comput Methods Programs Biomed ; 177: 219-229, 2019 Aug.
Article in English | MEDLINE | ID: mdl-31319951

ABSTRACT

BACKGROUND AND OBJECTIVE: Risk prediction models aim at identifying people at higher risk of developing a target disease. Feature selection is particularly important to improve the prediction model performance avoiding overfitting and to identify the leading cancer risk (and protective) factors. Assessing the stability of feature selection/ranking algorithms becomes an important issue when the aim is to analyze the features with more prediction power. METHODS: This work is focused on colorectal cancer, assessing several feature ranking algorithms in terms of performance for a set of risk prediction models (Neural Networks, Support Vector Machines (SVM), Logistic Regression, k-Nearest Neighbors and Boosted Trees). Additionally, their robustness is evaluated following a conventional approach with scalar stability metrics and a visual approach proposed in this work to study both similarity among feature ranking techniques as well as their individual stability. A comparative analysis is carried out between the most relevant features found out in this study and features provided by the experts according to the state-of-the-art knowledge. RESULTS: The two best performance results in terms of Area Under the ROC Curve (AUC) are achieved with a SVM classifier using the top-41 features selected by the SVM wrapper approach (AUC=0.693) and Logistic Regression with the top-40 features selected by the Pearson (AUC=0.689). Experiments showed that performing feature selection contributes to classification performance with a 3.9% and 1.9% improvement in AUC for the SVM and Logistic Regression classifier, respectively, with respect to the results using the full feature set. The visual approach proposed in this work allows to see that the Neural Network-based wrapper ranking is the most unstable while the Random Forest is the most stable. CONCLUSIONS: This study demonstrates that stability and model performance should be studied jointly as Random Forest turned out to be the most stable algorithm but outperformed by others in terms of model performance while SVM wrapper and the Pearson correlation coefficient are moderately stable while achieving good model performance.


Subject(s)
Colorectal Neoplasms/diagnostic imaging , Neural Networks, Computer , Risk Assessment/methods , Support Vector Machine , Area Under Curve , Computer Graphics , Computer Simulation , Databases, Factual , Humans , ROC Curve , Software , Spain/epidemiology
13.
Comput Methods Programs Biomed ; 108(2): 873-81, 2012 Nov.
Article in English | MEDLINE | ID: mdl-22382003

ABSTRACT

The automated assessment of the sperm quality is an important challenge in the veterinary field. In this paper, we explore how to describe the acrosomes of boar spermatozoa using image analysis so that they can be automatically categorized as intact or damaged. Our proposal aims at characterizing the acrosomes by means of texture features. The texture is described using first order statistics and features derived from the co-occurrence matrix of the image, both computed from the original image and from the coefficients yielded by the Discrete Wavelet Transform. Texture descriptors are evaluated and compared with moments-based descriptors in terms of the classification accuracy they provide. Experimental results with a Multilayer Perceptron and the k-Nearest Neighbours classifiers show that texture descriptors outperform moment-based descriptors, reaching an accuracy of 94.93%, which makes this approach very attractive for the veterinarian community.


Subject(s)
Acrosome , Spermatozoa , Animals , Male , Swine
SELECTION OF CITATIONS
SEARCH DETAIL
...