RESUMO
Specific language impairment (SLI) is one of the most common diseases in children, and early diagnosis can help to obtain better timely therapy economically. It is difficult and time-consuming for clinicians to accurately detect SLI through standard clinical assessments. Hence, machine learning algorithms have been developed to assist in the accurate diagnosis of SLI. This work aims to investigate the graph of the favipiravir molecule-based feature extraction function and propose an accurate SLI detection model using vowels. We proposed a novel handcrafted machine learning framework. This architecture comprises the favipiravir molecular structure pattern, statistical feature extractor, wavelet packet decomposition (WPD), iterative neighborhood component analysis (INCA), and support vector machine (SVM) classifier. Two feature extraction models, statistical and textural, are employed in the handcrafted feature generation methodology. A new nature-inspired graph-based feature extractor that uses the chemical depiction of the favipiravir (favipiravir became popular with the COVID-19 pandemic) is employed for feature extraction. Finally, the proposed favipiravir pattern, statistical feature extractor, and wavelet packet decomposition are used to create a feature vector. Moreover, a statistical feature extractor is used in this work. The WPD generates multilevel features, and the most meaningful features are selected using the NCA feature selector. Finally, these chosen features are fed to SVM classifier for automated classification. Two validation methods, (i) leave one subject out (LOSO) and (ii) tenfold cross-validations (CV), are used to obtain robust classification results. Our proposed favipiravir pattern-based model developed using a vowel dataset can detect SLI children with an accuracy of 99.87% and 98.86% using tenfold and LOSO CV strategies, respectively. These results demonstrated the high vowel classification ability of the proposed favipiravir pattern-based model.
RESUMO
Myocardial infarction (MI) is detected using electrocardiography (ECG) signals. Machine learning (ML) models have been used for automated MI detection on ECG signals. Deep learning models generally yield high classification performance but are computationally intensive. We have developed a novel multilevel hybrid feature extraction-based classification model with low time complexity for MI classification. The study dataset comprising 12-lead ECGs belonging to one healthy and 10 MI classes were downloaded from a public ECG signal databank. The model architecture comprised multilevel hybrid feature extraction, iterative feature selection, classification, and iterative majority voting (IMV). In the hybrid handcrafted feature (HHF) generation phase, both textural and statistical feature extraction functions were used to extract features from ECG beats but only at a low level. A new pooling-based multilevel decomposition model was presented to enable them to create features at a high level. This model used average and maximum pooling to create decomposed signals. Using these pooling functions, an unbalanced tree was obtained. Therefore, this model was named multilevel unbalanced pooling tree transformation (MUPTT). On the feature extraction side, two extractors (functions) were used to generate both statistical and textural features. To generate statistical features, 20 commonly used moments were used. A new, improved symmetric binary pattern function was proposed to generate textural features. Both feature extractors were applied to the original MI signal and the decomposed signals generated by the MUPTT. The most valuable features from among the extracted feature vectors were selected using iterative neighborhood component analysis (INCA). In the classification phase, a one-dimensional nearest neighbor classifier with ten-fold cross-validation was used to obtain lead-wise results. The computed lead-wise results derived from all 12 leads of the same beat were input to the IMV algorithm to generate ten voted results. The most representative was chosen using a greedy technique to calculate the overall classification performance of the model. The HHF-MUPTT-based ECG beat classification model attained excellent performance, with the best lead-wise accuracy of 99.85% observed in Lead III and 99.94% classification accuracy using the IMV algorithm. The results confirmed the high MI classification ability of the presented computationally lightweight HHF-MUPTT-based model.
RESUMO
BACKGROUND: Sleep stage classification is a crucial process for the diagnosis of sleep or sleep-related diseases. Currently, this process is based on manual electroencephalogram (EEG) analysis, which is resource-intensive and error-prone. Various machine learning models have been recommended to standardize and automate the analysis process to address these problems. MATERIALS AND METHODS: The well-known cyclic alternating pattern (CAP) sleep dataset is used to train and test an L-tetrolet pattern-based sleep stage classification model in this research. By using this dataset, the following three cases are created, and they are: Insomnia, Normal, and Fused cases. For each of these cases, the machine learning model is tasked with identifying six sleep stages. The model is structured in terms of feature generation, feature selection, and classification. Feature generation is established with a new L-tetrolet (Tetris letter) function and multiple pooling decomposition for level creation. We fuse ReliefF and iterative neighborhood component analysis (INCA) feature selection using a threshold value. The hybrid and iterative feature selectors are named threshold selection-based ReliefF and INCA (TSRFINCA). The selected features are classified using a cubic support vector machine. RESULTS: The presented L-tetrolet pattern and TSRFINCA-based sleep stage classification model yield 95.43%, 91.05%, and 92.31% accuracies for Insomnia, Normal dataset, and Fused cases, respectively. CONCLUSION: The recommended L-tetrolet pattern and TSRFINCA-based model push the envelope of current knowledge engineering by accurately classifying sleep stages even in the presence of sleep disorders.
RESUMO
BACKGROUND AND PURPOSE: Machine learning models have been used to diagnose schizophrenia. The main purpose of this research is to introduce an effective schizophrenia hand-modeled classification method. METHOD: A public electroencephalogram (EEG) signal data set was used in this work, and an automated schizophrenia detection model is presented using a cyclic group of prime order with a modulo 17 operator. Therefore, the presented feature extractor was named as the cyclic group of prime order pattern, CGP17Pat. Using the proposed CGP17Pat, a new multilevel feature extraction model is presented. To choose a highly distinctive feature, iterative neighborhood component analysis (INCA) was used, and these features were classified using k-nearest neighbors (kNN) with the 10-fold cross-validation and leave-one-subject-out (LOSO) validation techniques. Finally, iterative hard majority voting was employed in the last phase to obtain channel-wise results, and the general results were calculated. RESULTS: The presented CGP17Pat-based EEG classification model attained 99.91% accuracy employing 10-fold cross-validation and 84.33% accuracy using the LOSO strategy. CONCLUSIONS: The findings and results depicted the high classification ability of the presented cryptologic pattern for the data set used.
RESUMO
Mask usage is one of the most important precautions to limit the spread of COVID-19. Therefore, hygiene rules enforce the correct use of face coverings. Automated mask usage classification might be used to improve compliance monitoring. This study deals with the problem of inappropriate mask use. To address that problem, 2075 face mask usage images were collected. The individual images were labeled as either mask, no masked, or improper mask. Based on these labels, the following three cases were created: Case 1: mask versus no mask versus improper mask, Case 2: mask versus no mask + improper mask, and Case 3: mask versus no mask. This data was used to train and test a hybrid deep feature-based masked face classification model. The presented method comprises of three primary stages: (i) pre-trained ResNet101 and DenseNet201 were used as feature generators; each of these generators extracted 1000 features from an image; (ii) the most discriminative features were selected using an improved RelieF selector; and (iii) the chosen features were used to train and test a support vector machine classifier. That resulting model attained 95.95%, 97.49%, and 100.0% classification accuracy rates on Case 1, Case 2, and Case 3, respectively. Having achieved these high accuracy values indicates that the proposed model is fit for a practical trial to detect appropriate face mask use in real time.
Assuntos
COVID-19 , Máscaras , COVID-19/prevenção & controle , Humanos , SARS-CoV-2 , Máquina de Vetores de SuporteRESUMO
SARS-CoV-2 and Influenza-A can present similar symptoms. Computer-aided diagnosis can help facilitate screening for the two conditions, and may be especially relevant and useful in the current COVID-19 pandemic because seasonal Influenza-A infection can still occur. We have developed a novel text-based classification model for discriminating between the two conditions using protein sequences of varying lengths. We downloaded viral protein sequences of SARS-CoV-2 and Influenza-A with varying lengths (all 100 or greater) from the NCBI database and randomly selected 16,901 SARS-CoV-2 and 19,523 Influenza-A sequences to form a two-class study dataset. We used a new feature extraction function based on a unique pattern, HamletPat, generated from the text of Shakespeare's Hamlet, and a signum function to extract local binary pattern-like bits from overlapping fixed-length (27) blocks of the protein sequences. The bits were converted to decimal map signals from which histograms were extracted and concatenated to form a final feature vector of length 1280. The iterative Chi-square function selected the 340 most discriminative features to feed to an SVM with a Gaussian kernel for classification. The model attained 99.92% and 99.87% classification accuracy rates using hold-out (75:25 split ratio) and five-fold cross-validations, respectively. The excellent performance of the lightweight, handcrafted HamletPat-based classification model suggests that it can be a valuable tool for screening protein sequences to discriminate between SARS-CoV-2 and Influenza-A infections.
RESUMO
Electroencephalography (EEG) signals have been widely used to diagnose brain diseases for instance epilepsy, Parkinson's Disease (PD), Multiple Skleroz (MS), and many machine learning methods have been proposed to develop automated disease diagnosis methods using EEG signals. In this method, a multilevel machine learning method is presented to diagnose epilepsy disease. The proposed multilevel EEG classification method consists of pre-processing, feature extraction, feature concatenation, feature selection and classification phases. In order to create levels, Tunable-Q wavelet transform (TQWT) is chosen and 25 frequency coefficients sub-bands are calculated by using TQWT in the pre-processing. In the feature extraction phase, quadruple symmetric pattern (QSP) is chosen as feature extractor and extracts 256 features from the raw EEG signal and the extracted 25 sub-bands. In the feature selection phase, neighborhood component analysis (NCA) is used. The 128, 256, 512 and 1024 most significant features are selected in this phase. In the classification phase, k nearest neighbors (kNN) classifier is utilized as classifier. The proposed method is tested on seven cases using Bonn EEG dataset. The proposed method achieved 98.4% success rate for 5 classes case. Therefore, our proposed method can be used in bigger datasets for more validation.