Intra-subject enveloped multilayer fuzzy sample compression for speech diagnosis of Parkinson's disease.
Med Biol Eng Comput
; 62(2): 371-388, 2024 Feb.
Article
in En
| MEDLINE
| ID: mdl-37874453
Machine learning-based Parkinson's disease (PD) speech diagnosis is a current research hotspot. However, existing methods use each corpus sample as the base unit for modeling. Since different corpus samples within the same subject have different sensitive speech features, it is difficult to obtain unified and stable sensitive speech features (diagnostic markers) that reflect the pathology of the whole subject. Therefore, this study aims at compressing the corpus samples within the subject to facilitate the search for diagnostic markers with high diagnostic accuracy. A two-step sample compression module (TSCM) can solve the problem above. It includes two major parts: sample pruning module (SPM) and sample fuzzy clustering mechanism (SFCMD). Based on stacking multiple TSCMs, a multilayer sample compression module (MSCM) is formed to obtain multilayer compression samples. After that, simultaneous sample/feature selection mechanism (SS/FSM) is designed for feature selection. Based on the multilayer compression samples processed by MSCM and SS/FSM, a novel ensemble learning algorithm (EMSFE) is designed with sparse fusion ensemble learning mechanism (SFELM). The proposed EMSFE is validated by visualization of extracted features and performance comparison with related algorithms. The experimental results show that the proposed algorithm can effectively extract the stable diagnostic markers by compressing the corpus samples within the subject. Furthermore, based on LOSO cross validation, the proposed algorithm with extreme learning machine (ELM) classifier can achieve the accuracy of 92.5%, 93.75% and 91.67% on three datasets, respectively. The proposed EMSFE can extract unified and stable sensitive features that accurately reflect the overall pathology of the subject, which can better meet the requirements of clinical applications.The code and datasets can be found in: https://github.com/wywwwww/EMSFE-supplementary-material.git.
Key words
Full text:
1
Collection:
01-internacional
Database:
MEDLINE
Main subject:
Parkinson Disease
/
Data Compression
Limits:
Humans
Language:
En
Journal:
Med Biol Eng Comput
Year:
2024
Document type:
Article
Affiliation country:
China
Country of publication:
United States