Your browser doesn't support javascript.
loading
Comparison of ischemic stroke diagnosis models based on machine learning.
Yang, Wan-Xia; Wang, Fang-Fang; Pan, Yun-Yan; Xie, Jian-Qin; Lu, Ming-Hua; You, Chong-Ge.
Afiliação
  • Yang WX; Laboratory Medicine Center, Lanzhou University Second Hospital, Lanzhou, China.
  • Wang FF; Laboratory Medicine Center, Lanzhou University Second Hospital, Lanzhou, China.
  • Pan YY; Laboratory Medicine Center, Lanzhou University Second Hospital, Lanzhou, China.
  • Xie JQ; Anesthesiology Department, Lanzhou University Second Hospital, Lanzhou, China.
  • Lu MH; Laboratory Medicine Center, Lanzhou University Second Hospital, Lanzhou, China.
  • You CG; Laboratory Medicine Center, Lanzhou University Second Hospital, Lanzhou, China.
Front Neurol ; 13: 1014346, 2022.
Article em En | MEDLINE | ID: mdl-36545400
Background: The incidence, prevalence, and mortality of ischemic stroke (IS) continue to rise, resulting in a serious global disease burden. The prediction models have a great value in the early prediction and diagnosis of IS. Methods: The R software was used to screen the differentially expressed genes (DEGs) of IS and control samples in the datasets GSE16561, GSE58294, and GSE37587 and analyze DEGs for enrichment analysis. The feature genes of IS were obtained by several machine learning algorithms, including the least absolute shrinkage and selector operation (LASSO) logistic regression, the support vector machine-recursive feature elimination (SVM-RFE), and the Random Forest (RF). The IS diagnostic models were constructed based on transcriptomics by machine learning and artificial neural network (ANN). Results: A total of 69 DEGs, mainly involved in immune and inflammatory responses, were identified. The pathways enriched in the IS group were complement and coagulation cascades, lysosome, PPAR signaling pathway, regulation of autophagy, and toll-like receptor signaling pathway. The feature genes selected by LASSO, SVM-RFE, and RF were 17, 10, and 12, respectively. The area under the curve (AUC) of the LASSO model in the training dataset, GSE22255, and GSE195442 was 0.969, 0.890, and 1.000. The AUC of the SVM-RFE model was 0.957, 0.805, and 1.000, respectively. The AUC of the RF model was 0.947, 0.935, and 1.000, respectively. The models have good sensitivity, specificity, and accuracy. The AUC of the LASSO+ANN, SVM-RFE+ANN, and RF+ANN models was 1.000, 0.995, and 0.997, respectively, in the training dataset. However, the AUC of LASSO+ANN, SVM-RFE+ANN, and RF+ANN models was 0.688, 0.605, and 0.619, respectively, in the GSE22255 dataset. The AUC of the LASSO+ANN and RF+ANN models was 0.740 and 0.630, respectively, in the GSE195442 dataset. In the training dataset, the sensitivity, specificity, and accuracy of the LASSO+ANN model were 1.000, 1.000, and 1.000, respectively; of the SVM-RFE+ANN model were 0.946, 0.982, and 0.964, respectively; and of the RF+ANN model were 0.964, 1.000, and 0.982, respectively. In the test datasets, the sensitivity was very satisfactory; however, the specificity and accuracy were not good. Conclusion: The LASSO, SVM-RFE, and RF models have good prediction abilities. However, the ANN model is efficient at classifying positive samples and is unsuitable at classifying negative samples.
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Diagnostic_studies / Prognostic_studies / Risk_factors_studies Idioma: En Revista: Front Neurol Ano de publicação: 2022 Tipo de documento: Article País de afiliação: China País de publicação: Suíça

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Diagnostic_studies / Prognostic_studies / Risk_factors_studies Idioma: En Revista: Front Neurol Ano de publicação: 2022 Tipo de documento: Article País de afiliação: China País de publicação: Suíça