Your browser doesn't support javascript.
loading
Combination of Feature Selection and Resampling Methods to Predict Preterm Birth Based on Electrohysterographic Signals from Imbalance Data.
Nieto-Del-Amor, Félix; Prats-Boluda, Gema; Garcia-Casado, Javier; Diaz-Martinez, Alba; Diago-Almela, Vicente Jose; Monfort-Ortiz, Rogelio; Hao, Dongmei; Ye-Lin, Yiyao.
Afiliação
  • Nieto-Del-Amor F; Centro de Investigación e Innovación en Bioingeniería, Universitat Politècnica de València, 46022 Valencia, Spain.
  • Prats-Boluda G; Centro de Investigación e Innovación en Bioingeniería, Universitat Politècnica de València, 46022 Valencia, Spain.
  • Garcia-Casado J; Centro de Investigación e Innovación en Bioingeniería, Universitat Politècnica de València, 46022 Valencia, Spain.
  • Diaz-Martinez A; Centro de Investigación e Innovación en Bioingeniería, Universitat Politècnica de València, 46022 Valencia, Spain.
  • Diago-Almela VJ; Servicio de Obstetricia, H.U.P. La Fe, 46026 Valencia, Spain.
  • Monfort-Ortiz R; Servicio de Obstetricia, H.U.P. La Fe, 46026 Valencia, Spain.
  • Hao D; Faculty of Environment and Life, Beijing University of Technology, Beijing International Science and Technology Cooperation Base for Intelligent Physiological Measurement and Clinical Transformation, Beijing 100124, China.
  • Ye-Lin Y; Centro de Investigación e Innovación en Bioingeniería, Universitat Politècnica de València, 46022 Valencia, Spain.
Sensors (Basel) ; 22(14)2022 Jul 07.
Article em En | MEDLINE | ID: mdl-35890778
ABSTRACT
Due to its high sensitivity, electrohysterography (EHG) has emerged as an alternative technique for predicting preterm labor. The main obstacle in designing preterm labor prediction models is the inherent preterm/term imbalance ratio, which can give rise to relatively low performance. Numerous studies obtained promising preterm labor prediction results using the synthetic minority oversampling technique. However, these studies generally overestimate mathematical models' real generalization capacity by generating synthetic data before splitting the dataset, leaking information between the training and testing partitions and thus reducing the complexity of the classification task. In this work, we analyzed the effect of combining feature selection and resampling methods to overcome the class imbalance problem for predicting preterm labor by EHG. We assessed undersampling, oversampling, and hybrid methods applied to the training and validation dataset during feature selection by genetic algorithm, and analyzed the resampling effect on training data after obtaining the optimized feature subset. The best strategy consisted of undersampling the majority class of the validation dataset to 11 during feature selection, without subsequent resampling of the training data, achieving an AUC of 94.5 ± 4.6%, average precision of 84.5 ± 11.7%, maximum F1-score of 79.6 ± 13.8%, and recall of 89.8 ± 12.1%. Our results outperformed the techniques currently used in clinical practice, suggesting the EHG could be used to predict preterm labor in clinics.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Nascimento Prematuro / Trabalho de Parto Prematuro Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Nascimento Prematuro / Trabalho de Parto Prematuro Idioma: En Ano de publicação: 2022 Tipo de documento: Article