Your browser doesn't support javascript.
loading
Analyzing the Effect of Imputation on Classification Performance under MCAR and MAR Missing Mechanisms.
Buczak, Philip; Chen, Jian-Jia; Pauly, Markus.
Affiliation
  • Buczak P; Department of Statistics, TU Dortmund University, 44227 Dortmund, Germany.
  • Chen JJ; Department of Computer Science, TU Dortmund University, 44227 Dortmund, Germany.
  • Pauly M; Department of Statistics, TU Dortmund University, 44227 Dortmund, Germany.
Entropy (Basel) ; 25(3)2023 Mar 17.
Article in En | MEDLINE | ID: mdl-36981409
ABSTRACT
Many datasets in statistical analyses contain missing values. As omitting observations containing missing entries may lead to information loss or greatly reduce the sample size, imputation is usually preferable. However, imputation can also introduce bias and impact the quality and validity of subsequent analysis. Focusing on binary classification problems, we analyzed how missing value imputation under MCAR as well as MAR missingness with different missing patterns affects the predictive performance of subsequent classification. To this end, we compared imputation methods such as several MICE variants, missForest, Hot Deck as well as mean imputation with regard to the classification performance achieved with commonly used classifiers such as Random Forest, Extreme Gradient Boosting, Support Vector Machine and regularized logistic regression. Our simulation results showed that Random Forest based imputation (i.e., MICE Random Forest and missForest) performed particularly well in most scenarios studied. In addition to these two methods, simple mean imputation also proved to be useful, especially when many features (covariates) contained missing values.
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Language: En Journal: Entropy (Basel) Year: 2023 Document type: Article Affiliation country: Germany

Full text: 1 Collection: 01-internacional Database: MEDLINE Language: En Journal: Entropy (Basel) Year: 2023 Document type: Article Affiliation country: Germany