A comparison of machine learning algorithms and covariate balance measures for propensity score matching and weighting.

Cannas, Massimo; Arpino, Bruno

Cannas, Massimo; Arpino, Bruno.

Afiliación

Cannas M; Department of Economic and Business Sciences, University of Cagliari, Cagliari, Italy.
Arpino B; Department of Statistics, Computer Science, Applications, University of Firenze, Firenze, Italy.

Biom J ; 61(4): 1049-1072, 2019 07.

Article en En | MEDLINE | ID: mdl-31090108

ABSTRACT

ABSTRACT

Propensity score matching (PSM) and propensity score weighting (PSW) are popular tools to estimate causal effects in observational studies. We address two open issues how to estimate propensity scores and assess covariate balance. Using simulations, we compare the performance of PSM and PSW based on logistic regression and machine learning algorithms (CART; Bagging; Boosting; Random Forest; Neural Networks; naive Bayes). Additionally, we consider several measures of covariate balance (Absolute Standardized Average Mean (ASAM) with and without interactions; measures based on the quantile-quantile plots; ratio between variances of propensity scores; area under the curve (AUC)) and assess their ability in predicting the bias of PSM and PSW estimators. We also investigate the importance of tuning of machine learning parameters in the context of propensity score methods. Two simulation designs are employed. In the first, the generating processes are inspired to birth register data used to assess the effect of labor induction on the occurrence of caesarean section. The second exploits more general generating mechanisms. Overall, among the different techniques, random forests performed the best, especially in PSW. Logistic regression and neural networks also showed an excellent performance similar to that of random forests. As for covariate balance, the simplest and commonly used metric, the ASAM, showed a strong correlation with the bias of causal effects estimators. Our findings suggest that researchers should aim at obtaining an ASAM lower than 10% for as many variables as possible. In the empirical study we found that labor induction had a small and not statistically significant impact on caesarean section.

Asunto(s)

Biometría/métodos; Aprendizaje Automático; Puntaje de Propensión; Cesárea/estadística & datos numéricos; Femenino; Humanos; Trabajo de Parto Inducido/estadística & datos numéricos; Método de Montecarlo; Análisis Multivariante; Estudios Observacionales como Asunto; Embarazo

Palabras clave

caesarean section; covariate balance; labor induction; machine learning algorithms; propensity score methods

Texto completo

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Bases de datos: MEDLINE Asunto principal: Biometría / Puntaje de Propensión / Aprendizaje Automático Tipo de estudio: Health_economic_evaluation / Observational_studies / Prognostic_studies Límite: Female / Humans / Pregnancy Idioma: En Revista: Biom J Año: 2019 Tipo del documento: Article País de afiliación: Italia

Texto completo

Imprimir

XML

PubMed Links

Buscar en Google