Your browser doesn't support javascript.
loading
Pooling individual participant data from randomized controlled trials: Exploring potential loss of information.
van Wanrooij, Lennard L; Hoevenaar-Blom, Marieke P; Coley, Nicola; Ngandu, Tiia; Meiller, Yannick; Guillemont, Juliette; Rosenberg, Anna; Beishuizen, Cathrien R L; Moll van Charante, Eric P; Soininen, Hilkka; Brayne, Carol; Andrieu, Sandrine; Kivipelto, Miia; Richard, Edo.
  • van Wanrooij LL; Department of Neurology, Amsterdam UMC, University of Amsterdam, Amsterdam, The Netherlands.
  • Hoevenaar-Blom MP; Department of Neurology, Amsterdam UMC, University of Amsterdam, Amsterdam, The Netherlands.
  • Coley N; Department of Neurology, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, The Netherlands.
  • Ngandu T; Department of Epidemiology and Public Health, Toulouse University Hospital, Toulouse, France.
  • Meiller Y; INSERM, University of Toulouse UMR1027, Toulouse, France.
  • Guillemont J; Chronic Disease Prevention Unit, National Institute for Health and Welfare, Helsinki, Finland.
  • Rosenberg A; Department of Information and Operations Management, ESCP Europe, Paris, France.
  • Beishuizen CRL; INSERM, University of Toulouse, Toulouse, France.
  • Moll van Charante EP; Department of Neurology, Institute of Clinical Medicine, University of Eastern Finland, Kuopio, Finland.
  • Soininen H; Department of Neurology, Amsterdam UMC, University of Amsterdam, Amsterdam, The Netherlands.
  • Brayne C; Department of General Practice, Amsterdam UMC, University of Amsterdam, Amsterdam, The Netherlands.
  • Andrieu S; Department of Neurology, Institute of Clinical Medicine, University of Eastern Finland, Kuopio, Finland.
  • Kivipelto M; Neurocenter, Neurology, Kuopio University Hospital, Kuopio, Finland.
  • Richard E; Department of Public Health and Primary Care, Cambridge Institute of Public Health, University of Cambridge, Cambridge, United Kingdom.
PLoS One ; 15(5): e0232970, 2020.
Article en En | MEDLINE | ID: mdl-32396543
ABSTRACT

BACKGROUND:

Pooling individual participant data to enable pooled analyses is often complicated by diversity in variables across available datasets. Therefore, recoding original variables is often necessary to build a pooled dataset. We aimed to quantify how much information is lost in this process and to what extent this jeopardizes validity of analyses results.

METHODS:

Data were derived from a platform that was developed to pool data from three randomized controlled trials on the effect of treatment of cardiovascular risk factors on cognitive decline or dementia. We quantified loss of information using the R-squared of linear regression models with pooled variables as a function of their original variable(s). In case the R-squared was below 0.8, we additionally explored the potential impact of loss of information for future analyses. We did this second step by comparing whether the Beta coefficient of the predictor differed more than 10% when adding original or recoded variables as a confounder in a linear regression model. In a simulation we randomly sampled numbers, recoded those < = 1000 to 0 and those >1000 to 1 and varied the range of the continuous variable, the ratio of recoded zeroes to recoded ones, or both, and again extracted the R-squared from linear models to quantify information loss.

RESULTS:

The R-squared was below 0.8 for 8 out of 91 recoded variables. In 4 cases this had a substantial impact on the regression models, particularly when a continuous variable was recoded into a discrete variable. Our simulation showed that the least information is lost when the ratio of recoded zeroes to ones is 11.

CONCLUSIONS:

Large, pooled datasets provide great opportunities, justifying the efforts for data harmonization. Still, caution is warranted when using recoded variables which variance is explained limitedly by their original variables as this may jeopardize the validity of study results.
Asunto(s)

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Ensayos Clínicos Controlados Aleatorios como Asunto / Metaanálisis como Asunto Tipo de estudio: Clinical_trials / Prognostic_studies / Risk_factors_studies / Systematic_reviews Límite: Humans Idioma: En Año: 2020 Tipo del documento: Article

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Ensayos Clínicos Controlados Aleatorios como Asunto / Metaanálisis como Asunto Tipo de estudio: Clinical_trials / Prognostic_studies / Risk_factors_studies / Systematic_reviews Límite: Humans Idioma: En Año: 2020 Tipo del documento: Article