Your browser doesn't support javascript.
loading
Studying missingness in spinal cord injury data: challenges and impact of data imputation.
Bourguignon, Lucie; Lukas, Louis P; Guest, James D; Geisler, Fred H; Noonan, Vanessa; Curt, Armin; Brüningk, Sarah C; Jutzeler, Catherine R.
Afiliação
  • Bourguignon L; Department of Health Sciences and Technology (D-HEST), ETH Zurich, Universitätstrasse 2, 8092, Zürich, Switzerland. lucie.bourguignon@hest.ethz.ch.
  • Lukas LP; Schulthess Klinik, Lengghalde 2, 8008, Zürich, Switzerland. lucie.bourguignon@hest.ethz.ch.
  • Guest JD; SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland. lucie.bourguignon@hest.ethz.ch.
  • Geisler FH; Department of Health Sciences and Technology (D-HEST), ETH Zurich, Universitätstrasse 2, 8092, Zürich, Switzerland.
  • Noonan V; Schulthess Klinik, Lengghalde 2, 8008, Zürich, Switzerland.
  • Curt A; SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland.
  • Brüningk SC; Neurological Surgery and the Miami Project to Cure Paralysis, U Miami, Miami, FL, 33136, USA.
  • Jutzeler CR; Department of Medical Imaging, College of Medicine, University of Saskatchewan, Saskatoon, Saskatchewan, Canada.
BMC Med Res Methodol ; 24(1): 5, 2024 01 06.
Article em En | MEDLINE | ID: mdl-38184529
ABSTRACT

BACKGROUND:

In the last decades, medical research fields studying rare conditions such as spinal cord injury (SCI) have made extensive efforts to collect large-scale data. However, most analysis methods rely on complete data. This is particularly troublesome when studying clinical data as they are prone to missingness. Often, researchers mitigate this problem by removing patients with missing data from the analyses. Less commonly, imputation methods to infer likely values are applied.

OBJECTIVE:

Our objective was to study how handling missing data influences the results reported, taking the example of SCI registries. We aimed to raise awareness on the effects of missing data and provide guidelines to be applied for future research projects, in SCI research and beyond.

METHODS:

Using the Sygen clinical trial data (n = 797), we analyzed the impact of the type of variable in which data is missing, the pattern according to which data is missing, and the imputation strategy (e.g. mean imputation, last observation carried forward, multiple imputation).

RESULTS:

Our simulations show that mean imputation may lead to results strongly deviating from the underlying expected results. For repeated measures missing at late stages (> = 6 months after injury in this simulation study), carrying the last observation forward seems the preferable option for the imputation. This simulation study could show that a one-size-fit-all imputation strategy falls short in SCI data sets.

CONCLUSIONS:

Data-tailored imputation strategies are required (e.g., characterisation of the missingness pattern, last observation carried forward for repeated measures evolving to a plateau over time). Therefore, systematically reporting the extent, kind and decisions made regarding missing data will be essential to improve the interpretation, transparency, and reproducibility of the research presented.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Traumatismos da Medula Espinal / Pesquisa Biomédica Tipo de estudo: Guideline / Prognostic_studies Limite: Humans Idioma: En Revista: BMC Med Res Methodol Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Traumatismos da Medula Espinal / Pesquisa Biomédica Tipo de estudo: Guideline / Prognostic_studies Limite: Humans Idioma: En Revista: BMC Med Res Methodol Ano de publicação: 2024 Tipo de documento: Article