Your browser doesn't support javascript.
loading
Initial data analysis for longitudinal studies to build a solid foundation for reproducible analysis.
Lusa, Lara; Proust-Lima, Cécile; Schmidt, Carsten O; Lee, Katherine J; le Cessie, Saskia; Baillie, Mark; Lawrence, Frank; Huebner, Marianne.
Afiliação
  • Lusa L; Department of Mathematics, Faculty of Mathematics, Natural Sciences and Information Technologies, University of Primorska, Koper, Capodistria, Slovenia.
  • Proust-Lima C; Institute for Biostatistics and Medical Informatics, Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia.
  • Schmidt CO; Univ. Bordeaux, Inserm, Bordeaux Population Health Research Center, UMR1219, Bordeaux, France.
  • Lee KJ; Institute for community Medicine, SHIP-KEF University Medicine of Greifswald, Greifswald, Germany.
  • le Cessie S; Clinical Epidemiology and Biostatistics Unit, Murdoch Children's Research Institute, Melbourne, Australia.
  • Baillie M; University of Melbourne, Melbourne, Australia.
  • Lawrence F; Department of Clinical Epidemiology and Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, The Netherlands.
  • Huebner M; Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, The Netherlands.
PLoS One ; 19(5): e0295726, 2024.
Article em En | MEDLINE | ID: mdl-38809844
ABSTRACT
Initial data analysis (IDA) is the part of the data pipeline that takes place between the end of data retrieval and the beginning of data analysis that addresses the research question. Systematic IDA and clear reporting of the IDA findings is an important step towards reproducible research. A general framework of IDA for observational studies includes data cleaning, data screening, and possible updates of pre-planned statistical analyses. Longitudinal studies, where participants are observed repeatedly over time, pose additional challenges, as they have special features that should be taken into account in the IDA steps before addressing the research question. We propose a systematic approach in longitudinal studies to examine data properties prior to conducting planned statistical analyses. In this paper we focus on the data screening element of IDA, assuming that the research aims are accompanied by an analysis plan, meta-data are well documented, and data cleaning has already been performed. IDA data screening comprises five types of explorations, covering the analysis of participation profiles over time, evaluation of missing data, presentation of univariate and multivariate descriptions, and the depiction of longitudinal aspects. Executing the IDA plan will result in an IDA report to inform data analysts about data properties and possible implications for the analysis plan-another element of the IDA framework. Our framework is illustrated focusing on hand grip strength outcome data from a data collection across several waves in a complex survey. We provide reproducible R code on a public repository, presenting a detailed data screening plan for the investigation of the average rate of age-associated decline of grip strength. With our checklist and reproducible R code we provide data analysts a framework to work with longitudinal data in an informed way, enhancing the reproducibility and validity of their work.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Análise de Dados Limite: Female / Humans / Male Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Análise de Dados Limite: Female / Humans / Male Idioma: En Ano de publicação: 2024 Tipo de documento: Article