Your browser doesn't support javascript.
loading
Reproducible variability: assessing investigator discordance across 9 research teams attempting to reproduce the same observational study.
Ostropolets, Anna; Albogami, Yasser; Conover, Mitchell; Banda, Juan M; Baumgartner, William A; Blacketer, Clair; Desai, Priyamvada; DuVall, Scott L; Fortin, Stephen; Gilbert, James P; Golozar, Asieh; Ide, Joshua; Kanter, Andrew S; Kern, David M; Kim, Chungsoo; Lai, Lana Y H; Li, Chenyu; Liu, Feifan; Lynch, Kristine E; Minty, Evan; Neves, Maria Inês; Ng, Ding Quan; Obene, Tontel; Pera, Victor; Pratt, Nicole; Rao, Gowtham; Rappoport, Nadav; Reinecke, Ines; Saroufim, Paola; Shoaibi, Azza; Simon, Katherine; Suchard, Marc A; Swerdel, Joel N; Voss, Erica A; Weaver, James; Zhang, Linying; Hripcsak, George; Ryan, Patrick B.
Affiliation
  • Ostropolets A; Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, New York, USA.
  • Albogami Y; Department of Clinical Pharmacy, College of Pharmacy, King Saud University, Riyadh, Saudi Arabia.
  • Conover M; Observational Health Data Analytics, Janssen Research & Development, Titusville, New Jersey, USA.
  • Banda JM; Department of Computer Science, Georgia State University, Atlanta, Georgia, USA.
  • Baumgartner WA; Division of General Internal Medicine, Department of Medicine, University of Colorado Anschutz Medical Campus, Aurora, Colorado, USA.
  • Blacketer C; Observational Health Data Analytics, Janssen Research & Development, Titusville, New Jersey, USA.
  • Desai P; Research IT, Technology and Digital Solutions, Stanford Medicine, Stanford, California, USA.
  • DuVall SL; VA Salt Lake City Health Care System, Salt Lake City, Utah, USA.
  • Fortin S; Department of Internal Medicine, University of Utah School of Medicine, Salt Lake City, Utah, USA.
  • Gilbert JP; Observational Health Data Analytics, Janssen Research & Development, Titusville, New Jersey, USA.
  • Golozar A; Observational Health Data Analytics, Janssen Research & Development, Titusville, New Jersey, USA.
  • Ide J; Odysseus Data Services, New York, New York, USA.
  • Kanter AS; Johnson & Johnson, Titusville, New Jersey, USA.
  • Kern DM; Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, New York, USA.
  • Kim C; Observational Health Data Analytics, Janssen Research & Development, Titusville, New Jersey, USA.
  • Lai LYH; Department of Biomedical Sciences, Ajou University Graduate School of Medicine, Suwon, South Korea.
  • Li C; Department of Informatics, Imaging & Data Sciences, University of Manchester, Manchester, UK.
  • Liu F; Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, Pennsylvania, USA.
  • Lynch KE; Department of Population and Quantitative Health Sciences, University of Massachusetts Chan Medical School, Worcester, Massachusetts, USA.
  • Minty E; VA Salt Lake City Health Care System, Salt Lake City, Utah, USA.
  • Neves MI; Department of Internal Medicine, University of Utah School of Medicine, Salt Lake City, Utah, USA.
  • Ng DQ; O'Brien Institute for Public Health, Faculty of Medicine, University of Calgary, Calgary, Canada.
  • Obene T; Real World Solutions, IQVIA, Durham, North Carolina, USA.
  • Pera V; Department of Pharmaceutical Sciences, School of Pharmacy & Pharmaceutical Sciences, University of California, Irvine, California, USA.
  • Pratt N; Mississippi Urban Research Center, Jackson State University, Jackson, Mississippi, USA.
  • Rao G; Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, The Netherlands.
  • Rappoport N; Quality Use of Medicines and Pharmacy Research Centre, University of South Australia, Adelaide, Australia.
  • Reinecke I; Observational Health Data Analytics, Janssen Research & Development, Titusville, New Jersey, USA.
  • Saroufim P; Department of Software and Information Systems Engineering, Ben-Gurion University of the Negev, Israel.
  • Shoaibi A; Institute for Medical Informatics and Biometry, Carl Gustav Carus Faculty of Medicine, Technische Universität Dresden, Dresden, Germany.
  • Simon K; Cleveland Institute for Computational Biology, Case Western Reserve University, Cleveland, Ohio, USA.
  • Suchard MA; Observational Health Data Analytics, Janssen Research & Development, Titusville, New Jersey, USA.
  • Swerdel JN; VA Tennessee Valley Health Care System, Vanderbilt University Medical Center, Nashville, Tennessee, USA.
  • Voss EA; Department of Biostatistics, University of California, Los Angeles, California, USA.
  • Weaver J; Department of Human Genetics, University of California, Los Angeles, California, USA.
  • Zhang L; Observational Health Data Analytics, Janssen Research & Development, Titusville, New Jersey, USA.
  • Hripcsak G; Observational Health Data Analytics, Janssen Research & Development, Titusville, New Jersey, USA.
  • Ryan PB; Observational Health Data Analytics, Janssen Research & Development, Titusville, New Jersey, USA.
J Am Med Inform Assoc ; 30(5): 859-868, 2023 04 19.
Article in En | MEDLINE | ID: mdl-36826399
ABSTRACT

OBJECTIVE:

Observational studies can impact patient care but must be robust and reproducible. Nonreproducibility is primarily caused by unclear reporting of design choices and analytic procedures. This study aimed to (1) assess how the study logic described in an observational study could be interpreted by independent researchers and (2) quantify the impact of interpretations' variability on patient characteristics. MATERIALS AND

METHODS:

Nine teams of highly qualified researchers reproduced a cohort from a study by Albogami et al. The teams were provided the clinical codes and access to the tools to create cohort definitions such that the only variable part was their logic choices. We executed teams' cohort definitions against the database and compared the number of subjects, patient overlap, and patient characteristics.

RESULTS:

On average, the teams' interpretations fully aligned with the master implementation in 4 out of 10 inclusion criteria with at least 4 deviations per team. Cohorts' size varied from one-third of the master cohort size to 10 times the cohort size (2159-63 619 subjects compared to 6196 subjects). Median agreement was 9.4% (interquartile range 15.3-16.2%). The teams' cohorts significantly differed from the master implementation by at least 2 baseline characteristics, and most of the teams differed by at least 5.

CONCLUSIONS:

Independent research teams attempting to reproduce the study based on its free-text description alone produce different implementations that vary in the population size and composition. Sharing analytical code supported by a common data model and open-source tools allows reproducing a study unambiguously thereby preserving initial design choices.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Research Personnel Type of study: Observational_studies / Prognostic_studies Limits: Humans Language: En Journal: J Am Med Inform Assoc Journal subject: INFORMATICA MEDICA Year: 2023 Document type: Article Affiliation country: United States

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Research Personnel Type of study: Observational_studies / Prognostic_studies Limits: Humans Language: En Journal: J Am Med Inform Assoc Journal subject: INFORMATICA MEDICA Year: 2023 Document type: Article Affiliation country: United States