Your browser doesn't support javascript.
loading
Leveraging Artificial Intelligence and Synthetic Data Derivatives for Spine Surgery Research.
Greenberg, Jacob K; Landman, Joshua M; Kelly, Michael P; Pennicooke, Brenton H; Molina, Camilo A; Foraker, Randi E; Ray, Wilson Z.
Afiliación
  • Greenberg JK; Departments of Neurological Surgery, Medicine and Orthopaedic Surgery, Washington University School of Medicine in St Louis, St Louis, MO, USA.
  • Landman JM; Departments of Neurological Surgery, Medicine and Orthopaedic Surgery, Washington University School of Medicine in St Louis, St Louis, MO, USA.
  • Kelly MP; Orthopaedic Surgery, Washington University, St Louis, MO, USA.
  • Pennicooke BH; Departments of Neurological Surgery, Medicine and Orthopaedic Surgery, Washington University School of Medicine in St Louis, St Louis, MO, USA.
  • Molina CA; Departments of Neurological Surgery, Medicine and Orthopaedic Surgery, Washington University School of Medicine in St Louis, St Louis, MO, USA.
  • Foraker RE; Orthopaedic Surgery, Washington University, St Louis, MO, USA.
  • Ray WZ; Departments of Neurological Surgery, Medicine and Orthopaedic Surgery, Washington University School of Medicine in St Louis, St Louis, MO, USA.
Global Spine J ; 13(8): 2409-2421, 2023 Oct.
Article en En | MEDLINE | ID: mdl-35373623
ABSTRACT
STUDY

DESIGN:

Retrospective cohort study.

OBJECTIVES:

Leveraging electronic health records (EHRs) for spine surgery research is impeded by concerns regarding patient privacy and data ownership. Synthetic data derivatives may help overcome these limitations. This study's objective was to validate the use of synthetic data for spine surgery research.

METHODS:

Data came from the EHR from 15 hospitals. Patients that underwent anterior cervical or posterior lumbar fusion (2010-2020) were included. Real data were obtained from the EHR. Synthetic data was generated to simulate the properties of the real data, without maintaining a one-to-one correspondence with real patients. Within each cohort, ability to predict 30-day readmissions and 30-day complications was evaluated using logistic regression and extreme gradient boosting machines (XGBoost).

RESULTS:

We identified 9,072 real and 9,088 synthetic cervical fusion patients. Descriptive characteristics were nearly identical between the 2 datasets. When predicting readmission, models built using real and synthetic data both had c-statistics of .69-.71 using logistic regression and XGBoost. Among 12,111 real and 12,126 synthetic lumbar fusion patients, descriptive characteristics were nearly the same for most variables. Using logistic regression and XGBoost to predict readmission, discrimination was similar with models built using real and synthetic data (c-statistics .66-.69). When predicting complications, models derived using real and synthetic data showed similar discrimination in both cohorts. Despite some differences, the most influential predictors were similar in the real and synthetic datasets.

CONCLUSION:

Synthetic data replicate most descriptive and predictive properties of real data, and therefore may expand EHR research in spine surgery.
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Tipo de estudio: Observational_studies / Prognostic_studies / Risk_factors_studies Idioma: En Revista: Global Spine J Año: 2023 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Tipo de estudio: Observational_studies / Prognostic_studies / Risk_factors_studies Idioma: En Revista: Global Spine J Año: 2023 Tipo del documento: Article País de afiliación: Estados Unidos