Your browser doesn't support javascript.
loading
Mimicking clinical trials with synthetic acute myeloid leukemia patients using generative artificial intelligence.
Eckardt, Jan-Niklas; Hahn, Waldemar; Röllig, Christoph; Stasik, Sebastian; Platzbecker, Uwe; Müller-Tidow, Carsten; Serve, Hubert; Baldus, Claudia D; Schliemann, Christoph; Schäfer-Eckart, Kerstin; Hanoun, Maher; Kaufmann, Martin; Burchert, Andreas; Thiede, Christian; Schetelig, Johannes; Sedlmayr, Martin; Bornhäuser, Martin; Wolfien, Markus; Middeke, Jan Moritz.
Afiliação
  • Eckardt JN; Department of Internal Medicine I, University Hospital Carl Gustav Carus, Technical University Dresden, Dresden, Germany. jan-niklas.eckardt@uniklinikum-dresden.de.
  • Hahn W; Else Kröner Fresenius Center for Digital Health, Technical University Dresden, Dresden, Germany. jan-niklas.eckardt@uniklinikum-dresden.de.
  • Röllig C; Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI) Dresden/Leipzig, Leipzig, Germany.
  • Stasik S; Institute for Medical Informatics and Biometry, Technical University Dresden, Dresden, Germany.
  • Platzbecker U; Department of Internal Medicine I, University Hospital Carl Gustav Carus, Technical University Dresden, Dresden, Germany.
  • Müller-Tidow C; Department of Internal Medicine I, University Hospital Carl Gustav Carus, Technical University Dresden, Dresden, Germany.
  • Serve H; Medical Clinic and Policlinic I Hematology and Cell Therapy, University Hospital, Leipzig, Germany.
  • Baldus CD; Department of Medicine V, University Hospital Heidelberg, Heidelberg, Germany.
  • Schliemann C; Department of Medicine 2, Hematology and Oncology, Goethe University Frankfurt, Frankfurt, Germany.
  • Schäfer-Eckart K; Department of Hematology and Oncology, University Hospital Schleswig Holstein, Kiel, Germany.
  • Hanoun M; Department of Medicine A, University Hospital Münster, Münster, Germany.
  • Kaufmann M; Department of Internal Medicine V, Paracelsus Medizinische Privatuniversität and University Hospital Nürnberg, Nürnberg, Germany.
  • Burchert A; Department of Hematology, University Hospital Essen, Essen, Germany.
  • Thiede C; Department of Hematology, Oncology and Palliative Care, Robert-Bosch-Hospital, Stuttgart, Germany.
  • Schetelig J; Department of Hematology, Oncology and Immunology, Philipps-University-Marburg, Marburg, Germany.
  • Sedlmayr M; Department of Internal Medicine I, University Hospital Carl Gustav Carus, Technical University Dresden, Dresden, Germany.
  • Bornhäuser M; Department of Internal Medicine I, University Hospital Carl Gustav Carus, Technical University Dresden, Dresden, Germany.
  • Wolfien M; Institute for Medical Informatics and Biometry, Technical University Dresden, Dresden, Germany.
  • Middeke JM; Department of Internal Medicine I, University Hospital Carl Gustav Carus, Technical University Dresden, Dresden, Germany.
NPJ Digit Med ; 7(1): 76, 2024 Mar 20.
Article em En | MEDLINE | ID: mdl-38509224
ABSTRACT
Clinical research relies on high-quality patient data, however, obtaining big data sets is costly and access to existing data is often hindered by privacy and regulatory concerns. Synthetic data generation holds the promise of effectively bypassing these boundaries allowing for simplified data accessibility and the prospect of synthetic control cohorts. We employed two different methodologies of generative artificial intelligence - CTAB-GAN+ and normalizing flows (NFlow) - to synthesize patient data derived from 1606 patients with acute myeloid leukemia, a heterogeneous hematological malignancy, that were treated within four multicenter clinical trials. Both generative models accurately captured distributions of demographic, laboratory, molecular and cytogenetic variables, as well as patient outcomes yielding high performance scores regarding fidelity and usability of both synthetic cohorts (n = 1606 each). Survival analysis demonstrated close resemblance of survival curves between original and synthetic cohorts. Inter-variable relationships were preserved in univariable outcome analysis enabling explorative analysis in our synthetic data. Additionally, training sample privacy is safeguarded mitigating possible patient re-identification, which we quantified using Hamming distances. We provide not only a proof-of-concept for synthetic data generation in multimodal clinical data for rare diseases, but also full public access to synthetic data sets to foster further research.

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article