Your browser doesn't support javascript.
loading
Enriching Data Science and Health Care Education: Application and Impact of Synthetic Data Sets Through the Health Gym Project.
Kuo, Nicholas I-Hsien; Perez-Concha, Oscar; Hanly, Mark; Mnatzaganian, Emmanuel; Hao, Brandon; Di Sipio, Marcus; Yu, Guolin; Vanjara, Jash; Valerie, Ivy Cerelia; de Oliveira Costa, Juliana; Churches, Timothy; Lujic, Sanja; Hegarty, Jo; Jorm, Louisa; Barbieri, Sebastiano.
Afiliação
  • Kuo NI; Centre for Big Data Research in Health, The University of New South Wales, Sydney, Australia.
  • Perez-Concha O; Centre for Big Data Research in Health, The University of New South Wales, Sydney, Australia.
  • Hanly M; Centre for Big Data Research in Health, The University of New South Wales, Sydney, Australia.
  • Mnatzaganian E; The University of New South Wales, Sydney, Australia.
  • Hao B; The University of New South Wales, Sydney, Australia.
  • Di Sipio M; The University of New South Wales, Sydney, Australia.
  • Yu G; The University of New South Wales, Sydney, Australia.
  • Vanjara J; The University of New South Wales, Sydney, Australia.
  • Valerie IC; The University of New South Wales, Sydney, Australia.
  • de Oliveira Costa J; Medicines Intelligence Research Program, School of Population Health, The University of New South Wales, Sydney, Australia.
  • Churches T; School of Clinical Medicine, University of New South Wales, Sydney, Australia.
  • Lujic S; Ingham Institute of Applied Medical Research, Liverpool, Sydney, Australia.
  • Hegarty J; Centre for Big Data Research in Health, The University of New South Wales, Sydney, Australia.
  • Jorm L; Sydney Local Health District, Sydney, Australia.
  • Barbieri S; Centre for Big Data Research in Health, The University of New South Wales, Sydney, Australia.
JMIR Med Educ ; 10: e51388, 2024 Jan 16.
Article em En | MEDLINE | ID: mdl-38227356
ABSTRACT
Large-scale medical data sets are vital for hands-on education in health data science but are often inaccessible due to privacy concerns. Addressing this gap, we developed the Health Gym project, a free and open-source platform designed to generate synthetic health data sets applicable to various areas of data science education, including machine learning, data visualization, and traditional statistical models. Initially, we generated 3 synthetic data sets for sepsis, acute hypotension, and antiretroviral therapy for HIV infection. This paper discusses the educational applications of Health Gym's synthetic data sets. We illustrate this through their use in postgraduate health data science courses delivered by the University of New South Wales, Australia, and a Datathon event, involving academics, students, clinicians, and local health district professionals. We also include adaptable worked examples using our synthetic data sets, designed to enrich hands-on tutorial and workshop experiences. Although we highlight the potential of these data sets in advancing data science education and health care artificial intelligence, we also emphasize the need for continued research into the inherent limitations of synthetic data.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Inteligência Artificial / Infecções por HIV Tipo de estudo: Risk_factors_studies Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Inteligência Artificial / Infecções por HIV Tipo de estudo: Risk_factors_studies Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article