Your browser doesn't support javascript.
loading
Development of a Machine Learning Modelling Tool for Predicting HIV Incidence Using Public Health Data from a County in the Southern United States.
Saldana, Carlos S; Burkhardt, Elizabeth; Pennisi, Alfred; Oliver, Kirsten; Olmstead, John; Holland, David P; Gettings, Jenna; Mauck, Daniel; Austin, David; Wortley, Pascale; Ochoa, Karla V Saldana.
Afiliación
  • Saldana CS; Division of Infectious Diseases, Department of Medicine, Emory University School of Medicine, Atlanta - GA, United States.
  • Burkhardt E; Georgia Department of Public Health, Atlanta - GA, United States.
  • Pennisi A; Georgia Department of Public Health, Atlanta - GA, United States.
  • Oliver K; Georgia Department of Public Health, Atlanta - GA, United States.
  • Olmstead J; Georgia Department of Public Health, Atlanta - GA, United States.
  • Holland DP; Mercy Care Health Systems, Atlanta - GA, United States.
  • Gettings J; Fulton County Board of Health, Atlanta GA, United States.
  • Mauck D; Georgia Department of Public Health, Atlanta - GA, United States.
  • Austin D; Georgia Department of Public Health, Atlanta - GA, United States.
  • Wortley P; Georgia Department of Public Health, Atlanta - GA, United States.
  • Ochoa KVS; Georgia Department of Public Health, Atlanta - GA, United States.
Clin Infect Dis ; 2024 Feb 23.
Article en En | MEDLINE | ID: mdl-38393832
ABSTRACT

BACKGROUND:

Recent advancements in Machine Learning (ML) have significantly improved the accuracy of models predicting HIV incidence. These models typically utilize electronic medical records and patient registries. This study aims to broaden the application of these tools by utilizing de-identified public health datasets for notifiable sexually transmitted infections (STIs) from a southern U.S. County known for high HIV incidence rates. The goal is to assess the feasibility and accuracy of ML in predicting HIV incidence, which could potentially inform and enhance public health interventions.

METHODS:

We analyzed two de-identified public health datasets, spanning January 2010 to December 2021, focusing on notifiable STIs. Our process involved data processing and feature extraction, including sociodemographic factors, STI cases, and social vulnerability index (SVI) metrics. Various ML algorithms were trained and evaluated for predicting HIV incidence, using metrics such as accuracy, precision, recall, and F1 score.

RESULTS:

The study included 85,224 individuals, with 2,027 (2.37%) newly diagnosed with HIV during the study period. The ML models demonstrated high performance in predicting HIV incidence among males and females. Influential predictive features for males included age at STI diagnosis, previous STI information, provider type, and SVI. For females, they included age, ethnicity, previous STIs information, overall SVI, and race.

CONCLUSIONS:

The high accuracy of our ML models in predicting HIV incidence highlights the potential of using public health datasets for public health interventions such as tailored HIV testing and prevention. While these findings are promising, further research is needed to translate these models into practical public health applications.
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Idioma: En Revista: Clin Infect Dis Asunto de la revista: DOENCAS TRANSMISSIVEIS Año: 2024 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Idioma: En Revista: Clin Infect Dis Asunto de la revista: DOENCAS TRANSMISSIVEIS Año: 2024 Tipo del documento: Article País de afiliación: Estados Unidos
...