Your browser doesn't support javascript.
loading
Development of a Machine Learning Modeling Tool for Predicting HIV Incidence Using Public Health Data From a County in the Southern United States.
Saldana, Carlos S; Burkhardt, Elizabeth; Pennisi, Alfred; Oliver, Kirsten; Olmstead, John; Holland, David P; Gettings, Jenna; Mauck, Daniel; Austin, David; Wortley, Pascale; Ochoa, Karla V Saldana.
Affiliation
  • Saldana CS; Division of Infectious Diseases, Department of Medicine, Emory University School of Medicine, Atlanta, Georgia, USA.
  • Burkhardt E; Epidemiology Division, Georgia Department of Public Health, Atlanta, Georgia, USA.
  • Pennisi A; Epidemiology Division, Georgia Department of Public Health, Atlanta, Georgia, USA.
  • Oliver K; Epidemiology Division, Georgia Department of Public Health, Atlanta, Georgia, USA.
  • Olmstead J; Epidemiology Division, Georgia Department of Public Health, Atlanta, Georgia, USA.
  • Holland DP; Division of Primary Care, Mercy Care Health Systems, Atlanta, Georgia, USA.
  • Gettings J; Fulton County Board of Health, Communicable Disease Prevention Branch, Atlanta, Georgia, USA.
  • Mauck D; Epidemiology Division, Georgia Department of Public Health, Atlanta, Georgia, USA.
  • Austin D; Epidemiology Division, Georgia Department of Public Health, Atlanta, Georgia, USA.
  • Wortley P; Epidemiology Division, Georgia Department of Public Health, Atlanta, Georgia, USA.
  • Ochoa KVS; Epidemiology Division, Georgia Department of Public Health, Atlanta, Georgia, USA.
Clin Infect Dis ; 79(3): 717-726, 2024 Sep 26.
Article in En | MEDLINE | ID: mdl-38393832
ABSTRACT

BACKGROUND:

Advancements in machine learning (ML) have improved the accuracy of models that predict human immunodeficiency virus (HIV) incidence. These models have used electronic medical records and registries. We aim to broaden the application of these tools by using deidentified public health datasets for notifiable sexually transmitted infections (STIs) from a southern US county known for high HIV incidence. The goal is to assess the feasibility and accuracy of ML in predicting HIV incidence, which could inform and enhance public health interventions.

METHODS:

We analyzed 2 deidentified public health datasets from January 2010 to December 2021, focusing on notifiable STIs. Our process involved data processing and feature extraction, including sociodemographic factors, STI cases, and social vulnerability index (SVI) metrics. Various ML models were trained and evaluated for predicting HIV incidence using metrics such as accuracy, precision, recall, and F1 score.

RESULTS:

We included 85 224 individuals; 2027 (2.37%) were newly diagnosed with HIV during the study period. The ML models demonstrated high performance in predicting HIV incidence among males and females. Influential features for males included age at STI diagnosis, previous STI information, provider type, and SVI. For females, predictive features included age, ethnicity, previous STI information, overall SVI, and race.

CONCLUSIONS:

The high accuracy of our ML models in predicting HIV incidence highlights the potential of using public health datasets for public health interventions such as tailored HIV testing and prevention. While these findings are promising, further research is needed to translate these models into practical public health applications.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: HIV Infections / Machine Learning Limits: Adolescent / Adult / Female / Humans / Male / Middle aged Country/Region as subject: America do norte Language: En Journal: Clin Infect Dis Journal subject: DOENCAS TRANSMISSIVEIS Year: 2024 Type: Article Affiliation country: United States

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: HIV Infections / Machine Learning Limits: Adolescent / Adult / Female / Humans / Male / Middle aged Country/Region as subject: America do norte Language: En Journal: Clin Infect Dis Journal subject: DOENCAS TRANSMISSIVEIS Year: 2024 Type: Article Affiliation country: United States