Your browser doesn't support javascript.
loading
COVID-19 outbreaks surveillance through text mining applied to electronic health records.
Rocha, Hermano Alexandre Lima; Solha, Erik Zarko Macêdo; Furtado, Vasco; Justino, Francion Linhares; Barreto, Lucas Arêa Leão; da Silva, Ronaldo Guedes; de Oliveira, Ítalo Martins; Bates, David Westfall; de Góes Cavalcanti, Luciano Pamplona; Lima Neto, Antônio Silva; de Oliveira, Erneson Alves.
Afiliação
  • Rocha HAL; Department of Community Health, Federal University of Ceará, Street Papi Júnior, 1223, 5th. Floor, Fortaleza, CE, Brazil. hermano@ufc.br.
  • Solha EZM; Postgraduate Program in Applied Informatics, University of Fortaleza, Fortaleza, CE, 60811-905, Brazil.
  • Furtado V; Postgraduate Program in Applied Informatics, University of Fortaleza, Fortaleza, CE, 60811-905, Brazil.
  • Justino FL; Postgraduate Program in Applied Informatics, University of Fortaleza, Fortaleza, CE, 60811-905, Brazil.
  • Barreto LAL; Department of Community Health, Federal University of Ceará, Street Papi Júnior, 1223, 5th. Floor, Fortaleza, CE, Brazil.
  • da Silva RG; Department of Community Health, Federal University of Ceará, Street Papi Júnior, 1223, 5th. Floor, Fortaleza, CE, Brazil.
  • de Oliveira ÍM; Health Secretariat, Ceará State Government, Fortaleza, CE, Brazil.
  • Bates DW; Harvard Medical School, Boston, MA, USA.
  • de Góes Cavalcanti LP; Department of Community Health, Federal University of Ceará, Street Papi Júnior, 1223, 5th. Floor, Fortaleza, CE, Brazil.
  • Lima Neto AS; School of Public Health of Ceará, Fortaleza, CE, Brazil.
  • de Oliveira EA; Faculty of Medicine, Christus University Center, Fortaleza, CE, Brazil.
BMC Infect Dis ; 24(1): 359, 2024 Mar 28.
Article em En | MEDLINE | ID: mdl-38549109
ABSTRACT

BACKGROUND:

The COVID-19 pandemic has caused significant disruptions to everyday life and has had social, political, and financial consequences that will persist for years. Several initiatives with intensive use of technology were quickly developed in this scenario. However, technologies that enhance epidemiological surveillance in contexts with low testing capacity and healthcare resources are scarce. Therefore, this study aims to address this gap by developing a data science model that uses routinely generated healthcare encounter records to detect possible new outbreaks early in real-time.

METHODS:

We defined an epidemiological indicator that is a proxy for suspected cases of COVID-19 using the health records of Emergency Care Unit (ECU) patients and text mining techniques. The open-field dataset comprises 2,760,862 medical records from nine ECUs, where each record has information about the patient's age, reported symptoms, and the time and date of admission. We also used a dataset where 1,026,804 cases of COVID-19 were officially confirmed. The records range from January 2020 to May 2022. Sample cross-correlation between two finite stochastic time series was used to evaluate the models.

RESULTS:

For patients with age 18 years, we find time-lag () = 72 days and cross-correlation () ~ 0.82, = 25 days and ~ 0.93, and = 17 days and ~ 0.88 for the first, second, and third waves, respectively.

CONCLUSIONS:

In conclusion, the developed model can aid in the early detection of signs of possible new COVID-19 outbreaks, weeks before traditional surveillance systems, thereby anticipating in initiating preventive and control actions in public health with a higher likelihood of success.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: COVID-19 Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: COVID-19 Idioma: En Ano de publicação: 2024 Tipo de documento: Article