RESUMO
BACKGROUND: Literature reviews (LRs) identify, evaluate, and synthesize relevant papers to a particular research question to advance understanding and support decision-making. However, LRs, especially traditional systematic reviews, are slow, resource-intensive, and become outdated quickly. OBJECTIVE: LiteRev is an advanced and enhanced version of an existing automation tool designed to assist researchers in conducting LRs through the implementation of cutting-edge technologies such as natural language processing and machine learning techniques. In this paper, we present a comprehensive explanation of LiteRev's capabilities, its methodology, and an evaluation of its accuracy and efficiency to a manual LR, highlighting the benefits of using LiteRev. METHODS: Based on the user's query, LiteRev performs an automated search on a wide range of open-access databases and retrieves relevant metadata on the resulting papers, including abstracts or full texts when available. These abstracts (or full texts) are text processed and represented as a term frequency-inverse document frequency matrix. Using dimensionality reduction (pairwise controlled manifold approximation) and clustering (hierarchical density-based spatial clustering of applications with noise) techniques, the corpus is divided into different topics described by a list of the most important keywords. The user can then select one or several topics of interest, enter additional keywords to refine its search, or provide key papers to the research question. Based on these inputs, LiteRev performs a k-nearest neighbor (k-NN) search and suggests a list of potentially interesting papers. By tagging the relevant ones, the user triggers new k-NN searches until no additional paper is suggested for screening. To assess the performance of LiteRev, we ran it in parallel to a manual LR on the burden and care for acute and early HIV infection in sub-Saharan Africa. We assessed the performance of LiteRev using true and false predictive values, recall, and work saved over sampling. RESULTS: LiteRev extracted, processed, and transformed text into a term frequency-inverse document frequency matrix of 631 unique papers from PubMed. The topic modeling module identified 16 topics and highlighted 2 topics of interest to the research question. Based on 18 key papers, the k-NNs module suggested 193 papers for screening out of 613 papers in total (31.5% of the whole corpus) and correctly identified 64 relevant papers out of the 87 papers found by the manual abstract screening (recall rate of 73.6%). Compared to the manual full text screening, LiteRev identified 42 relevant papers out of the 48 papers found manually (recall rate of 87.5%). This represents a total work saved over sampling of 56%. CONCLUSIONS: We presented the features and functionalities of LiteRev, an automation tool that uses natural language processing and machine learning methods to streamline and accelerate LRs and support researchers in getting quick and in-depth overviews on any topic of interest.
Assuntos
Infecções por HIV , Processamento de Linguagem Natural , Humanos , Análise por Conglomerados , Bases de Dados Factuais , Aprendizado de Máquina , Literatura de Revisão como AssuntoRESUMO
OBJECTIVE: To describe the prevalence of HIV viral suppression and assess the factors associated with HIV viral suppression among persons receiving antiretroviral therapy (ART) in Malawi in 2021. METHODS: Implementation study using routinely collected patient-level HIV RNA-PCR test result data extracted from the national Laboratory Management Information System (LIMS) database managed by the Department of HIV/AIDS in 2021. We calculated frequencies, proportions and odds ratios (OR) of HIV viral suppression with their associated 95% confidence intervals (95%CIs). We performed a random-effects logistic regression to determine the risk factors associated with HIV viral suppression among ART patients, controlling for the spatial autocorrelation between districts and adjusting for other variables. RESULTS: We evaluated 515,797 adults and children receiving ART and having a viral load test in 2021. Of these, 92.8% had HIV viral suppression. ART patients living in urban areas had lower likelihood of HIV viral suppression than those living in rural areas (adjusted OR [aOR] = 0.95, 95%CI: 0.92-0.99, p = 0.01). There was an increasing trend in HIV viral suppression with increasing ART duration. Routine VL monitoring samples were 39% more likely to have suppressed VL values than confirmatory HIV VL monitoring samples (aOR = 1.39; 95%CI: 1.34-1.43, p < 0.001). CONCLUSION: This is the first national analysis of Malawi HIV VL data from LIMS. Our findings show the need to particularly consider the urban residents, those below 20 years, males, those on ART for less than a year as well as those on specific ARV regimens in order to persistently suppress HIV VL and consequently achieve the goal of achieving HIV VL suppression by 2030.
Assuntos
Fármacos Anti-HIV , Infecções por HIV , Sistemas de Informação Administrativa , Adulto , Fármacos Anti-HIV/uso terapêutico , Criança , Infecções por HIV/epidemiologia , Humanos , Malaui/epidemiologia , Masculino , Razão de Chances , Carga ViralRESUMO
Emerging infectious diseases are a growing threat in sub-Saharan African countries, but the human and technical capacity to quickly respond to outbreaks remains limited. Here, we describe the experience and lessons learned from a joint project with the WHO Regional Office for Africa (WHO AFRO) to support the sub-Saharan African COVID-19 response.In June 2020, WHO AFRO contracted a number of consultants to reinforce the COVID-19 response in member states by providing actionable epidemiological analysis. Given the urgency of the situation and the magnitude of work required, we recruited a worldwide network of field experts, academics and students in the areas of public health, data science and social science to support the effort. Most analyses were performed on a merged line list of COVID-19 cases using a reverse engineering model (line listing built using data extracted from national situation reports shared by countries with the Regional Office for Africa as per the IHR (2005) obligations). The data analysis platform The Renku Project ( https://renkulab.io ) provided secure data storage and permitted collaborative coding.Over a period of 6 months, 63 contributors from 32 nations (including 17 African countries) participated in the project. A total of 45 in-depth country-specific epidemiological reports and data quality reports were prepared for 28 countries. Spatial transmission and mortality risk indices were developed for 23 countries. Text and video-based training modules were developed to integrate and mentor new members. The team also began to develop EpiGraph Hub, a web application that automates the generation of reports similar to those we created, and includes more advanced data analyses features (e.g. mathematical models, geospatial analyses) to deliver real-time, actionable results to decision-makers.Within a short period, we implemented a global collaborative approach to health data management and analyses to advance national responses to health emergencies and outbreaks. The interdisciplinary team, the hands-on training and mentoring, and the participation of local researchers were key to the success of this initiative.
Assuntos
COVID-19 , África Subsaariana/epidemiologia , COVID-19/epidemiologia , Surtos de Doenças/prevenção & controle , Humanos , Saúde Pública , Recursos HumanosRESUMO
This study analysed the reported incidence of COVID-19 and associated epidemiological and socio-economic factors in the WHO African region. Data from COVID-19 confirmed cases and SARS-CoV-2 tests reported to the WHO by Member States between 25 February and 31 December 2020 and publicly available health and socio-economic data were analysed using univariate and multivariate binomial regression models. The overall cumulative incidence was 1846 cases per million population. Cape Verde (21 350 per million), South Africa (18 060 per million), Namibia (9840 per million), Eswatini (8151 per million) and Botswana (6044 per million) recorded the highest cumulative incidence, while Benin (260 per million), Democratic Republic of Congo (203 per million), Niger (141 cases per million), Chad (133 per million) and Burundi (62 per million) recorded the lowest. Increasing percentage of urban population (ß = -0.011, P = 0.04) was associated with low cumulative incidence, while increasing number of cumulative SARS-CoV-2 tests performed per 10 000 population (ß = 0.0006, P = 0.006) and the proportion of population aged 15-64 years (adjusted ß = 0.174, P < 0.0001) were associated with high COVID-19 cumulative incidence. With limited testing capacities and overwhelmed health systems, these findings highlight the need for countries to increase and decentralise testing capacities and adjust testing strategies to target most at-risk populations.
Assuntos
COVID-19/epidemiologia , SARS-CoV-2 , Organização Mundial da Saúde , Adolescente , Adulto , África/epidemiologia , Humanos , Incidência , Modelos Logísticos , Pessoa de Meia-Idade , Análise Multivariada , Estudos Retrospectivos , Fatores de Tempo , Adulto JovemRESUMO
BACKGROUND: Demographic and sociobehavioral factors are strong drivers of HIV infection rates in sub-Saharan Africa. These factors are often studied in qualitative research but ignored in quantitative analyses. However, they provide in-depth insight into the local behavior and may help to improve HIV prevention. OBJECTIVE: To obtain a comprehensive overview of the sociobehavioral factors influencing HIV prevalence and incidence in Malawi, we systematically reviewed the literature using a newly programmed tool for automatizing part of the systematic review process. METHODS: Due to the choice of broad search terms ("HIV AND Malawi"), our preliminary search revealed many thousands of articles. We, therefore, developed a Python tool to automatically extract, process, and categorize open-access articles published from January 1, 1987 to October 1, 2019 in the PubMed, PubMed Central, JSTOR, Paperity, and arXiV databases. We then used a topic modelling algorithm to classify and identify publications of interest. RESULTS: Our tool extracted 22,709 unique articles; 16,942 could be further processed. After topic modelling, 519 of these were clustered into relevant topics, of which 20 were kept after manual screening. We retrieved 7 more publications after examining the references so that 27 publications were finally included in the review. Reducing the 16,942 articles to 519 potentially relevant articles using the software took 5 days. Several factors contributing to the risk of HIV infection were identified, including religion, gender and relationship dynamics, beliefs, and sociobehavioral attitudes. CONCLUSIONS: Our software does not replace traditional systematic reviews, but it returns useful results to broad queries of open-access literature in under a week, without a priori knowledge. This produces a "seed dataset" of relevance that could be further developed. It identified known factors and factors that may be specific to Malawi. In the future, we aim to expand the tool by adding more social science databases and applying it to other sub-Saharan African countries.
Assuntos
Infecções por HIV/epidemiologia , Programas de Rastreamento/métodos , Humanos , Malaui , Pesquisa QualitativaRESUMO
INTRODUCTION: Machine learning algorithms are increasingly being used to inform HIV prevention and detection strategies. We validated and extended a previously developed machine learning model for patient retention on antiretroviral therapy in a new geographic catchment area in South Africa. METHODS: We compared the ability of an adaptive boosting algorithm to predict interruption in treatment (IIT) in 2 South African cohorts from the Free State and Mpumalanga and Gauteng and North West (GA/NW) provinces. We developed a novel set of predictive features for the GA/NW cohort using a categorical boosting model. We evaluated the ability of the model to predict IIT over all visits and across different periods within a patient's treatment trajectory. RESULTS: When predicting IIT, the GA/NW and Free State and Mpumalanga models demonstrated a sensitivity of 60% and 61%, respectively, able to correctly predict nearly two-thirds of all missed visits with a positive predictive value of 18% and 19%. Using predictive features generated from the GA/NW cohort, the categorical boosting model correctly predicted 22,119 of a total of 35,985 missed next visits, yielding a sensitivity of 62%, specificity of 67%, and positive predictive value of 20%. Model performance was highest when tested on visits within the first 6 months. CONCLUSIONS: Machine learning algorithms may be useful in informing tools to increase antiretroviral therapy patient retention and efficiency of HIV care interventions. This is particularly relevant in developing countries where health data systems are being strengthened to collect data on a scale that is large enough to apply novel analytical methods.
Assuntos
Infecções por HIV , Humanos , Infecções por HIV/tratamento farmacológico , África do Sul , Aprendizado de MáquinaRESUMO
INTRODUCTION: High yield HIV testing strategies are critical to reach epidemic control in high prevalence and low-resource settings such as East and Southern Africa. In this study, we aimed to predict the HIV status of individuals living in Angola, Burundi, Ethiopia, Lesotho, Malawi, Mozambique, Namibia, Rwanda, Zambia and Zimbabwe with the highest precision and sensitivity for different policy targets and constraints based on a minimal set of socio-behavioural characteristics. METHODS: We analysed the most recent Demographic and Health Survey from these 10 countries to predict individual's HIV status using four different algorithms (a penalized logistic regression, a generalized additive model, a support vector machine, and a gradient boosting trees). The algorithms were trained and validated on 80% of the data, and tested on the remaining 20%. We compared the predictions based on the F1 score, the harmonic mean of sensitivity and positive predictive value (PPV), and we assessed the generalization of our models by testing them against an independent left-out country. The best performing algorithm was trained on a minimal subset of variables which were identified as the most predictive, and used to 1) identify 95% of people living with HIV (PLHIV) while maximising precision and 2) identify groups of individuals by adjusting the probability threshold of being HIV positive (90% in our scenario) for achieving specific testing strategies. RESULTS: Overall 55,151 males and 69,626 females were included in the analysis. The gradient boosting trees algorithm performed best in predicting HIV status with a mean F1 score of 76.8% [95% confidence interval (CI) 76.0%-77.6%] for males (vs [CI 67.8%-70.6%] for SVM) and 78.8% [CI 78.2%-79.4%] for females (vs [CI 73.4%-75.8%] for SVM). Among the ten most predictive variables for each sex, nine were identical: longitude, latitude and, altitude of place of residence, current age, age of most recent partner, total lifetime number of sexual partners, years lived in current place of residence, condom use during last intercourse and, wealth index. Only age at first sex for male (ranked 10th) and Rohrer's index for female (ranked 6th) were not similar for both sexes. Our large-scale scenario, which consisted in identifying 95% of all PLHIV, would have required testing 49.4% of males and 48.1% of females while achieving a precision of 15.4% for males and 22.7% for females. For the second scenario, only 4.6% of males and 6.0% of females would have had to be tested to find 55.7% of all males and 50.5% of all females living with HIV. CONCLUSIONS: We trained a gradient boosting trees algorithm to find 95% of PLHIV with a precision twice higher than with general population testing by using only a limited number of socio-behavioural characteristics. We also successfully identified people at high risk of infection who may be offered pre-exposure prophylaxis or voluntary medical male circumcision. These findings can inform the implementation of new high-yield HIV tests and help develop very precise strategies based on low-resource settings constraints.
Assuntos
Circuncisão Masculina , Infecções por HIV , Profilaxia Pré-Exposição , África Austral/epidemiologia , Feminino , Infecções por HIV/diagnóstico , Infecções por HIV/tratamento farmacológico , Infecções por HIV/epidemiologia , Teste de HIV , Humanos , MasculinoRESUMO
BACKGROUND: Despite the availability of individual-level data of infants accessing HIV DNA-PCR testing service, there has been little in-depth analysis of such data. Therefore, we describe trends in risk of HIV infection among Malawi's HIV-exposed infants (HEI) with DNA-PCR HIV test result from 2013 to 2020. METHODS: This is an implementation study using routinely collected patient-level HIV DNA-PCR test result data extracted from the national Laboratory Management Information System database managed by the Department of HIV/AIDS between 1 January 2013 and 30 June 2020. We calculated frequencies, proportions and odds ratio (OR) with their associated 95% CI. We performed a random-effects logistic regression to determine the risk factors associated with HIV infection in infants, controlling for the spatial autocorrelation between districts and adjusting for other variables. RESULTS: We evaluated 255 229 HEI across 750 facilities in 28 districts. The HIV DNA-PCR test was performed within 2 months in 57% of the children. The overall HIV prevalence among all tested HEI between 2013 and 2020 was 7.2% (95% CI 7.1% to 7.3%). We observed a decreasing trend in the proportion of HEI that tested HIV positive from 7.0% (95% CI 6.6% to 7.4%) in 2013 to 5.7% (95% CI 5.4% to 5.9%) in 2015 followed by an increase to 9.9% (95% CI 9.6% to 10.2%) in 2017 and thereafter a decreasing trend between 2017 (i.e. 9.72% (95%CI: 9.43-10.01)) and 2020 (i.e. 3.86% (95%CI: 3.34-4.37)). The HIV prevalence increased by age of the HEI. There was spatial heterogeneity of HIV prevalence between districts of Malawi. The prevalence of HIV was higher among the HEI from the Northern region of Malawi. CONCLUSION: The main findings of the study are that the DNA test is performed within 2 months only in 57% of the children, that the decreasing trend of HIV prevalence among HEI observed up to 2015 was followed by an increase up to 2017 and a decrease afterwards, and that the risk of HIV infection increased with age at HIV testing. We summarised spatial and temporal trends of risk of HIV infection among HEI in Malawi between 2013 and 2020. There is need to ensure that all the HEI are enrolled in HIV care by 8 weeks of age in order to further reduce the risk of HIV in this population.
Assuntos
Síndrome da Imunodeficiência Adquirida , Infecções por HIV , Criança , Infecções por HIV/diagnóstico , Humanos , Lactente , Malaui/epidemiologia , Prevalência , Fatores de RiscoRESUMO
INTRODUCTION: HIV incidence varies widely between sub-Saharan African (SSA) countries. This variation coincides with a substantial sociobehavioural heterogeneity, which complicates the design of effective interventions. In this study, we investigated how sociobehavioural heterogeneity in sub-Saharan Africa could account for the variance of HIV incidence between countries. METHODS: We analysed aggregated data, at the national-level, from the most recent Demographic and Health Surveys of 29 SSA countries (2010-2017), which included 594,644 persons (183,310 men and 411,334 women). We preselected 48 demographic, socio-economic, behavioural and HIV-related attributes to describe each country. We used Principal Component Analysis to visualize sociobehavioural similarity between countries, and to identify the variables that accounted for most sociobehavioural variance in SSA. We used hierarchical clustering to identify groups of countries with similar sociobehavioural profiles, and we compared the distribution of HIV incidence (estimates from UNAIDS) and sociobehavioural variables within each cluster. RESULTS: The most important characteristics, which explained 69% of sociobehavioural variance across SSA among the variables we assessed were: religion; male circumcision; number of sexual partners; literacy; uptake of HIV testing; women's empowerment; accepting attitude toward people living with HIV/AIDS; rurality; ART coverage; and, knowledge about AIDS. Our model revealed three groups of countries, each with characteristic sociobehavioural profiles. HIV incidence was mostly similar within each cluster and different between clusters (median (IQR); 0.5/1000 (0.6/1000), 1.8/1000 (1.3/1000) and 5.0/1000 (4.2/1000)). CONCLUSIONS: Our findings suggest that the combination of sociobehavioural factors play a key role in determining the course of the HIV epidemic, and that similar techniques can help to predict the effects of behavioural change on the HIV epidemic and to design targeted interventions to impede HIV transmission in SSA.
RESUMO
The extent of SARS-CoV-2 circulation in many African countries remains unclear, underlining the need for antibody sero-surveys to assess the cumulative attack rate. Here, we present the results of a cross-sectional sero-survey of a random sample of residents of a health district in Yaounde, Cameroon, conducted from October 14 to November 26, 2020. Among the 971 participants, the test-adjusted seroprevalence of anti-SARS-CoV-2 IgG antibodies was 29·2% (95% CI 24·3-34·1). This is about 322 times greater than the 0.09% nationwide attack rate implied by COVID-19 case counts at the time. Men, obese individuals and those living in large households were significantly more likely to be seropositive, and the majority (64·2% [58·7-69·4]) of seropositive individuals reported no symptoms. Despite the high seroprevalence, most of the population had not been infected with SARS-CoV-2, highlighting the importance of continued measures to control viral spread and quick vaccine deployment to protect the vulnerable.
Assuntos
Anticorpos Antivirais/sangue , COVID-19/epidemiologia , COVID-19/virologia , SARS-CoV-2/imunologia , Estudos Soroepidemiológicos , População Urbana , Adolescente , Adulto , Fatores Etários , Idoso , Camarões/epidemiologia , Criança , Pré-Escolar , Feminino , Geografia , Humanos , Imunoglobulina G/sangue , Imunoglobulina M/sangue , Masculino , Pessoa de Meia-Idade , Aceitação pelo Paciente de Cuidados de Saúde , Fatores de Risco , Fatores Sexuais , Adulto JovemRESUMO
BACKGROUND: We aimed to estimate the disease burden of Tuberculosis (TB) and return on investment of TB care in selected high-burden countries of the Western Pacific Region (WPR) until 2030. METHODS: We projected the TB epidemic in Viet Nam and Lao People's Democratic Republic (PDR) 2020-2030 using a mathematical model under various scenarios: counterfactual (no TB care); baseline (TB care continues at current levels); and 12 different diagnosis and treatment interventions. We retrieved previous modeling results for China and the Philippines. We pooled the new and existing information on incidence and deaths in the four countries, covering >80% of the TB burden in WPR. We estimated the return on investment of TB care and interventions in Viet Nam and Lao PDR using a Solow model. FINDINGS: In the baseline scenario, TB incidence in the four countries decreased from 97â¢0/100,000/year (2019) to 90â¢1/100,000/year (2030), and TB deaths from 83,300/year (2019) to 71,100/year (2030). Active case finding (ACF) strategies (screening people not seeking care for respiratory symptoms) were the most effective single interventions. Return on investment (2020-2030) for TB care in Viet Nam and Lao PDR ranged US$4-US$49/dollar spent; additional interventions brought up to US$2â¢7/dollar spent. INTERPRETATION: In the modeled countries, TB incidence will only modestly decrease without additional interventions. Interventions that include ACF can reduce TB burden but achieving the End TB incidence and mortality targets will be difficult without new transformational tools (e.g. vaccine, new diagnostic tools, shorter treatment). However, TB care, even at its current level, can bring a multiple-fold return on investment. FUNDING: World Health Organization Western Pacific Regional Office; Swiss National Science Foundation Grant 163878.
RESUMO
INTRODUCTION: Within many sub-Saharan African countries including Malawi, HIV prevalence varies widely between regions. This variability may be related to the distribution of population groups with specific sociobehavioural characteristics that influence the transmission of HIV and the uptake of prevention. In this study, we intended to identify groups of people in Malawi with similar risk profiles. METHODS: We used data from the Demographic and Health Survey in Malawi (2015 to 2016), and stratified the analysis by sex. We considered demographic, socio-behavioural and HIV-related variables. Using Latent Class Analysis (LCA), we identified groups of people sharing common sociobehavioural characteristics. The optimal number of classes (groups) was selected based on the Bayesian information criterion. We compared the proportions of individuals belonging to the different groups across the three regions and 28 districts of Malawi. RESULTS: We found nine groups of women and six groups of men. Most women in the groups with highest risk of being HIV positive were living in female-headed households and were formerly married or in a union. Among men, older men had the highest risk of being HIV positive, followed by young (20 to 25) single men. Generally, low HIV testing uptake correlated with lower risk of having HIV. However, rural adolescent girls had a low probability of being tested (48.7%) despite a relatively high HIV prevalence. Urban districts and the Southern region had a higher percentage of high-prevalence and less tested groups of individuals than other areas. CONCLUSIONS: LCA is an efficient method to find groups of people sharing common HIV risk profiles, identify particularly vulnerable sub-populations, and plan targeted interventions focusing on these groups. Tailored support, prevention and HIV testing programmes should focus particularly on female household heads, adolescent girls living in rural areas, older married men and young men who have never been married.
Assuntos
Infecções por HIV/epidemiologia , Infecções por HIV/psicologia , Comportamento Social , Adolescente , Adulto , Teorema de Bayes , Criança , Feminino , Humanos , Análise de Classes Latentes , Alfabetização , Malaui/epidemiologia , Masculino , Pessoa de Meia-Idade , Prevalência , População Rural/estatística & dados numéricos , Fatores Sexuais , Adulto JovemRESUMO
The recent lifting of COVID-19 related restrictions in Switzerland causes uncertainty about the future of the epidemic. We developed a compartmental model for SARS-CoV-2 transmission in Switzerland and projected the course of the epidemic until the end of year 2020 under various scenarios. The model was age-structured with three categories: children (0-17), adults (18-64) and seniors (65- years). Lifting all restrictions according to the plans disclosed by the Swiss federal authorities by mid-May resulted in a rapid rebound in the epidemic, with the peak expected in July. Measures equivalent to at least 90% reduction in all contacts were able to eradicate the epidemic; a 56% reduction in contacts could keep the intensive care unit occupancy under the critical level and delay the next wave until October. In scenarios where strong contact reductions were only applied in selected age groups, the epidemic could not be suppressed, resulting in an increased risk of a rebound in July, and another stronger wave in September. Future interventions need to cover all age groups to keep the SARS-CoV-2 epidemic under control.