RESUMO
Considerable spatial heterogeneity has been observed in COVID-19 transmission across administrative areas of England throughout the pandemic. This study investigates what drives these differences. We constructed a probabilistic case count model for 306 administrative areas of England across 95 weeks, fit using a Bayesian evidence synthesis framework. We incorporate the impact of acquired immunity, of spatial exportation of cases, and 16 spatially-varying socio-economic, socio-demographic, health, and mobility variables. Model comparison assesses the relative contributions of these respective mechanisms. We find that spatially-varying and time-varying differences in week-to-week transmission were definitively associated with differences in: time spent at home, variant-of-concern proportion, and adult social care funding. However, model comparison demonstrates that the impact of these terms is negligible compared to the role of spatial exportation between administrative areas. While these results confirm the impact of some, but not all, static measures of spatially-varying inequity in England, our work corroborates the finding that observed differences in disease transmission during the pandemic were predominantly driven by underlying epidemiological factors rather than aggregated metrics of demography and health inequity between areas. Further work is required to assess how health inequity more broadly contributes to these epidemiological factors.
Assuntos
Teorema de Bayes , COVID-19 , SARS-CoV-2 , Humanos , COVID-19/transmissão , COVID-19/epidemiologia , Inglaterra/epidemiologia , Pandemias/estatística & dados numéricos , Fatores Socioeconômicos , Disparidades nos Níveis de Saúde , Modelos EstatísticosRESUMO
Epidemiological delays are key quantities that inform public health policy and clinical practice. They are used as inputs for mathematical and statistical models, which in turn can guide control strategies. In recent work, we found that censoring, right truncation, and dynamical bias were rarely addressed correctly when estimating delays and that these biases were large enough to have knock-on impacts across a large number of use cases. Here, we formulate a checklist of best practices for estimating and reporting epidemiological delays. We also provide a flowchart to guide practitioners based on their data. Our examples are focused on the incubation period and serial interval due to their importance in outbreak response and modeling, but our recommendations are applicable to other delays. The recommendations, which are based on the literature and our experience estimating epidemiological delay distributions during outbreak responses, can help improve the robustness and utility of reported estimates and provide guidance for the evaluation of estimates for downstream use in transmission models or other analyses.
Assuntos
Doenças Transmissíveis , Surtos de Doenças , Humanos , Doenças Transmissíveis/epidemiologia , Surtos de Doenças/estatística & dados numéricos , Modelos Estatísticos , Biologia Computacional/métodos , Modelos EpidemiológicosRESUMO
BACKGROUND: Syndromic management is widely used to treat symptomatic sexually transmitted infections in settings without aetiologic diagnostics. However, underlying aetiologies and consequent treatment suitability are uncertain without regular assessment. This systematic review estimated the distribution, trends, and determinants of aetiologies for vaginal discharge, urethral discharge, and genital ulcer in sub-Saharan Africa (SSA). METHODS AND FINDINGS: We searched Embase, MEDLINE, Global Health, Web of Science, and grey literature from inception until December 20, 2023, for observational studies reporting aetiologic diagnoses among symptomatic populations in SSA. We adjusted observations for diagnostic test performance, used generalised linear mixed-effects meta-regressions to generate estimates, and critically appraised studies using an adapted Joanna Briggs Institute checklist. Of 4,418 identified records, 206 reports were included from 190 studies in 32 countries conducted between 1969 and 2022. In 2015, estimated primary aetiologies for vaginal discharge were candidiasis (69.4% [95% confidence interval (CI): 44.3% to 86.6%], n = 50), bacterial vaginosis (50.0% [95% CI: 32.3% to 67.8%], n = 39), chlamydia (16.2% [95% CI: 8.6% to 28.5%], n = 50), and trichomoniasis (12.9% [95% CI: 7.7% to 20.7%], n = 80); for urethral discharge were gonorrhoea (77.1% [95% CI: 68.1% to 84.1%], n = 68) and chlamydia (21.9% [95% CI: 15.4% to 30.3%], n = 48); and for genital ulcer were herpes simplex virus type 2 (HSV-2) (48.3% [95% CI: 32.9% to 64.1%], n = 47) and syphilis (9.3% [95% CI: 6.4% to 13.4%], n = 117). Temporal variation was substantial, particularly for genital ulcer where HSV-2 replaced chancroid as the primary cause. Aetiologic distributions for each symptom were largely the same across regions and population strata, despite HIV status and age being significantly associated with several infection diagnoses. Limitations of the review include the absence of studies in 16 of 48 SSA countries, substantial heterogeneity in study observations, and impeded assessment of this variability due to incomplete or inconsistent reporting across studies. CONCLUSIONS: In our study, syndrome aetiologies in SSA aligned with World Health Organization guidelines without strong evidence of geographic or demographic variation, supporting broad guideline applicability. Temporal changes underscore the importance of regular aetiologic re-assessment for effective syndromic management. PROSPERO NUMBER: CRD42022348045.
Assuntos
Úlcera , Descarga Vaginal , Humanos , África Subsaariana/epidemiologia , Feminino , Descarga Vaginal/epidemiologia , Descarga Vaginal/etiologia , Úlcera/epidemiologia , Infecções Sexualmente Transmissíveis/epidemiologia , Infecções Sexualmente Transmissíveis/diagnóstico , Vaginose Bacteriana/epidemiologia , Vaginose Bacteriana/diagnóstico , Vaginose Bacteriana/complicações , Infecções por Chlamydia/epidemiologia , Infecções por Chlamydia/complicações , Infecções por Chlamydia/diagnóstico , Doenças Uretrais/epidemiologia , Doenças Uretrais/etiologia , Doenças dos Genitais Femininos/epidemiologiaRESUMO
BACKGROUND: In settings without etiologic testing for sexually transmitted infections (STIs), programs rely on STI symptom data to inform priorities. To evaluate whether self-reported STI symptoms in household surveys consistently represent the STI burden, we compared symptomatic infection rates between survey self-reporting and health facility case reporting in Malawi. METHODS: We analyzed self-reported symptoms and treatment seeking in the past year among sexually active adults from 4 Malawi Demographic and Health Surveys between 2000 and 2015. Bayesian mixed-effects models were used to estimate temporal trends, spatial variation, and sociodemographic determinants. Survey reporting was compared with health facility syndromic diagnoses between 2014 and 2021. RESULTS: In surveys, 11.0% (95% confidence interval, 10.7%-11.4%) of adults reported STI or STI-related symptoms in the last year, of whom 54.2% (52.8%-55.7%) sought treatment. In facilities, the mean annual symptomatic case diagnosis rate was 3.3%. Survey-reported treatment in the last year was 3.8% (95% credible interval, 2.3%-6.1%) for genital ulcer, 3.8% (2.0%-6.7%) for vaginal discharge, and 2.6% (1.2%-4.7%) for urethral discharge. Mean annual diagnosis rates at facilities were 0.5% for genital ulcer, 2.2% for vaginal discharge, and 2.0% for urethral discharge. Both data sources indicated a higher burden of symptoms among women, individuals older than 25 years, and those in Southern Malawi. CONCLUSIONS: Survey and facility case reports indicated similar spatial and demographic patterns of STI symptom burden and care seeking, but implied large differences in the magnitude and relative burden of symptoms, particularly genital ulcer, which could affect program priorities. Targeted etiologic surveillance would improve interpretation of these data to enable more comprehensive STI surveillance.
Assuntos
Infecções por HIV , Infecções Sexualmente Transmissíveis , Descarga Vaginal , Adulto , Feminino , Humanos , Úlcera , Teorema de Bayes , Malaui/epidemiologia , Infecções Sexualmente Transmissíveis/diagnóstico , Infecções Sexualmente Transmissíveis/epidemiologia , Infecções por HIV/epidemiologiaRESUMO
The time-varying reproduction number (Rt) is an important measure of epidemic transmissibility that directly informs policy decisions and the optimisation of control measures. EpiEstim is a widely used opensource software tool that uses case incidence and the serial interval (SI, time between symptoms in a case and their infector) to estimate Rt in real-time. The incidence and the SI distribution must be provided at the same temporal resolution, which can limit the applicability of EpiEstim and other similar methods, e.g. for contexts where the time window of incidence reporting is longer than the mean SI. In the EpiEstim R package, we implement an expectation-maximisation algorithm to reconstruct daily incidence from temporally aggregated data, from which Rt can then be estimated. We assess the validity of our method using an extensive simulation study and apply it to COVID-19 and influenza data. For all datasets, the influence of intra-weekly variability in reported data was mitigated by using aggregated weekly data. Rt estimated on weekly sliding windows using incidence reconstructed from weekly data was strongly correlated with estimates from the original daily data. The simulation study revealed that Rt was well estimated in all scenarios and regardless of the temporal aggregation of the data. In the presence of weekend effects, Rt estimates from reconstructed data were more successful at recovering the true value of Rt than those obtained from reported daily data. These results show that this novel method allows Rt to be successfully recovered from aggregated data using a simple approach with very few data requirements. Additionally, by removing administrative noise when daily incidence data are reconstructed, the accuracy of Rt estimates can be improved.
Assuntos
COVID-19 , Humanos , Incidência , Software , Simulação por Computador , ReproduçãoRESUMO
Forecast evaluation is essential for the development of predictive epidemic models and can inform their use for public health decision-making. Common scores to evaluate epidemiological forecasts are the Continuous Ranked Probability Score (CRPS) and the Weighted Interval Score (WIS), which can be seen as measures of the absolute distance between the forecast distribution and the observation. However, applying these scores directly to predicted and observed incidence counts may not be the most appropriate due to the exponential nature of epidemic processes and the varying magnitudes of observed values across space and time. In this paper, we argue that transforming counts before applying scores such as the CRPS or WIS can effectively mitigate these difficulties and yield epidemiologically meaningful and easily interpretable results. Using the CRPS on log-transformed values as an example, we list three attractive properties: Firstly, it can be interpreted as a probabilistic version of a relative error. Secondly, it reflects how well models predicted the time-varying epidemic growth rate. And lastly, using arguments on variance-stabilizing transformations, it can be shown that under the assumption of a quadratic mean-variance relationship, the logarithmic transformation leads to expected CRPS values which are independent of the order of magnitude of the predicted quantity. Applying a transformation of log(x + 1) to data and forecasts from the European COVID-19 Forecast Hub, we find that it changes model rankings regardless of stratification by forecast date, location or target types. Situations in which models missed the beginning of upward swings are more strongly emphasised while failing to predict a downturn following a peak is less severely penalised when scoring transformed forecasts as opposed to untransformed ones. We conclude that appropriate transformations, of which the natural logarithm is only one particularly attractive option, should be considered when assessing the performance of different models in the context of infectious disease incidence.
Assuntos
COVID-19 , Epidemias , Humanos , COVID-19/epidemiologia , Saúde Pública , Probabilidade , RegistrosRESUMO
BACKGROUND: The serial interval is a key epidemiological measure that quantifies the time between an infector's and an infectee's onset of symptoms. This measure helps investigate epidemiological links between cases, and is an important parameter in transmission models used to estimate transmissibility and inform control strategies. The emergence of multiple variants of concern (VOC) during the SARS-CoV-2 pandemic has led to uncertainties about potential changes in the serial interval of COVID-19. We estimated the household serial interval of multiple VOC using data collected by the Virus Watch study. This online, prospective, community cohort study followed-up entire households in England and Wales since mid-June 2020. METHODS: This analysis included 5842 symptomatic individuals with confirmed SARS-CoV-2 infection among 2579 households from Sept 1, 2020, to Aug 10, 2022. SARS-CoV-2 variant designation was based upon national surveillance data of variant prevalence by date and geographical region. We used a Bayesian framework to infer who infected whom by exploring all transmission trees compatible with the observed dates of symptoms, given assumptions on the incubation period and generation time distributions using the R package outbreaker2. FINDINGS: We characterised the serial interval of COVID-19 by VOC. The mean serial interval was shortest for omicron BA5 (2·02 days; 95% credible interval [CrI] 1·26-2·84) and longest for alpha (3·37 days; 2·52-4·04). The mean serial interval before alpha (wild-type) was 2·29 days (95% CrI 1·39-2·94), 3·11 days (2·28-3·90) for delta, 2·72 days (2·01-3·47) for omicron BA1, and 2·67 days (1·90-3·46) for omicron BA2. We estimated that 17% (95% CrI 5-26) of serial interval values are negative across all variants. INTERPRETATION: Most methods estimating the reproduction number from incidence time series do not allow for a negative serial interval by construction. Further research is needed to extend these methods and assess biases introduced by not accounting for negative serial intervals. To our knowledge, this study is the first to use a Bayesian framework to estimate the serial interval of all major SARS-CoV-2 VOC from thousands of confirmed household cases. FUNDING: UK Medical Research Council and Wellcome Trust.
Assuntos
COVID-19 , Humanos , COVID-19/epidemiologia , SARS-CoV-2 , Teorema de Bayes , Estudos de Coortes , Estudos ProspectivosRESUMO
Forecasts based on epidemiological modelling have played an important role in shaping public policy throughout the COVID-19 pandemic. This modelling combines knowledge about infectious disease dynamics with the subjective opinion of the researcher who develops and refines the model and often also adjusts model outputs. Developing a forecast model is difficult, resource- and time-consuming. It is therefore worth asking what modelling is able to add beyond the subjective opinion of the researcher alone. To investigate this, we analysed different real-time forecasts of cases of and deaths from COVID-19 in Germany and Poland over a 1-4 week horizon submitted to the German and Polish Forecast Hub. We compared crowd forecasts elicited from researchers and volunteers, against a) forecasts from two semi-mechanistic models based on common epidemiological assumptions and b) the ensemble of all other models submitted to the Forecast Hub. We found crowd forecasts, despite being overconfident, to outperform all other methods across all forecast horizons when forecasting cases (weighted interval score relative to the Hub ensemble 2 weeks ahead: 0.89). Forecasts based on computational models performed comparably better when predicting deaths (rel. WIS 1.26), suggesting that epidemiological modelling and human judgement can complement each other in important ways.
Assuntos
COVID-19 , Doenças Transmissíveis , COVID-19/epidemiologia , Previsões , Humanos , Pandemias , Polônia/epidemiologiaRESUMO
BACKGROUND: England's COVID-19 roadmap out of lockdown policy set out the timeline and conditions for the stepwise lifting of non-pharmaceutical interventions (NPIs) as vaccination roll-out continued, with step one starting on March 8, 2021. In this study, we assess the roadmap, the impact of the delta (B.1.617.2) variant of SARS-CoV-2, and potential future epidemic trajectories. METHODS: This mathematical modelling study was done to assess the UK Government's four-step process to easing lockdown restrictions in England, UK. We extended a previously described model of SARS-CoV-2 transmission to incorporate vaccination and multi-strain dynamics to explicitly capture the emergence of the delta variant. We calibrated the model to English surveillance data, including hospital admissions, hospital occupancy, seroprevalence data, and population-level PCR testing data using a Bayesian evidence synthesis framework, then modelled the potential trajectory of the epidemic for a range of different schedules for relaxing NPIs. We estimated the resulting number of daily infections and hospital admissions, and daily and cumulative deaths. Three scenarios spanning a range of optimistic to pessimistic vaccine effectiveness, waning natural immunity, and cross-protection from previous infections were investigated. We also considered three levels of mixing after the lifting of restrictions. FINDINGS: The roadmap policy was successful in offsetting the increased transmission resulting from lifting NPIs starting on March 8, 2021, with increasing population immunity through vaccination. However, because of the emergence of the delta variant, with an estimated transmission advantage of 76% (95% credible interval [95% CrI] 69-83) over alpha, fully lifting NPIs on June 21, 2021, as originally planned might have led to 3900 (95% CrI 1500-5700) peak daily hospital admissions under our central parameter scenario. Delaying until July 19, 2021, reduced peak hospital admissions by three fold to 1400 (95% CrI 700-1700) per day. There was substantial uncertainty in the epidemic trajectory, with particular sensitivity to the transmissibility of delta, level of mixing, and estimates of vaccine effectiveness. INTERPRETATION: Our findings show that the risk of a large wave of COVID-19 hospital admissions resulting from lifting NPIs can be substantially mitigated if the timing of NPI relaxation is carefully balanced against vaccination coverage. However, with the delta variant, it might not be possible to fully lift NPIs without a third wave of hospital admissions and deaths, even if vaccination coverage is high. Variants of concern, their transmissibility, vaccine uptake, and vaccine effectiveness must be carefully monitored as countries relax pandemic control measures. FUNDING: National Institute for Health Research, UK Medical Research Council, Wellcome Trust, and UK Foreign, Commonwealth and Development Office.
Assuntos
Vacinas contra COVID-19/administração & dosagem , COVID-19/prevenção & controle , COVID-19/transmissão , Controle de Doenças Transmissíveis/organização & administração , SARS-CoV-2 , Cobertura Vacinal/organização & administração , COVID-19/epidemiologia , COVID-19/mortalidade , Inglaterra/epidemiologia , Mortalidade Hospitalar/tendências , Hospitalização/estatística & dados numéricos , Humanos , Modelos Teóricos , Admissão do Paciente/estatística & dados numéricosRESUMO
Mathematical models are powerful tools in HIV epidemiology, producing quantitative projections of key indicators such as HIV incidence and prevalence. In order to improve the accuracy of predictions, such models need to incorporate a number of behavioural and biological heterogeneities, especially those related to the sexual network within which HIV transmission occurs. An individual-based model, which explicitly models sexual partnerships, is thus often the most natural type of model to choose. In this paper we present PopART-IBM, a computationally efficient individual-based model capable of simulating 50 years of an HIV epidemic in a large, high-prevalence community in under a minute. We show how the model calibrates within a Bayesian inference framework to detailed age- and sex-stratified data from multiple sources on HIV prevalence, awareness of HIV status, ART status, and viral suppression for an HPTN 071 (PopART) study community in Zambia, and present future projections of HIV prevalence and incidence for this community in the absence of trial intervention.
Assuntos
Simulação por Computador , Infecções por HIV/epidemiologia , Modelos Estatísticos , Processos Estocásticos , Adolescente , Adulto , Idoso , Algoritmos , Terapia Antirretroviral de Alta Atividade , Progressão da Doença , Feminino , Infecções por HIV/tratamento farmacológico , Infecções por HIV/transmissão , Humanos , Incidência , Masculino , Pessoa de Meia-Idade , Prevalência , Reprodutibilidade dos Testes , Adulto Jovem , Zâmbia/epidemiologiaRESUMO
BACKGROUND: As the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic continues its rapid global spread, quantification of local transmission patterns has been, and will continue to be, critical for guiding the pandemic response. Understanding the accuracy and limitations of statistical methods to estimate the basic reproduction number, R0, in the context of emerging epidemics is therefore vital to ensure appropriate interpretation of results and the subsequent implications for control efforts. METHODS: Using simulated epidemic data, we assess the performance of 7 commonly used statistical methods to estimate R0 as they would be applied in a real-time outbreak analysis scenario: fitting to an increasing number of data points over time and with varying levels of random noise in the data. Method comparison was also conducted on empirical outbreak data, using Zika surveillance data from the 2015-2016 epidemic in Latin America and the Caribbean. RESULTS: We find that most methods considered here frequently overestimate R0 in the early stages of epidemic growth on simulated data, the magnitude of which decreases when fitted to an increasing number of time points. This trend of decreasing bias over time can easily lead to incorrect conclusions about the course of the epidemic or the need for control efforts. CONCLUSIONS: We show that true changes in pathogen transmissibility can be difficult to disentangle from changes in methodological accuracy and precision in the early stages of epidemic growth, particularly for data with significant over-dispersion. As localized epidemics of SARS-CoV-2 take hold around the globe, awareness of this trend will be important for appropriately cautious interpretation of results and subsequent guidance for control efforts.
Assuntos
COVID-19 , Epidemias , Infecção por Zika virus , Zika virus , Número Básico de Reprodução , Região do Caribe , Humanos , Pandemias , Reprodução , SARS-CoV-2RESUMO
The end-of-outbreak declaration is an important step in controlling infectious disease outbreaks. Objective estimation of the confidence level that an outbreak is over is important to reduce the risk of postdeclaration flare-ups. We developed a simulation-based model with which to quantify that confidence and tested it on simulated Ebola virus disease data. We found that these confidence estimates were most sensitive to the instantaneous reproduction number, the reporting rate, and the time between the symptom onset and death or recovery of the last detected case. For Ebola virus disease, our results suggested that the current World Health Organization criterion of 42 days since the recovery or death of the last detected case is too short and too sensitive to underreporting. Therefore, we suggest a shift to a preliminary end-of-outbreak declaration after 63 days from the symptom onset day of the last detected case. This preliminary declaration should still be followed by 90 days of enhanced surveillance to capture potential flare-ups of cases, after which the official end of the outbreak can be declared. This sequence corresponds to more than 95% confidence that an outbreak is over in most of the scenarios examined. Our framework is generic and therefore could be adapted to estimate end-of-outbreak confidence for other infectious diseases.
Assuntos
Surtos de Doenças/estatística & dados numéricos , Doença pelo Vírus Ebola/epidemiologia , Controle de Infecções/estatística & dados numéricos , Saúde Global , HumanosRESUMO
Ebola emerged in West Africa around December 2013 and swept through Guinea, Sierra Leone and Liberia, giving rise to 27,748 confirmed, probable and suspected cases reported by 29 July 2015. Case diagnoses during the epidemic have relied on polymerase chain reaction-based tests. Owing to limited laboratory capacity and local transport infrastructure, the delays from sample collection to test results being available have often been 2 days or more. Point-of-care rapid diagnostic tests offer the potential to substantially reduce these delays. We review Ebola rapid diagnostic tests approved by the World Health Organization and those currently in development. Such rapid diagnostic tests could allow early triaging of patients, thereby reducing the potential for nosocomial transmission. In addition, despite the lower test accuracy, rapid diagnostic test-based diagnosis may be beneficial in some contexts because of the reduced time spent by uninfected individuals in health-care settings where they may be at increased risk of infection; this also frees up hospital beds. We use mathematical modelling to explore the potential benefits of diagnostic testing strategies involving rapid diagnostic tests alone and in combination with polymerase chain reaction testing. Our analysis indicates that the use of rapid diagnostic tests with sensitivity and specificity comparable with those currently under development always enhances control, whether evaluated at a health-care-unit or population level. If such tests had been available throughout the recent epidemic, we estimate, for Sierra Leone, that their use in combination with confirmatory polymerase chain-reaction testing might have reduced the scale of the epidemic by over a third.
Assuntos
Testes Diagnósticos de Rotina , Doença pelo Vírus Ebola , África Ocidental/epidemiologia , Doença pelo Vírus Ebola/diagnóstico , Doença pelo Vírus Ebola/epidemiologia , Doença pelo Vírus Ebola/prevenção & controle , Doença pelo Vírus Ebola/transmissão , Humanos , Fatores de Tempo , TriagemRESUMO
Recent years have seen the development of numerous methodologies for reconstructing transmission trees in infectious disease outbreaks from densely sampled whole genome sequence data. However, a fundamental and as of yet poorly addressed limitation of such approaches is the requirement for genetic diversity to arise on epidemiological timescales. Specifically, the position of infected individuals in a transmission tree can only be resolved by genetic data if mutations have accumulated between the sampled pathogen genomes. To quantify and compare the useful genetic diversity expected from genetic data in different pathogen outbreaks, we introduce here the concept of 'transmission divergence', defined as the number of mutations separating whole genome sequences sampled from transmission pairs. Using parameter values obtained by literature review, we simulate outbreak scenarios alongside sequence evolution using two models described in the literature to describe transmission divergence of ten major outbreak-causing pathogens. We find that while mean values vary significantly between the pathogens considered, their transmission divergence is generally very low, with many outbreaks characterised by large numbers of genetically identical transmission pairs. We describe the impact of transmission divergence on our ability to reconstruct outbreaks using two outbreak reconstruction tools, the R packages outbreaker and phybreak, and demonstrate that, in agreement with previous observations, genetic sequence data of rapidly evolving pathogens such as RNA viruses can provide valuable information on individual transmission events. Conversely, sequence data of pathogens with lower mean transmission divergence, including Streptococcus pneumoniae, Shigella sonnei and Clostridium difficile, provide little to no information about individual transmission events. Our results highlight the informational limitations of genetic sequence data in certain outbreak scenarios, and demonstrate the need to expand the toolkit of outbreak reconstruction tools to integrate other types of epidemiological data.
Assuntos
Bactérias/genética , Mapeamento Cromossômico , Doenças Transmissíveis/genética , Doenças Transmissíveis/transmissão , Transmissão de Doença Infecciosa , Vírus/genética , Bactérias/patogenicidade , Sequência de Bases , Doenças Transmissíveis/epidemiologia , Surtos de Doenças , Transmissão de Doença Infecciosa/estatística & dados numéricos , Predisposição Genética para Doença , Variação Genética , Genoma Bacteriano , Genoma Viral , Humanos , Filogenia , Análise de Sequência de DNA , Vírus/patogenicidade , Sequenciamento Completo do GenomaRESUMO
There exists significant interest in developing statistical and computational tools for inferring 'who infected whom' in an infectious disease outbreak from densely sampled case data, with most recent studies focusing on the analysis of whole genome sequence data. However, genomic data can be poorly informative of transmission events if mutations accumulate too slowly to resolve individual transmission pairs or if there exist multiple pathogens lineages within-host, and there has been little focus on incorporating other types of outbreak data. We present here a methodology that uses contact data for the inference of transmission trees in a statistically rigorous manner, alongside genomic data and temporal data. Contact data is frequently collected in outbreaks of pathogens spread by close contact, including Ebola virus (EBOV), severe acute respiratory syndrome coronavirus (SARS-CoV) and Mycobacterium tuberculosis (TB), and routinely used to reconstruct transmission chains. As an improvement over previous, ad-hoc approaches, we developed a probabilistic model that relates a set of contact data to an underlying transmission tree and integrated this in the outbreaker2 inference framework. By analyzing simulated outbreaks under various contact tracing scenarios, we demonstrate that contact data significantly improves our ability to reconstruct transmission trees, even under realistic limitations on the coverage of the contact tracing effort and the amount of non-infectious mixing between cases. Indeed, contact data is equally or more informative than fully sampled whole genome sequence data in certain scenarios. We then use our method to analyze the early stages of the 2003 SARS outbreak in Singapore and describe the range of transmission scenarios consistent with contact data and genetic sequence in a probabilistic manner for the first time. This simple yet flexible model can easily be incorporated into existing tools for outbreak reconstruction and should permit a better integration of genomic and epidemiological data for inferring transmission chains.
Assuntos
Teorema de Bayes , Doenças Transmissíveis/transmissão , Biologia Computacional/métodos , Busca de Comunicante , Surtos de Doenças/estatística & dados numéricos , Genoma Viral/genética , Algoritmos , Doenças Transmissíveis/virologia , Humanos , Modelos Biológicos , Coronavírus Relacionado à Síndrome Respiratória Aguda Grave/genética , Síndrome Respiratória Aguda Grave/transmissão , Síndrome Respiratória Aguda Grave/virologia , Singapura , SoftwareRESUMO
Early assessment of infectious disease outbreaks is key to implementing timely and effective control measures. In particular, rapidly recognising whether infected individuals stem from a single outbreak sustained by local transmission, or from repeated introductions, is crucial to adopt effective interventions. In this study, we introduce a new framework for combining several data streams, e.g. temporal, spatial and genetic data, to identify clusters of related cases of an infectious disease. Our method explicitly accounts for underreporting, and allows incorporating preexisting information about the disease, such as its serial interval, spatial kernel, and mutation rate. We define, for each data stream, a graph connecting all cases, with edges weighted by the corresponding pairwise distance between cases. Each graph is then pruned by removing distances greater than a given cutoff, defined based on preexisting information on the disease and assumptions on the reporting rate. The pruned graphs corresponding to different data streams are then merged by intersection to combine all data types; connected components define clusters of cases related for all types of data. Estimates of the reproduction number (the average number of secondary cases infected by an infectious individual in a large population), and the rate of importation of the disease into the population, are also derived. We test our approach on simulated data and illustrate it using data on dog rabies in Central African Republic. We show that the outbreak clusters identified using our method are consistent with structures previously identified by more complex, computationally intensive approaches.
Assuntos
Doenças Transmissíveis/epidemiologia , Raiva/epidemiologia , Animais , Análise por Conglomerados , Surtos de Doenças/classificação , Surtos de Doenças/veterinária , Cães , TempoRESUMO
With more than 1,700 laboratory-confirmed infections, Middle East respiratory syndrome coronavirus (MERS-CoV) remains a significant threat for public health. However, the lack of detailed data on modes of transmission from the animal reservoir and between humans means that the drivers of MERS-CoV epidemics remain poorly characterized. Here, we develop a statistical framework to provide a comprehensive analysis of the transmission patterns underlying the 681 MERS-CoV cases detected in the Kingdom of Saudi Arabia (KSA) between January 2013 and July 2014. We assess how infections from the animal reservoir, the different levels of mixing, and heterogeneities in transmission have contributed to the buildup of MERS-CoV epidemics in KSA. We estimate that 12% [95% credible interval (CI): 9%, 15%] of cases were infected from the reservoir, the rest via human-to-human transmission in clusters (60%; CI: 57%, 63%), within (23%; CI: 20%, 27%), or between (5%; CI: 2%, 8%) regions. The reproduction number at the start of a cluster was 0.45 (CI: 0.33, 0.58) on average, but with large SD (0.53; CI: 0.35, 0.78). It was >1 in 12% (CI: 6%, 18%) of clusters but fell by approximately one-half (47% CI: 34%, 63%) its original value after 10 cases on average. The ongoing exposure of humans to MERS-CoV from the reservoir is of major concern, given the continued risk of substantial outbreaks in health care systems. The approach we present allows the study of infectious disease transmission when data linking cases to each other remain limited and uncertain.
Assuntos
Infecções por Coronavirus/transmissão , Animais , Reservatórios de Doenças , Humanos , Zoonoses/transmissãoRESUMO
BACKGROUND: Reconstructing individual transmission events in an infectious disease outbreak can provide valuable information and help inform infection control policy. Recent years have seen considerable progress in the development of methodologies for reconstructing transmission chains using both epidemiological and genetic data. However, only a few of these methods have been implemented in software packages, and with little consideration for customisability and interoperability. Users are therefore limited to a small number of alternatives, incompatible tools with fixed functionality, or forced to develop their own algorithms at considerable personal effort. RESULTS: Here we present outbreaker2, a flexible framework for outbreak reconstruction. This R package re-implements and extends the original model introduced with outbreaker, but most importantly also provides a modular platform allowing users to specify custom models within an optimised inferential framework. As a proof of concept, we implement the within-host evolutionary model introduced with TransPhylo, which is very distinct from the original genetic model in outbreaker, and demonstrate how even complex model results can be successfully included with minimal effort. CONCLUSIONS: outbreaker2 provides a valuable starting point for future outbreak reconstruction tools, and represents a unifying platform that promotes customisability and interoperability. Implemented in the R software, outbreaker2 joins a growing body of tools for outbreak analysis.
Assuntos
Surtos de Doenças , Software , Algoritmos , Evolução Biológica , Ebolavirus/fisiologia , Doença pelo Vírus Ebola/epidemiologia , Doença pelo Vírus Ebola/virologia , Humanos , Cadeias de Markov , Modelos Teóricos , Método de Monte CarloRESUMO
Viral phylogenetic methods contribute to understanding how HIV spreads in populations, and thereby help guide the design of prevention interventions. So far, most analyses have been applied to well-sampled concentrated HIV-1 epidemics in wealthy countries. To direct the use of phylogenetic tools to where the impact of HIV-1 is greatest, the Phylogenetics And Networks for Generalized HIV Epidemics in Africa (PANGEA-HIV) consortium generates full-genome viral sequences from across sub-Saharan Africa. Analyzing these data presents new challenges, since epidemics are principally driven by heterosexual transmission and a smaller fraction of cases is sampled. Here, we show that viral phylogenetic tools can be adapted and used to estimate epidemiological quantities of central importance to HIV-1 prevention in sub-Saharan Africa. We used a community-wide methods comparison exercise on simulated data, where participants were blinded to the true dynamics they were inferring. Two distinct simulations captured generalized HIV-1 epidemics, before and after a large community-level intervention that reduced infection levels. Five research groups participated. Structured coalescent modeling approaches were most successful: phylogenetic estimates of HIV-1 incidence, incidence reductions, and the proportion of transmissions from individuals in their first 3 months of infection correlated with the true values (Pearson correlation > 90%), with small bias. However, on some simulations, true values were markedly outside reported confidence or credibility intervals. The blinded comparison revealed current limits and strengths in using HIV phylogenetics in challenging settings, provided benchmarks for future methods' development, and supports using the latest generation of phylogenetic tools to advance HIV surveillance and prevention.
Assuntos
Infecções por HIV/epidemiologia , Infecções por HIV/virologia , HIV-1/genética , África Subsaariana/epidemiologia , Simulação por Computador , Epidemias , Feminino , Infecções por HIV/prevenção & controle , Infecções por HIV/transmissão , Humanos , Incidência , Masculino , FilogeniaRESUMO
Methicillin-resistant Staphylococcus aureus (MRSA) is a major cause of nosocomial infection. Whole-genome sequencing of MRSA has been used to define phylogeny and transmission in well-resourced healthcare settings, yet the greatest burden of nosocomial infection occurs in resource-restricted settings where barriers to transmission are lower. Here, we study the flux and genetic diversity of MRSA on ward and individual patient levels in a hospital where transmission was common. We repeatedly screened all patients on two intensive care units for MRSA carriage over a 3-mo period. All MRSA belonged to multilocus sequence type 239 (ST 239). We defined the population structure and charted the spread of MRSA by sequencing 79 isolates from 46 patients and five members of staff, including the first MRSA-positive screen isolates and up to two repeat isolates where available. Phylogenetic analysis identified a flux of distinct ST 239 clades over time in each intensive care unit. In total, five main clades were identified, which varied in the carriage of plasmids encoding antiseptic and antimicrobial resistance determinants. Sequence data confirmed intra- and interwards transmission events and identified individual patients who were colonized by more than one clade. One patient on each unit was the source of numerous transmission events, and deep sampling of one of these cases demonstrated colonization with a "cloud" of related MRSA variants. The application of whole-genome sequencing and analysis provides novel insights into the transmission of MRSA in under-resourced healthcare settings and has relevance to wider global health.