RESUMO
Sequence variation observed in populations of pathogens can be used for important public health and evolutionary genomic analyses, especially outbreak analysis and transmission reconstruction. Identifying this variation is typically achieved by aligning sequence reads to a reference genome, but this approach is susceptible to reference biases and requires careful filtering of called genotypes. There is a need for tools that can process this growing volume of bacterial genome data, providing rapid results, but that remain simple so they can be used without highly trained bioinformaticians, expensive data analysis, and long-term storage and processing of large files. Here we describe split k-mer analysis (SKA2), a method that supports both reference-free and reference-based mapping to quickly and accurately genotype populations of bacteria using sequencing reads or genome assemblies. SKA2 is highly accurate for closely related samples, and in outbreak simulations, we show superior variant recall compared with reference-based methods, with no false positives. SKA2 can also accurately map variants to a reference and be used with recombination detection methods to rapidly reconstruct vertical evolutionary history. SKA2 is many times faster than comparable methods and can be used to add new genomes to an existing call set, allowing sequential use without the need to reanalyze entire collections. With an inherent absence of reference bias, high accuracy, and a robust implementation, SKA2 has the potential to become the tool of choice for genotyping bacteria. SKA2 is implemented in Rust and is freely available as open-source software.
Assuntos
Surtos de Doenças , Genoma Bacteriano , Genômica/métodos , Humanos , Software , Genótipo , Bactérias/genética , Bactérias/classificaçãoRESUMO
The emergence of successive Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) variants of concern (VOCs) during 2020 to 2022, each exhibiting increased epidemic growth relative to earlier circulating variants, has created a need to understand the drivers of such growth. However, both pathogen biology and changing host characteristics-such as varying levels of immunity-can combine to influence replication and transmission of SARS-CoV-2 within and between hosts. Disentangling the role of variant and host in individual-level viral shedding of VOCs is essential to inform Coronavirus Disease 2019 (COVID-19) planning and response and interpret past epidemic trends. Using data from a prospective observational cohort study of healthy adult volunteers undergoing weekly occupational health PCR screening, we developed a Bayesian hierarchical model to reconstruct individual-level viral kinetics and estimate how different factors shaped viral dynamics, measured by PCR cycle threshold (Ct) values over time. Jointly accounting for both interindividual variation in Ct values and complex host characteristics-such as vaccination status, exposure history, and age-we found that age and number of prior exposures had a strong influence on peak viral replication. Older individuals and those who had at least 5 prior antigen exposures to vaccination and/or infection typically had much lower levels of shedding. Moreover, we found evidence of a correlation between the speed of early shedding and duration of incubation period when comparing different VOCs and age groups. Our findings illustrate the value of linking information on participant characteristics, symptom profile and infecting variant with prospective PCR sampling, and the importance of accounting for increasingly complex population exposure landscapes when analysing the viral kinetics of VOCs. Trial Registration: The Legacy study is a prospective observational cohort study of healthy adult volunteers undergoing weekly occupational health PCR screening for SARS-CoV-2 at University College London Hospitals or at the Francis Crick Institute (NCT04750356) (22,23). The Legacy study was approved by London Camden and Kings Cross Health Research Authority Research and Ethics committee (IRAS number 286469). The Legacy study was approved by London Camden and Kings Cross Health Research Authority Research and Ethics committee (IRAS number 286469) and is sponsored by University College London Hospitals. Written consent was given by all participants.
Assuntos
COVID-19 , SARS-CoV-2 , Adulto , Humanos , SARS-CoV-2/genética , Teorema de Bayes , COVID-19/epidemiologia , Estudos ProspectivosRESUMO
The evolution of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) virus leads to new variants that warrant timely epidemiological characterization. Here we use the dense genomic surveillance data generated by the COVID-19 Genomics UK Consortium to reconstruct the dynamics of 71 different lineages in each of 315 English local authorities between September 2020 and June 2021. This analysis reveals a series of subepidemics that peaked in early autumn 2020, followed by a jump in transmissibility of the B.1.1.7/Alpha lineage. The Alpha variant grew when other lineages declined during the second national lockdown and regionally tiered restrictions between November and December 2020. A third more stringent national lockdown suppressed the Alpha variant and eliminated nearly all other lineages in early 2021. Yet a series of variants (most of which contained the spike E484K mutation) defied these trends and persisted at moderately increasing proportions. However, by accounting for sustained introductions, we found that the transmissibility of these variants is unlikely to have exceeded the transmissibility of the Alpha variant. Finally, B.1.617.2/Delta was repeatedly introduced in England and grew rapidly in early summer 2021, constituting approximately 98% of sampled SARS-CoV-2 genomes on 26 June 2021.
Assuntos
COVID-19/epidemiologia , COVID-19/virologia , Genoma Viral/genética , Genômica , SARS-CoV-2/genética , Substituição de Aminoácidos , COVID-19/transmissão , Inglaterra/epidemiologia , Monitoramento Epidemiológico , Humanos , Epidemiologia Molecular , Mutação , Quarentena/estatística & dados numéricos , SARS-CoV-2/classificação , Análise Espaço-Temporal , Glicoproteína da Espícula de Coronavírus/genéticaRESUMO
MOTIVATION: Metagenome-Assembled Genomes (MAGs) or Single-cell Amplified Genomes (SAGs) are often incomplete, with sequences missing due to errors in assembly or low coverage. This presents a particular challenge for the identification of true gene frequencies within a microbial population, as core genes missing in only a few assemblies will be mischaracterized by current pangenome approaches. RESULTS: Here, we present CELEBRIMBOR, a Snakemake pangenome analysis pipeline which uses a measure of genome completeness to automatically adjust the frequency threshold at which core genes are identified, enabling accurate core gene identification in MAGs and SAGs. AVAILABILITY AND IMPLEMENTATION: CELEBRIMBOR is published under open source Apache 2.0 licence at https://github.com/bacpop/CELEBRIMBOR and is available as a Docker container from this repository. Supplementary material is available in the online version of the article.
Assuntos
Metagenoma , Software , Metagenômica/métodosRESUMO
During 2020-2022, players and staff in the English Premier League in the United Kingdom were tested regularly for SARS-CoV-2 with the aim of creating a biosecure bubble for each team. We found that prevalence and reinfection estimates were consistent with those from other studies and with community infection trends.
Assuntos
COVID-19 , SARS-CoV-2 , Humanos , COVID-19/epidemiologia , Reino Unido/epidemiologia , Prevalência , Teste para COVID-19/métodos , Reinfecção/epidemiologia , Reinfecção/virologia , Masculino , AdultoRESUMO
INTRODUCTION: Cluster randomised trials (CRTs) are the gold standard for measuring the community-wide impacts of malaria control tools. CRTs rely on well-defined sample size estimations to detect statistically significant effects of trialled interventions, however these are often predicted poorly by triallists. Here, we review the accuracy of predicted parameters used in sample size calculations for malaria CRTs with epidemiological outcomes. METHODS: We searched for published malaria CRTs using four online databases in March 2022. Eligible trials included those with malaria-specific epidemiological outcomes which randomised at least six geographical clusters to study arms. Predicted and observed sample size parameters were extracted by reviewers for each trial. Pair-wise Spearman's correlation coefficients (rs) were calculated to assess the correlation between predicted and observed control-arm outcome measures and effect sizes (relative percentage reductions) between arms. Among trials which retrospectively calculated an estimate of heterogeneity in cluster outcomes, we recalculated study power according to observed trial estimates. RESULTS: Of the 1889 records identified and screened, 108 articles were eligible and comprised of 71 malaria CRTs. Among 91.5% (65/71) of trials that included sample size calculations, most estimated cluster heterogeneity using the coefficient of variation (k) (80%, 52/65) which were often predicted without using prior data (67.7%, 44/65). Predicted control-arm prevalence moderately correlated with observed control-arm prevalence (rs: 0.44, [95%CI: 0.12,0.68], p-value < 0.05], with 61.2% (19/31) of prevalence estimates overestimated. Among the minority of trials that retrospectively calculated cluster heterogeneity (20%, 13/65), empirical values contrasted with those used in sample size estimations and often compromised study power. Observed effect sizes were often smaller than had been predicted at the sample size stage (72.9%, 51/70) and were typically higher in the first, compared to the second, year of trials. Overall, effect sizes achieved by malaria interventions tested in trials decreased between 1995 and 2021. CONCLUSIONS: Study findings reveal sample size parameters in malaria CRTs were often inaccurate and resulted in underpowered studies. Future trials must strive to obtain more representative epidemiological sample size inputs to ensure interventions against malaria are adequately evaluated. REGISTRATION: This review is registered with PROSPERO (CRD42022315741).
Assuntos
Malária , Ensaios Clínicos Controlados Aleatórios como Assunto , Humanos , Tamanho da Amostra , Malária/epidemiologia , Ensaios Clínicos Controlados Aleatórios como Assunto/métodos , Ensaios Clínicos Controlados Aleatórios como Assunto/estatística & dados numéricos , Análise por ConglomeradosRESUMO
Substantial progress has been made globally to control malaria, however there is a growing need for innovative new tools to ensure continued progress. One approach is to harness genetic sequencing and accompanying methodological approaches as have been used in the control of other infectious diseases. However, to utilize these methodologies for malaria, we first need to extend the methods to capture the complex interactions between parasites, human and vector hosts, and environment, which all impact the level of genetic diversity and relatedness of malaria parasites. We develop an individual-based transmission model to simulate malaria parasite genetics parameterized using estimated relationships between complexity of infection and age from five regions in Uganda and Kenya. We predict that cotransmission and superinfection contribute equally to within-host parasite genetic diversity at 11.5% PCR prevalence, above which superinfections dominate. Finally, we characterize the predictive power of six metrics of parasite genetics for detecting changes in transmission intensity, before grouping them in an ensemble statistical model. The model predicted malaria prevalence with a mean absolute error of 0.055. Different assumptions about the availability of sample metadata were considered, with the most accurate predictions of malaria prevalence made when the clinical status and age of sampled individuals is known. Parasite genetics may provide a novel surveillance tool for estimating the prevalence of malaria in areas in which prevalence surveys are not feasible. However, the findings presented here reinforce the need for patient metadata to be recorded and made available within all future attempts to use parasite genetics for surveillance.
Assuntos
Malária/transmissão , Modelos Estatísticos , Plasmodium/genética , Adolescente , Criança , Pré-Escolar , Variação Genética , Humanos , Quênia/epidemiologia , Malária/epidemiologia , Malária/parasitologia , Mosquitos Vetores/parasitologia , Prevalência , Superinfecção , Uganda/epidemiologiaRESUMO
BACKGROUND: Forecasting healthcare demand is essential in epidemic settings, both to inform situational awareness and facilitate resource planning. Ideally, forecasts should be robust across time and locations. During the COVID-19 pandemic in England, it is an ongoing concern that demand for hospital care for COVID-19 patients in England will exceed available resources. METHODS: We made weekly forecasts of daily COVID-19 hospital admissions for National Health Service (NHS) Trusts in England between August 2020 and April 2021 using three disease-agnostic forecasting models: a mean ensemble of autoregressive time series models, a linear regression model with 7-day-lagged local cases as a predictor, and a scaled convolution of local cases and a delay distribution. We compared their point and probabilistic accuracy to a mean-ensemble of them all and to a simple baseline model of no change from the last day of admissions. We measured predictive performance using the weighted interval score (WIS) and considered how this changed in different scenarios (the length of the predictive horizon, the date on which the forecast was made, and by location), as well as how much admissions forecasts improved when future cases were known. RESULTS: All models outperformed the baseline in the majority of scenarios. Forecasting accuracy varied by forecast date and location, depending on the trajectory of the outbreak, and all individual models had instances where they were the top- or bottom-ranked model. Forecasts produced by the mean-ensemble were both the most accurate and most consistently accurate forecasts amongst all the models considered. Forecasting accuracy was improved when using future observed, rather than forecast, cases, especially at longer forecast horizons. CONCLUSIONS: Assuming no change in current admissions is rarely better than including at least a trend. Using confirmed COVID-19 cases as a predictor can improve admissions forecasts in some scenarios, but this is variable and depends on the ability to make consistently good case forecasts. However, ensemble forecasts can make forecasts that make consistently more accurate forecasts across time and locations. Given minimal requirements on data and computation, our admissions forecasting ensemble could be used to anticipate healthcare needs in future epidemic or pandemic settings.
Assuntos
COVID-19 , Previsões , Hospitais , Humanos , Pandemias , SARS-CoV-2 , Medicina EstatalRESUMO
[This corrects the article DOI: 10.1371/journal.pcbi.1008409.].
RESUMO
BACKGROUND: Routine asymptomatic testing using RT-PCR of people who interact with vulnerable populations, such as medical staff in hospitals or care workers in care homes, has been employed to help prevent outbreaks among vulnerable populations. Although the peak sensitivity of RT-PCR can be high, the probability of detecting an infection will vary throughout the course of an infection. The effectiveness of routine asymptomatic testing will therefore depend on testing frequency and how PCR detection varies over time. METHODS: We fitted a Bayesian statistical model to a dataset of twice weekly PCR tests of UK healthcare workers performed by self-administered nasopharyngeal swab, regardless of symptoms. We jointly estimated times of infection and the probability of a positive PCR test over time following infection; we then compared asymptomatic testing strategies by calculating the probability that a symptomatic infection is detected before symptom onset and the probability that an asymptomatic infection is detected within 7 days of infection. RESULTS: We estimated that the probability that the PCR test detected infection peaked at 77% (54-88%) 4 days after infection, decreasing to 50% (38-65%) by 10 days after infection. Our results suggest a substantially higher probability of detecting infections 1-3 days after infection than previously published estimates. We estimated that testing every other day would detect 57% (33-76%) of symptomatic cases prior to onset and 94% (75-99%) of asymptomatic cases within 7 days if test results were returned within a day. CONCLUSIONS: Our results suggest that routine asymptomatic testing can enable detection of a high proportion of infected individuals early in their infection, provided that the testing is frequent and the time from testing to notification of results is sufficiently fast.
Assuntos
Teste de Ácido Nucleico para COVID-19/métodos , COVID-19/diagnóstico , Reação em Cadeia da Polimerase/métodos , Teorema de Bayes , COVID-19/patologia , Feminino , Humanos , MasculinoRESUMO
Estimation of the effective reproductive number Rt is important for detecting changes in disease transmission over time. During the Coronavirus Disease 2019 (COVID-19) pandemic, policy makers and public health officials are using Rt to assess the effectiveness of interventions and to inform policy. However, estimation of Rt from available data presents several challenges, with critical implications for the interpretation of the course of the pandemic. The purpose of this document is to summarize these challenges, illustrate them with examples from synthetic data, and, where possible, make recommendations. For near real-time estimation of Rt, we recommend the approach of Cori and colleagues, which uses data from before time t and empirical estimates of the distribution of time between infections. Methods that require data from after time t, such as Wallinga and Teunis, are conceptually and methodologically less suited for near real-time estimation, but may be appropriate for retrospective analyses of how individuals infected at different time points contributed to the spread. We advise caution when using methods derived from the approach of Bettencourt and Ribeiro, as the resulting Rt estimates may be biased if the underlying structural assumptions are not met. Two key challenges common to all approaches are accurate specification of the generation interval and reconstruction of the time series of new infections from observations occurring long after the moment of transmission. Naive approaches for dealing with observation delays, such as subtracting delays sampled from a distribution, can introduce bias. We provide suggestions for how to mitigate this and other technical challenges and highlight open problems in Rt estimation.
Assuntos
Número Básico de Reprodução , COVID-19 , COVID-19/epidemiologia , COVID-19/transmissão , Biologia Computacional , Humanos , Modelos Estatísticos , SARS-CoV-2RESUMO
Widespread, repeated testing using rapid antigen tests to proactively detect asymptomatic SARS-CoV-2 infections has been a promising yet controversial topic during the COVID-19 pandemic. Concerns have been raised over whether currently authorized lateral flow tests are sufficiently sensitive and specific to detect enough infections to impact transmission whilst minimizing unnecessary isolation of false positives. These concerns have often been illustrated using simple, textbook calculations of positivity rates and positive predictive value assuming fixed values for sensitivity, specificity and prevalence. However, we argue that evaluating repeated testing strategies requires the consideration of three additional factors: new infections continue to arise depending on the incidence rate, isolating positive individuals reduces prevalence in the tested population, and each infected individual is tested multiple times during their infection course. We provide a simple mathematical model with an online interface to illustrate how these three factors impact test positivity rates and the number of isolating individuals over time. These results highlight the potential pitfalls of using inappropriate textbook-style calculations to evaluate statistics arising from repeated testing strategies during an epidemic.
Assuntos
Teste para COVID-19/estatística & dados numéricos , Adolescente , Criança , Inglaterra , Feminino , Humanos , Masculino , Modelos Estatísticos , Pandemias , Valor Preditivo dos Testes , SARS-CoV-2 , Instituições Acadêmicas , Sensibilidade e EspecificidadeRESUMO
BACKGROUND: Asymptomatic or subclinical SARS-CoV-2 infections are often unreported, which means that confirmed case counts may not accurately reflect underlying epidemic dynamics. Understanding the level of ascertainment (the ratio of confirmed symptomatic cases to the true number of symptomatic individuals) and undetected epidemic progression is crucial to informing COVID-19 response planning, including the introduction and relaxation of control measures. Estimating case ascertainment over time allows for accurate estimates of specific outcomes such as seroprevalence, which is essential for planning control measures. METHODS: Using reported data on COVID-19 cases and fatalities globally, we estimated the proportion of symptomatic cases (i.e. any person with any of fever ≥ 37.5 °C, cough, shortness of breath, sudden onset of anosmia, ageusia or dysgeusia illness) that were reported in 210 countries and territories, given those countries had experienced more than ten deaths. We used published estimates of the baseline case fatality ratio (CFR), which was adjusted for delays and under-ascertainment, then calculated the ratio of this baseline CFR to an estimated local delay-adjusted CFR to estimate the level of under-ascertainment in a particular location. We then fit a Bayesian Gaussian process model to estimate the temporal pattern of under-ascertainment. RESULTS: Based on reported cases and deaths, we estimated that, during March 2020, the median percentage of symptomatic cases detected across the 84 countries which experienced more than ten deaths ranged from 2.4% (Bangladesh) to 100% (Chile). Across the ten countries with the highest number of total confirmed cases as of 6 July 2020, we estimated that the peak number of symptomatic cases ranged from 1.4 times (Chile) to 18 times (France) larger than reported. Comparing our model with national and regional seroprevalence data where available, we find that our estimates are consistent with observed values. Finally, we estimated seroprevalence for each country. As of 7 June, our seroprevalence estimates range from 0% (many countries) to 13% (95% CrI 5.6-24%) (Belgium). CONCLUSIONS: We found substantial under-ascertainment of symptomatic cases, particularly at the peak of the first wave of the SARS-CoV-2 pandemic, in many countries. Reported case counts will therefore likely underestimate the rate of outbreak growth initially and underestimate the decline in the later stages of an epidemic. Although there was considerable under-reporting in many locations, our estimates were consistent with emerging serological data, suggesting that the proportion of each country's population infected with SARS-CoV-2 worldwide is generally low.
Assuntos
Infecções por Coronavirus/epidemiologia , Pandemias , Pneumonia Viral/epidemiologia , Teorema de Bayes , Betacoronavirus , COVID-19 , Humanos , SARS-CoV-2 , Estudos SoroepidemiológicosRESUMO
Adjusting for delay from confirmation to death, we estimated case and infection fatality ratios (CFR, IFR) for coronavirus disease (COVID-19) on the Diamond Princess ship as 2.6% (95% confidence interval (CI): 0.89-6.7) and 1.3% (95% CI: 0.38-3.6), respectively. Comparing deaths on board with expected deaths based on naive CFR estimates from China, we estimated CFR and IFR in China to be 1.2% (95% CI: 0.3-2.7) and 0.6% (95% CI: 0.2-1.3), respectively.
Assuntos
Técnicas de Laboratório Clínico/métodos , Infecções por Coronavirus/mortalidade , Coronavirus/isolamento & purificação , Surtos de Doenças/prevenção & controle , Pneumonia Viral/mortalidade , Navios , Viagem , Adolescente , Adulto , Idoso , Betacoronavirus , COVID-19 , Teste para COVID-19 , Criança , Pré-Escolar , Busca de Comunicante , Coronavirus/genética , Infecções por Coronavirus/diagnóstico , Infecções por Coronavirus/transmissão , Feminino , Humanos , Lactente , Recém-Nascido , Masculino , Pessoa de Meia-Idade , Mortalidade , Pandemias , Pneumonia Viral/diagnóstico , Pneumonia Viral/transmissão , Sistemas Automatizados de Assistência Junto ao Leito , Reação em Cadeia da Polimerase , Fatores de Risco , SARS-CoV-2 , Adulto JovemRESUMO
BACKGROUND: The number of clinical cases of malaria is often recorded in resource constrained or conflict settings as a proxy for disease burden. Interpreting case count data in areas of humanitarian need is challenging due to uncertainties in population size caused by security concerns, resource constraints and population movement. Malaria prevalence in women visiting ante-natal care (ANC) clinics has the potential to be an easier and more accurate metric for malaria surveillance that is unbiased by population size if malaria testing is routinely conducted irrespective of symptoms. METHODS: A suite of distributed lag non-linear models was fitted to clinical incidence time-series data in children under 5 years and ANC prevalence data from health centres run by Médecins Sans Frontières in the Democratic Republic of Congo, which implement routine intermittent screening and treatment alongside intermittent preventative treatment in pregnancy. These statistical models enable the temporal relationship between the two metrics to be disentangled. RESULTS: There was a strong relationship between the ANC prevalence and clinical incidence suggesting that both can be used to describe current malaria endemicity. There was no evidence that ANC prevalence could predict future clinical incidence, though a change in clinical incidence was shown to influence ANC prevalence up to 3 months into the future. CONCLUSIONS: The results indicate that ANC prevalence may be a suitable metric for retrospective evaluations of the impact of malaria interventions and is a useful method for evaluating long-term malaria trends in resource constrained settings.
Assuntos
Malária/epidemiologia , Cuidado Pré-Natal/estatística & dados numéricos , Pré-Escolar , República Democrática do Congo/epidemiologia , Humanos , Incidência , Lactente , Recém-Nascido , Modelos Estatísticos , Prevalência , Socorro em DesastresRESUMO
In this review, we assess the status of computational modelling of pathogens. We focus on three disparate but interlinked research areas that produce models with very different spatial and temporal scope. First, we examine antimicrobial resistance (AMR). Many mechanisms of AMR are not well understood. As a result, it is hard to measure the current incidence of AMR, predict the future incidence, and design strategies to preserve existing antibiotic effectiveness. Next, we look at how to choose the finite number of bacterial strains that can be included in a vaccine. To do this, we need to understand what happens to vaccine and non-vaccine strains after vaccination programmes. Finally, we look at within-host modelling of antibody dynamics. The SARS-CoV-2 pandemic produced huge amounts of antibody data, prompting improvements in this area of modelling. We finish by discussing the challenges that persist in understanding these complex biological systems.
Assuntos
COVID-19 , Simulação por Computador , SARS-CoV-2 , Humanos , COVID-19/prevenção & controle , COVID-19/epidemiologia , COVID-19/imunologia , COVID-19/virologia , SARS-CoV-2/imunologia , Antibacterianos/farmacologia , Pandemias/prevenção & controle , Resistência Microbiana a MedicamentosRESUMO
Background: The COVID-19 pandemic both relied and placed significant burdens on the experts involved from research and public health sectors. The sustained high pressure of a pandemic on responders, such as healthcare workers, can lead to lasting psychological impacts including acute stress disorder, post-traumatic stress disorder, burnout, and moral injury, which can impact individual wellbeing and productivity. Methods: As members of the infectious disease modelling community, we convened a reflective workshop to understand the professional and personal impacts of response work on our community and to propose recommendations for future epidemic responses. The attendees represented a range of career stages, institutions, and disciplines. This piece was collectively produced by those present at the session based on our collective experiences. Results: Key issues we identified at the workshop were lack of institutional support, insecure contracts, unequal credit and recognition, and mental health impacts. Our recommendations include rewarding impactful work, fostering academia-public health collaboration, decreasing dependence on key individuals by developing teams, increasing transparency in decision-making, and implementing sustainable work practices. Conclusions: Despite limitations in representation, this workshop provided valuable insights into the UK COVID-19 modelling experience and guidance for future public health crises. Recognising and addressing the issues highlighted is crucial, in our view, for ensuring the effectiveness of epidemic response work in the future.
RESUMO
BACKGROUND: The emergence of SARS-CoV-2 variants and COVID-19 vaccination have resulted in complex exposure histories. Rapid assessment of the effects of these exposures on neutralising antibodies against SARS-CoV-2 infection is crucial for informing vaccine strategy and epidemic management. We aimed to investigate heterogeneity in individual-level and population-level antibody kinetics to emerging variants by previous SARS-CoV-2 exposure history, to examine implications for real-time estimation, and to examine the effects of vaccine-campaign timing. METHODS: Our Bayesian hierarchical model of antibody kinetics estimated neutralising-antibody trajectories against a panel of SARS-CoV-2 variants quantified with a live virus microneutralisation assay and informed by individual-level COVID-19 vaccination and SARS-CoV-2 infection histories. Antibody titre trajectories were modelled with a piecewise linear function that depended on the key biological quantities of an initial titre value, time the peak titre is reached, set-point time, and corresponding rates of increase and decrease for gradients between two timing parameters. All process parameters were estimated at both the individual level and the population level. We analysed data from participants in the University College London Hospitals-Francis Crick Institute Legacy study cohort (NCT04750356) who underwent surveillance for SARS-CoV-2 either through asymptomatic mandatory occupational health screening once per week between April 1, 2020, and May 31, 2022, or symptom-based testing between April 1, 2020, and Feb 1, 2023. People included in the Legacy study were either Crick employees or health-care workers at three London hospitals, older than 18 years, and gave written informed consent. Legacy excluded people who were unable or unwilling to give informed consent and those not employed by a qualifying institution. We segmented data to include vaccination events occurring up to 150 days before the emergence of three variants of concern: delta, BA.2, and XBB 1.5. We split the data for each wave into two categories: real-time and retrospective. The real-time dataset contained neutralising-antibody titres collected up to the date of emergence in each wave; the retrospective dataset contained all samples until the next SARS-CoV-2 exposure of each individual, whether vaccination or infection. FINDINGS: We included data from 335 participants in the delta wave analysis, 223 (67%) of whom were female and 112 (33%) of whom were male (median age 40 years, IQR 22-58); data from 385 participants in the BA.2 wave analysis, 271 (70%) of whom were female and 114 (30%) of whom were male (41 years, 22-60); and data from 248 participants in the XBB 1.5 wave analysis, 191 (77%) of whom were female, 56 (23%) of whom were male, and one (<1%) of whom preferred not to say (40 years, 21-59). Overall, we included 968 exposures (vaccinations) across 1895 serum samples in the model. For the delta wave, we estimated peak titre values as 490·0 IC50 (95% credible interval 224·3-1515·9) for people with no previous infection and as 702·4 IC50 (300·8-2322·7) for people with a previous infection before omicron; the delta wave did not include people with a previous omicron infection. For the BA.2 wave, we estimated peak titre values as 858·1 IC50 (689·8-1363·2) for people with no previous infection, 1020·7 IC50 (725·9-1722·6) for people with a previous infection before omicron, and 1422·0 IC50 (679·2-3027·3) for people with a previous omicron infection. For the XBB 1.5 wave, we estimated peak titre values as 703·2 IC50 (415·0-3197·8) for people with no previous infection, 1215·9 IC50 (511·6-7338·7) for people with a previous infection before omicron, and 1556·3 IC50 (757·2-7907·9) for people with a previous omicron infection. INTERPRETATION: Our study shows the feasibility of real-time estimation of antibody kinetics before SARS-CoV-2 variant emergence. This estimation is valuable for understanding how specific combinations of SARS-CoV-2 exposures influence antibody kinetics and for examining how COVID-19 vaccination-campaign timing could affect population-level immunity to emerging variants. FUNDING: Wellcome Trust, National Institute for Health Research University College London Hospitals Biomedical Research Centre, UK Research and Innovation, UK Medical Research Council, Francis Crick Institute, and Genotype-to-Phenotype National Virology Consortium.
RESUMO
The emergence of successive SARS-CoV-2 variants of concern (VOC) during 2020-22, each exhibiting increased epidemic growth relative to earlier circulating variants, has created a need to understand the drivers of such growth. However, both pathogen biology and changing host characteristics - such as varying levels of immunity - can combine to influence replication and transmission of SARS-CoV-2 within and between hosts. Disentangling the role of variant and host in individual-level viral shedding of VOCs is essential to inform COVID-19 planning and response, and interpret past epidemic trends. Using data from a prospective observational cohort study of healthy adult volunteers undergoing weekly occupational health PCR screening, we developed a Bayesian hierarchical model to reconstruct individual-level viral kinetics and estimate how different factors shaped viral dynamics, measured by PCR cycle threshold (Ct) values over time. Jointly accounting for both inter-individual variation in Ct values and complex host characteristics - such as vaccination status, exposure history and age - we found that age and number of prior exposures had a strong influence on peak viral replication. Older individuals and those who had at least five prior antigen exposures to vaccination and/or infection typically had much lower levels of shedding. Moreover, we found evidence of a correlation between the speed of early shedding and duration of incubation period when comparing different VOCs and age groups. Our findings illustrate the value of linking information on participant characteristics, symptom profile and infecting variant with prospective PCR sampling, and the importance of accounting for increasingly complex population exposure landscapes when analysing the viral kinetics of VOCs.