Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 342
Filtrar
1.
Cell ; 187(19): 5468-5482.e11, 2024 Sep 19.
Artigo em Inglês | MEDLINE | ID: mdl-39303692

RESUMO

Zoonotic spillovers of viruses have occurred through the animal trade worldwide. The start of the COVID-19 pandemic was traced epidemiologically to the Huanan Seafood Wholesale Market. Here, we analyze environmental qPCR and sequencing data collected in the Huanan market in early 2020. We demonstrate that market-linked severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) genetic diversity is consistent with market emergence and find increased SARS-CoV-2 positivity near and within a wildlife stall. We identify wildlife DNA in all SARS-CoV-2-positive samples from this stall, including species such as civets, bamboo rats, and raccoon dogs, previously identified as possible intermediate hosts. We also detect animal viruses that infect raccoon dogs, civets, and bamboo rats. Combining metagenomic and phylogenetic approaches, we recover genotypes of market animals and compare them with those from farms and other markets. This analysis provides the genetic basis for a shortlist of potential intermediate hosts of SARS-CoV-2 to prioritize for serological and viral sampling.


Assuntos
Animais Selvagens , COVID-19 , Filogenia , SARS-CoV-2 , Animais , COVID-19/epidemiologia , COVID-19/virologia , SARS-CoV-2/genética , SARS-CoV-2/isolamento & purificação , Animais Selvagens/virologia , Humanos , Pandemias
2.
Cell ; 185(7): 1117-1129.e8, 2022 03 31.
Artigo em Inglês | MEDLINE | ID: mdl-35298912

RESUMO

Game animals are wildlife species traded and consumed as food and are potential reservoirs for SARS-CoV and SARS-CoV-2. We performed a meta-transcriptomic analysis of 1,941 game animals, representing 18 species and five mammalian orders, sampled across China. From this, we identified 102 mammalian-infecting viruses, with 65 described for the first time. Twenty-one viruses were considered as potentially high risk to humans and domestic animals. Civets (Paguma larvata) carried the highest number of potentially high-risk viruses. We inferred the transmission of bat-associated coronavirus from bats to civets, as well as cross-species jumps of coronaviruses from bats to hedgehogs, from birds to porcupines, and from dogs to raccoon dogs. Of note, we identified avian Influenza A virus H9N2 in civets and Asian badgers, with the latter displaying respiratory symptoms, as well as cases of likely human-to-wildlife virus transmission. These data highlight the importance of game animals as potential drivers of disease emergence.


Assuntos
Animais Selvagens/virologia , Doenças Transmissíveis Emergentes/virologia , Reservatórios de Doenças , Mamíferos/virologia , Viroma , Animais , China , Filogenia , Zoonoses
3.
Nature ; 634(8032): 228-233, 2024 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-39232170

RESUMO

Animals such as raccoon dogs, mink and muskrats are farmed for fur and are sometimes used as food or medicinal products1,2, yet they are also potential reservoirs of emerging pathogens3. Here we performed single-sample metatranscriptomic sequencing of internal tissues from 461 individual fur animals that were found dead due to disease. We characterized 125 virus species, including 36 that were novel and 39 at potentially high risk of cross-species transmission, including zoonotic spillover. Notably, we identified seven species of coronaviruses, expanding their known host range, and documented the cross-species transmission of a novel canine respiratory coronavirus to raccoon dogs and of bat HKU5-like coronaviruses to mink, present at a high abundance in lung tissues. Three subtypes of influenza A virus-H1N2, H5N6 and H6N2-were detected in the lungs of guinea pig, mink and muskrat, respectively. Multiple known zoonotic viruses, such as Japanese encephalitis virus and mammalian orthoreovirus4,5, were detected in guinea pigs. Raccoon dogs and mink carried the highest number of potentially high-risk viruses, while viruses from the Coronaviridae, Paramyxoviridae and Sedoreoviridae families commonly infected multiple hosts. These data also reveal potential virus transmission between farmed animals and wild animals, and from humans to farmed animals, indicating that fur farming represents an important transmission hub for viral zoonoses.


Assuntos
Pelo Animal , Animais Domésticos , Animais Selvagens , Reservatórios de Doenças , Especificidade de Hospedeiro , Zoonoses Virais , Animais , Cães , Cobaias , Humanos , Animais Domésticos/virologia , Animais Selvagens/virologia , Arvicolinae/virologia , Quirópteros/virologia , Coronavirus/isolamento & purificação , Coronavirus/genética , Coronavirus/classificação , Reservatórios de Doenças/virologia , Reservatórios de Doenças/veterinária , Vírus da Encefalite Japonesa (Espécie)/genética , Vírus da Encefalite Japonesa (Espécie)/isolamento & purificação , Vírus da Influenza A/classificação , Vírus da Influenza A/genética , Vírus da Influenza A/isolamento & purificação , Pulmão/virologia , Vison/virologia , Orthoreovirus/genética , Orthoreovirus/isolamento & purificação , Filogenia , Cães Guaxinins/virologia , Zoonoses Virais/transmissão , Zoonoses Virais/virologia
4.
Nature ; 610(7930): 154-160, 2022 10.
Artigo em Inglês | MEDLINE | ID: mdl-35952712

RESUMO

The SARS-CoV-2 Delta (Pango lineage B.1.617.2) variant of concern spread globally, causing resurgences of COVID-19 worldwide1,2. The emergence of the Delta variant in the UK occurred on the background of a heterogeneous landscape of immunity and relaxation of non-pharmaceutical interventions. Here we analyse 52,992 SARS-CoV-2 genomes from England together with 93,649 genomes from the rest of the world to reconstruct the emergence of Delta and quantify its introduction to and regional dissemination across England in the context of changing travel and social restrictions. Using analysis of human movement, contact tracing and virus genomic data, we find that the geographic focus of the expansion of Delta shifted from India to a more global pattern in early May 2021. In England, Delta lineages were introduced more than 1,000 times and spread nationally as non-pharmaceutical interventions were relaxed. We find that hotel quarantine for travellers reduced onward transmission from importations; however, the transmission chains that later dominated the Delta wave in England were seeded before travel restrictions were introduced. Increasing inter-regional travel within England drove the nationwide dissemination of Delta, with some cities receiving more than 2,000 observable lineage introductions from elsewhere. Subsequently, increased levels of local population mixing-and not the number of importations-were associated with the faster relative spread of Delta. The invasion dynamics of Delta depended on spatial heterogeneity in contact patterns, and our findings will inform optimal spatial interventions to reduce the transmission of current and future variants of concern, such as Omicron (Pango lineage B.1.1.529).


Assuntos
COVID-19 , SARS-CoV-2 , COVID-19/epidemiologia , COVID-19/prevenção & controle , COVID-19/transmissão , COVID-19/virologia , Cidades/epidemiologia , Busca de Comunicante , Inglaterra/epidemiologia , Genoma Viral/genética , Humanos , Quarentena/legislação & jurisprudência , SARS-CoV-2/genética , SARS-CoV-2/crescimento & desenvolvimento , SARS-CoV-2/isolamento & purificação , Viagem/legislação & jurisprudência
5.
Nature ; 595(7869): 713-717, 2021 07.
Artigo em Inglês | MEDLINE | ID: mdl-34192736

RESUMO

After the first wave of SARS-CoV-2 infections in spring 2020, Europe experienced a resurgence of the virus starting in late summer 2020 that was deadlier and more difficult to contain1. Relaxed intervention measures and summer travel have been implicated as drivers of the second wave2. Here we build a phylogeographical model to evaluate how newly introduced lineages, as opposed to the rekindling of persistent lineages, contributed to the resurgence of COVID-19 in Europe. We inform this model using genomic, mobility and epidemiological data from 10 European countries and estimate that in many countries more than half of the lineages circulating in late summer resulted from new introductions since 15 June 2020. The success in onward transmission of newly introduced lineages was negatively associated with the local incidence of COVID-19 during this period. The pervasive spread of variants in summer 2020 highlights the threat of viral dissemination when restrictions are lifted, and this needs to be carefully considered in strategies to control the current spread of variants that are more transmissible and/or evade immunity. Our findings indicate that more effective and coordinated measures are required to contain the spread through cross-border travel even as vaccination is reducing disease burden.


Assuntos
COVID-19/transmissão , COVID-19/virologia , SARS-CoV-2/isolamento & purificação , COVID-19/epidemiologia , COVID-19/prevenção & controle , Europa (Continente)/epidemiologia , Genoma Viral/genética , Humanos , Incidência , Locomoção , Filogenia , Filogeografia , SARS-CoV-2/classificação , SARS-CoV-2/genética , SARS-CoV-2/patogenicidade , Fatores de Tempo , Viagem/estatística & dados numéricos
6.
Proc Natl Acad Sci U S A ; 121(3): e2318989121, 2024 Jan 16.
Artigo em Inglês | MEDLINE | ID: mdl-38215186

RESUMO

The continuous-time Markov chain (CTMC) is the mathematical workhorse of evolutionary biology. Learning CTMC model parameters using modern, gradient-based methods requires the derivative of the matrix exponential evaluated at the CTMC's infinitesimal generator (rate) matrix. Motivated by the derivative's extreme computational complexity as a function of state space cardinality, recent work demonstrates the surprising effectiveness of a naive, first-order approximation for a host of problems in computational biology. In response to this empirical success, we obtain rigorous deterministic and probabilistic bounds for the error accrued by the naive approximation and establish a "blessing of dimensionality" result that is universal for a large class of rate matrices with random entries. Finally, we apply the first-order approximation within surrogate-trajectory Hamiltonian Monte Carlo for the analysis of the early spread of Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) across 44 geographic regions that comprise a state space of unprecedented dimensionality for unstructured (flexible) CTMC models within evolutionary biology.


Assuntos
COVID-19 , SARS-CoV-2 , Humanos , Algoritmos , COVID-19/epidemiologia , Cadeias de Markov
7.
Nat Methods ; 20(4): 512-522, 2023 04.
Artigo em Inglês | MEDLINE | ID: mdl-36823332

RESUMO

In response to the emergence of SARS-CoV-2 variants of concern, the global scientific community, through unprecedented effort, has sequenced and shared over 11 million genomes through GISAID, as of May 2022. This extraordinarily high sampling rate provides a unique opportunity to track the evolution of the virus in near real-time. Here, we present outbreak.info , a platform that currently tracks over 40 million combinations of Pango lineages and individual mutations, across over 7,000 locations, to provide insights for researchers, public health officials and the general public. We describe the interpretable visualizations available in our web application, the pipelines that enable the scalable ingestion of heterogeneous sources of SARS-CoV-2 variant data and the server infrastructure that enables widespread data dissemination via a high-performance API that can be accessed using an R package. We show how outbreak.info can be used for genomic surveillance and as a hypothesis-generation tool to understand the ongoing pandemic at varying geographic and temporal scales.


Assuntos
COVID-19 , SARS-CoV-2 , Humanos , Genômica , Surtos de Doenças , Mutação
8.
Bioinformatics ; 40(2)2024 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-38243701

RESUMO

MOTIVATION: Advancements in high-throughput genomic sequencing are delivering genomic pathogen data at an unprecedented rate, positioning statistical phylogenetics as a critical tool to monitor infectious diseases globally. This rapid growth spurs the need for efficient inference techniques, such as Hamiltonian Monte Carlo (HMC) in a Bayesian framework, to estimate parameters of these phylogenetic models where the dimensions of the parameters increase with the number of sequences N. HMC requires repeated calculation of the gradient of the data log-likelihood with respect to (wrt) all branch-length-specific (BLS) parameters that traditionally takes O(N2) operations using the standard pruning algorithm. A recent study proposes an approach to calculate this gradient in O(N), enabling researchers to take advantage of gradient-based samplers such as HMC. The CPU implementation of this approach makes the calculation of the gradient computationally tractable for nucleotide-based models but falls short in performance for larger state-space size models, such as Markov-modulated and codon models. Here, we describe novel massively parallel algorithms to calculate the gradient of the log-likelihood wrt all BLS parameters that take advantage of graphics processing units (GPUs) and result in many fold higher speedups over previous CPU implementations. RESULTS: We benchmark these GPU algorithms on three computing systems using three evolutionary inference examples exploring complete genomes from 997 dengue viruses, 62 carnivore mitochondria and 49 yeasts, and observe a >128-fold speedup over the CPU implementation for codon-based models and >8-fold speedup for nucleotide-based models. As a practical demonstration, we also estimate the timing of the first introduction of West Nile virus into the continental Unites States under a codon model with a relaxed molecular clock from 104 full viral genomes, an inference task previously intractable. AVAILABILITY AND IMPLEMENTATION: We provide an implementation of our GPU algorithms in BEAGLE v4.0.0 (https://github.com/beagle-dev/beagle-lib), an open-source library for statistical phylogenetics that enables parallel calculations on multi-core CPUs and GPUs. We employ a BEAGLE-implementation using the Bayesian phylogenetics framework BEAST (https://github.com/beast-dev/beast-mcmc).


Assuntos
Algoritmos , Software , Filogenia , Teorema de Bayes , Códon , Nucleotídeos
9.
Syst Biol ; 2024 Oct 25.
Artigo em Inglês | MEDLINE | ID: mdl-39453463

RESUMO

Online phylogenetic inference methods add sequentially arriving sequences to an inferred phylogeny without the need to recompute the entire tree from scratch. Some online method implementations exist already, but there remains concern that additional sequences may change the topological relationship among the original set of taxa. We call such a change in tree topology a lack of stability for the inferred tree. In this paper, we analyze the stability of single taxon addition in a Maximum Likelihood framework across 1, 000 empirical datasets. We find that instability occurs in almost 90% of our examples, although observed topological differences do not always reach significance under the AU-test. Changes in tree topology after addition of a taxon rarely occur close to its attachment location, and are more frequently observed in more distant tree locations carrying low bootstrap support. To investigate whether instability is predictable, we hypothesize sources of instability and design summary statistics addressing these hypotheses. Using these summary statistics as input features for machine learning under random forests, we are able to predict instability and can identify the most influential features. In summary, it does not appear that a strict insertion-only online inference method will deliver globally optimal trees, although relaxing insertion strictness by allowing for a small number of final tree rearrangements or accepting slightly suboptimal solutions appears feasible.

10.
Syst Biol ; 73(3): 562-578, 2024 Sep 05.
Artigo em Inglês | MEDLINE | ID: mdl-38712512

RESUMO

Phylogenetic and discrete-trait evolutionary inference depend heavily on an appropriate characterization of the underlying character substitution process. In this paper, we present random-effects substitution models that extend common continuous-time Markov chain models into a richer class of processes capable of capturing a wider variety of substitution dynamics. As these random-effects substitution models often require many more parameters than their usual counterparts, inference can be both statistically and computationally challenging. Thus, we also propose an efficient approach to compute an approximation to the gradient of the data likelihood with respect to all unknown substitution model parameters. We demonstrate that this approximate gradient enables scaling of sampling-based inference, namely Bayesian inference via Hamiltonian Monte Carlo, under random-effects substitution models across large trees and state-spaces. Applied to a dataset of 583 SARS-CoV-2 sequences, an HKY model with random-effects shows strong signals of nonreversibility in the substitution process, and posterior predictive model checks clearly show that it is a more adequate model than a reversible model. When analyzing the pattern of phylogeographic spread of 1441 influenza A virus (H3N2) sequences between 14 regions, a random-effects phylogeographic substitution model infers that air travel volume adequately predicts almost all dispersal rates. A random-effects state-dependent substitution model reveals no evidence for an effect of arboreality on the swimming mode in the tree frog subfamily Hylinae. Simulations reveal that random-effects substitution models can accommodate both negligible and radical departures from the underlying base substitution model. We show that our gradient-based inference approach is over an order of magnitude more time efficient than conventional approaches.


Assuntos
Classificação , Filogenia , Classificação/métodos , SARS-CoV-2/genética , SARS-CoV-2/classificação , Vírus da Influenza A Subtipo H3N2/genética , Vírus da Influenza A Subtipo H3N2/classificação , Modelos Genéticos , Cadeias de Markov , Teorema de Bayes
11.
PLoS Comput Biol ; 20(3): e1011640, 2024 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-38551979

RESUMO

Birth-death models play a key role in phylodynamic analysis for their interpretation in terms of key epidemiological parameters. In particular, models with piecewise-constant rates varying at different epochs in time, to which we refer as episodic birth-death-sampling (EBDS) models, are valuable for their reflection of changing transmission dynamics over time. A challenge, however, that persists with current time-varying model inference procedures is their lack of computational efficiency. This limitation hinders the full utilization of these models in large-scale phylodynamic analyses, especially when dealing with high-dimensional parameter vectors that exhibit strong correlations. We present here a linear-time algorithm to compute the gradient of the birth-death model sampling density with respect to all time-varying parameters, and we implement this algorithm within a gradient-based Hamiltonian Monte Carlo (HMC) sampler to alleviate the computational burden of conducting inference under a wide variety of structures of, as well as priors for, EBDS processes. We assess this approach using three different real world data examples, including the HIV epidemic in Odesa, Ukraine, seasonal influenza A/H3N2 virus dynamics in New York state, America, and Ebola outbreak in West Africa. HMC sampling exhibits a substantial efficiency boost, delivering a 10- to 200-fold increase in minimum effective sample size per unit-time, in comparison to a Metropolis-Hastings-based approach. Additionally, we show the robustness of our implementation in both allowing for flexible prior choices and in modeling the transmission dynamics of various pathogens by accurately capturing the changing trend of viral effective reproductive number.


Assuntos
Epidemias , Doença pelo Vírus Ebola , Influenza Humana , Humanos , Vírus da Influenza A Subtipo H3N2 , Algoritmos , Influenza Humana/epidemiologia , Doença pelo Vírus Ebola/epidemiologia , Método de Monte Carlo
12.
Mol Biol Evol ; 40(11)2023 Nov 03.
Artigo em Inglês | MEDLINE | ID: mdl-37950885

RESUMO

Molecular clock models undergird modern methods of divergence-time estimation. Local clock models propose that the rate of molecular evolution is constant within phylogenetic subtrees. Current local clock inference procedures exhibit one or more weaknesses, namely they achieve limited scalability to trees with large numbers of taxa, impose model misspecification, or require a priori knowledge of the existence and location of clocks. To overcome these challenges, we present an autocorrelated, Bayesian model of heritable clock rate evolution that leverages heavy-tailed priors with mean zero to shrink increments of change between branch-specific clocks. We further develop an efficient Hamiltonian Monte Carlo sampler that exploits closed form gradient computations to scale our model to large trees. Inference under our shrinkage clock exhibits a speed-up compared to the popular random local clock when estimating branch-specific clock rates on a variety of simulated datasets. This speed-up increases with the size of the problem. We further show our shrinkage clock recovers known local clocks within a rodent and mammalian phylogeny. Finally, in a problem that once appeared computationally impractical, we investigate the heritable clock structure of various surface glycoproteins of influenza A virus in the absence of prior knowledge about clock placement. We implement our shrinkage clock and make it publicly available in the BEAST software package.


Assuntos
Evolução Molecular , Mamíferos , Animais , Filogenia , Teorema de Bayes , Fatores de Tempo , Modelos Genéticos
13.
N Engl J Med ; 384(13): 1240-1247, 2021 04 01.
Artigo em Inglês | MEDLINE | ID: mdl-33789012

RESUMO

During the 2018-2020 Ebola virus disease (EVD) outbreak in North Kivu province in the Democratic Republic of Congo, EVD was diagnosed in a patient who had received the recombinant vesicular stomatitis virus-based vaccine expressing a ZEBOV glycoprotein (rVSV-ZEBOV) (Merck). His treatment included an Ebola virus (EBOV)-specific monoclonal antibody (mAb114), and he recovered within 14 days. However, 6 months later, he presented again with severe EVD-like illness and EBOV viremia, and he died. We initiated epidemiologic and genomic investigations that showed that the patient had had a relapse of acute EVD that led to a transmission chain resulting in 91 cases across six health zones over 4 months. (Funded by the Bill and Melinda Gates Foundation and others.).


Assuntos
Ebolavirus/genética , Doença pelo Vírus Ebola/transmissão , Adulto , Teorema de Bayes , República Democrática do Congo/epidemiologia , Vacinas contra Ebola/imunologia , Ebolavirus/isolamento & purificação , Evolução Fatal , Genoma Viral , Doença pelo Vírus Ebola/diagnóstico , Doença pelo Vírus Ebola/epidemiologia , Doença pelo Vírus Ebola/terapia , Humanos , Masculino , Mutação , Filogenia , RNA Viral/sangue , Recidiva
14.
J Virol ; 97(1): e0109122, 2023 01 31.
Artigo em Inglês | MEDLINE | ID: mdl-36475767

RESUMO

Getah virus (GETV) mainly causes disease in livestock and may pose an epidemic risk due to its expanding host range and the potential of long-distance dispersal through animal trade. Here, we used metagenomic next-generation sequencing (mNGS) to identify GETV as the pathogen responsible for reemerging swine disease in China and subsequently estimated key epidemiological parameters using phylodynamic and spatially-explicit phylogeographic approaches. The GETV isolates were able to replicate in a variety of cell lines, including human cells, and showed high pathogenicity in a mouse model, suggesting the potential for more mammal hosts. We obtained 16 complete genomes and 79 E2 gene sequences from viral strains collected in China from 2016 to 2021 through large-scale surveillance among livestock, pets, and mosquitoes. Our phylogenetic analysis revealed that three major GETV lineages are responsible for the current epidemic in livestock in China. We identified three potential positively selected sites and mutations of interest in E2, which may impact the transmissibility and pathogenicity of the virus. Phylodynamic inference of the GETV demographic dynamics identified an association between livestock meat consumption and the evolution of viral genetic diversity. Finally, phylogeographic reconstruction of GETV dispersal indicated that the sampled lineages have preferentially circulated within areas associated with relatively higher mean annual temperature and pig population density. Our results highlight the importance of continuous surveillance of GETV among livestock in southern Chinese regions associated with relatively high temperatures. IMPORTANCE Although livestock is known to be the primary reservoir of Getah virus (GETV) in Asian countries, where identification is largely based on serology, the evolutionary history and spatial epidemiology of GETV in these regions remain largely unknown. Through our sequencing efforts, we provided robust support for lineage delineation of GETV and identified three major lineages that are responsible for the current epidemic in livestock in China. We further analyzed genomic and epidemiological data to reconstruct the recent demographic and dispersal history of GETV in domestic animals in China and to explore the impact of environmental factors on its genetic diversity and its diffusion. Notably, except for livestock meat consumption, other pig-related factors such as the evolution of live pig transport and pork production do not show a significant association with the evolution of viral genetic diversity, pointing out that further studies should investigate the potential contribution of other host species to the GETV outbreak. Our analysis of GETV demonstrates the need for wider animal species surveillance and provides a baseline for future studies of the molecular epidemiology and early warning of emerging arboviruses in China.


Assuntos
Arbovírus , Genoma Viral , Filogenia , Animais , Humanos , Camundongos , Arbovírus/genética , China/epidemiologia , Genômica , Gado/virologia
15.
J Med Virol ; 96(7): e29773, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-38940448

RESUMO

The dynamics of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) transmission are influenced by a variety of factors, including social restrictions and the emergence of distinct variants. In this study, we delve into the origins and dissemination of the Alpha, Delta, and Omicron-BA.1 variants of concern in Galicia, northwest Spain. For this, we leveraged genomic data collected by the EPICOVIGAL Consortium and from the GISAID database, along with mobility information from other Spanish regions and foreign countries. Our analysis indicates that initial introductions during the Alpha phase were predominantly from other Spanish regions and France. However, as the pandemic progressed, introductions from Portugal and the United States became increasingly significant. The number of detected introductions varied from 96 and 101 for Alpha and Delta to 39 for Omicron-BA.1. Most of these introductions left a low number of descendants (<10), suggesting a limited impact on the evolution of the pandemic in Galicia. Notably, Galicia's major coastal cities emerged as critical hubs for viral transmission, highlighting their role in sustaining and spreading the virus. This research emphasizes the critical role of regional connectivity in the spread of SARS-CoV-2 and offers essential insights for enhancing public health strategies and surveillance measures.


Assuntos
COVID-19 , SARS-CoV-2 , Espanha/epidemiologia , COVID-19/epidemiologia , COVID-19/transmissão , COVID-19/virologia , Humanos , SARS-CoV-2/genética , Genoma Viral , Filogenia , Pandemias
16.
Syst Biol ; 72(5): 1136-1153, 2023 11 01.
Artigo em Inglês | MEDLINE | ID: mdl-37458991

RESUMO

Divergence time estimation is crucial to provide temporal signals for dating biologically important events from species divergence to viral transmissions in space and time. With the advent of high-throughput sequencing, recent Bayesian phylogenetic studies have analyzed hundreds to thousands of sequences. Such large-scale analyses challenge divergence time reconstruction by requiring inference on highly correlated internal node heights that often become computationally infeasible. To overcome this limitation, we explore a ratio transformation that maps the original $N-1$ internal node heights into a space of one height parameter and $N-2$ ratio parameters. To make the analyses scalable, we develop a collection of linear-time algorithms to compute the gradient and Jacobian-associated terms of the log-likelihood with respect to these ratios. We then apply Hamiltonian Monte Carlo sampling with the ratio transform in a Bayesian framework to learn the divergence times in 4 pathogenic viruses (West Nile virus, rabies virus, Lassa virus, and Ebola virus) and the coralline red algae. Our method both resolves a mixing issue in the West Nile virus example and improves inference efficiency by at least 5-fold for the Lassa and rabies virus examples as well as for the algae example. Our method now also makes it computationally feasible to incorporate mixed-effects molecular clock models for the Ebola virus example, confirms the findings from the original study, and reveals clearer multimodal distributions of the divergence times of some clades of interest.


Assuntos
Algoritmos , Filogenia , Teorema de Bayes , Fatores de Tempo , Método de Monte Carlo
17.
PLoS Comput Biol ; 19(8): e1011419, 2023 08.
Artigo em Inglês | MEDLINE | ID: mdl-37639445

RESUMO

Inferring dependencies between mixed-type biological traits while accounting for evolutionary relationships between specimens is of great scientific interest yet remains infeasible when trait and specimen counts grow large. The state-of-the-art approach uses a phylogenetic multivariate probit model to accommodate binary and continuous traits via a latent variable framework, and utilizes an efficient bouncy particle sampler (BPS) to tackle the computational bottleneck-integrating many latent variables from a high-dimensional truncated normal distribution. This approach breaks down as the number of specimens grows and fails to reliably characterize conditional dependencies between traits. Here, we propose an inference pipeline for phylogenetic probit models that greatly outperforms BPS. The novelty lies in 1) a combination of the recent Zigzag Hamiltonian Monte Carlo (Zigzag-HMC) with linear-time gradient evaluations and 2) a joint sampling scheme for highly correlated latent variables and correlation matrix elements. In an application exploring HIV-1 evolution from 535 viruses, the inference requires joint sampling from an 11,235-dimensional truncated normal and a 24-dimensional covariance matrix. Our method yields a 5-fold speedup compared to BPS and makes it possible to learn partial correlations between candidate viral mutations and virulence. Computational speedup now enables us to tackle even larger problems: we study the evolution of influenza H1N1 glycosylations on around 900 viruses. For broader applicability, we extend the phylogenetic probit model to incorporate categorical traits, and demonstrate its use to study Aquilegia flower and pollinator co-evolution.


Assuntos
Vírus da Influenza A Subtipo H1N1 , Teorema de Bayes , Vírus da Influenza A Subtipo H1N1/genética , Filogenia , Flores , Glicosilação
18.
Stat Med ; 43(2): 395-418, 2024 01 30.
Artigo em Inglês | MEDLINE | ID: mdl-38010062

RESUMO

Postmarket safety surveillance is an integral part of mass vaccination programs. Typically relying on sequential analysis of real-world health data as they accrue, safety surveillance is challenged by sequential multiple testing and by biases induced by residual confounding in observational data. The current standard approach based on the maximized sequential probability ratio test (MaxSPRT) fails to satisfactorily address these practical challenges and it remains a rigid framework that requires prespecification of the surveillance schedule. We develop an alternative Bayesian surveillance procedure that addresses both aforementioned challenges using a more flexible framework. To mitigate bias, we jointly analyze a large set of negative control outcomes that are adverse events with no known association with the vaccines in order to inform an empirical bias distribution, which we then incorporate into estimating the effect of vaccine exposure on the adverse event of interest through a Bayesian hierarchical model. To address multiple testing and improve on flexibility, at each analysis timepoint, we update a posterior probability in favor of the alternative hypothesis that vaccination induces higher risks of adverse events, and then use it for sequential detection of safety signals. Through an empirical evaluation using six US observational healthcare databases covering more than 360 million patients, we benchmark the proposed procedure against MaxSPRT on testing errors and estimation accuracy, under two epidemiological designs, the historical comparator and the self-controlled case series. We demonstrate that our procedure substantially reduces Type 1 error rates, maintains high statistical power and fast signal detection, and provides considerably more accurate estimation than MaxSPRT. Given the extensiveness of the empirical study which yields more than 7 million sets of results, we present all results in a public R ShinyApp. As an effort to promote open science, we provide full implementation of our method in the open-source R package EvidenceSynthesis.


Assuntos
Sistemas de Notificação de Reações Adversas a Medicamentos , Vigilância de Produtos Comercializados , Vacinas , Humanos , Teorema de Bayes , Viés , Probabilidade , Vacinas/efeitos adversos
19.
BMC Infect Dis ; 24(1): 1139, 2024 Oct 10.
Artigo em Inglês | MEDLINE | ID: mdl-39390446

RESUMO

We investigate the emergence, mutation profile, and dissemination of SARS-CoV-2 lineage B.1.214.2, first identified in Belgium in January 2021. This variant, featuring a 3-amino acid insertion in the spike protein similar to the Omicron variant, was speculated to enhance transmissibility or immune evasion. Initially detected in international travelers, it substantially transmitted in Central Africa, Belgium, Switzerland, and France, peaking in April 2021. Our travel-aware phylogeographic analysis, incorporating travel history, estimated the origin to the Republic of the Congo, with primary European entry through France and Belgium, and multiple smaller introductions during the epidemic. We correlate its spread with human travel patterns and air passenger data. Further, upon reviewing national reports of SARS-CoV-2 outbreaks in Belgian nursing homes, we found this strain caused moderately severe outcomes (8.7% case fatality ratio). A distinct nasopharyngeal immune response was observed in elderly patients, characterized by 80% unique signatures, higher B- and T-cell activation, increased type I IFN signaling, and reduced NK, Th17, and complement system activation, compared to similar outbreaks. This unique immune response may explain the variant's epidemiological behavior and underscores the need for nasal vaccine strategies against emerging variants.


Assuntos
COVID-19 , SARS-CoV-2 , Glicoproteína da Espícula de Coronavírus , Humanos , SARS-CoV-2/genética , SARS-CoV-2/imunologia , COVID-19/imunologia , COVID-19/virologia , COVID-19/epidemiologia , Glicoproteína da Espícula de Coronavírus/genética , Glicoproteína da Espícula de Coronavírus/imunologia , Idoso , Masculino , Viagem , Bélgica/epidemiologia , Pessoa de Meia-Idade , Feminino , Adulto , Filogeografia , Nasofaringe/virologia
20.
Mol Biol Evol ; 39(2)2022 02 03.
Artigo em Inglês | MEDLINE | ID: mdl-34951645

RESUMO

The ongoing SARS (severe acute respiratory syndrome)-CoV (coronavirus)-2 pandemic has exposed major gaps in our knowledge on the origin, ecology, evolution, and spread of animal coronaviruses. Porcine epidemic diarrhea virus (PEDV) is a member of the genus Alphacoronavirus in the family Coronaviridae that may have originated from bats and leads to significant hazards and widespread epidemics in the swine population. The role of local and global trade of live swine and swine-related products in disseminating PEDV remains unclear, especially in developing countries with complex swine production systems. Here, we undertake an in-depth phylogeographic analysis of PEDV sequence data (including 247 newly sequenced samples) and employ an extension of this inference framework that enables formally testing the contribution of a range of predictor variables to the geographic spread of PEDV. Within China, the provinces of Guangdong and Henan were identified as primary hubs for the spread of PEDV, for which we estimate live swine trade to play a very important role. On a global scale, the United States and China maintain the highest number of PEDV lineages. We estimate that, after an initial introduction out of China, the United States acted as an important source of PEDV introductions into Japan, Korea, China, and Mexico. Live swine trade also explains the dispersal of PEDV on a global scale. Given the increasingly global trade of live swine, our findings have important implications for designing prevention and containment measures to combat a wide range of livestock coronaviruses.


Assuntos
Coronavirus , Vírus da Diarreia Epidêmica Suína , Doenças dos Suínos , Animais , China , Pandemias , Filogenia , Filogeografia , Vírus da Diarreia Epidêmica Suína/genética , Suínos , Doenças dos Suínos/epidemiologia , Estados Unidos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA