RESUMO
Invasive rodents are a major cause of environmental damage and biodiversity loss, particularly on islands. Unlike insects, genetic biocontrol strategies including population-suppressing gene drives with biased inheritance have not been developed in mice. Here, we demonstrate a gene drive strategy (tCRISPR) that leverages super-Mendelian transmission of the t haplotype to spread inactivating mutations in a haplosufficient female fertility gene (Prl). Using spatially explicit individual-based in silico modeling, we show that tCRISPR can eradicate island populations under a range of realistic field-based parameter values. We also engineer transgenic tCRISPR mice that, crucially, exhibit biased transmission of the modified t haplotype and Prl mutations at levels our modeling predicts would be sufficient for eradication. This is an example of a feasible gene drive system for invasive alien rodent population control.
Assuntos
Biodiversidade , Tecnologia de Impulso Genético , Camundongos , Feminino , Animais , Roedores , Genética Populacional , Repetições Palindrômicas Curtas Agrupadas e Regularmente EspaçadasRESUMO
Invasive alien species continue to threaten global biodiversity. CRISPR-based gene drives, which can theoretically spread through populations despite imparting a fitness cost, could be used to suppress or eradicate pest populations. We develop an individual-based, spatially explicit, stochastic model to simulate the ability of CRISPR-based homing and X chromosome shredding drives to eradicate populations of invasive house mice (Mus muculus) from islands. Using the model, we explore the interactive effect of the efficiency of the drive constructs and the spatial ecology of the target population on the outcome of a gene-drive release. We also consider the impact of polyandrous mating and sperm competition, which could compromise the efficacy of some gene-drive strategies. Our results show that both drive strategies could be used to eradicate large populations of mice. Whereas parameters related to drive efficiency and demography strongly influence drive performance, we find that sperm competition following polyandrous mating is unlikely to impact the outcome of an eradication effort substantially. Assumptions regarding the spatial ecology of mice influenced the probability of and time required for eradication, with short-range dispersal capacities and limited mate-search areas producing 'chase' dynamics across the island characterized by cycles of local extinction and recolonization by mice. We also show that highly efficient drives are not always optimal, when dispersal and mate-search capabilities are low. Rapid local population suppression around the introduction sites can cause loss of the gene drive before it can spread to the entire island. We conclude that, although the design of efficient gene drives is undoubtedly critical, accurate data on the spatial ecology of target species are critical for predicting the result of a gene-drive release.
Assuntos
Tecnologia de Impulso Genético , Animais , Biodiversidade , Tecnologia de Impulso Genético/métodos , Espécies Introduzidas , Camundongos , Probabilidade , VertebradosRESUMO
The unrivaled growth in e-commerce of animals and plants presents an unprecedented opportunity to monitor wildlife trade to inform conservation, biosecurity, and law enforcement. Using the internet to quantify the scale of the wildlife trade (volume and frequency) is a relatively recent and rapidly developing approach that lacks an accessible framework for locating relevant websites and collecting data. We produced an accessible guide for internet-based wildlife trade surveillance. We detailed a repeatable method involving a systematic internet search, with search engines, to locate relevant websites and content. For data collection, we highlight web-scraping technology as an efficient way to collect data in an automated fashion at regularly timed intervals. Our guide is applicable to the multitude of trade-based contexts because researchers can tailor search keywords for specific taxa or derived products and locations of interest. We provide information for working with the diversity of websites used in wildlife trade. For example, to locate relevant content on social media (e.g., posts or groups), each social media platform should be examined individually via the site's internal search engine. A key advantage of using the internet to study wildlife trade is the relative ease of access to an increasing amount of trade-related data. However, not all wildlife trade occurs online and it may occur on unobservable sections of the internet.
Resumen Una Guía para Usar el Internet para Monitorear y Cuantificar el Mercado de Fauna El crecimiento incomparable del comercio en línea de animales y plantas representa una oportunidad sin precedentes para monitorear el mercado de fauna y así orientar a la conservación, la bioseguridad y la aplicación de la ley. El uso del internet para cuantificar la escala del mercado de fauna (volumen y frecuencia) es una estrategia relativamente reciente y de rápido desarrollo que carece de un marco de trabajo accesible para la localización de sitios web relevantes y para la recolección de datos. Realizamos una guía accesible para la vigilancia del mercado de fauna en internet. Detallamos un método repetible que involucra una búsqueda sistemática por internet, por medio de buscadores, para localizar sitios web y contenidos relevantes. Para la recolección de datos, resaltamos la tecnología de web scraping como una manera eficiente de obtener datos de manera automatizada a intervalos regulares de tiempo. Nuestra guía puede aplicarse a la multitud de contextos basados en el mercado porque los investigadores pueden adaptar las palabras de búsqueda a taxones específicos o productos derivados y a localidades de interés. Proporcionamos información para poder trabajar con la diversidad de sitios web que se usan para el mercado de fauna. Por ejemplo, para localizar contenido relevante en las redes sociales (p. ej.: publicaciones o grupos), cada plataforma social debería ser examinada individualmente por medio del buscador interno del sitio. Una ventaja importante de usar el internet para estudiar el mercado de fauna es el acceso relativamente sencillo a una creciente cantidad de datos relacionados con el mercado. Sin embargo, no todo el mercado de fauna ocurre en línea y puede que suceda en secciones inobservables del internet.
Assuntos
Animais Selvagens , Mídias Sociais , Animais , Comércio , Conservação dos Recursos Naturais , Coleta de Dados , Humanos , InternetRESUMO
Freya Shearer and co-authors discuss the use of decision analysis in planning for infectious disease pandemics.
Assuntos
Doenças Transmissíveis/epidemiologia , Técnicas de Apoio para a Decisão , Planejamento em Desastres/métodos , Pandemias/prevenção & controle , Antivirais/administração & dosagem , Planejamento em Desastres/tendências , Humanos , Influenza Humana/epidemiologia , Influenza Humana/prevenção & controleRESUMO
Combinations of intense non-pharmaceutical interventions (lockdowns) were introduced worldwide to reduce SARS-CoV-2 transmission. Many governments have begun to implement exit strategies that relax restrictions while attempting to control the risk of a surge in cases. Mathematical modelling has played a central role in guiding interventions, but the challenge of designing optimal exit strategies in the face of ongoing transmission is unprecedented. Here, we report discussions from the Isaac Newton Institute 'Models for an exit strategy' workshop (11-15 May 2020). A diverse community of modellers who are providing evidence to governments worldwide were asked to identify the main questions that, if answered, would allow for more accurate predictions of the effects of different exit strategies. Based on these questions, we propose a roadmap to facilitate the development of reliable models to guide exit strategies. This roadmap requires a global collaborative effort from the scientific community and policymakers, and has three parts: (i) improve estimation of key epidemiological parameters; (ii) understand sources of heterogeneity in populations; and (iii) focus on requirements for data collection, particularly in low-to-middle-income countries. This will provide important information for planning exit strategies that balance socio-economic benefits with public health.
Assuntos
Infecções por Coronavirus/epidemiologia , Infecções por Coronavirus/transmissão , Imunidade Coletiva , Modelos Teóricos , Pneumonia Viral/epidemiologia , Pneumonia Viral/transmissão , COVID-19 , Criança , Infecções por Coronavirus/imunologia , Infecções por Coronavirus/prevenção & controle , Erradicação de Doenças , Características da Família , Humanos , Pandemias/prevenção & controle , Pneumonia Viral/imunologia , Pneumonia Viral/prevenção & controle , Instituições Acadêmicas , Estudos SoroepidemiológicosRESUMO
In an outbreak of an emerging disease the epidemiological characteristics of the pathogen may be largely unknown. A key determinant of ability to control the outbreak is the relative timing of infectiousness and symptom onset. We provide a method for identifying this relationship with high accuracy based on data from simulated household-stratified symptom-onset data. Further, this can be achieved with observations taken on only a few specific days, chosen optimally, within each household. The information provided by this method may inform decision making processes for outbreak response. An accurate and computationally-efficient heuristic for determining the optimal surveillance scheme is introduced. This heuristic provides a novel approach to optimal design for Bayesian model discrimination.
Assuntos
Surtos de Doenças , Teorema de Bayes , Surtos de Doenças/prevenção & controleRESUMO
Understanding the epidemiology of seasonal influenza is critical for healthcare resource allocation and early detection of anomalous seasons. It can be challenging to obtain high-quality data of influenza cases specifically, as clinical presentations with influenza-like symptoms may instead be cases of one of a number of alternate respiratory viruses. We use a new dataset of confirmed influenza virological data from 2011-2016, along with high-quality denominators informing a hierarchical observation process, to model seasonal influenza dynamics in New South Wales, Australia. We use approximate Bayesian computation to estimate parameters in a climate-driven stochastic epidemic model, including the basic reproduction number R0, the proportion of the population susceptible to the circulating strain at the beginning of the season, and the probability an infected individual seeks treatment. We conclude that R0 and initial population susceptibility were strongly related, emphasising the challenges of identifying these parameters. Relatively high R0 values alongside low initial population susceptibility were among the results most consistent with these data. Our results reinforce the importance of distinguishing between R0 and the effective reproduction number (Re) in modelling studies.
Assuntos
Número Básico de Reprodução/estatística & dados numéricos , Influenza Humana/epidemiologia , Atenção Primária à Saúde/estatística & dados numéricos , Imunidade Adaptativa , Austrália/epidemiologia , Teorema de Bayes , Surtos de Doenças , Humanos , Modelos Teóricos , Vigilância da População/métodos , Atenção Primária à Saúde/tendências , Estações do AnoRESUMO
Understanding the risk of biological invasions associated with particular transport pathways and source regions is critical for implementing effective biosecurity management. This may require both a model for physical connectedness between regions, and a measure of environmental similarity, so as to quantify the potential for a species to be transported from a given region and to survive at a destination region. We present an analysis of integrated biosecurity risk into Australia, based on flights and shipping data from each global geopolitical region, and an adaptation of the "range bagging" method to determine environmental matching between regions. Here, we describe global patterns of environmental matching and highlight those regions with many physical connections. We classify patterns of global invasion risk (high to low) into Australian states and territories. We validate our analysis by comparison with global presence data for 844 phytophagous insect pest species, and produce a list of high-risk species not previously known to be present in Australia. We determined that, of the insect pest species used for validation, the species most likely to be present in Australia were those also present in geopolitical regions with high transport connectivity to Australia, and those regions that were geographically close, and had similar environments.
Assuntos
Insetos , Medição de Risco/métodos , Viagem Aérea , Animais , Austrália , Meio Ambiente , Geografia , Espécies Introduzidas , Modelos Biológicos , Reprodutibilidade dos Testes , NaviosRESUMO
Self-replicating gene drives that can spread deleterious alleles through animal populations have been promoted as a much needed but controversial 'silver bullet' for controlling invasive alien species. Homing-based drives comprise an endonuclease and a guide RNA (gRNA) that are replicated during meiosis via homologous recombination. However, their efficacy for controlling wild populations is threatened by inherent polymorphic resistance and the creation of resistance alleles via non-homologous end-joining (NHEJ)-mediated DNA repair. We used stochastic individual-based models to identify realistic gene-drive strategies capable of eradicating vertebrate pest populations (mice, rats and rabbits) on islands. One popular strategy, a sex-reversing drive that converts heterozygous females into sterile males, failed to spread and required the ongoing deployment of gene-drive carriers to achieve eradication. Under alternative strategies, multiplexed gRNAs could overcome inherent polymorphic resistance and were required for eradication success even when the probability of NHEJ was low. Strategies causing homozygotic embryonic non-viability or homozygotic female sterility produced high probabilities of eradication and were robust to NHEJ-mediated deletion of the DNA sequence between multiplexed endonuclease recognition sites. The latter two strategies also purged the gene drive when eradication failed, therefore posing lower long-term risk should animals escape beyond target islands. Multiplexing gRNAs will be necessary if this technology is to be useful for insular extirpation attempts; however, precise knowledge of homing rates will be required to design low-risk gene drives with high probabilities of eradication success.
Assuntos
Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas , Espécies Introduzidas , Controle de Pragas/métodos , Animais , Feminino , Ilhas , Masculino , Camundongos , Coelhos , RatosRESUMO
Scratch assays are often used to investigate potential drug treatments for chronic wounds and cancer. Interpreting these experiments with a mathematical model allows us to estimate the cell diffusivity, D, and the cell proliferation rate, λ. However, the influence of the experimental design on the estimates of D and λ is unclear. Here we apply an approximate Bayesian computation (ABC) parameter inference method, which produces a posterior distribution of D and λ, to new sets of synthetic data, generated from an idealised mathematical model, and experimental data for a non-adhesive mesenchymal population of fibroblast cells. The posterior distribution allows us to quantify the amount of information obtained about D and λ. We investigate two types of scratch assay, as well as varying the number and timing of the experimental observations captured. Our results show that a scrape assay, involving one cell front, provides more precise estimates of D and λ, and is more computationally efficient to interpret than a wound assay, with two opposingly directed cell fronts. We find that recording two observations, after making the initial observation, is sufficient to estimate D and λ, and that the final observation time should correspond to the time taken for the cell front to move across the field of view. These results provide guidance for estimating D and λ, while simultaneously minimising the time and cost associated with performing and interpreting the experiment.
Assuntos
Algoritmos , Movimento Celular , Proliferação de Células , Fibroblastos/citologia , Modelos Biológicos , Células 3T3 , Animais , Teorema de Bayes , Biologia Computacional/métodos , Camundongos , Reprodutibilidade dos Testes , Projetos de PesquisaRESUMO
Recently, pandemic response has involved the use of antivirals. These antivirals are often allocated to households dynamically throughout the pandemic with the aim being to retard the spread of infection. A drawback of this is that there is a delay until infection is confirmed and antivirals are delivered. Here an alternative allocation scheme is considered, where antivirals are instead preallocated to households at the start of a pandemic, thus reducing this delay. To compare these two schemes, a deterministic approximation to a novel stochastic household model is derived, which allows efficient computation of key quantities such as the expected epidemic final size, expected early growth rate, expected peak size and expected peak time of the epidemic. It is found that the theoretical best choice of allocation scheme depends on strain transmissibility, the delay in delivering antivirals under a dynamic allocation scheme and the stockpile size. A broad summary is that for realistic stockpile sizes, a dynamic allocation scheme is superior with the important exception of the epidemic final size under a severe pandemic scenario. Our results, viewed in conjunction with the practical considerations of implementing a preallocation scheme, support a focus on attempting to reduce the delay in delivering antivirals under a dynamic allocation scheme during a future pandemic.
Assuntos
Antivirais/administração & dosagem , Antivirais/provisão & distribuição , Influenza Humana/tratamento farmacológico , Influenza Humana/epidemiologia , Pandemias , Humanos , Influenza Humana/transmissão , Conceitos Matemáticos , Modelos Biológicos , Pandemias/estatística & dados numéricos , Processos Estocásticos , Estoque EstratégicoRESUMO
Biological invasions are a key component of human-induced global change. The continuing increase in global wildlife trade has raised concerns about the parallel increase in the number of new invasive species. However, the factors that link the wildlife trade to the biological invasion process are still poorly understood. Moreover, there are analytical challenges in researching the role of global wildlife trade in biological invasions, particularly issues related to the under-reporting of introduced and established populations in areas with reduced sampling effort. In this work, we use high-quality data on the international trade in Nearctic turtles (1999-2009) coupled with a statistical modelling framework, which explicitly accounts for detection, to investigate the factors that influence the introduction (release, or escape into the wild) of globally traded Nearctic turtles and the establishment success (self-sustaining exotic populations) of slider turtles (Trachemys scripta), the most frequently traded turtle species. We found that the introduction of a species was influenced by the total number of turtles exported to a jurisdiction and the age at maturity of the species, while the establishment success of slider turtles was best associated with the propagule number (number of release events), and the number of native turtles in the jurisdiction of introduction. These results indicate both a direct and indirect association between the wildlife trade and the introduction of turtles and establishment success of slider turtles, respectively. Our results highlight the existence of gaps in the number of globally recorded introduction events and established populations of slider turtles, although the expected bias is low. We emphasize the importance of researching independently the factors that affect the different stages of the invasion pathway. Critically, we observe that the number of traded individuals might not always be an adequate proxy for propagule pressure and establishment success.
Assuntos
Ecossistema , Espécies Introduzidas , Tartarugas/fisiologia , Animais , Comércio , Espécies Introduzidas/economia , Modelos Biológicos , América do Norte , Risco , Especificidade da EspécieRESUMO
Cell colonization during embryonic development involves cells migrating and proliferating over growing tissues. Unsuccessful colonization, resulting from genetic causes, can result in various birth defects. However not all individuals with the same mutation show the disease. This is termed incomplete penetrance, and it even extends to discordancy in monozygotic (identical) twins. A one-dimensional agent-based model of cell migration and proliferation within a growing tissue is presented, where the position of every cell is recorded at any time. We develop a new model that approximates this agent-based process - rather than requiring the precise configuration of cells within the tissue, the new model records the total number of cells, the position of the most advanced cell, and then invokes an approximation for how the cells are distributed. The probability mass function (PMF) for the most advanced cell is obtained for both the agent-based model and its approximation. The two PMFs compare extremely well, but using the approximation is computationally faster. Success or failure of colonization is probabilistic. For example for sufficiently high proliferation rate the colonization is assured. However, if the proliferation rate is sufficiently low, there will be a lower, say 50%, chance of success. These results provide insights into the puzzle of incomplete penetrance of a disease phenotype, especially in monozygotic twins. Indeed, stochastic cell behavior (amplified by disease-causing mutations) within the colonization process may play a key role in incomplete penetrance, rather than differences in genes, their expression or environmental conditions.
Assuntos
Desenvolvimento Embrionário , Processos Estocásticos , Doença de Hirschsprung/genética , Doença de Hirschsprung/patologia , Humanos , Cadeias de Markov , Probabilidade , Gêmeos MonozigóticosRESUMO
Processes that spread through local contact, including outbreaks of infectious diseases, are inherently noisy, and are frequently observed to be far noisier than predicted by standard stochastic models that assume homogeneous mixing. One way to reproduce the observed levels of noise is to introduce significant individual-level heterogeneity with respect to infection processes, such that some individuals are expected to generate more secondary cases than others. Here we consider a population where individuals can be naturally aggregated into clumps (subpopulations) with stronger interaction within clumps than between them. This clumped structure induces significant increases in the noisiness of a spreading process, such as the transmission of infection, despite complete homogeneity at the individual level. Given the ubiquity of such clumped aggregations (such as homes, schools and workplaces for humans or farms for livestock) we suggest this as a plausible explanation for noisiness of many epidemic time series.
Assuntos
Doenças Transmissíveis/epidemiologia , Dinâmica Populacional , Doenças Transmissíveis/transmissão , Surtos de Doenças/estatística & dados numéricos , Suscetibilidade a Doenças/epidemiologia , Epidemias/estatística & dados numéricos , Humanos , Dinâmica Populacional/estatística & dados numéricos , Processos EstocásticosRESUMO
BACKGROUND: When an outbreak of a novel pathogen occurs, some of the most pressing questions from a public-health point of view relate to its transmissibility, and the probabilities of different clinical outcomes following infection, to allow an informed response. Estimates of these quantities are often based on household data due to the high potential for transmission in this setting, but typically a rich spectrum of individual-level outcomes (from uninfected to serious illness) are simplified to binary data (infected or not). We address the added benefit from retaining the heterogeneous outcome information in the case of the 2009-10 influenza pandemic, which posed particular problems for estimation of key epidemiological characteristics due to its relatively mild nature and hence low case ascertainment rates. METHODS: We use mathematical models of within-household transmission and case ascertainment, together with Bayesian statistics to estimate transmission probabilities stratified by household size, the variability of infectiousness of cases, and a set of probabilities describing case ascertainment. This novel approach was applied to data we collected from the early "containment phase" stage of the epidemic in Birmingham, England. We also conducted a comprehensive review of studies of household transmission of influenza A(H1N1)pdm09. RESULTS: We find large variability in the published estimates of within-household transmissibility of influenza A(H1N1)pdm09 in both model-based studies and those reporting secondary attack rates, finding that these estimates are very sensitive to how an infected case is defined. In particular, we find that reliance on laboratory confirmation alone underestimates the true number of cases, while utilising the heterogeneous range of outcomes (based on case definitions) for household infections allows a far more comprehensive pattern of transmission to be elucidated. CONCLUSIONS: Differences in household sizes and how cases are defined could account for an appreciable proportion of the reported variability of within-household transmissibility of influenza A(H1N1)pdm09. Retaining and statistically analysing the full spectrum of individual-level outcomes (based on case definitions) rather than taking a potentially arbitrary threshold for infection, provides much-needed additional information. In a future pandemic, our approach could be used as a real-time analysis tool to infer the true number of cases, within-household transmission rates and levels of case ascertainment.
Assuntos
Surtos de Doenças , Métodos Epidemiológicos , Características da Família , Saúde da Família , Vírus da Influenza A Subtipo H1N1/isolamento & purificação , Influenza Humana/epidemiologia , Influenza Humana/transmissão , Bioestatística/métodos , Inglaterra/epidemiologia , Humanos , Vírus da Influenza A Subtipo H1N1/patogenicidade , Influenza Humana/virologia , Modelos TeóricosRESUMO
We investigate the spread of an exotic herb, Hieracium lepidulum, into a New Zealand Nothofagus forest with the aim of understanding how stand-development of tree populations, propagule pressure and invader persistence, affect invasion across the landscape and within communities. Using data repeatedly collected over 35 years, from 250 locations, we parametrize continuous-time Markov chain models and use these models to examine future projections of the invasion under a range of hypothetical scenarios. We found that the probability of invasion into a stand was relatively high following canopy disturbance and that local abundance of Hieracium was promoted by minor disturbances. However, model predictions extrapolated 45 years into the future show that neither the rate of landscape-level invasion, nor local population growth of Hieracium, was affected much by changing the frequency of canopy disturbance events. Instead, invasion levels were strongly affected by the ability of Hieracium to persist in the understorey following forest canopy closure, and by propagule supply from streams, forest edges and plants already established within the stand. Our results show that disturbance frequency has surprisingly little influence on the long-term trajectory of invasion, while invader persistence strongly determines invasion patterns.
Assuntos
Asteraceae/fisiologia , Espécies Introduzidas , Árvores , Cadeias de Markov , Modelos Biológicos , Nova Zelândia , Dinâmica PopulacionalRESUMO
A highly effective method for controlling the spread of an infectious disease is vaccination. However, there are many situations where vaccines are in limited supply. The ability to determine, under this constraint, a vaccination strategy which minimises the number of people that become infected over the course of a potential epidemic is essential. Two questions naturally arise: when is it best to allocate vaccines, and to whom should they be allocated? We address these questions in the context of metapopulation models of disease spread. We discover that in practice it is generally optimal to distribute all vaccines prophylactically, rather than withholding until infection is introduced. For small metapopulations, we provide a method for determining the optimal prophylactic allocation. As the optimal strategy becomes computationally intensive to obtain when the population size increases, we detail an approximation method to determine an approximately optimal vaccination scheme. We find that our approximate strategy is consistently at least as good as three strategies reported in the literature across a wide range of parameter values.
Assuntos
Epidemias , Vacinas , Epidemias/prevenção & controle , Humanos , VacinaçãoRESUMO
National influenza pandemic plans have evolved substantially over recent decades, as has the scientific research that underpins the advice contained within them. While the knowledge generated by many research activities has been directly incorporated into the current generation of pandemic plans, scientists and policymakers are yet to capitalise fully on the potential for near real-time analytics to formally contribute to epidemic decision-making. Theoretical studies demonstrate that it is now possible to make robust estimates of pandemic impact in the earliest stages of a pandemic using first few hundred household cohort (FFX) studies and algorithms designed specifically for analysing FFX data. Pandemic plans already recognise the importance of both situational awareness i.e., knowing pandemic impact and its key drivers, and the need for pandemic special studies and related analytic methods for estimating these drivers. An important next step is considering how information from these situational assessment activities can be integrated into the decision-making processes articulated in pandemic planning documents. Here we introduce a decision support tool that directly uses outputs from FFX algorithms to present recommendations on response options, including a quantification of uncertainty, to decision makers. We illustrate this approach using response information from within the Australian influenza pandemic plan.
Assuntos
Influenza Humana , Austrália , Humanos , Influenza Humana/epidemiologia , Pandemias/prevenção & controle , PolíticasRESUMO
Automated monitoring of websites that trade wildlife is increasingly necessary to inform conservation and biosecurity efforts. However, e-commerce and wildlife trading websites can contain a vast number of advertisements, an unknown proportion of which may be irrelevant to researchers and practitioners. Given that many wildlife-trade advertisements have an unstructured text format, automated identification of relevant listings has not traditionally been possible, nor attempted. Other scientific disciplines have solved similar problems using machine learning and natural language processing models, such as text classifiers. Here, we test the ability of a suite of text classifiers to extract relevant advertisements from wildlife trade occurring on the Internet. We collected data from an Australian classifieds website where people can post advertisements of their pet birds (n = 16.5k advertisements). We found that text classifiers can predict, with a high degree of accuracy, which listings are relevant (ROC AUC ≥ 0.98, F1 score ≥ 0.77). Furthermore, in an attempt to answer the question 'how much data is required to have an adequately performing model?', we conducted a sensitivity analysis by simulating decreases in sample sizes to measure the subsequent change in model performance. From our sensitivity analysis, we found that text classifiers required a minimum sample size of 33% (c. 5.5k listings) to accurately identify relevant listings (for our dataset), providing a reference point for future applications of this sort. Our results suggest that text classification is a viable tool that can be applied to the online trade of wildlife to reduce time dedicated to data cleaning. However, the success of text classifiers will vary depending on the advertisements and websites, and will therefore be context dependent. Further work to integrate other machine learning tools, such as image classification, may provide better predictive abilities in the context of streamlining data processing for wildlife trade related online data.