Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 44
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
J Clin Microbiol ; 57(5)2019 05.
Artigo em Inglês | MEDLINE | ID: mdl-30728194

RESUMO

Foodborne pathogen surveillance in the United States is transitioning from strain identification using restriction digest technology (pulsed-field gel electrophoresis [PFGE]) to shotgun sequencing of the entire genome (whole-genome sequencing [WGS]). WGS requires a new suite of analysis tools, some of which have long histories in academia but are new to the field of public health and regulatory decision making. Although the general workflow is fairly standard for collecting and analyzing WGS data for disease surveillance, there are a number of differences in how the data are collected and analyzed across public health agencies, both nationally and internationally. This impedes collaborative public health efforts, so national and international efforts are underway to enable direct comparison of these different analysis methods. Ultimately, the harmonization efforts will allow the (mutually trusted and understood) production and analysis of WGS data by labs and agencies worldwide, thus improving outbreak response capabilities globally. This review provides a historical perspective on the use of WGS for pathogen tracking and summarizes the efforts underway to ensure the major steps in phylogenomic pipelines used for pathogen disease surveillance can be readily validated. The tools for doing this will ensure that the results produced are sound, reproducible, and comparable across different analytic approaches.


Assuntos
Bactérias/genética , Análise de Dados , Doenças Transmitidas por Alimentos/diagnóstico , Filogenia , Bactérias/patogenicidade , Biologia Computacional/métodos , Biologia Computacional/normas , Surtos de Doenças/prevenção & controle , Eletroforese em Gel de Campo Pulsado , Monitoramento Epidemiológico , Genoma Bacteriano , Humanos , Saúde Pública , Estados Unidos , Sequenciamento Completo do Genoma
2.
BMC Bioinformatics ; 18(1): 178, 2017 Mar 20.
Artigo em Inglês | MEDLINE | ID: mdl-28320310

RESUMO

BACKGROUND: Using phylogenomic analysis tools for tracking pathogens has become standard practice in academia, public health agencies, and large industries. Using the same raw read genomic data as input, there are several different approaches being used to infer phylogenetic tree. These include many different SNP pipelines, wgMLST approaches, k-mer algorithms, whole genome alignment and others; each of these has advantages and disadvantages, some have been extensively validated, some are faster, some have higher resolution. A few of these analysis approaches are well-integrated into the regulatory process of US Federal agencies (e.g. the FDA's SNP pipeline for tracking foodborne pathogens). However, despite extensive validation on benchmark datasets and comparison with other pipelines, we lack methods for fully exploring the effects of multiple parameter values in each pipeline that can potentially have an effect on whether the correct phylogenetic tree is recovered. RESULTS: To resolve this problem, we offer a program, TreeToReads, which can generate raw read data from mutated genomes simulated under a known phylogeny. This simulation pipeline allows direct comparisons of simulated and observed data in a controlled environment. At each step of these simulations, researchers can vary parameters of interest (e.g., input tree topology, amount of sequence divergence, rate of indels, read coverage, distance of reference genome, etc) to assess the effects of various parameter values on correctly calling SNPs and reconstructing an accurate tree. CONCLUSIONS: Such critical assessments of the accuracy and robustness of analytical pipelines are essential to progress in both research and applied settings.


Assuntos
Genômica/métodos , Filogenia
3.
Planta Med ; 83(18): 1420-1430, 2017 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-28651291

RESUMO

Precise, species-level identification of plants in foods and dietary supplements is difficult. While the use of DNA barcoding regions (short regions of DNA with diagnostic utility) has been effective for many inquiries, it is not always a robust approach for closely related species, especially in highly processed products. The use of fully sequenced chloroplast genomes, as an alternative to short diagnostic barcoding regions, has demonstrated utility for closely related species. The U. S. Food and Drug Administration (FDA) has also developed species-specific DNA-based assays targeting plant species of interest by utilizing chloroplast genome sequences. Here, we introduce a repository of complete chloroplast genome sequences called GenomeTrakrCP, which will be publicly available at the National Center for Biotechnology Information (NCBI). Target species for inclusion are plants found in foods and dietary supplements, toxin producers, common contaminants and adulterants, and their close relatives. Publicly available data will include annotated assemblies, raw sequencing data, and voucher information with each NCBI accession associated with an authenticated reference herbarium specimen. To date, 40 complete chloroplast genomes have been deposited in GenomeTrakrCP (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA325670/), and this will be expanded in the future.


Assuntos
Bases de Dados de Ácidos Nucleicos/normas , Genoma de Cloroplastos/genética , Plantas/classificação , Código de Barras de DNA Taxonômico , DNA de Cloroplastos/química , DNA de Cloroplastos/genética , Anotação de Sequência Molecular , Folhas de Planta/classificação , Folhas de Planta/genética , Plantas/genética , Padrões de Referência , Especificidade da Espécie , Estados Unidos , United States Food and Drug Administration
4.
Microbiology (Reading) ; 161(2): 374-386, 2015 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-28206902

RESUMO

Prokaryotic CRISPR-Cas (clustered regularly interspaced short palindromic repeats and CRISPR-associated genes) systems provide adaptive immunity from invasive genetic elements and encompass three essential features: (i) cas genes, (ii) a CRISPR array composed of spacers and direct repeats and (iii) an AT-rich leader sequence upstream of the array. We performed in-depth sequence analysis of the CRISPR-Cas systems in >600 Salmonella, representing four clinically prevalent serovars. Each CRISPR-Cas feature is extremely conserved in the Salmonella, and the CRISPR1 locus is more highly conserved than CRISPR2. Array composition is serovar-specific, although no convincing evidence of recent spacer acquisition against exogenous nucleic acids exists. Only 12 % of spacers match phage and plasmid sequences and self-targeting spacers are associated with direct repeat variants. High nucleotide identity (>99.9 %) exists across the cas operon among isolates of a single serovar and in some cases this conservation extends across divergent serovars. These observations reflect historical CRISPR-Cas immune activity, showing that this locus has ceased undergoing adaptive events. Intriguingly, the high level of conservation across divergent serovars shows that the genetic integrity of these inactive loci is maintained over time, contrasting with the canonical view that inactive CRISPR loci degenerate over time. This thorough characterization of Salmonella CRISPR-Cas systems presents new insights into Salmonella CRISPR evolution, particularly with respect to cas gene conservation, leader sequences, organization of direct repeats and protospacer matches. Collectively, our data suggest that Salmonella CRISPR-Cas systems are no longer immunogenic; rather, their impressive conservation indicates they may have an alternative function in Salmonella.

5.
Microbiology (Reading) ; 161(Pt 2): 374-86, 2015 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-25479838

RESUMO

Prokaryotic CRISPR-Cas (clustered regularly interspaced short palindromic repeats and CRISPR-associated genes) systems provide adaptive immunity from invasive genetic elements and encompass three essential features: (i) cas genes, (ii) a CRISPR array composed of spacers and direct repeats and (iii) an AT-rich leader sequence upstream of the array. We performed in-depth sequence analysis of the CRISPR-Cas systems in >600 Salmonella, representing four clinically prevalent serovars. Each CRISPR-Cas feature is extremely conserved in the Salmonella, and the CRISPR1 locus is more highly conserved than CRISPR2. Array composition is serovar-specific, although no convincing evidence of recent spacer acquisition against exogenous nucleic acids exists. Only 12% of spacers match phage and plasmid sequences and self-targeting spacers are associated with direct repeat variants. High nucleotide identity (>99.9%) exists across the cas operon among isolates of a single serovar and in some cases this conservation extends across divergent serovars. These observations reflect historical CRISPR-Cas immune activity, showing that this locus has ceased undergoing adaptive events. Intriguingly, the high level of conservation across divergent serovars shows that the genetic integrity of these inactive loci is maintained over time, contrasting with the canonical view that inactive CRISPR loci degenerate over time. This thorough characterization of Salmonella CRISPR-Cas systems presents new insights into Salmonella CRISPR evolution, particularly with respect to cas gene conservation, leader sequences, organization of direct repeats and protospacer matches. Collectively, our data suggest that Salmonella CRISPR-Cas systems are no longer immunogenic; rather, their impressive conservation indicates they may have an alternative function in Salmonella.


Assuntos
Sistemas CRISPR-Cas , Evolução Molecular , Salmonella/genética , Técnicas de Tipagem Bacteriana , Sequência de Bases , DNA Bacteriano/genética , Variação Genética , Humanos , Dados de Sequência Molecular , Polimorfismo de Nucleotídeo Único , Salmonella/classificação , Salmonella/isolamento & purificação , Infecções por Salmonella/microbiologia
6.
mSystems ; 9(6): e0141523, 2024 Jun 18.
Artigo em Inglês | MEDLINE | ID: mdl-38819130

RESUMO

Wastewater surveillance has emerged as a crucial public health tool for population-level pathogen surveillance. Supported by funding from the American Rescue Plan Act of 2021, the FDA's genomic epidemiology program, GenomeTrakr, was leveraged to sequence SARS-CoV-2 from wastewater sites across the United States. This initiative required the evaluation, optimization, development, and publication of new methods and analytical tools spanning sample collection through variant analyses. Version-controlled protocols for each step of the process were developed and published on protocols.io. A custom data analysis tool and a publicly accessible dashboard were built to facilitate real-time visualization of the collected data, focusing on the relative abundance of SARS-CoV-2 variants and sub-lineages across different samples and sites throughout the project. From September 2021 through June 2023, a total of 3,389 wastewater samples were collected, with 2,517 undergoing sequencing and submission to NCBI under the umbrella BioProject, PRJNA757291. Sequence data were released with explicit quality control (QC) tags on all sequence records, communicating our confidence in the quality of data. Variant analysis revealed wide circulation of Delta in the fall of 2021 and captured the sweep of Omicron and subsequent diversification of this lineage through the end of the sampling period. This project successfully achieved two important goals for the FDA's GenomeTrakr program: first, contributing timely genomic data for the SARS-CoV-2 pandemic response, and second, establishing both capacity and best practices for culture-independent, population-level environmental surveillance for other pathogens of interest to the FDA. IMPORTANCE: This paper serves two primary objectives. First, it summarizes the genomic and contextual data collected during a Covid-19 pandemic response project, which utilized the FDA's laboratory network, traditionally employed for sequencing foodborne pathogens, for sequencing SARS-CoV-2 from wastewater samples. Second, it outlines best practices for gathering and organizing population-level next generation sequencing (NGS) data collected for culture-free, surveillance of pathogens sourced from environmental samples.


Assuntos
COVID-19 , SARS-CoV-2 , United States Food and Drug Administration , Águas Residuárias , SARS-CoV-2/genética , Estados Unidos/epidemiologia , Águas Residuárias/virologia , COVID-19/epidemiologia , COVID-19/transmissão , COVID-19/prevenção & controle , COVID-19/virologia , Humanos , Pandemias/prevenção & controle , Genoma Viral/genética , Vigilância Epidemiológica Baseada em Águas Residuárias
7.
Microb Genom ; 10(6)2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38860884

RESUMO

As public health laboratories expand their genomic sequencing and bioinformatics capacity for the surveillance of different pathogens, labs must carry out robust validation, training, and optimization of wet- and dry-lab procedures. Achieving these goals for algorithms, pipelines and instruments often requires that lower quality datasets be made available for analysis and comparison alongside those of higher quality. This range of data quality in reference sets can complicate the sharing of sub-optimal datasets that are vital for the community and for the reproducibility of assays. Sharing of useful, but sub-optimal datasets requires careful annotation and documentation of known issues to enable appropriate interpretation, avoid being mistaken for better quality information, and for these data (and their derivatives) to be easily identifiable in repositories. Unfortunately, there are currently no standardized attributes or mechanisms for tagging poor-quality datasets, or datasets generated for a specific purpose, to maximize their utility, searchability, accessibility and reuse. The Public Health Alliance for Genomic Epidemiology (PHA4GE) is an international community of scientists from public health, industry and academia focused on improving the reproducibility, interoperability, portability, and openness of public health bioinformatic software, skills, tools and data. To address the challenges of sharing lower quality datasets, PHA4GE has developed a set of standardized contextual data tags, namely fields and terms, that can be included in public repository submissions as a means of flagging pathogen sequence data with known quality issues, increasing their discoverability. The contextual data tags were developed through consultations with the community including input from the International Nucleotide Sequence Data Collaboration (INSDC), and have been standardized using ontologies - community-based resources for defining the tag properties and the relationships between them. The standardized tags are agnostic to the organism and the sequencing technique used and thus can be applied to data generated from any pathogen using an array of sequencing techniques. The tags can also be applied to synthetic (lab created) data. The list of standardized tags is maintained by PHA4GE and can be found at https://github.com/pha4ge/contextual_data_QC_tags. Definitions, ontology IDs, examples of use, as well as a JSON representation, are provided. The PHA4GE QC tags were tested, and are now implemented, by the FDA's GenomeTrakr laboratory network as part of its routine submission process for SARS-CoV-2 wastewater surveillance. We hope that these simple, standardized tags will help improve communication regarding quality control in public repositories, in addition to making datasets of variable quality more easily identifiable. Suggestions for additional tags can be submitted to PHA4GE via the New Term Request Form in the GitHub repository. By providing a mechanism for feedback and suggestions, we also expect that the tags will evolve with the needs of the community.


Assuntos
Biologia Computacional , Saúde Pública , Controle de Qualidade , Humanos , Biologia Computacional/métodos , Disseminação de Informação/métodos , Reprodutibilidade dos Testes , Anotação de Sequência Molecular/métodos , Genômica/métodos , Software
8.
PeerJ ; 11: e14596, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36721781

RESUMO

Background: The accurate identification of SARS-CoV-2 (SC2) variants and estimation of their abundance in mixed population samples (e.g., air or wastewater) is imperative for successful surveillance of community level trends. Assessing the performance of SC2 variant composition estimators (VCEs) should improve our confidence in public health decision making. Here, we introduce a linear regression based VCE and compare its performance to four other VCEs: two re-purposed DNA sequence read classifiers (Kallisto and Kraken2), a maximum-likelihood based method (Lineage deComposition for Sars-Cov-2 pooled samples (LCS)), and a regression based method (Freyja). Methods: We simulated DNA sequence datasets of known variant composition from both Illumina and Oxford Nanopore Technologies (ONT) platforms and assessed the performance of each VCE. We also evaluated VCEs performance using publicly available empirical wastewater samples collected for SC2 surveillance efforts. Bioinformatic analyses were performed with a custom NextFlow workflow (C-WAP, CFSAN Wastewater Analysis Pipeline). Relative root mean squared error (RRMSE) was used as a measure of performance with respect to the known abundance and concordance correlation coefficient (CCC) was used to measure agreement between pairs of estimators. Results: Based on our results from simulated data, Kallisto was the most accurate estimator as it had the lowest RRMSE, followed by Freyja. Kallisto and Freyja had the most similar predictions, reflected by the highest CCC metrics. We also found that accuracy was platform and amplicon panel dependent. For example, the accuracy of Freyja was significantly higher with Illumina data compared to ONT data; performance of Kallisto was best with ARTICv4. However, when analyzing empirical data there was poor agreement among methods and variations in the number of variants detected (e.g., Freyja ARTICv4 had a mean of 2.2 variants while Kallisto ARTICv4 had a mean of 10.1 variants). Conclusion: This work provides an understanding of the differences in performance of a number of VCEs and how accurate they are in capturing the relative abundance of SC2 variants within a mixed sample (e.g., wastewater). Such information should help officials gauge the confidence they can have in such data for informing public health decisions.


Assuntos
COVID-19 , Humanos , COVID-19/diagnóstico , Funções Verossimilhança , SARS-CoV-2/genética , Águas Residuárias
9.
Microbiol Resour Announc ; 12(9): e0016323, 2023 Sep 19.
Artigo em Inglês | MEDLINE | ID: mdl-37504519

RESUMO

The continued emergence and spread of antimicrobial resistance among pathogenic bacteria are ever-growing threats to health and economy. Here, we report the draft genomes for 45 Enterobacterales clinical isolates, including historical and contemporary drug-resistant organisms, obtained in Pakistan between 1998 and 2016: 5 Serratia, 3 Salmonella, 3 Enterobacter, and 34 Klebsiella.

10.
Microb Genom ; 9(12)2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-38085797

RESUMO

Fast, efficient public health actions require well-organized and coordinated systems that can supply timely and accurate knowledge. Public databases of pathogen genomic data, such as the International Nucleotide Sequence Database Collaboration (INSDC), have become essential tools for efficient public health decisions. However, these international resources began primarily for academic purposes, rather than for surveillance or interventions. Now, queries need to access not only the whole genomes of multiple pathogens but also make connections using robust contextual metadata to identify issues of public health relevance. Databases that over time developed a patchwork of submission formats and requirements need to be consistently organized and coordinated internationally to allow effective searches.To help resolve these issues, we propose a common pathogen data structure called the Pathogen Data Object Model (DOM) that will formalize the minimum pieces of sequence data and contextual data necessary for general public health uses, while recognizing that submitters will likely withhold a wide range of non-public contextual data. Further, we propose contributors use the Pathogen DOM for all pathogen submissions (bacterial, viral, fungal, and parasites), which will simplify data submissions and provide a consistent and transparent data structure for downstream data analyses. We also highlight how improved submission tools can support the Pathogen DOM, offering users additional easy-to-use methods to ensure this structure is followed.


Assuntos
Nucleotídeos , Saúde Pública , Sequência de Bases , Genômica/métodos , Bases de Dados de Ácidos Nucleicos
11.
J Bacteriol ; 194(21): 5994-5, 2012 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-23045502

RESUMO

Salmonella enterica subsp. enterica serovar Enteritidis is a common food-borne pathogen, often associated with shell eggs and poultry. Here, we report draft genomes of 21 S. Enteritidis strains associated with or related to the U.S.-wide 2010 shell egg recall. Eleven of these genomes were from environmental isolates associated with the egg outbreak, and 10 were reference isolates from previous years, unrelated to the outbreak. The whole-genome sequence data for these 21 human pathogen strains are being released in conjunction with the newly formed 100K Genome Project.


Assuntos
DNA Bacteriano/química , DNA Bacteriano/genética , Genoma Bacteriano , Salmonella enteritidis/genética , Análise de Sequência de DNA , Ovos/microbiologia , Microbiologia Ambiental , Humanos , Dados de Sequência Molecular , Salmonella enteritidis/isolamento & purificação
12.
New Phytol ; 195(4): 857-871, 2012 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-22738134

RESUMO

The aims of this study were to investigate the appearance of strigolactones in the green lineage and to determine the primitive function of these molecules. We measured the strigolactone content of several isolated liverworts, mosses, charophyte and chlorophyte green algae using a sensitive biological assay and LC-MS/MS analyses. In parallel, sequence comparison of strigolactone-related genes and phylogenetic analyses were performed using available genomic data and newly sequenced expressed sequence tags. The primitive function of strigolactones was determined by exogenous application of the synthetic strigolactone analog, GR24, and by mutant phenotyping. Liverworts, the most basal Embryophytes and Charales, one of the closest green algal relatives to Embryophytes, produce strigolactones, whereas several other species of green algae do not. We showed that GR24 stimulates rhizoid elongation of Charales, liverworts and mosses, and rescues the phenotype of the strigolactone-deficient Ppccd8 mutant of Physcomitrella patens. These findings demonstrate that the first function of strigolactones was not to promote arbuscular mycorrhizal symbiosis. Rather, they suggest that the strigolactones appeared earlier in the streptophyte lineage to control rhizoid elongation. They may have been conserved in basal Embryophytes for this role and then recruited for the stimulation of colonization by glomeromycotan fungi.


Assuntos
Bryopsida/genética , Evolução Molecular , Hepatófitas/genética , Lactonas/metabolismo , Filogenia , Proteínas de Algas/química , Proteínas de Algas/genética , Proteínas de Algas/metabolismo , Sequência de Aminoácidos , Bryopsida/efeitos dos fármacos , Bryopsida/crescimento & desenvolvimento , Clorófitas/efeitos dos fármacos , Clorófitas/genética , Clorófitas/metabolismo , Cromatografia Líquida , Genes de Plantas/genética , Hepatófitas/efeitos dos fármacos , Hepatófitas/metabolismo , Lactonas/farmacologia , Espectrometria de Massas , Dados de Sequência Molecular , Alinhamento de Sequência
13.
Plant Physiol ; 155(1): 209-21, 2011 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-21081694

RESUMO

Auxin signaling is central to plant growth and development, yet hardly anything is known about its evolutionary origin. While the presence of key players in auxin signaling has been analyzed in various land plant species, similar analyses in the green algal lineages are lacking. Here, we survey the key players in auxin biology in the available genomes of Chlorophyta species. We found that the genetic potential for auxin biosynthesis and AUXIN1 (AUX1)/LIKE AUX1- and P-GLYCOPROTEIN/ATP-BINDING CASSETTE subfamily B-dependent transport is already present in several single-celled and colony-forming Chlorophyta species. In addition, our analysis of expressed sequence tag libraries from Coleochaete orbicularis and Spirogyra pratensis, green algae of the Streptophyta clade that are evolutionarily closer to the land plants than those of the Chlorophyta clade, revealed the presence of partial AUXIN RESPONSE FACTORs and/or AUXIN/INDOLE-3-ACETIC ACID proteins (the key factors in auxin signaling) and PIN-FORMED-like proteins (the best-characterized auxin-efflux carriers). While the identification of these possible AUXIN RESPONSE FACTOR- and AUXIN/INDOLE-3-ACETIC ACID precursors and putative PIN-FORMED orthologs calls for a deeper investigation of their evolution after sequencing more intermediate genomes, it emphasizes that the canonical auxin response machinery and auxin transport mechanisms were, at least in part, already present before plants "moved" to land habitats.


Assuntos
Clorófitas/genética , Evolução Molecular , Ácidos Indolacéticos/metabolismo , Transdução de Sinais/genética , Sequência de Aminoácidos , Arabidopsis/genética , Transporte Biológico , Chlamydomonas/genética , Clorófitas/metabolismo , Dados de Sequência Molecular , Filogenia , Regiões Promotoras Genéticas/genética , Proteínas/química , Proteínas/genética , Proteínas/metabolismo
14.
Gigascience ; 112022 02 16.
Artigo em Inglês | MEDLINE | ID: mdl-35169842

RESUMO

BACKGROUND: The Public Health Alliance for Genomic Epidemiology (PHA4GE) (https://pha4ge.org) is a global coalition that is actively working to establish consensus standards, document and share best practices, improve the availability of critical bioinformatics tools and resources, and advocate for greater openness, interoperability, accessibility, and reproducibility in public health microbial bioinformatics. In the face of the current pandemic, PHA4GE has identified a need for a fit-for-purpose, open-source SARS-CoV-2 contextual data standard. RESULTS: As such, we have developed a SARS-CoV-2 contextual data specification package based on harmonizable, publicly available community standards. The specification can be implemented via a collection template, as well as an array of protocols and tools to support both the harmonization and submission of sequence data and contextual information to public biorepositories. CONCLUSIONS: Well-structured, rich contextual data add value, promote reuse, and enable aggregation and integration of disparate datasets. Adoption of the proposed standard and practices will better enable interoperability between datasets and systems, improve the consistency and utility of generated data, and ultimately facilitate novel insights and discoveries in SARS-CoV-2 and COVID-19. The package is now supported by the NCBI's BioSample database.


Assuntos
COVID-19 , SARS-CoV-2 , Genômica , Humanos , Metadados , Saúde Pública , Reprodutibilidade dos Testes
15.
J Food Prot ; 85(5): 755-772, 2022 05 01.
Artigo em Inglês | MEDLINE | ID: mdl-35259246

RESUMO

ABSTRACT: This multiagency report developed by the Interagency Collaboration for Genomics for Food and Feed Safety provides an overview of the use of and transition to whole genome sequencing (WGS) technology for detection and characterization of pathogens transmitted commonly by food and for identification of their sources. We describe foodborne pathogen analysis, investigation, and harmonization efforts among the following federal agencies: National Institutes of Health; Department of Health and Human Services, Centers for Disease Control and Prevention (CDC) and U.S. Food and Drug Administration (FDA); and the U.S. Department of Agriculture, Food Safety and Inspection Service, Agricultural Research Service, and Animal and Plant Health Inspection Service. We describe single nucleotide polymorphism, core-genome, and whole genome multilocus sequence typing data analysis methods as used in the PulseNet (CDC) and GenomeTrakr (FDA) networks, underscoring the complementary nature of the results for linking genetically related foodborne pathogens during outbreak investigations while allowing flexibility to meet the specific needs of Interagency Collaboration partners. We highlight how we apply WGS to pathogen characterization (virulence and antimicrobial resistance profiles) and source attribution efforts and increase transparency by making the sequences and other data publicly available through the National Center for Biotechnology Information. We also highlight the impact of current trends in the use of culture-independent diagnostic tests for human diagnostic testing on analytical approaches related to food safety and what is next for the use of WGS in the area of food safety.


Assuntos
Doenças Transmitidas por Alimentos , Animais , Surtos de Doenças/prevenção & controle , Inocuidade dos Alimentos , Doenças Transmitidas por Alimentos/epidemiologia , Doenças Transmitidas por Alimentos/prevenção & controle , Genômica , Estados Unidos , Sequenciamento Completo do Genoma
16.
Microb Genom ; 7(2)2021 02.
Artigo em Inglês | MEDLINE | ID: mdl-33539276

RESUMO

Salmonella enterica subspecies arizonae is frequently associated with animal reservoirs, particularly reptiles, and can cause illness in some mammals, including humans. Using whole-genome sequencing data, core genome phylogenetic analyses were performed using 112 S. enterica subsp. arizonae isolates, representing 46 of 102 described serovars. Nearly one-third of these are polyphyletic, including two serovars that appear in four and five distinct evolutionary lineages. Subspecies arizonae has a monophasic H antigen. Among the 46 serovars investigated, only 8 phase 1 H antigens were identified, demonstrating high conservation for this antigen. Prophages and plasmids were found throughout this subspecies, including five novel prophages. Polyphyly was also reflected in prophage content, although some clade-specific enrichment for some phages was observed. IncFII(S) was the most frequent plasmid replicon identified and was found in a quarter of S. enterica subsp. arizonae genomes. Salmonella pathogenicity islands (SPIs) 1 and 2 are present across all Salmonella, including this subspecies, although effectors sipA, sptP and arvA in SPI-1 and sseG and ssaI in SPI-2 appear to be lost in this lineage. SPI-20, encoding a type VI secretion system, is exclusive to this subspecies and is well maintained in all genomes sampled. A number of fimbral operons were identified, including the sas operon that appears to be a synapomorphy for this subspecies, while others exhibited more clade-specific patterns. This work reveals evolutionary patterns in S. enterica subsp. arizonae that make this subspecies a unique lineage within this very diverse species.


Assuntos
Antígenos de Bactérias/genética , Salmonella enterica/classificação , Sequenciamento Completo do Genoma/métodos , Antígenos de Bactérias/imunologia , Fímbrias Bacterianas/genética , Genoma Bacteriano , Ilhas Genômicas , Sequenciamento de Nucleotídeos em Larga Escala , Filogenia , Plasmídeos/genética , Prófagos/genética , Salmonella enterica/genética , Salmonella enterica/imunologia , Sorogrupo
17.
J Infect Dev Ctries ; 15(12): 1899-1909, 2021 12 31.
Artigo em Inglês | MEDLINE | ID: mdl-35044949

RESUMO

INTRODUCTION: Non-typhoidal Salmonella are major foodborne pathogens causing serious challenges to public health and food safety worldwide. This study aimed to determine the resistance, virulence genes, sequence type, using multi-locus sequence typing, plasmids and Single Nucleotide Polymorphisms (SNPs) of Salmonella enterica subsp. enterica serovar Nigeria (S. Nigeria) from livestock in Ilorin, North central Nigeria. METHODOLOGY: A total of 1,500 samples from pig (feces; n = 600) and poultry (feces, postmortem samples; n = 900) were collected and analyzed between 2014 to 2017. Presumptive Salmonella isolates were characterized by Whole Genome Sequencing (WGS). RESULTS: We recovered nine S. Nigeria serovars. All the isolates harbored a single point mutation parC(T57S) in addition to qnrB19 and the tetA gene. Furthermore, two plasmids, Col(pHAD28) and IncQ1 predicted to encode qnrB19 and tetA genes, respectively, were detected in all the strains. All the isolates belonged to a single sequence type (ST) 4911, the SNP-based phylogeny showed all the isolates to be highly related, in addition two clinical isolates from the United Kingdom (UK) and Canada, collected outside of this study, also fell into this cluster. Twenty virulence genes were identified from Salmonella Pathogenicity Islands (SPI), chromosomal and fimbriae loci. CONCLUSIONS: This study highlights the roles of pig and poultry in the emergence and spread of S. Nigeria serovar in Nigeria, sub-Sahara Africa. It also highlighted the importance of WGS in clinical and epidemiological surveillance. There is the need for collaborative research studies to investigate the public health importance of Salmonella enterica serovar Nigeria.


Assuntos
Contaminação de Alimentos/análise , Testes de Sensibilidade Microbiana/métodos , Aves Domésticas , Suínos , Animais , Anti-Infecciosos , Farmacorresistência Bacteriana Múltipla/genética , Nigéria , Salmonella enterica/genética , Salmonella enterica/isolamento & purificação , Sequenciamento Completo do Genoma
18.
mSystems ; 6(1)2021 02 23.
Artigo em Inglês | MEDLINE | ID: mdl-33622857

RESUMO

Microbiome samples are inherently defined by the environment in which they are found. Therefore, data that provide context and enable interpretation of measurements produced from biological samples, often referred to as metadata, are critical. Important contributions have been made in the development of community-driven metadata standards; however, these standards have not been uniformly embraced by the microbiome research community. To understand how these standards are being adopted, or the barriers to adoption, across research domains, institutions, and funding agencies, the National Microbiome Data Collaborative (NMDC) hosted a workshop in October 2019. This report provides a summary of discussions that took place throughout the workshop, as well as outcomes of the working groups initiated at the workshop.

19.
BMC Plant Biol ; 10: 96, 2010 May 25.
Artigo em Inglês | MEDLINE | ID: mdl-20500869

RESUMO

UNLABELLED: The large and diverse land plant lineage is nested within a clade of fresh water green algae, the charophytes. Collection of genome-scale data for land plants and other organisms over the past decade has invigorated the field of evolutionary biology. One of the core questions in the field asks: how did a colonization event by a green algae over 450 mya lead to one of the most successful lineages on the tree of life? This question can best be answered using the comparative method, the first step of which is to gather genome-scale data across closely related lineages to land plants. Before sequencing an entire genome it is useful to first gather transcriptome data: it is less expensive, it targets the protein coding regions of the genome, and provides support for gene models for future genome sequencing. We built Expressed Sequence Tag (EST) libraries for two charophyte species, Coleochaete orbicularis (Coleochaetales) and Spirogyra pratensis (Zygnematales). We used both Sanger sequencing and next generation 454 sequencing to cover as much of the transcriptome as possible. RESULTS: Our sequencing effort for Spirogyra pratensis yielded 9,984 5' Sanger reads plus 598,460 GS FLX Standard 454 sequences; Coleochaete orbicularis yielded 4,992 5' Sanger reads plus 673,811 GS FLX Titanium 454 sequences. After clustering S. pratensis yielded 12,000 unique transcripts, or unigenes, and C. orbicularis yielded 19,000. Both transcriptomes were very plant-like, i.e. most of the transcripts were more similar to streptophytes (land plants + charophyte green algae) than to other green algae in the sister group chlorophytes. BLAST results of several land plant genes hypothesized to be important in early land plant evolution resulted in high quality hits in both transcriptomes revealing putative orthologs ripe for follow-up studies. CONCLUSIONS: Two main conclusions were drawn from this study. One illustrates the utility of next generation sequencing for transcriptome studies: larger scale data collection at a lower cost enabled us to cover a considerable portion of the transcriptome for both species. And, two, that the charophyte green algal transcriptoms are remarkably plant-like, which gives them the unique capacity to be major players for future evolutionary genomic studies addressing origin of land plant questions.


Assuntos
Clorófitas/genética , Evolução Molecular , Perfilação da Expressão Gênica , Análise por Conglomerados , Hibridização Genômica Comparativa , DNA de Algas/genética , Etiquetas de Sequências Expressas , Biblioteca Genômica , Análise de Sequência de DNA/métodos
20.
One Health Outlook ; 2(1): 20, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-33103064

RESUMO

The holistic approach of One Health, which sees human, animal, plant, and environmental health as a unit, rather than discrete parts, requires not only interdisciplinary cooperation, but standardized methods for communicating and archiving data, enabling participants to easily share what they have learned and allow others to build upon their findings. Ongoing work by NCBI and the GenomeTrakr project illustrates how open data platforms can help meet the needs of federal and state regulators, public health laboratories, departments of agriculture, and universities. Here we describe how microbial pathogen surveillance can be transformed by having an open access database along with Best Practices for contributors to follow. First, we describe the open pathogen surveillance framework, hosted on the NCBI platform. We cover the current community standards for WGS quality, provide an SOP for assessing your own sequence quality and recommend QC thresholds for all submitters to follow. We then provide an overview of NCBI data submission along with step by step details. And finally, we provide curation guidance and an SOP for keeping your public data current within the database. These Best Practices can be models for other open data projects, thereby advancing the One Health goals of Findable, Accessible, Interoperable and Re-usable (FAIR) data.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA