RESUMO
The National Center for Biotechnology Information (NCBI) provides online information resources for biology, including the GenBank® nucleic acid sequence database and the PubMed® database of citations and abstracts published in life science journals. NCBI provides search and retrieval operations for most of these data from 35 distinct databases. The E-utilities serve as the programming interface for most of these databases. Resources receiving significant updates in the past year include PubMed, PMC, Bookshelf, SciENcv, the NIH Comparative Genomics Resource (CGR), NCBI Virus, SRA, RefSeq, foreign contamination screening tools, Taxonomy, iCn3D, ClinVar, GTR, MedGen, dbSNP, ALFA, ClinicalTrials.gov, Pathogen Detection, antimicrobial resistance resources, and PubChem. These resources can be accessed through the NCBI home page at https://www.ncbi.nlm.nih.gov.
Assuntos
Bases de Dados Genéticas , National Library of Medicine (U.S.) , Biotecnologia/instrumentação , Bases de Dados de Ácidos Nucleicos , Internet , Estados UnidosRESUMO
The National Center for Biotechnology Information (NCBI) provides online information resources for biology, including the GenBank® nucleic acid sequence database and the PubMed® database of citations and abstracts published in life science journals. NCBI provides search and retrieval operations for most of these data from 35 distinct databases. The E-utilities serve as the programming interface for most of these databases. New resources include the Comparative Genome Resource (CGR) and the BLAST ClusteredNR database. Resources receiving significant updates in the past year include PubMed, PMC, Bookshelf, IgBLAST, GDV, RefSeq, NCBI Virus, GenBank type assemblies, iCn3D, ClinVar, GTR, dbGaP, ALFA, ClinicalTrials.gov, Pathogen Detection, antimicrobial resistance resources, and PubChem. These resources can be accessed through the NCBI home page at https://www.ncbi.nlm.nih.gov.
Assuntos
Bases de Dados Genéticas , Bases de Dados de Ácidos Nucleicos , Estados Unidos , National Library of Medicine (U.S.) , Alinhamento de Sequência , Biotecnologia , InternetRESUMO
Assigning names to ß-lactamase variants has been inconsistent and has led to confusion in the published literature. The common availability of whole genome sequencing has resulted in an exponential growth in the number of new ß-lactamase genes. In November 2021 an international group of ß-lactamase experts met virtually to develop a consensus for the way naturally-occurring ß-lactamase genes should be named. This document formalizes the process for naming novel ß-lactamases, followed by their subsequent publication.
Assuntos
Inibidores de beta-Lactamases , beta-Lactamases , Consenso , beta-Lactamases/genéticaRESUMO
Unlike for classes A and B, a standardized amino acid numbering scheme has not been proposed for the class C (AmpC) ß-lactamases, which complicates communication in the field. Here, we propose a scheme developed through a collaborative approach that considers both sequence and structure, preserves traditional numbering of catalytically important residues (Ser64, Lys67, Tyr150, and Lys315), is adaptable to new variants or enzymes yet to be discovered and includes a variation for genetic and epidemiological applications.
Assuntos
Proteínas de Bactérias/classificação , Bactérias Gram-Negativas/genética , Bactérias Gram-Positivas/genética , Mutação , Terminologia como Assunto , Resistência beta-Lactâmica/genética , beta-Lactamases/classificação , Sequência de Aminoácidos , Antibacterianos/química , Antibacterianos/farmacologia , Proteínas de Bactérias/antagonistas & inibidores , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Expressão Gênica , Bactérias Gram-Negativas/efeitos dos fármacos , Bactérias Gram-Negativas/enzimologia , Bactérias Gram-Positivas/efeitos dos fármacos , Bactérias Gram-Positivas/enzimologia , Cooperação Internacional , Estrutura Secundária de Proteína , Alinhamento de Sequência , Homologia de Sequência de Aminoácidos , Inibidores de beta-Lactamases/química , Inibidores de beta-Lactamases/farmacologia , beta-Lactamases/genética , beta-Lactamases/metabolismo , beta-Lactamas/química , beta-Lactamas/farmacologiaRESUMO
Carbapenem-resistant Enterobacteriaceae (CRE) are among the most severe threats to the antibiotic era. Multiple different species can exhibit resistance due to many different mechanisms, and many different mobile elements are capable of transferring resistance between lineages. We prospectively sampled CRE from hospitalized patients from three Boston-area hospitals, together with a collection of CRE from a single California hospital, to define the frequency and characteristics of outbreaks and determine whether there is evidence for transfer of strains within and between hospitals and the frequency with which resistance is transferred between lineages or species. We found eight species exhibiting resistance, with the majority of our sample being the sequence type 258 (ST258) lineage of Klebsiella pneumoniae There was very little evidence of extensive hospital outbreaks, but a great deal of variation in resistance mechanisms and the genomic backgrounds carrying these mechanisms. Local transmission was evident in clear phylogeographic structure between the samples from the two coasts. The most common resistance mechanisms were KPC (K. pneumoniae carbapenemases) beta-lactamases encoded by blaKPC2, blaKPC3, and blaKPC4, which were transferred between strains and species by seven distinct subgroups of the Tn4401 element. We also found evidence for previously unrecognized resistance mechanisms that produced resistance when transformed into a susceptible genomic background. The extensive variation, together with evidence of transmission beyond limited clonal outbreaks, points to multiple unsampled transmission chains throughout the continuum of care, including asymptomatic carriage and transmission of CRE. This finding suggests that to control this threat, we need an aggressive approach to surveillance and isolation.
Assuntos
Carbapenêmicos/farmacologia , Elementos de DNA Transponíveis/genética , Surtos de Doenças , Infecções por Enterobacteriaceae/microbiologia , Enterobacteriaceae/efeitos dos fármacos , Fatores R/genética , Resistência beta-Lactâmica/genética , Proteínas de Bactérias/genética , Boston/epidemiologia , Células Clonais , Infecção Hospitalar/epidemiologia , Infecção Hospitalar/microbiologia , Infecção Hospitalar/transmissão , Enterobacteriaceae/enzimologia , Enterobacteriaceae/genética , Infecções por Enterobacteriaceae/epidemiologia , Infecções por Enterobacteriaceae/transmissão , Variação Genética , Genoma Bacteriano , Humanos , Estudos Prospectivos , Alinhamento de Sequência , Transformação Bacteriana , Resistência beta-Lactâmica/fisiologia , beta-Lactamases/genéticaRESUMO
Antimicrobial resistance (AMR) is a major public health problem that requires publicly available tools for rapid analysis. To identify AMR genes in whole-genome sequences, the National Center for Biotechnology Information (NCBI) has produced AMRFinder, a tool that identifies AMR genes using a high-quality curated AMR gene reference database. The Bacterial Antimicrobial Resistance Reference Gene Database consists of up-to-date gene nomenclature, a set of hidden Markov models (HMMs), and a curated protein family hierarchy. Currently, it contains 4,579 antimicrobial resistance proteins and more than 560 HMMs. Here, we describe AMRFinder and its associated database. To assess the predictive ability of AMRFinder, we measured the consistency between predicted AMR genotypes from AMRFinder and resistance phenotypes of 6,242 isolates from the National Antimicrobial Resistance Monitoring System (NARMS). This included 5,425 Salmonella enterica, 770 Campylobacter spp., and 47 Escherichia coli isolates phenotypically tested against various antimicrobial agents. Of 87,679 susceptibility tests performed, 98.4% were consistent with predictions. To assess the accuracy of AMRFinder, we compared its gene symbol output with that of a 2017 version of ResFinder, another publicly available resistance gene detection system. Most gene calls were identical, but there were 1,229 gene symbol differences (8.8%) between them, with differences due to both algorithmic differences and database composition. AMRFinder missed 16 loci that ResFinder found, while ResFinder missed 216 loci that AMRFinder identified. Based on these results, AMRFinder appears to be a highly accurate AMR gene detection system.
RESUMO
The initial report of the mcr-1 (mobile colistin resistance) gene has led to many reports of mcr-1 variants and other mcr genes from different bacterial species originating from human, animal and environmental samples in different geographical locations. Resistance gene nomenclature is complex and unfortunately problems such as different names being used for the same gene/protein or the same name being used for different genes/proteins are not uncommon. Registries exist for some families, such as bla (ß-lactamase) genes, but there is as yet no agreed nomenclature scheme for mcr genes. The National Center for Biotechnology Information (NCBI) recently took over assigning bla allele numbers from the longstanding Lahey ß-lactamase website and has agreed to do the same for mcr genes. Here, we propose a nomenclature scheme that we hope will be acceptable to researchers in this area and that will reduce future confusion.
Assuntos
Alelos , Antibacterianos/farmacologia , Bactérias/genética , Colistina/farmacologia , Farmacorresistência Bacteriana/genética , Genes MDR , Bactérias/efeitos dos fármacos , Escherichia coli/efeitos dos fármacos , Proteínas de Escherichia coli/genética , Testes de Sensibilidade Microbiana , Terminologia como Assunto , Sequenciamento Completo do Genoma , beta-Lactamases/genéticaRESUMO
The faecal flora is a common reservoir for urinary tract infection (UTI), and Escherichia coli (E. coli) is frequently found in this reservoir without causing extraintestinal infection. We investigated these E. coli reservoirs by whole-genome sequencing a large collection of E. coli from healthy controls (faecal), who had never previously had UTI, and from UTI patients (faecal and urinary) sampled from the same geographical area. We compared MLST types, phylogenetic relationship, accessory genome content and FimH type between patient and control faecal isolates as well as between UTI and faecal-only isolates, respectively. Comparison of the accessory genome of UTI isolates to faecal isolates revealed 35 gene families which were significantly more prevalent in the UTI isolates compared to the faecal isolates, although none of these were unique to one of the two groups. Of these 35, 22 belonged to a genomic island and three putatively belonged to a type VI secretion system (T6SS). MLST types and SNP phylogeny indicated no clustering of the UTI or faecal E. coli from patients distinct from the control faecal isolates, although there was an overrepresentation of UTI isolates belonging to clonal lineages CC73 and CC12. One combination of mutations in FimH, N70S/S78N, was significantly associated to UTI, while phylogenetic analysis of FimH and fimH identified no signs of distinct adaptation of UTI isolates compared to faecal-only isolates not causing UTI. In summary, the results showed that (i) healthy women who had never previously had UTI carried faecal E. coli which were overall closely related to UTI and faecal isolates from UTI patients; (ii) UTI isolates do not cluster separately from faecal-only isolates based on SNP analysis; and (iii) 22 gene families of a genomic island, putative T6SS proteins as well as specific metabolism and virulence associated proteins were significantly more common in UTI isolates compared to faecal-only isolates and (iv) evolution of fimH for these isolates was not linked to the clinical source of the isolates, apart from the mutation combination N70S/S78N, which was correlated to UTI isolates of phylogroup B2. Combined, these findings illustrate that faecal and UTI isolates, as well as faecal-only and faecal-UTI isolates, are closely related and can only be distinguished, if at all, by their accessory genome.
Assuntos
Infecções por Escherichia coli/microbiologia , Escherichia coli/classificação , Escherichia coli/isolamento & purificação , Trato Gastrointestinal/microbiologia , Genoma Bacteriano , Genótipo , Infecções Urinárias/microbiologia , Adesinas de Escherichia coli/genética , Análise por Conglomerados , Escherichia coli/genética , Feminino , Proteínas de Fímbrias/genética , Variação Genética , Humanos , Tipagem de Sequências Multilocus , Filogenia , Sequenciamento Completo do GenomaRESUMO
The majority of extraintestinal pathogenic Escherichia coli (ExPEC) causing urinary tract infections (UTI) are found in the patient's own gut flora, but only limited knowledge is available on the potential adaptation that may occur in the bacteria in order to traverse the perineum and successfully infect the urinary tract. Here, matching pairs of faecal and UTI isolates from 42 patients were compared pairwise using in-depth whole-genome sequencing to investigate whether genetic changes were evident for successful colonization in these two different environments. The identified non-synonymous mutations (0-12 substitutions in each pair) were primarily associated to genes encoding virulence factors and nutrient metabolism; and indications of parallel evolution were observed in genes encoding the major phase-variable protein antigen 43, a toxin/antitoxin locus and haemolysin B. No differences in virulence potential were observed in a mouse UTI model for five matching faecal and UTI isolates with or without mutations in antigen 43 and haemolysin B. Variations in plasmid content were observed in only four of the 42 pairs. Although, we observed mutations in known UTI virulence genes for a few pairs, the majority showed no detectable differences with respect to mutations or mobilome when compared to their faecal counterpart. The results show that UPECs are successful in colonizing both the bladder and gut without adaptation.
Assuntos
Escherichia coli/isolamento & purificação , Escherichia coli/fisiologia , Fezes/microbiologia , Sistema Urinário/microbiologia , Adaptação Biológica , Adaptação Fisiológica , Adulto , Animais , Escherichia coli/genética , Feminino , Variação Genética , Genoma Bacteriano , Genótipo , Humanos , Camundongos , Análise de Sequência de DNARESUMO
Hospital-associated infections caused by methicillin-resistant Staphylococcus aureus (MRSA) are a global health burden dominated by a small number of bacterial clones. The pandemic EMRSA-16 clone (ST36-II) has been widespread in UK hospitals for 20 y, but its evolutionary origin and the molecular basis for its hospital association are unclear. We carried out a Bayesian phylogenetic reconstruction on the basis of the genome sequences of 87 S. aureus isolates including 60 EMRSA-16 and 27 additional clonal complex 30 (CC30) isolates, collected from patients in three continents over a 53-y period. The three major pandemic clones to originate from the CC30 lineage, including phage type 80/81, Southwest Pacific, and EMRSA-16, shared a most recent common ancestor that existed over 100 y ago, whereas the hospital-associated EMRSA-16 clone is estimated to have emerged about 35 y ago. Our CC30 genome-wide analysis revealed striking molecular correlates of hospital- or community-associated pandemics represented by mobile genetic elements and nonsynonymous mutations affecting antibiotic resistance and virulence. Importantly, phylogeographic analysis indicates that EMRSA-16 spread within the United Kingdom by transmission from hospitals in large population centers in London and Glasgow to regional health-care settings, implicating patient referrals as an important cause of nationwide transmission. Taken together, the high-resolution phylogenomic approach used resulted in a unique understanding of the emergence and transmission of a major MRSA clone and provided molecular correlates of its hospital adaptation. Similar approaches for hospital-associated clones of other bacterial pathogens may inform appropriate measures for controlling their intra- and interhospital spread.
Assuntos
Infecção Hospitalar/transmissão , Genoma Bacteriano/genética , Staphylococcus aureus Resistente à Meticilina/genética , Filogenia , Infecções Estafilocócicas/epidemiologia , Infecções Estafilocócicas/transmissão , Sequência de Bases , Teorema de Bayes , Humanos , Staphylococcus aureus Resistente à Meticilina/patogenicidade , Modelos Genéticos , Dados de Sequência Molecular , Filogeografia , Alinhamento de Sequência , Análise de Sequência de DNA , Especificidade da Espécie , Reino Unido/epidemiologia , VirulênciaRESUMO
The degree to which molecular epidemiology reveals information about the sources and transmission patterns of an outbreak depends on the resolution of the technology used and the samples studied. Isolates of Escherichia coli O104:H4 from the outbreak centered in Germany in May-July 2011, and the much smaller outbreak in southwest France in June 2011, were indistinguishable by standard tests. We report a molecular epidemiological analysis using multiplatform whole-genome sequencing and analysis of multiple isolates from the German and French outbreaks. Isolates from the German outbreak showed remarkably little diversity, with only two single nucleotide polymorphisms (SNPs) found in isolates from four individuals. Surprisingly, we found much greater diversity (19 SNPs) in isolates from seven individuals infected in the French outbreak. The German isolates form a clade within the more diverse French outbreak strains. Moreover, five isolates derived from a single infected individual from the French outbreak had extremely limited diversity. The striking difference in diversity between the German and French outbreak samples is consistent with several hypotheses, including a bottleneck that purged diversity in the German isolates, variation in mutation rates in the two E. coli outbreak populations, or uneven distribution of diversity in the seed populations that led to each outbreak.
Assuntos
Surtos de Doenças/estatística & dados numéricos , Infecções por Escherichia coli/epidemiologia , Infecções por Escherichia coli/microbiologia , Escherichia coli/genética , Escherichia coli/isolamento & purificação , Infecções por Escherichia coli/genética , Europa (Continente)/epidemiologia , Humanos , Modelos Genéticos , Filogenia , Polimorfismo de Nucleotídeo Único/genéticaRESUMO
BACKGROUND: Methicillin-resistant Staphylococcus aureus (MRSA) colonization predicts later infection, with both host and pathogen determinants of invasive disease. METHODS: This nested case-control study evaluates predictors of MRSA bacteremia in an 8-intensive care unit (ICU) prospective adult cohort from 1 September 2003 through 30 April 2005 with active MRSA surveillance and collection of ICU, post-ICU, and readmission MRSA isolates. We selected MRSA carriers who did (cases) and those who did not (controls) develop MRSA bacteremia. Generating assembled genome sequences, we evaluated 30 MRSA genes potentially associated with virulence and invasion. Using multivariable Cox proportional hazards regression, we assessed the association of these genes with MRSA bacteremia, controlling for host risk factors. RESULTS: We collected 1578 MRSA isolates from 520 patients. We analyzed host and pathogen factors for 33 cases and 121 controls. Predictors of MRSA bacteremia included a diagnosis of cancer, presence of a central venous catheter, hyperglycemia (glucose level, >200 mg/dL), and infection with a MRSA strain carrying the gene for staphylococcal enterotoxin P (sep). Receipt of an anti-MRSA medication had a significant protective effect. CONCLUSIONS: In an analysis controlling for host factors, colonization with MRSA carrying sep increased the risk of MRSA bacteremia. Identification of risk-adjusted genetic determinants of virulence may help to improve prediction of invasive disease and suggest new targets for therapeutic intervention.
Assuntos
Bacteriemia/microbiologia , Staphylococcus aureus Resistente à Meticilina/patogenicidade , Infecções Estafilocócicas/microbiologia , Idoso , Idoso de 80 Anos ou mais , Toxinas Bacterianas/genética , Estudos de Casos e Controles , Enterotoxinas/genética , Feminino , Hospitalização , Humanos , Unidades de Terapia Intensiva , Masculino , Staphylococcus aureus Resistente à Meticilina/genética , Staphylococcus aureus Resistente à Meticilina/isolamento & purificação , Pessoa de Meia-Idade , Fatores de RiscoRESUMO
Whole-genome sequencing of a collection of 103 Acinetobacter strains belonging to 22 validly named species and another 16 putative species allowed detection of genes for 50 new class D ß-lactamases and 65 new Acinetobacter-derived cephalosporinases (ADC). All oxacillinases (OXA) contained the three typical motifs of class D ß-lactamases, STFK, (F/Y)GN, and K(S/T)G. The phylogenetic tree drawn from the OXA sequences led to an increase in the number of OXA groups from 7 to 18. The topologies of the OXA and RpoB phylogenetic trees were similar, supporting the ancient acquisition of blaOXA genes by Acinetobacter species. The class D ß-lactamase genes appeared to be intrinsic to several species, such as Acinetobacter baumannii, Acinetobacter pittii, Acinetobacter calcoaceticus, and Acinetobacter lwoffii. Neither blaOXA-40/143- nor blaOXA-58-like genes were detected, and their origin remains therefore unknown. The phylogenetic tree analysis based on the alignment of the sequences deduced from blaADC revealed five main clusters, one containing ADC belonging to species closely related to A. baumannii and the others composed of cephalosporinases from the remaining species. No indication of blaOXA or blaADC transfer was observed between distantly related species, except for blaOXA-279, possibly transferred from Acinetobacter genomic species 6 to Acinetobacter parvus. Analysis of ß-lactam susceptibility of seven strains harboring new oxacillinases and cloning of the corresponding genes in Escherichia coli and in a susceptible A. baumannii strain indicated very weak hydrolysis of carbapenems. Overall, this study reveals a large pool of ß-lactamases in different Acinetobacter spp., potentially transferable to pathogenic strains of the genus.
Assuntos
Acinetobacter/genética , Cefalosporinase/classificação , Cefalosporinase/genética , Filogenia , Acinetobacter/classificação , Acinetobacter/efeitos dos fármacos , Acinetobacter/enzimologia , Motivos de Aminoácidos , Antibacterianos/metabolismo , Antibacterianos/farmacologia , Cefalosporinase/metabolismo , Escherichia coli/genética , Escherichia coli/metabolismo , Expressão Gênica , Dados de Sequência Molecular , Plasmídeos , Proteínas Recombinantes/classificação , Proteínas Recombinantes/genética , Proteínas Recombinantes/metabolismo , Alinhamento de Sequência , beta-Lactamas/metabolismo , beta-Lactamas/farmacologiaRESUMO
Defining bacterial species remains a challenging problem even for the model bacterium Escherichia coli and has major practical consequences for reliable diagnosis of infectious disease agents and regulations for transport and possession of organisms of economic importance. E. coli traditionally is thought to live within the gastrointestinal tract of humans and other warm-blooded animals and not to survive for extended periods outside its host; this understanding is the basis for its widespread use as a fecal contamination indicator. Here, we report the genome sequences of nine environmentally adapted strains that are phenotypically and taxonomically indistinguishable from typical E. coli (commensal or pathogenic). We find, however, that the commensal genomes encode for more functions that are important for fitness in the human gut, do not exchange genetic material with their environmental counterparts, and hence do not evolve according to the recently proposed fragmented speciation model. These findings are consistent with a more stringent and ecologic definition for bacterial species than the current definition and provide means to start replacing traditional approaches of defining distinctive phenotypes for new species with omics-based procedures. They also have important implications for reliable diagnosis and regulation of pathogenic E. coli and for the coliform cell-counting test.
Assuntos
Escherichia coli/genética , Evolução Molecular , Genoma Bacteriano/genética , Modelos Biológicos , Infecções por Escherichia coli/genética , Humanos , Análise de Sequência de DNA/métodosRESUMO
Signatures of balancing selection can highlight polymorphisms and functions that are important to the long-term fitness of a species. We performed a first genome-wide scan for balancing selection in a bacterial species, Staphylococcus aureus, which is a common cause of serious antimicrobial-resistant infections of humans. Using a sliding window approach, the genomes of 16 strains of S. aureus, including 5 new genome sequences presented here, and 1 outgroup strain of S. epidermidis were scanned for signatures of balancing selection. A total of 195 short windows were investigated based on their extreme values of both Tajima's D (>2.03) and π/K ratios (>0.12) relative to the rest of the genome. To test the unusualness of these windows, an Approximate Bayesian Computation framework was used to select a null demographic model that better accounted for the observed data than did the standard neutral model. A total of 186 windows were demonstrated to be unusual under the null model and, thus, represented candidate loci under balancing selection. These 186 candidate windows were located within 99 candidate genes that were spread across 62 different loci. Nearly all the signal (97.2%) was located within coding sequences; balancing selection on gene regulation apparently occurs through the targeting of global regulators such as agr and gra/aps. The agr locus had some of the strongest signatures of balancing selection, which provides new insight into the causes of diversity at this locus. The list of candidate genes included multiple virulence-associated genes and was significantly enriched for functions in amino acid and inorganic ion transport and metabolism and in defense mechanisms against innate immunity and antimicrobials, highlighting these particular functions as important to the fitness of this pathogen.
Assuntos
Genes Bacterianos , Seleção Genética , Staphylococcus aureus/genética , Teorema de Bayes , Evolução Molecular , Aptidão Genética , Loci Gênicos , Metagenômica , Família Multigênica , Filogenia , Alinhamento de Sequência , Staphylococcus epidermidis/genéticaRESUMO
Understanding the fine-structure molecular architecture of bacterial epidemics has been a long-sought goal of infectious disease research. We used short-read-length DNA sequencing coupled with mass spectroscopy analysis of SNPs to study the molecular pathogenomics of three successive epidemics of invasive infections involving 344 serotype M3 group A Streptococcus in Ontario, Canada. Sequencing the genome of 95 strains from the three epidemics, coupled with analysis of 280 biallelic SNPs in all 344 strains, revealed an unexpectedly complex population structure composed of a dynamic mixture of distinct clonally related complexes. We discovered that each epidemic is dominated by micro- and macrobursts of multiple emergent clones, some with distinct strain genotype-patient phenotype relationships. On average, strains were differentiated from one another by only 49 SNPs and 11 insertion-deletion events (indels) in the core genome. Ten percent of SNPs are strain specific; that is, each strain has a unique genome sequence. We identified nonrandom temporal-spatial patterns of strain distribution within and between the epidemic peaks. The extensive full-genome data permitted us to identify genes with significantly increased rates of nonsynonymous (amino acid-altering) nucleotide polymorphisms, thereby providing clues about selective forces operative in the host. Comparative expression microarray analysis revealed that closely related strains differentiated by seemingly modest genetic changes can have significantly divergent transcriptomes. We conclude that enhanced understanding of bacterial epidemics requires a deep-sequencing, geographically centric, comparative pathogenomics strategy.
Assuntos
Surtos de Doenças , Genoma Bacteriano , Infecções Estreptocócicas/epidemiologia , Streptococcus pyogenes/isolamento & purificação , Evolução Biológica , Códon de Terminação , Genótipo , Humanos , Espectrometria de Massas , Análise de Sequência com Séries de Oligonucleotídeos , Ontário/epidemiologia , Fenótipo , Filogenia , Polimorfismo de Nucleotídeo Único , Streptococcus pyogenes/patogenicidade , VirulênciaRESUMO
The sharing of genome sequences in online data repositories allows for large scale analyses of specific genes or gene families. This can result in the detection of novel gene subtypes as well as the development of improved detection methods. Here, we used publicly available WGS data to detect a novel Stx subtype, Stx2n in two clinical E. coli strains isolated in the USA. During this process, additional Stx2 subtypes were detected; six Stx2j, one Stx2m strain, and one Stx2o, were all analyzed for variability from the originally described subtypes. Complete genome sequences were assembled from short- or long-read sequencing and analyzed for serotype, and ST types. The WGS data from Stx2n- and Stx2o-producing STEC strains were further analyzed for virulence genes pro-phage analysis and phage insertion sites. Nucleotide and amino acid maximum parsimony trees showed expected clustering of the previously described subtypes and a clear separation of the novel Stx2n subtype. WGS data were used to design OMNI PCR primers for the detection of all known stx1 (283 bp amplicon), stx2 (400 bp amplicon), intimin encoded by eae (221 bp amplicon), and stx2f (438 bp amplicon) subtypes. These primers were tested in three different laboratories, using standard reference strains. An analysis of the complete genome sequence showed variability in serogroup, virulence genes, and ST type, and Stx2 pro-phages showed variability in size, gene composition, and phage insertion sites. The strains with Stx2j, Stx2m, Stx2n, and Stx2o showed toxicity to Vero cells. Stx2j carrying strain, 2012C-4221, was induced when grown with sub-inhibitory concentrations of ciprofloxacin, and toxicity was detected. Taken together, these data highlight the need to reinforce genomic surveillance to identify the emergence of potential new Stx2 or Stx1 variants. The importance of this surveillance has a paramount impact on public health. Per our description in this study, we suggest that 2017C-4317 be designated as the Stx2n type-strain.
RESUMO
Fast, efficient public health actions require well-organized and coordinated systems that can supply timely and accurate knowledge. Public databases of pathogen genomic data, such as the International Nucleotide Sequence Database Collaboration (INSDC), have become essential tools for efficient public health decisions. However, these international resources began primarily for academic purposes, rather than for surveillance or interventions. Now, queries need to access not only the whole genomes of multiple pathogens but also make connections using robust contextual metadata to identify issues of public health relevance. Databases that over time developed a patchwork of submission formats and requirements need to be consistently organized and coordinated internationally to allow effective searches.To help resolve these issues, we propose a common pathogen data structure called the Pathogen Data Object Model (DOM) that will formalize the minimum pieces of sequence data and contextual data necessary for general public health uses, while recognizing that submitters will likely withhold a wide range of non-public contextual data. Further, we propose contributors use the Pathogen DOM for all pathogen submissions (bacterial, viral, fungal, and parasites), which will simplify data submissions and provide a consistent and transparent data structure for downstream data analyses. We also highlight how improved submission tools can support the Pathogen DOM, offering users additional easy-to-use methods to ensure this structure is followed.
Assuntos
Nucleotídeos , Saúde Pública , Sequência de Bases , Genômica/métodos , Bases de Dados de Ácidos NucleicosRESUMO
Staphylococcus aureus is a major cause of antimicrobial-resistant infections of humans. Hybrids of S. aureus, which originate from large-scale chromosomal recombinations between parents of distinct genetic backgrounds, are of interest from clinical and evolutionary perspectives. We present draft genome sequences of two S. aureus hybrids of sequence type 34 (ST34) and ST42.