Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 14 de 14
Filter
Add more filters










Publication year range
1.
PLoS Pathog ; 7(8): e1002191, 2011 Aug.
Article in English | MEDLINE | ID: mdl-21876672

ABSTRACT

The genus Salmonella contains two species, S. bongori and S. enterica. Compared to the well-studied S. enterica there is a marked lack of information regarding the genetic makeup and diversity of S. bongori. S. bongori has been found predominantly associated with cold-blooded animals, but it can infect humans. To define the phylogeny of this species, and compare it to S. enterica, we have sequenced 28 isolates representing most of the known diversity of S. bongori. This cross-species analysis allowed us to confidently differentiate ancestral functions from those acquired following speciation, which include both metabolic and virulence-associated capacities. We show that, although S. bongori inherited a basic set of Salmonella common virulence functions, it has subsequently elaborated on this in a different direction to S. enterica. It is an established feature of S. enterica evolution that the acquisition of the type III secretion systems (T3SS-1 and T3SS-2) has been followed by the sequential acquisition of genes encoding secreted targets, termed effectors proteins. We show that this is also true of S. bongori, which has acquired an array of novel effector proteins (sboA-L). All but two of these effectors have no significant S. enterica homologues and instead are highly similar to those found in enteropathogenic Escherichia coli (EPEC). Remarkably, SboH is found to be a chimeric effector protein, encoded by a fusion of the T3SS-1 effector gene sopA and a gene highly similar to the EPEC effector nleH from enteropathogenic E. coli. We demonstrate that representatives of these new effectors are translocated and that SboH, similarly to NleH, blocks intrinsic apoptotic pathways while being targeted to the mitochondria by the SopA part of the fusion. This work suggests that S. bongori has inherited the ancestral Salmonella virulence gene set, but has adapted by incorporating virulence determinants that resemble those employed by EPEC.


Subject(s)
Biological Evolution , Salmonella/genetics , Animals , Enteropathogenic Escherichia coli/genetics , Genes, Bacterial , Genomic Islands/genetics , Humans , Phylogeny , Salmonella enterica/genetics , Sequence Analysis, DNA , Translocation, Genetic , Virulence/genetics , Virulence Factors/genetics
2.
BMC Genomics ; 12: 120, 2011 Feb 18.
Article in English | MEDLINE | ID: mdl-21333003

ABSTRACT

BACKGROUND: Small interspersed repeats are commonly found in many bacterial chromosomes. Two families of repeats (BOX and RUP) have previously been identified in the genome of Streptococcus pneumoniae, a nasopharyngeal commensal and respiratory pathogen of humans. However, little is known about the role they play in pneumococcal genetics. RESULTS: Analysis of the genome of S. pneumoniae ATCC 700669 revealed the presence of a third repeat family, which we have named SPRITE. All three repeats are present at a reduced density in the genome of the closely related species S. mitis. However, they are almost entirely absent from all other streptococci, although a set of elements related to the pneumococcal BOX repeat was identified in the zoonotic pathogen S. suis. In conjunction with information regarding their distribution within the pneumococcal chromosome, this suggests that it is unlikely that these repeats are specialised sequences performing a particular role for the host, but rather that they constitute parasitic elements. However, comparing insertion sites between pneumococcal sequences indicates that they appear to transpose at a much lower rate than IS elements. Some large BOX elements in S. pneumoniae were found to encode open reading frames on both strands of the genome, whilst another was found to form a composite RNA structure with two T box riboswitches. In multiple cases, such BOX elements were demonstrated as being expressed using directional RNA-seq and RT-PCR. CONCLUSIONS: BOX, RUP and SPRITE repeats appear to have proliferated extensively throughout the pneumococcal chromosome during the species' past, but novel insertions are currently occurring at a relatively slow rate. Through their extensive secondary structures, they seem likely to affect the expression of genes with which they are co-transcribed. Software for annotation of these repeats is freely available from ftp://ftp.sanger.ac.uk/pub/pathogens/strep_repeats/.


Subject(s)
Genome, Bacterial , Repetitive Sequences, Nucleic Acid , Streptococcus pneumoniae/genetics , Base Sequence , DNA, Bacterial/genetics , Molecular Sequence Annotation , Molecular Sequence Data , Nucleic Acid Conformation , Open Reading Frames , Sequence Analysis, DNA
3.
BMC Genomics ; 11: 652, 2010 Nov 23.
Article in English | MEDLINE | ID: mdl-21092259

ABSTRACT

BACKGROUND: The genus Neisseria contains two important yet very different pathogens, N. meningitidis and N. gonorrhoeae, in addition to non-pathogenic species, of which N. lactamica is the best characterized. Genomic comparisons of these three bacteria will provide insights into the mechanisms and evolution of pathogenesis in this group of organisms, which are applicable to understanding these processes more generally. RESULTS: Non-pathogenic N. lactamica exhibits very similar population structure and levels of diversity to the meningococcus, whilst gonococci are essentially recent descendents of a single clone. All three species share a common core gene set estimated to comprise around 1190 CDSs, corresponding to about 60% of the genome. However, some of the nucleotide sequence diversity within this core genome is particular to each group, indicating that cross-species recombination is rare in this shared core gene set. Other than the meningococcal cps region, which encodes the polysaccharide capsule, relatively few members of the large accessory gene pool are exclusive to one species group, and cross-species recombination within this accessory genome is frequent. CONCLUSION: The three Neisseria species groups represent coherent biological and genetic groupings which appear to be maintained by low rates of inter-species horizontal genetic exchange within the core genome. There is extensive evidence for exchange among positively selected genes and the accessory genome and some evidence of hitch-hiking of housekeeping genes with other loci. It is not possible to define a 'pathogenome' for this group of organisms and the disease causing phenotypes are therefore likely to be complex, polygenic, and different among the various disease-associated phenotypes observed.


Subject(s)
Evolution, Molecular , Genes, Bacterial/genetics , Neisseria lactamica/genetics , Neisseria lactamica/isolation & purification , Bacterial Proteins/genetics , Bacterial Proteins/metabolism , Base Composition/genetics , Base Sequence , Gene Order/genetics , Molecular Sequence Data , Neisseria lactamica/pathogenicity , Phylogeny , Sequence Homology, Nucleic Acid , Virulence/genetics
6.
Genome Biol ; 10(5): R51, 2009.
Article in English | MEDLINE | ID: mdl-19432983

ABSTRACT

BACKGROUND: Pseudomonas fluorescens are common soil bacteria that can improve plant health through nutrient cycling, pathogen antagonism and induction of plant defenses. The genome sequences of strains SBW25 and Pf0-1 were determined and compared to each other and with P. fluorescens Pf-5. A functional genomic in vivo expression technology (IVET) screen provided insight into genes used by P. fluorescens in its natural environment and an improved understanding of the ecological significance of diversity within this species. RESULTS: Comparisons of three P. fluorescens genomes (SBW25, Pf0-1, Pf-5) revealed considerable divergence: 61% of genes are shared, the majority located near the replication origin. Phylogenetic and average amino acid identity analyses showed a low overall relationship. A functional screen of SBW25 defined 125 plant-induced genes including a range of functions specific to the plant environment. Orthologues of 83 of these exist in Pf0-1 and Pf-5, with 73 shared by both strains. The P. fluorescens genomes carry numerous complex repetitive DNA sequences, some resembling Miniature Inverted-repeat Transposable Elements (MITEs). In SBW25, repeat density and distribution revealed 'repeat deserts' lacking repeats, covering approximately 40% of the genome. CONCLUSIONS: P. fluorescens genomes are highly diverse. Strain-specific regions around the replication terminus suggest genome compartmentalization. The genomic heterogeneity among the three strains is reminiscent of a species complex rather than a single species. That 42% of plant-inducible genes were not shared by all strains reinforces this conclusion and shows that ecological success requires specialized and core functions. The diversity also indicates the significant size of genetic information within the Pseudomonas pan genome.


Subject(s)
Ecosystem , Genome, Bacterial , Plants/microbiology , Pseudomonas fluorescens/genetics , Plants/metabolism , Pseudomonas fluorescens/classification , Pseudomonas fluorescens/metabolism
7.
PLoS One ; 3(7): e2567, 2008 Jul 02.
Article in English | MEDLINE | ID: mdl-18596979

ABSTRACT

This work centres on the genomic comparisons of two closely-related nitrogen-fixing symbiotic bacteria, Rhizobium leguminosarum biovar viciae 3841 and Rhizobium etli CFN42. These strains maintain a stable genomic core that is also common to other rhizobia species plus a very variable and significant accessory component. The chromosomes are highly syntenic, whereas plasmids are related by fewer syntenic blocks and have mosaic structures. The pairs of plasmids p42f-pRL12, p42e-pRL11 and p42b-pRL9 as well large parts of p42c with pRL10 are shown to be similar, whereas the symbiotic plasmids (p42d and pRL10) are structurally unrelated and seem to follow distinct evolutionary paths. Even though purifying selection is acting on the whole genome, the accessory component is evolving more rapidly. This component is constituted largely for proteins for transport of diverse metabolites and elements of external origin. The present analysis allows us to conclude that a heterogeneous and quickly diversifying group of plasmids co-exists in a common genomic framework.


Subject(s)
Genome, Bacterial , Nitrogen Fixation/genetics , Plasmids/genetics , Rhizobium etli/genetics , Rhizobium leguminosarum/genetics , Symbiosis/genetics , Evolution, Molecular , Models, Genetic , Phylogeny , Plasmids/metabolism , Synteny
8.
Genome Res ; 18(10): 1624-37, 2008 Oct.
Article in English | MEDLINE | ID: mdl-18583645

ABSTRACT

We have determined the complete genome sequences of a host-promiscuous Salmonella enterica serovar Enteritidis PT4 isolate P125109 and a chicken-restricted Salmonella enterica serovar Gallinarum isolate 287/91. Genome comparisons between these and other Salmonella isolates indicate that S. Gallinarum 287/91 is a recently evolved descendent of S. Enteritidis. Significantly, the genome of S. Gallinarum has undergone extensive degradation through deletion and pseudogene formation. Comparison of the pseudogenes in S. Gallinarum with those identified previously in other host-adapted bacteria reveals the loss of many common functional traits and provides insights into possible mechanisms of host and tissue adaptation. We propose that experimental analysis in chickens and mice of S. Enteritidis-harboring mutations in functional homologs of the pseudogenes present in S. Gallinarum could provide an experimentally tractable route toward unraveling the genetic basis of host adaptation in S. enterica.


Subject(s)
Evolution, Molecular , Genome, Bacterial , Salmonella enteritidis/genetics , Salmonella/genetics , Adaptation, Physiological/genetics , Animals , Chickens/microbiology , Mice , Molecular Sequence Data , Salmonella Infections, Animal/genetics , Salmonella Infections, Animal/microbiology
9.
Genome Biol ; 9(4): R74, 2008 Apr 17.
Article in English | MEDLINE | ID: mdl-18419807

ABSTRACT

BACKGROUND: Stenotrophomonas maltophilia is a nosocomial opportunistic pathogen of the Xanthomonadaceae. The organism has been isolated from both clinical and soil environments in addition to the sputum of cystic fibrosis patients and the immunocompromised. Whilst relatively distant phylogenetically, the closest sequenced relatives of S. maltophilia are the plant pathogenic xanthomonads. RESULTS: The genome of the bacteremia-associated isolate S. maltophilia K279a is 4,851,126 bp and of high G+C content. The sequence reveals an organism with a remarkable capacity for drug and heavy metal resistance. In addition to a number of genes conferring resistance to antimicrobial drugs of different classes via alternative mechanisms, nine resistance-nodulation-division (RND)-type putative antimicrobial efflux systems are present. Functional genomic analysis confirms a role in drug resistance for several of the novel RND efflux pumps. S. maltophilia possesses potentially mobile regions of DNA and encodes a number of pili and fimbriae likely to be involved in adhesion and biofilm formation that may also contribute to increased antimicrobial drug resistance. CONCLUSION: The panoply of antimicrobial drug resistance genes and mobile genetic elements found suggests that the organism can act as a reservoir of antimicrobial drug resistance determinants in a clinical environment, which is an issue of considerable concern.


Subject(s)
Drug Resistance, Microbial/genetics , Genome, Bacterial , Stenotrophomonas maltophilia/genetics , Metals, Heavy/toxicity , Stenotrophomonas maltophilia/physiology
10.
Genome Res ; 18(2): 331-42, 2008 Feb.
Article in English | MEDLINE | ID: mdl-18071028

ABSTRACT

Large inserts of horizontally acquired DNA that contain functionally related genes with limited phylogenetic distribution are often referred to as genomic islands (GIs), and structural definitions of these islands, based on common features, have been proposed. Although a large number of mobile elements fall well within the GI definition, there are several concerns about the structural consensus for GIs: The current GI definition was put forward 10 yr ago when only 12 complete bacterial genomes were available, a large number of GIs deviate from that definition, and in silico predictions assuming a full/partial GI structural model bias the sampling of the GI structural space toward "well-structured" GIs. In this study, the structural features of genomic regions are sampled by a hypothesis-free, bottom-up search, and these are exploited in a machine learning approach with the aim of explicitly quantifying and modeling the contribution of each feature to the GI structure. Performing a whole-genome-based comparative analysis between 37 strains of three different genera and 12 outgroup genomes, 668 genomic regions were sampled and used to train structural GI models. The data show that, overall, GIs from the three different genera fall into distinct, genus-specific structural families. However, decreasing the taxa resolution, by studying GI structures across different genus boundaries, provides models that converge on a fairly similar GI structure, further suggesting that GIs can be seen as a superfamily of mobile elements, with core and variable structural features, rather than a well-defined family.


Subject(s)
Algorithms , Artificial Intelligence , Computational Biology/methods , Genomic Islands/genetics , Models, Genetic , Phylogeny , Bacteria/genetics , Cluster Analysis , Likelihood Functions , Species Specificity
12.
Genome Biol ; 8(6): R100, 2007.
Article in English | MEDLINE | ID: mdl-17547764

ABSTRACT

BACKGROUND: DNA sequences that are shared between closely related organisms while being absent from their common ancestor and from sister lineages of that ancestor are likely to have been acquired by horizontal gene transfer. Over time, the composition of those sequences tends to become more similar to the compositional signature of their host (amelioration). RESULTS: From a whole-genome comparative analysis of eleven Salmonella, three Escherichia coli and one Shigella strain, we inferred the relative time of insertion of putative horizontally acquired (PHA) genes in three Salmonella strains on different branches of the S. enterica phylogenetic tree. Compositional analysis suggests that most of the PHA genes are still undergoing an amelioration process and shows a clear correlation between time of insertion and the level of amelioration. CONCLUSION: The results show that older insertions include almost all functional classes. However, very recent horizontal transfer events in the Salmonella lineage involve primarily prophage elements that are shared only between very recently diverged lineages; despite this, the prophage sequence composition is close to that of the host, indicating that host adaptation, rather than amelioration, is likely to be the source of the compositional similarity. Almost half of the PHA genes were acquired at the base of the Salmonella lineage, whereas nearly three-quarters are shared between most S. enterica subspecies. The numerical distribution of PHA genes in the Salmonella tree topology correlates well with the divergence of the major Salmonella species, highlighting the major impact of horizontal transfer on the evolution of the salmonellae.


Subject(s)
Salmonella/genetics , Escherichia coli/classification , Escherichia coli/genetics , Gene Transfer, Horizontal , Genes, Bacterial , Phylogeny , Prophages/genetics , Salmonella/classification
13.
Bioinformatics ; 22(18): 2196-203, 2006 Sep 15.
Article in English | MEDLINE | ID: mdl-16837528

ABSTRACT

MOTIVATION: There is a growing literature on the detection of Horizontal Gene Transfer (HGT) events by means of parametric, non-comparative methods. Such approaches rely only on sequence information and utilize different low and high order indices to capture compositional deviation from the genome backbone; the superiority of the latter over the former has been shown elsewhere. However even high order k-mers may be poor estimators of HGT, when insufficient information is available, e.g. in short sliding windows. Most of the current HGT prediction methods require pre-existing annotation, which may restrict their application on newly sequenced genomes. RESULTS: We introduce a novel computational method, Interpolated Variable Order Motifs (IVOMs), which exploits compositional biases using variable order motif distributions and captures more reliably the local composition of a sequence compared with fixed-order methods. For optimal localization of the boundaries of each predicted region, a second order, two-state hidden Markov model (HMM) is implemented in a change-point detection framework. We applied the IVOM approach to the genome of Salmonella enterica serovar Typhi CT18, a well-studied prokaryote in terms of HGT events, and we show that the IVOMs outperform state-of-the-art low and high order motif methods predicting not only the already characterized Salmonella Pathogenicity Islands (SPI-1 to SPI-10) but also three novel SPIs (SPI-15, SPI-16, SPI-17) and other HGT events. AVAILABILITY: The software is available under a GPL license as a standalone application at http://www.sanger.ac.uk/Software/analysis/alien_hunter CONTACT: gsv@sanger.ac.uk SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Algorithms , DNA, Bacterial/genetics , Gene Transfer, Horizontal/genetics , Genomic Islands/genetics , Salmonella/genetics , Salmonella/pathogenicity , Virulence/genetics , Biological Evolution , Genome, Bacterial/genetics , Virulence Factors/genetics
14.
BMC Bioinformatics ; 4: 53, 2003 Oct 31.
Article in English | MEDLINE | ID: mdl-14594459

ABSTRACT

BACKGROUND: The availability of increasing amounts of sequence data from completely sequenced genomes boosts the development of new computational methods for automated genome annotation and comparative genomics. Therefore, there is a need for tools that facilitate the visualization of raw data and results produced by bioinformatics analysis, providing new means for interactive genome exploration. Visual inspection can be used as a basis to assess the quality of various analysis algorithms and to aid in-depth genomic studies. RESULTS: GeneViTo is a JAVA-based computer application that serves as a workbench for genome-wide analysis through visual interaction. The application deals with various experimental information concerning both DNA and protein sequences (derived from public sequence databases or proprietary data sources) and meta-data obtained by various prediction algorithms, classification schemes or user-defined features. Interaction with a Graphical User Interface (GUI) allows easy extraction of genomic and proteomic data referring to the sequence itself, sequence features, or general structural and functional features. Emphasis is laid on the potential comparison between annotation and prediction data in order to offer a supplement to the provided information, especially in cases of "poor" annotation, or an evaluation of available predictions. Moreover, desired information can be output in high quality JPEG image files for further elaboration and scientific use. A compilation of properly formatted GeneViTo input data for demonstration is available to interested readers for two completely sequenced prokaryotes, Chlamydia trachomatis and Methanococcus jannaschii. CONCLUSIONS: GeneViTo offers an inspectional view of genomic functional elements, concerning data stemming both from database annotation and analysis tools for an overall analysis of existing genomes. The application is compatible with Linux or Windows ME-2000-XP operating systems, provided that the appropriate Java Runtime Environment is already installed in the system.


Subject(s)
Bacterial Proton-Translocating ATPases/chemistry , Bacterial Proton-Translocating ATPases/physiology , Computer Graphics/trends , Databases, Genetic , Software , Base Sequence/genetics , Binding Sites/genetics , Chlamydia trachomatis/enzymology , Chlamydia trachomatis/genetics , Chromosome Mapping/methods , Chromosomes, Bacterial/genetics , Computational Biology/methods , DNA Restriction Enzymes/genetics , Genes, Bacterial/genetics , Genes, Bacterial/physiology , Genes, rRNA/genetics , Genome, Bacterial , Methanococcus/enzymology , Methanococcus/genetics , Molecular Sequence Data , Phylogeny , Protein Subunits/genetics , RNA, Bacterial/genetics , RNA, Transfer/genetics , User-Computer Interface
SELECTION OF CITATIONS
SEARCH DETAIL
...