Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 46
Filtrar
1.
PLoS Genet ; 16(10): e1009042, 2020 10.
Artículo en Inglés | MEDLINE | ID: mdl-33022009

RESUMEN

A ~10-11 bp periodicity in dinucleotides imparting DNA bending, with shorter periods found in organisms with positively-supercoiled DNA and longer periods found in organisms with negatively-supercoiled DNA, was previously suggested to assist in DNA compaction. However, when measured with more robust methods, variation in the observed periods between organisms with different growth temperatures is not consistent with that hypothesis. We demonstrate that dinucleotide periodicity does not arise solely by mutational biases but is under selection. We found variation between genomes in both the period and the suite of dinucleotides that are periodic. Whereas organisms with similar growth temperatures have highly variable periods, differences in periods increase with phylogenetic distance between organisms. In addition, while the suites of dinucleotides under selection for periodicity become more dissimilar among more distantly-related organisms, there is a core set of dinucleotides that are strongly periodic among genomes in all domains of life. Notably, this core set of periodic motifs are not involved in DNA bending. These data indicate that dinucleotide periodicity is an ancient genomic architecture which may play a role in shaping the evolution of genes and genomes.


Asunto(s)
ADN/genética , Repeticiones de Dinucleótido/genética , Evolución Molecular , Motivos de Nucleótidos/genética , Archaea/genética , Genoma Bacteriano/genética , Genómica , Mutación/genética , Nucleosomas/genética , Paleontología , Filogenia , Selección Genética/genética
2.
PLoS Genet ; 14(5): e1007421, 2018 05.
Artículo en Inglés | MEDLINE | ID: mdl-29813058

RESUMEN

Despite significant frequencies of lateral gene transfer between species, higher taxonomic groups of bacteria show ecological and phenotypic cohesion. This suggests that barriers prevent panmictic dissemination of genes via lateral gene transfer. We have proposed that most bacterial genomes have a functional architecture imposed by Architecture IMparting Sequences (AIMS). AIMS are defined as 8 base pair sequences preferentially abundant on leading strands, whose abundance and strand-bias are positively correlated with proximity to the replication terminus. We determined that inversions whose endpoints lie within a single chromosome arm, which would reverse the polarity of AIMS in the inverted region, are both shorter and less frequent near the replication terminus. This distribution is consistent with the increased selection on AIMS function in this region, thus constraining DNA rearrangement. To test the hypothesis that AIMS also constrain DNA transfer between genomes, AIMS were identified in genomes while ignoring atypical, potentially laterally-transferred genes. The strand-bias of AIMS within recently acquired genes was negatively correlated with the distance of those genes from their genome's replication terminus. This suggests that selection for AIMS function prevents the acquisition of genes whose AIMS are not found predominantly in the permissive orientation. This constraint has led to the loss of at least 18% of genes acquired by transfer in the terminus-proximal region. We used completely sequenced genomes to produce a predictive road map of paths of expected horizontal gene transfer between species based on AIMS compatibility between donor and recipient genomes. These results support a model whereby organisms retain introgressed genes only if the benefits conferred by their encoded functions outweigh the detriments incurred by the presence of foreign DNA lacking genome-wide architectural information.


Asunto(s)
Bacterias/genética , Cromosomas/genética , Reordenamiento Génico/genética , Transferencia de Gen Horizontal , Selección Genética , Inversión Cromosómica , Replicación del ADN , Genoma Bacteriano , Filogenia
3.
Nucleic Acids Res ; 46(5): 2265-2278, 2018 03 16.
Artículo en Inglés | MEDLINE | ID: mdl-29432573

RESUMEN

Highly Iterated Palindrome 1 (HIP1, GCGATCGC) is hyper-abundant in most cyanobacterial genomes. In some cyanobacteria, average HIP1 abundance exceeds one motif per gene. Such high abundance suggests a significant role in cyanobacterial biology. However, 20 years of study have not revealed whether HIP1 has a function, much less what that function might be. We show that HIP1 is 15- to 300-fold over-represented in genomes analyzed. More importantly, HIP1 sites are conserved both within and between open reading frames, suggesting that their overabundance is maintained by selection rather than by continual replenishment by neutral processes, such as biased DNA repair. This evidence for selection suggests a functional role for HIP1. No evidence was found to support a functional role as a peptide or RNA motif or a role in the regulation of gene expression. Rather, we demonstrate that the distribution of HIP1 along cyanobacterial chromosomes is significantly periodic, with periods ranging from 10 to 90 kb, consistent in scale with periodicities reported for co-regulated, co-expressed and evolutionarily correlated genes. The periodicity we observe is also comparable in scale to chromosomal interaction domains previously described in other bacteria. In this context, our findings imply HIP1 functions associated with chromosome and nucleoid structure.


Asunto(s)
Proteínas Bacterianas/genética , Cianobacterias/genética , Genoma Bacteriano/genética , Selección Genética , Proteínas Bacterianas/metabolismo , Secuencia de Bases , Cromosomas Bacterianos/genética , Cianobacterias/clasificación , Cianobacterias/metabolismo , ADN Bacteriano/genética , Regulación Bacteriana de la Expresión Génica , Periodicidad , Filogenia
4.
Microbiology (Reading) ; 162(4): 610-621, 2016 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-26888189

RESUMEN

Antigenically distinct members of bacterial species can be differentially distributed in the environment. Predators known to consume antigenically distinct prey with different efficiencies are also differentially distributed. Here we show that antigenically distinct, but otherwise isogenic and physiologically indistinct, strains of Salmonella enterica show differential survival in natural soil, sediment and intestinal environments, where they would face a community of predators. Decline in overall cell numbers is attenuated by factors that inhibit the action of predators, including heat and antiprotozoal and antihelminthic drugs. Moreover, the fitness of strains facing these predators - calculated by comparing survival with and without treatments attenuating predator activity - varies between environments. These results suggest that relative survival in natural environments is arbitrated by communities of natural predators whose feeding preferences, if not species composition, vary between environments. These data support the hypothesis that survival against natural predators may drive the differential distribution of bacteria among microenvironments.

5.
Proc Biol Sci ; 283(1829)2016 04 27.
Artículo en Inglés | MEDLINE | ID: mdl-27097926

RESUMEN

Despite the importance of host attributes for the likelihood of associated microbial transmission, individual variation is seldom considered in studies of wildlife disease. Here, we test the influence of host phenotypes on social network structure and the likelihood of cuticular bacterial transmission from exposed individuals to susceptible group-mates using female social spiders (Stegodyphus dumicola). Based on the interactions of resting individuals of known behavioural types, we assessed whether individuals assorted according to their behavioural traits. We found that individuals preferentially interacted with individuals of unlike behavioural phenotypes. We next applied a green fluorescent protein-transformed cuticular bacterium,Pantoeasp., to individuals and allowed them to interact with an unexposed colony-mate for 24 h. We found evidence for transmission of bacteria in 55% of cases. The likelihood of transmission was influenced jointly by the behavioural phenotypes of both the exposed and susceptible individuals: transmission was more likely when exposed spiders exhibited higher 'boldness' relative to their colony-mate, and when unexposed individuals were in better body condition. Indirect transmission via shared silk took place in only 15% of cases. Thus, bodily contact appears key to transmission in this system. These data represent a fundamental step towards understanding how individual traits influence larger-scale social and epidemiological dynamics.


Asunto(s)
Arañas/microbiología , Arañas/fisiología , Animales , Femenino , Pantoea/aislamiento & purificación , Fenotipo , Seda , Conducta Social
6.
Mol Biol Evol ; 29(12): 3669-83, 2012 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-22740635

RESUMEN

In bacteria, physiological change may be effected by a single gene acquisition, producing ecological differentiation without genetic isolation. Natural selection acting on such differences can reduce the frequency of genotypes that arise from recombination at these loci. However, gene acquisition can only account for recombination interference in the fraction of the genome that is tightly linked to the integration site. To identify additional loci that contribute to adaptive differences, we examined orthologous genes in species of Enterobacteriaceae to identify significant differences in the degree of codon selection. Significance was assessed using the Adaptive Codon Enrichment metric, which accounts for the variation in codon usage bias that is expected to arise from mutation and drift; large differences in codon usage bias were identified in more genes than would be expected to arise from stochastic processes alone. Genes in the same operon showed parallel differences in codon usage bias, suggesting that changes in the overall levels of gene expression led to changes in the degree of adaptive codon usage. Most significant differences between orthologous operons were found among those involved with specific environmental adaptations, whereas "housekeeping" genes rarely showed significant changes. When considered together, the loci experiencing significant changes in codon selection outnumber potentially adaptive gene acquisition events. The identity of genes under strong codon selection seems to be influenced by the habitat from which the bacteria were isolated. We propose a two-stage model for how adaptation to different selective regimes can drive bacterial speciation. Initially, gene acquisitions catalyze rapid ecological differentiation, which modifies the utilization of genes, thereby changing the strength of codon selection on them. Alleles develop fitness variation by substitution, producing recombination interference at these loci in addition to those flanking acquired genes, allowing sequences to diverge across the entire genome and establishing genetic isolation (i.e., protection from frequent homologous recombination).


Asunto(s)
Adaptación Biológica/genética , Bacterias/genética , Codón/genética , Ambiente , Transferencia de Gen Horizontal/genética , Filogenia , Selección Genética/genética , Análisis de Varianza , Funciones de Verosimilitud , Modelos Genéticos , Operón/genética , Análisis de Componente Principal , Especificidad de la Especie
7.
Nucleic Acids Res ; 39(9): e56, 2011 May.
Artículo en Inglés | MEDLINE | ID: mdl-21297116

RESUMEN

Because the properties of horizontally-transferred genes will reflect the mutational proclivities of their donor genomes, they often show atypical compositional properties relative to native genes. Parametric methods use these discrepancies to identify bacterial genes recently acquired by horizontal transfer. However, compositional patterns of native genes vary stochastically, leaving no clear boundary between typical and atypical genes. As a result, while strongly atypical genes are readily identified as alien, genes of ambiguous character are poorly classified when a single threshold separates typical and atypical genes. This limitation affects all parametric methods that examine genes independently, and escaping it requires the use of additional genomic information. We propose that the performance of all parametric methods can be improved by using a multiple-threshold approach. First, strongly atypical alien genes and strongly typical native genes would be identified using conservative thresholds. Genes with ambiguous compositional features would then be classified by examining gene context, including the class (native or alien) of flanking genes. By including additional genomic information in a multiple-threshold framework, we observed a remarkable improvement in the performance of several popular, but algorithmically distinct, methods for alien gene detection.


Asunto(s)
Genes Bacterianos , Genómica/métodos , Algoritmos , Análisis por Conglomerados , Transferencia de Gen Horizontal , Genes Arqueales , Genoma Bacteriano
8.
Proc Natl Acad Sci U S A ; 107(25): 11453-8, 2010 Jun 22.
Artículo en Inglés | MEDLINE | ID: mdl-20534528

RESUMEN

Evolutionary relationships among species are often assumed to be fundamentally unambiguous, where genes within a genome are thought to evolve in concert and phylogenetic incongruence between individual orthologs is attributed to idiosyncrasies in their evolution. We have identified substantial incongruence between the phylogenies of orthologous genes in Escherichia, Salmonella, and Citrobacter, or E. coli, E. fergusonii, and E. albertii. The source of incongruence was inferred to be recombination, because individual genes support conflicting topology more robustly than expected from stochastic sequence homoplasies. Clustering of phylogenetically informative sites on the genome indicated that the regions of recombination extended over several kilobases. Analysis of phylogenetically distant taxa resulted in consensus among individual gene phylogenies, suggesting that recombination is not ongoing; instead, conflicting relationships among genes in descendent taxa reflect recombination among their ancestors. Incongruence could have resulted from random assortment of ancestral polymorphisms if species were instantly created from the division of a recombining population. However, the estimated branch lengths in alternative phylogenies would require ancestral populations with far more diversity than is found in extant populations. Rather, these and previous data collectively suggest that genome-wide recombination rates decreased gradually, with variation in rate among loci, leading to pluralistic relationships among their descendent taxa.


Asunto(s)
Bacterias/genética , Linaje de la Célula , Codón , ADN Bacteriano/genética , Enterobacteriaceae/genética , Escherichia coli/genética , Evolución Molecular , Variación Genética , Genoma , Modelos Genéticos , Familia de Multigenes , Filogenia , Polimorfismo Genético , Recombinación Genética , Salmonella enterica/genética
9.
J Bacteriol ; 193(12): 2941-7, 2011 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-21515774

RESUMEN

Haemophilus ducreyi, the etiologic agent of chancroid, expresses variants of several key virulence factors. While previous reports suggested that H. ducreyi strains formed two clonal populations, the differences between, and diversity within, these populations were unclear. To assess their variability, we examined sequence diversity at 11 H. ducreyi loci, including virulence and housekeeping genes, augmenting published data sets with PCR-amplified genes to acquire data for at least 10 strains at each locus. While sequences from all 11 loci place strains into two distinct groups, there was very little variation within each group. The difference between alleles of the two groups was variable and large at 3 loci encoding surface-exposed proteins (0.4 < K(S) < 1.3, where K(S) is divergence at synonymous sites) but consistently small at genes encoding cytoplasmic or periplasmic proteins (K(S) < 0.09). The data suggest that the two classes have recently diverged, that recombination has introduced variant alleles into at least 3 distinct loci, and that these alleles have been confined to one of the two classes. In addition, recombination is evident among alleles within, but not between, classes. Rather than clones of the same species, these properties indicate that the two classes may form distinct species.


Asunto(s)
Variación Genética , Haemophilus ducreyi/clasificación , Haemophilus ducreyi/genética , Antígenos Bacterianos/genética , Antígenos Bacterianos/metabolismo , Proteínas Bacterianas/genética , Proteínas Bacterianas/metabolismo , Secuencia de Bases , Regulación Bacteriana de la Expresión Génica , Infecciones por Haemophilus/prevención & control , Vacunas contra Haemophilus/inmunología , Haemophilus ducreyi/inmunología , Humanos , Datos de Secuencia Molecular , Filogenia , Factores de Tiempo
10.
BMC Genomics ; 12: 374, 2011 Jul 25.
Artículo en Inglés | MEDLINE | ID: mdl-21787402

RESUMEN

BACKGROUND: Statistics measuring codon selection seek to compare genes by their sensitivity to selection for translational efficiency, but existing statistics lack a model for testing the significance of differences between genes. Here, we introduce a new statistic for measuring codon selection, the Adaptive Codon Enrichment (ACE). RESULTS: This statistic represents codon usage bias in terms of a probabilistic distribution, quantifying the extent that preferred codons are over-represented in the gene of interest relative to the mean and variance that would result from stochastic sampling of codons. Expected codon frequencies are derived from the observed codon usage frequencies of a broad set of genes, such that they are likely to reflect nonselective, genome wide influences on codon usage (e.g. mutational biases). The relative adaptiveness of synonymous codons is deduced from the frequency of codon usage in a pre-selected set of genes relative to the expected frequency. The ACE can predict both transcript abundance during rapid growth and the rate of synonymous substitutions, with accuracy comparable to or greater than existing metrics. We further examine how the composition of reference gene sets affects the accuracy of the statistic, and suggest methods for selecting appropriate reference sets for any genome, including bacteriophages. Finally, we demonstrate that the ACE may naturally be extended to quantify the genome-wide influence of codon selection in a manner that is sensitive to a large fraction of codons in the genome. This reveals substantial variation among genomes, correlated with the tRNA gene number, even among groups of bacteria where previously proposed whole-genome measures show little variation. CONCLUSIONS: The statistical framework of the ACE allows rigorous comparison of the level of codon selection acting on genes, both within a genome and between genomes.


Asunto(s)
Bacterias/genética , Codón/genética , Genómica , Algoritmos , Sustitución de Aminoácidos , Perfilación de la Expresión Génica , Genes Bacterianos/genética , Genoma Bacteriano/genética , Modelos Genéticos , Sistemas de Lectura Abierta/genética , Probabilidad , Procesos Estocásticos
11.
Nucleic Acids Res ; 37(16): 5255-66, 2009 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-19589805

RESUMEN

While the recognition of genomic islands can be a powerful mechanism for identifying genes that distinguish related bacteria, few methods have been developed to identify them specifically. Rather, identification of islands often begins with cataloging individual genes likely to have been recently introduced into the genome; regions with many putative alien genes are then examined for other features suggestive of recent acquisition of a large genomic region. When few phylogenetic relatives are available, the identification of alien genes relies on their atypical features relative to the bulk of the genes in the genome. The weakness of these 'bottom-up' approaches lies in the difficulty in identifying robustly those genes which are atypical, or phylogenetically restricted, due to recent foreign ancestry. Herein, we apply an alternative 'top-down' approach where bacterial genomes are recursively divided into progressively smaller regions, each with uniform composition. In this way, large chromosomal regions with atypical features are identified with high confidence due to the simultaneous analysis of multiple genes. This approach is based on a generalized divergence measure to quantify the compositional difference between segments in a hypothesis-testing framework. We tested the proposed genome island prediction algorithm on both artificial chimeric genomes and genuine bacterial genomes.


Asunto(s)
Algoritmos , Genoma Bacteriano , Islas Genómicas , Genómica/métodos , Genes Bacterianos , Heterogeneidad Genética , Salmonella typhi/genética
12.
Curr Opin Genet Dev ; 15(6): 584-8, 2005 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-16188434

RESUMEN

Genomes of pathogenic bacteria evolve by large-scale changes in gene inventory. The continual acquisition of genomic islands, which refines their metabolic arsenal, is offset by gene loss. Far from this being a passive deletion of genes no longer useful to pathogens, the removal of genes encoding problematic metabolic process and immunogenic surface antigens might be strongly beneficial. Genomes of virulent eukaryotes show the footprint of similar genomic alterations, including acquisition of genes by lateral transfer, and genome degradation in obligate pathogens. These common features suggest that unicellular pathogens share common strategies for adaptation.


Asunto(s)
Bacterias/genética , Bacterias/patogenicidad , Genómica , Evolución Molecular , Genoma Bacteriano , Humanos
13.
Methods Mol Biol ; 532: 29-53, 2009.
Artículo en Inglés | MEDLINE | ID: mdl-19271178

RESUMEN

Bacteria experience recombination in two ways. In the context of the Biological Species concept, allelic exchange purges genic variability within bacterial populations as gene exchange mediates selective sweeps. In contrast, horizontal gene transfer (HGT) increases the size of the population's pan-genome by providing an influx of novel genetic material. Here we discuss the interplay of these two processes, with an emphasis on how they allow for the maintenance of genotypically cohesive bacterial populations, yet allow for the separation of these populations upon bacterial speciation. In populations that maintain genotypic similarity by frequent allelic exchange, horizontally transferred genes may initiate ecological barriers to genetic exchange. The resulting recombination interference allows for the accumulation of neutral mutations and, consequently, the imposition of a pre-mating barrier to gene transfer.


Asunto(s)
Bacterias/genética , Transferencia de Gen Horizontal , Especiación Genética , Recombinación Genética , Bacterias/clasificación , Evolución Biológica , Ecosistema , Genoma Bacteriano , Modelos Genéticos , Mutación , Selección Genética , Factores de Tiempo
14.
Nucleic Acids Res ; 35(14): 4629-39, 2007.
Artículo en Inglés | MEDLINE | ID: mdl-17591616

RESUMEN

Most parametric methods for detecting foreign genes in bacterial genomes use a scoring function that measures the atypicality of a gene with respect to the bulk of the genome. Genes whose features are sufficiently atypical-lying beyond a threshold value-are deemed foreign. Yet these methods fail when the range of features of donor genomes overlaps with that of the recipient genome, leading to misclassification of foreign and native genes; existing parametric methods choose threshold parameters to balance these error rates. To circumvent this problem, we have developed a two-pronged approach to minimize the misclassification of genes. First, beyond classifying genes as merely atypical, a gene clustering method based on Jensen-Shannon entropic divergence identifies classes of foreign genes that are also similar to each other. Second, genome position is used to reassign genes among classes whose composition features overlap. This process minimizes the misclassification of either native or foreign genes that are weakly atypical. The performance of this approach was assessed using artificial chimeric genomes and then applied to the well-characterized Escherichia coli K12 genome. Not only were foreign genes identified with a high degree of accuracy, but genes originating from the same donor organism were effectively grouped.


Asunto(s)
Transferencia de Gen Horizontal , Genómica/métodos , Análisis por Conglomerados , Escherichia coli K12/genética , Genoma Bacteriano , Filogenia
15.
PLoS Genet ; 2(6): e92, 2006 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-16789831

RESUMEN

Bacteriophages are the most abundant forms of life in the biosphere and carry genomes characterized by high genetic diversity and mosaic architectures. The complete sequences of 30 mycobacteriophage genomes show them collectively to encode 101 tRNAs, three tmRNAs, and 3,357 proteins belonging to 1,536 "phamilies" of related sequences, and a statistical analysis predicts that these represent approximately 50% of the total number of phamilies in the mycobacteriophage population. These phamilies contain 2.19 proteins on average; more than half (774) of them contain just a single protein sequence. Only six phamilies have representatives in more than half of the 30 genomes, and only three-encoding tape-measure proteins, lysins, and minor tail proteins-are present in all 30 phages, although these phamilies are themselves highly modular, such that no single amino acid sequence element is present in all 30 mycobacteriophage genomes. Of the 1,536 phamilies, only 230 (15%) have amino acid sequence similarity to previously reported proteins, reflecting the enormous genetic diversity of the entire phage population. The abundance and diversity of phages, the simplicity of phage isolation, and the relatively small size of phage genomes support bacteriophage isolation and comparative genomic analysis as a highly suitable platform for discovery-based education.


Asunto(s)
Micobacteriófagos/genética , Proteoma , Virología/educación , ADN Viral/genética , Genes Virales , Genoma Viral , Genómica , Datos de Secuencia Molecular , ARN Mensajero/genética , ARN de Transferencia/genética , ARN Viral/genética , Proteínas Virales/genética
16.
Nat Rev Microbiol ; 3(9): 733-9, 2005 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-16138101

RESUMEN

There is no widely accepted concept of species for prokaryotes, and assignment of isolates to species is based on measures of phenotypic or genome similarity. The current methods for defining prokaryotic species are inadequate and incapable of keeping pace with the levels of diversity that are being uncovered in nature. Prokaryotic taxonomy is being influenced by advances in microbial population genetics, ecology and genomics, and by the ease with which sequence data can be obtained. Here, we review the classical approaches to prokaryotic species definition and discuss the current and future impact of multilocus nucleotide-sequence-based approaches to prokaryotic systematics. We also consider the potential, and difficulties, of assigning species status to biologically or ecologically meaningful sequence clusters.


Asunto(s)
Bacterias/genética , Bacterias/clasificación , ADN Bacteriano/genética , Modelos Genéticos , Filogenia
17.
Curr Opin Microbiol ; 8(5): 572-8, 2005 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-16122972

RESUMEN

Bacterial genomes have been viewed as collections of genes, with each gene and genome evolving more-or-less independently through the acquisition of mutational changes. This historical view has been overturned by the finding that genomes of even closely-related taxa differ widely in gene content. Yet, genomes are more than ever-shuffling collections of genes. Some genes within a genome are more transient than others, conferring a layer of phenotypic lability over a core of genotypic stability; this core decreases in size as the taxa included become increasingly diverse. In addition, some lineages no longer experience high rates of gene turnover, and gene content alters primarily through slow rates of gene loss. More importantly, the cell and molecular biology of the bacterial cell imposes constraints on chromosome composition, maintaining a stable architecture in the face of gene turnover. As a result, genomes reflect the sum of processes that introduce variability, which is then arbitrated by processes that maintain stability.


Asunto(s)
Evolución Molecular , Genoma Bacteriano , Bacterias/clasificación , Bacterias/genética , Transferencia de Gen Horizontal , Recombinación Genética , Eliminación de Secuencia
18.
PLoS Comput Biol ; 1(6): e56, 2005 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-16292353

RESUMEN

Parametric methods for identifying laterally transferred genes exploit the directional mutational biases unique to each genome. Yet the development of new, more robust methods--as well as the evaluation and proper implementation of existing methods--relies on an arbitrary assessment of performance using real genomes, where the evolutionary histories of genes are not known. We have used the framework of a generalized hidden Markov model to create artificial genomes modeled after genuine genomes. To model a genome, "core" genes--those displaying patterns of mutational biases shared among large numbers of genes--are identified by a novel gene clustering approach based on the Akaike information criterion. Gene models derived from multiple "core" gene clusters are used to generate an artificial genome that models the properties of a genuine genome. Chimeric artificial genomes--representing those having experienced lateral gene transfer--were created by combining genes from multiple artificial genomes, and the performance of the parametric methods for identifying "atypical" genes was assessed directly. We found that a hidden Markov model that included multiple gene models, each trained on sets of genes representing the range of genotypic variability within a genome, could produce artificial genomes that mimicked the properties of genuine genomes. Moreover, different methods for detecting foreign genes performed differently--i.e., they had different sets of strengths and weaknesses--when identifying atypical genes within chimeric artificial genomes.


Asunto(s)
Escherichia coli/genética , Genes Bacterianos/genética , Genoma/genética , Modelos Genéticos , Algoritmos , Composición de Base , Secuencia de Bases , Quimerismo , Variación Genética/genética , Genoma Bacteriano/genética , Cadenas de Markov , Sistemas de Lectura Abierta/genética
19.
Genome Biol Evol ; 8(6): 2065-75, 2016 07 03.
Artículo en Inglés | MEDLINE | ID: mdl-27289093

RESUMEN

Neisseria meningitidis is an important cause of meningococcal disease globally. Sequence type (ST)-11 clonal complex (cc11) is a hypervirulent meningococcal lineage historically associated with serogroup C capsule and is believed to have acquired the W capsule through a C to W capsular switching event. We studied the sequence of capsule gene cluster (cps) and adjoining genomic regions of 524 invasive W cc11 strains isolated globally. We identified recombination breakpoints corresponding to two distinct recombination events within W cc11: A 8.4-kb recombinant region likely acquired from W cc22 including the sialic acid/glycosyl-transferase gene, csw resulted in a C→W change in capsular phenotype and a 13.7-kb recombinant segment likely acquired from Y cc23 lineage includes 4.5 kb of cps genes and 8.2 kb downstream of the cps cluster resulting in allelic changes in capsule translocation genes. A vast majority of W cc11 strains (497/524, 94.8%) retain both recombination events as evidenced by sharing identical or very closely related capsular allelic profiles. These data suggest that the W cc11 capsular switch involved two separate recombination events and that current global W cc11 meningococcal disease is caused by strains bearing this mosaic capsular switch.


Asunto(s)
Infecciones Meningocócicas/genética , Neisseria meningitidis/genética , Filogenia , Recombinación Genética , Genoma Bacteriano , Genómica , Humanos , Infecciones Meningocócicas/microbiología , Familia de Multigenes , Neisseria meningitidis/patogenicidad , Serogrupo
20.
Elife ; 4: e06416, 2015 Apr 28.
Artículo en Inglés | MEDLINE | ID: mdl-25919952

RESUMEN

The bacteriophage population is large, dynamic, ancient, and genetically diverse. Limited genomic information shows that phage genomes are mosaic, and the genetic architecture of phage populations remains ill-defined. To understand the population structure of phages infecting a single host strain, we isolated, sequenced, and compared 627 phages of Mycobacterium smegmatis. Their genetic diversity is considerable, and there are 28 distinct genomic types (clusters) with related nucleotide sequences. However, amino acid sequence comparisons show pervasive genomic mosaicism, and quantification of inter-cluster and intra-cluster relatedness reveals a continuum of genetic diversity, albeit with uneven representation of different phages. Furthermore, rarefaction analysis shows that the mycobacteriophage population is not closed, and there is a constant influx of genes from other sources. Phage isolation and analysis was performed by a large consortium of academic institutions, illustrating the substantial benefits of a disseminated, structured program involving large numbers of freshman undergraduates in scientific discovery.


Asunto(s)
ADN Viral/genética , Variación Genética , Genoma Viral , Micobacteriófagos/genética , Filogenia , Investigación Biomédica/ética , Conducta Cooperativa , Flujo Génico , Secuenciación de Nucleótidos de Alto Rendimiento , Difusión de la Información , Mosaicismo , Micobacteriófagos/clasificación , Mycobacterium smegmatis/virología , Filogeografía , Recursos Humanos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA