RESUMEN
Plasmodium knowlesi is an intracellular malaria parasite whose natural vertebrate host is Macaca fascicularis (the 'kra' monkey); however, it is now increasingly recognized as a significant cause of human malaria, particularly in southeast Asia. Plasmodium knowlesi was the first malaria parasite species in which antigenic variation was demonstrated, and it has a close phylogenetic relationship to Plasmodium vivax, the second most important species of human malaria parasite (reviewed in ref. 4). Despite their relatedness, there are important phenotypic differences between them, such as host blood cell preference, absence of a dormant liver stage or 'hypnozoite' in P. knowlesi, and length of the asexual cycle (reviewed in ref. 4). Here we present an analysis of the P. knowlesi (H strain, Pk1(A+) clone) nuclear genome sequence. This is the first monkey malaria parasite genome to be described, and it provides an opportunity for comparison with the recently completed P. vivax genome and other sequenced Plasmodium genomes. In contrast to other Plasmodium genomes, putative variant antigen families are dispersed throughout the genome and are associated with intrachromosomal telomere repeats. One of these families, the KIRs, contains sequences that collectively match over one-half of the host CD99 extracellular domain, which may represent an unusual form of molecular mimicry.
Asunto(s)
Genoma de Protozoos/genética , Genómica , Macaca mulatta/parasitología , Malaria/parasitología , Plasmodium knowlesi/genética , Secuencia de Aminoácidos , Animales , Antígenos CD/química , Antígenos CD/genética , Cromosomas/genética , Secuencia Conservada , Genes Protozoarios/genética , Humanos , Datos de Secuencia Molecular , Plasmodium knowlesi/clasificación , Plasmodium knowlesi/fisiología , Estructura Terciaria de Proteína , Proteínas Protozoarias/química , Proteínas Protozoarias/genética , Análisis de Secuencia de ADN , Telómero/genéticaRESUMEN
Corynebacterium diphtheriae is a Gram-positive, non-spore forming, non-motile, pleomorphic rod belonging to the genus Corynebacterium and the actinomycete group of organisms. The organism produces a potent bacteriophage-encoded protein exotoxin, diphtheria toxin (DT), which causes the symptoms of diphtheria. This potentially fatal infectious disease is controlled in many developed countries by an effective immunisation programme. However, the disease has made a dramatic return in recent years, in particular within the Eastern European region. The largest, and still on-going, outbreak since the advent of mass immunisation started within Russia and the newly independent states of the former Soviet Union in the 1990s. We have sequenced the genome of a UK clinical isolate (biotype gravis strain NCTC13129), representative of the clone responsible for this outbreak. The genome consists of a single circular chromosome of 2 488 635 bp, with no plasmids. It provides evidence that recent acquisition of pathogenicity factors goes beyond the toxin itself, and includes iron-uptake systems, adhesins and fimbrial proteins. This is in contrast to Corynebacterium's nearest sequenced pathogenic relative, Mycobacterium tuberculosis, where there is little evidence of recent horizontal DNA acquisition. The genome itself shows an unusually extreme large-scale compositional bias, being noticeably higher in G+C near the origin than at the terminus.
Asunto(s)
Corynebacterium diphtheriae/genética , Genoma Bacteriano , Anciano , Composición de Base , Cromosomas Bacterianos/genética , Corynebacterium diphtheriae/metabolismo , Corynebacterium diphtheriae/patogenicidad , ADN Bacteriano/química , ADN Bacteriano/genética , Toxina Diftérica/metabolismo , Femenino , Fimbrias Bacterianas/genética , Humanos , Glicoproteínas de Membrana/genética , Datos de Secuencia Molecular , Análisis de Secuencia de ADN , Virulencia/genéticaRESUMEN
Campylobacter jejuni, from the delta-epsilon group of proteobacteria, is a microaerophilic, Gram-negative, flagellate, spiral bacterium-properties it shares with the related gastric pathogen Helicobacter pylori. It is the leading cause of bacterial food-borne diarrhoeal disease throughout the world. In addition, infection with C. jejuni is the most frequent antecedent to a form of neuromuscular paralysis known as Guillain-Barré syndrome. Here we report the genome sequence of C. jejuni NCTC11168. C. jejuni has a circular chromosome of 1,641,481 base pairs (30.6% G+C) which is predicted to encode 1,654 proteins and 54 stable RNA species. The genome is unusual in that there are virtually no insertion sequences or phage-associated sequences and very few repeat sequences. One of the most striking findings in the genome was the presence of hypervariable sequences. These short homopolymeric runs of nucleotides were commonly found in genes encoding the biosynthesis or modification of surface structures, or in closely linked genes of unknown function. The apparently high rate of variation of these homopolymeric tracts may be important in the survival strategy of C. jejuni.
Asunto(s)
Proteínas Bacterianas , Campylobacter jejuni/genética , Variación Genética , Genoma Bacteriano , Secuencia de Aminoácidos , Toxinas Bacterianas/genética , Campylobacter jejuni/clasificación , Campylobacter jejuni/metabolismo , Quimiotaxis , Contaminación de Alimentos , Humanos , Lipopolisacáridos/biosíntesis , Proteínas de la Membrana/metabolismo , Proteínas Quimiotácticas Aceptoras de Metilo , Datos de Secuencia Molecular , FilogeniaRESUMEN
Large-scale systematic sequencing has generally depended on the availability of an ordered library of large-insert bacterial or viral genomic clones for the organism under study. The generation of these large insert libraries, and the location of each clone on a genome map, is a laborious and time-consuming process. In an effort to overcome these problems, several groups have successfully demonstrated the viability of the whole-genome random 'shotgun' method in large-scale sequencing of both viruses and prokaryotes. Here we report the sequence of Saccharomyces cerevisiae chromosome IX, determined in part by a whole-chromosome 'shotgun', and describe the particular difficulties encountered in the random 'shotgun' sequencing of an entire eukaryotic chromosome. Analysis of this sequence shows that chromosome IX contains 221 open reading frames (ORFs), of which approximately 30% have been sequenced previously. This chromosome shows features typical of a small Saccharomyces cerevisiae chromosome.
Asunto(s)
Cromosomas Fúngicos , Saccharomyces cerevisiae/genética , Composición de Base , Secuencia de Bases , ADN de Hongos , Sistemas de Lectura AbiertaRESUMEN
Systematic sequencing of the genome of Saccharomyces cerevisiae has revealed thousands of new predicted genes and allowed analysis of long-range features of chromosomal organization. Generally, genes and predicted genes seem to be distributed evenly throughout the genome, having no overall preference for DNA strand. Apart from the smaller chromosomes, which can have substantially lower gene density in their telomeric regions, there is a consistent average of one open reading frame (ORF) approximately every two kilobases. However, one of the most surprising findings for a eukaryote with approximately 6,000 genes was the amount of apparent redundancy in its genome. This redundancy occurs both between individual ORFs and over more extensive chromosome regions, which have been duplicated preserving gene order and orientation. Here we report the entire nucleotide sequence of chromosome XIII, the sixth-largest S. cerevisiae chromosome, and demonstrate that its features and organization are consistent with those observed for other S. cerevisiae chromosomes. Analysis revealed 459 ORFs, 284 have not been identified previously. Both intra- and interchromosomal duplications of regions of this chromosome have occurred.
Asunto(s)
Cromosomas Fúngicos , Saccharomyces cerevisiae/genética , Composición de Base , Secuencia de Bases , ADN de Hongos , Datos de Secuencia Molecular , Sistemas de Lectura AbiertaRESUMEN
Neisseria meningitidis causes bacterial meningitis and is therefore responsible for considerable morbidity and mortality in both the developed and the developing world. Meningococci are opportunistic pathogens that colonize the nasopharynges and oropharynges of asymptomatic carriers. For reasons that are still mostly unknown, they occasionally gain access to the blood, and subsequently to the cerebrospinal fluid, to cause septicaemia and meningitis. N. meningitidis strains are divided into a number of serogroups on the basis of the immunochemistry of their capsular polysaccharides; serogroup A strains are responsible for major epidemics and pandemics of meningococcal disease, and therefore most of the morbidity and mortality associated with this disease. Here we have determined the complete genome sequence of a serogroup A strain of Neisseria meningitidis, Z2491. The sequence is 2,184,406 base pairs in length, with an overall G+C content of 51.8%, and contains 2,121 predicted coding sequences. The most notable feature of the genome is the presence of many hundreds of repetitive elements, ranging from short repeats, positioned either singly or in large multiple arrays, to insertion sequences and gene duplications of one kilobase or more. Many of these repeats appear to be involved in genome fluidity and antigenic variation in this important human pathogen.
Asunto(s)
ADN Bacteriano , Genoma Bacteriano , Neisseria meningitidis/genética , Variación Antigénica/genética , Proteínas Bacterianas/genética , Reordenamiento Génico , Datos de Secuencia Molecular , Neisseria meningitidis/clasificación , Secuencias Repetitivas de Ácidos Nucleicos , Análisis de Secuencia de ADN , SerotipificaciónRESUMEN
Salmonella enterica serovar Typhi (S. typhi) is the aetiological agent of typhoid fever, a serious invasive bacterial disease of humans with an annual global burden of approximately 16 million cases, leading to 600,000 fatalities. Many S. enterica serovars actively invade the mucosal surface of the intestine but are normally contained in healthy individuals by the local immune defence mechanisms. However, S. typhi has evolved the ability to spread to the deeper tissues of humans, including liver, spleen and bone marrow. Here we have sequenced the 4,809,037-base pair (bp) genome of a S. typhi (CT18) that is resistant to multiple drugs, revealing the presence of hundreds of insertions and deletions compared with the Escherichia coli genome, ranging in size from single genes to large islands. Notably, the genome sequence identifies over two hundred pseudogenes, several corresponding to genes that are known to contribute to virulence in Salmonella typhimurium. This genetic degradation may contribute to the human-restricted host range for S. typhi. CT18 harbours a 218,150-bp multiple-drug-resistance incH1 plasmid (pHCM1), and a 106,516-bp cryptic plasmid (pHCM2), which shows recent common ancestry with a virulence plasmid of Yersinia pestis.
Asunto(s)
Genoma Bacteriano , Salmonella typhi/genética , Mapeo Cromosómico , Cromosomas Bacterianos , ADN Bacteriano , Farmacorresistencia Bacteriana Múltiple/genética , Escherichia coli/genética , Eliminación de Gen , Humanos , Datos de Secuencia Molecular , Mutagénesis Insercional , Plásmidos/genética , Recombinación Genética , Salmonella typhimurium/genética , Análisis de Secuencia de ADN , SerotipificaciónRESUMEN
The bacterial family Enterobacteriaceae is notable for its well studied human pathogens, including Salmonella, Yersinia, Shigella, and Escherichia spp. However, it also contains several plant pathogens. We report the genome sequence of a plant pathogenic enterobacterium, Erwinia carotovora subsp. atroseptica (Eca) strain SCRI1043, the causative agent of soft rot and blackleg potato diseases. Approximately 33% of Eca genes are not shared with sequenced enterobacterial human pathogens, including some predicted to facilitate unexpected metabolic traits, such as nitrogen fixation and opine catabolism. This proportion of genes also contains an overrepresentation of pathogenicity determinants, including possible horizontally acquired gene clusters for putative type IV secretion and polyketide phytotoxin synthesis. To investigate whether these gene clusters play a role in the disease process, an arrayed set of insertional mutants was generated, and mutations were identified. Plant bioassays showed that these mutants were significantly reduced in virulence, demonstrating both the presence of novel pathogenicity determinants in Eca, and the impact of functional genomics in expanding our understanding of phytopathogenicity in the Enterobacteriaceae.
Asunto(s)
Genoma Bacteriano , Pectobacterium carotovorum/genética , Pectobacterium carotovorum/patogenicidad , Enfermedades de las Plantas/microbiología , Solanum tuberosum/microbiología , Virulencia/genética , Secuencia de Bases , Evolución Biológica , Cartilla de ADN , Ambiente , Datos de Secuencia Molecular , Reacción en Cadena de la PolimerasaRESUMEN
Leprosy, a chronic human neurological disease, results from infection with the obligate intracellular pathogen Mycobacterium leprae, a close relative of the tubercle bacillus. Mycobacterium leprae has the longest doubling time of all known bacteria and has thwarted every effort at culture in the laboratory. Comparing the 3.27-megabase (Mb) genome sequence of an armadillo-derived Indian isolate of the leprosy bacillus with that of Mycobacterium tuberculosis (4.41 Mb) provides clear explanations for these properties and reveals an extreme case of reductive evolution. Less than half of the genome contains functional genes but pseudogenes, with intact counterparts in M. tuberculosis, abound. Genome downsizing and the current mosaic arrangement appear to have resulted from extensive recombination events between dispersed repetitive sequences. Gene deletion and decay have eliminated many important metabolic activities including siderophore production, part of the oxidative and most of the microaerophilic and anaerobic respiratory chains, and numerous catabolic systems and their regulatory circuits.
Asunto(s)
Genoma Bacteriano , Mycobacterium leprae/genética , Animales , Armadillos , ADN Bacteriano , Metabolismo Energético , Evolución Molecular , Transferencia de Gen Horizontal , Humanos , Lepra/microbiología , Datos de Secuencia Molecular , Familia de Multigenes , Mycobacterium leprae/metabolismo , Análisis de Secuencia de ADNRESUMEN
Countless millions of people have died from tuberculosis, a chronic infectious disease caused by the tubercle bacillus. The complete genome sequence of the best-characterized strain of Mycobacterium tuberculosis, H37Rv, has been determined and analysed in order to improve our understanding of the biology of this slow-growing pathogen and to help the conception of new prophylactic and therapeutic interventions. The genome comprises 4,411,529 base pairs, contains around 4,000 genes, and has a very high guanine + cytosine content that is reflected in the biased amino-acid content of the proteins. M. tuberculosis differs radically from other bacteria in that a very large portion of its coding capacity is devoted to the production of enzymes involved in lipogenesis and lipolysis, and to two new families of glycine-rich proteins with a repetitive structure that may represent a source of antigenic variation.
Asunto(s)
Genoma Bacteriano , Mycobacterium tuberculosis/genética , Mapeo Cromosómico , Cromosomas Bacterianos , Farmacorresistencia Microbiana , Humanos , Metabolismo de los Lípidos , Datos de Secuencia Molecular , Mycobacterium tuberculosis/inmunología , Mycobacterium tuberculosis/metabolismo , Mycobacterium tuberculosis/patogenicidad , Análisis de Secuencia de ADN , Tuberculosis/microbiologíaRESUMEN
The Gram-negative bacterium Yersinia pestis is the causative agent of the systemic invasive infectious disease classically referred to as plague, and has been responsible for three human pandemics: the Justinian plague (sixth to eighth centuries), the Black Death (fourteenth to nineteenth centuries) and modern plague (nineteenth century to the present day). The recent identification of strains resistant to multiple drugs and the potential use of Y. pestis as an agent of biological warfare mean that plague still poses a threat to human health. Here we report the complete genome sequence of Y. pestis strain CO92, consisting of a 4.65-megabase (Mb) chromosome and three plasmids of 96.2 kilobases (kb), 70.3 kb and 9.6 kb. The genome is unusually rich in insertion sequences and displays anomalies in GC base-composition bias, indicating frequent intragenomic recombination. Many genes seem to have been acquired from other bacteria and viruses (including adhesins, secretion systems and insecticidal toxins). The genome contains around 150 pseudogenes, many of which are remnants of a redundant enteropathogenic lifestyle. The evidence of ongoing genome fluidity, expansion and decay suggests Y. pestis is a pathogen that has undergone large-scale genetic flux and provides a unique insight into the ways in which new and highly virulent pathogens evolve.
Asunto(s)
Genoma Bacteriano , Yersinia pestis/genética , Animales , Antígenos Bacterianos/genética , Proteínas Bacterianas/genética , Proteínas Bacterianas/metabolismo , Cromosomas Bacterianos , ADN Bacteriano , Metabolismo Energético , Evolución Molecular , Transferencia de Gen Horizontal , Humanos , Insectos/microbiología , Lipopolisacáridos , Datos de Secuencia Molecular , Mutación , Peste/microbiología , Seudogenes , Análisis de Secuencia de ADN , Virulencia/genética , Yersinia pestis/inmunología , Yersinia pestis/patogenicidad , Yersinia pseudotuberculosis/genéticaRESUMEN
Analysis of Plasmodium falciparum chromosome 3, and comparison with chromosome 2, highlights novel features of chromosome organization and gene structure. The sub-telomeric regions of chromosome 3 show a conserved order of features, including repetitive DNA sequences, members of multigene families involved in pathogenesis and antigenic variation, a number of conserved pseudogenes, and several genes of unknown function. A putative centromere has been identified that has a core region of about 2 kilobases with an extremely high (adenine + thymidine) composition and arrays of tandem repeats. We have predicted 215 protein-coding genes and two transfer RNA genes in the 1,060,106-base-pair chromosome sequence. The predicted protein-coding genes can be divided into three main classes: 52.6% are not spliced, 45.1% have a large exon with short additional 5' or 3' exons, and 2.3% have a multiple exon structure more typical of higher eukaryotes.
Asunto(s)
Genoma de Protozoos , Plasmodium falciparum/genética , Animales , Secuencia de Bases , Centrómero , Mapeo Cromosómico , Cromosomas , ADN Protozoario , Datos de Secuencia Molecular , Proteínas Protozoarias/genética , Análisis de Secuencia de ADN , TelómeroRESUMEN
Since the sequencing of the first two chromosomes of the malaria parasite, Plasmodium falciparum, there has been a concerted effort to sequence and assemble the entire genome of this organism. Here we report the sequence of chromosomes 1, 3-9 and 13 of P. falciparum clone 3D7--these chromosomes account for approximately 55% of the total genome. We describe the methods used to map, sequence and annotate these chromosomes. By comparing our assemblies with the optical map, we indicate the completeness of the resulting sequence. During annotation, we assign Gene Ontology terms to the predicted gene products, and observe clustering of some malaria-specific terms to specific chromosomes. We identify a highly conserved sequence element found in the intergenic region of internal var genes that is not associated with their telomeric counterparts.
Asunto(s)
ADN Protozoario , Plasmodium falciparum/genética , Animales , Secuencia de Bases , Cromosomas , Genes Protozoarios , Genoma de Protozoos , Datos de Secuencia Molecular , Familia de Multigenes , Proteoma , Proteínas Protozoarias/genética , Análisis de Secuencia de ADNRESUMEN
We have sequenced and annotated the genome of fission yeast (Schizosaccharomyces pombe), which contains the smallest number of protein-coding genes yet recorded for a eukaryote: 4,824. The centromeres are between 35 and 110 kilobases (kb) and contain related repeats including a highly conserved 1.8-kb element. Regions upstream of genes are longer than in budding yeast (Saccharomyces cerevisiae), possibly reflecting more-extended control regions. Some 43% of the genes contain introns, of which there are 4,730. Fifty genes have significant similarity with human disease genes; half of these are cancer related. We identify highly conserved genes important for eukaryotic cell organization including those required for the cytoskeleton, compartmentation, cell-cycle control, proteolysis, protein phosphorylation and RNA splicing. These genes may have originated with the appearance of eukaryotic life. Few similarly conserved genes that are important for multicellular organization were identified, suggesting that the transition from prokaryotes to eukaryotes required more new genes than did the transition from unicellular to multicellular organization.