Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 37
Filter
1.
Mol Microbiol ; 109(3): 327-344, 2018 08.
Article in English | MEDLINE | ID: mdl-29802740

ABSTRACT

We found that mutations that increased the transcription of the mgtCBR (Mg2+ transport-related) operon conferred increased thermotolerance on this organism. The 5' leader of the mgtCBR mRNA contains two short open reading frames (ORFs), mgtM and mgtP, whose translation regulates the expression of the mgtCBR operon by a mechanism that is similar to attenuation in amino acid biosynthetic operons. We obtained two types of mutations that resulted in elevated transcription of the operon: defects in the mgtM ribosome-binding site, impairing the translation of this ORF and deletions encompassing the stop codon of mgtM that extend the translation of this ORF across a downstream Rho termination site. These mgtM mutations give further insights into the mechanism of the transcriptional control of the mgtCBR operon that we discuss in this work. We show that the increased thermotolerance requires elevated expression of the mgtC gene, but functional mgtB and mgtR, which respectively encode an Mg2+ transporter and a regulatory protein, are dispensable for this response. MgtC has been shown to have complex functions, including a requirement for virulence, flagella-independent motility and synthesis of cellulose and we now found that it has a role in the regulation of thermotolerance.


Subject(s)
Bacterial Proteins/metabolism , Base Sequence , Cation Transport Proteins/metabolism , Salmonella typhimurium/genetics , Sequence Deletion , Thermotolerance/genetics , Adenosine Triphosphatases/genetics , Adenosine Triphosphatases/metabolism , Bacterial Proteins/genetics , Cation Transport Proteins/genetics , Heat-Shock Response/genetics , Open Reading Frames/genetics , Operon/genetics , RNA, Messenger/genetics , RNA, Messenger/metabolism , Serogroup , Virulence/genetics
2.
BMC Genomics ; 19(1): 233, 2018 Apr 04.
Article in English | MEDLINE | ID: mdl-29618317

ABSTRACT

BACKGROUND: Management requires a robust understanding of between- and within-species genetic variability, however such data are still lacking in many species. For example, although multiple population genetics studies of the peregrine falcon (Falco peregrinus) have been conducted, no similar studies have been done of the closely-related prairie falcon (F. mexicanus) and it is unclear how much genetic variation and population structure exists across the species' range. Furthermore, the phylogenetic relationship of F. mexicanus relative to other falcon species is contested. We utilized a genomics approach (i.e., genome sequencing and assembly followed by single nucleotide polymorphism genotyping) to rapidly address these gaps in knowledge. RESULTS: We sequenced the genome of a single female prairie falcon and generated a 1.17 Gb (gigabases) draft genome assembly. We generated maximum likelihood phylogenetic trees using complete mitochondrial genomes as well as nuclear protein-coding genes. This process provided evidence that F. mexicanus is an outgroup to the clade that includes the peregrine falcon and members of the subgenus Hierofalco. We annotated > 16,000 genes and almost 600,000 high-quality single nucleotide polymorphisms (SNPs) in the nuclear genome, providing the raw material for a SNP assay design featuring > 140 gene-associated markers and a molecular-sexing marker. We subsequently genotyped ~ 100 individuals from California (including the San Francisco East Bay Area, Pinnacles National Park and the Mojave Desert) and Idaho (Snake River Birds of Prey National Conservation Area). We tested for population structure and found evidence that individuals sampled in California and Idaho represent a single panmictic population. CONCLUSIONS: Our study illustrates how genomic resources can rapidly shed light on genetic variability in understudied species and resolve phylogenetic relationships. Furthermore, we found evidence of a single, randomly mating population of prairie falcons across our sampling locations. Prairie falcons are highly mobile and relatively rare long-distance dispersal events may promote gene flow throughout the range. As such, California's prairie falcons might be managed as a single population, indicating that management actions undertaken to benefit the species at the local level have the potential to influence the species as a whole.


Subject(s)
Avian Proteins/genetics , Falconiformes/classification , Polymorphism, Single Nucleotide , Whole Genome Sequencing/veterinary , Animals , California , Falconiformes/genetics , Female , Genetics, Population , Idaho , Mitochondria/genetics , Phylogeny , Phylogeography
3.
Plant Physiol ; 171(3): 1905-20, 2016 07.
Article in English | MEDLINE | ID: mdl-27217494

ABSTRACT

Traditional marker-based mapping and next-generation sequencing was used to determine that the Arabidopsis (Arabidopsis thaliana) low cell wall arabinose mutant murus5 (mur5) encodes a defective allele of REVERSIBLY GLYCOSYLATED POLYPEPTIDE2 (RGP2). Marker analysis of 13 F2 confirmed mutant progeny from a recombinant mapping population gave a rough map position on the upper arm of chromosome 5, and deep sequencing of DNA from these 13 lines gave five candidate genes with GĆ¢Ā†Ā’A (CĆ¢Ā†Ā’T) transitions predicted to result in amino acid changes. Of these five, only insertional mutant alleles of RGP2, a gene that encodes a UDP-arabinose mutase that interconverts UDP-arabinopyranose and UDP-arabinofuranose, exhibited the low cell wall arabinose phenotype. The identities of mur5 and two SALK insertional alleles were confirmed by allelism tests and overexpression of wild-type RGP2 complementary DNA placed under the control of the 35S promoter in the three alleles. The mur5 mutation results in the conversion of cysteine-257 to tyrosine-257 within a conserved hydrophobic cluster predicted to be distal to the active site and essential for protein stability and possible heterodimerization with other isoforms of RGP.


Subject(s)
Arabidopsis Proteins/genetics , Arabidopsis Proteins/metabolism , Arabidopsis/cytology , Arabinose/metabolism , Cell Wall/metabolism , Glucosyltransferases/genetics , Glucosyltransferases/metabolism , Alleles , Arabidopsis/genetics , Arabidopsis Proteins/chemistry , Arabinose/genetics , Cell Wall/genetics , Chromosome Mapping , Chromosomes, Plant , Gene Expression Regulation, Plant , Genetic Complementation Test , Glucosyltransferases/chemistry , High-Throughput Nucleotide Sequencing , Mutation , Plants, Genetically Modified , Protein Domains , Protein Folding , Protein Stability , Sequence Homology, Amino Acid
4.
Nat Rev Genet ; 8(12): 973-82, 2007 Dec.
Article in English | MEDLINE | ID: mdl-17984973

ABSTRACT

Our knowledge of the structure and composition of genomes is rapidly progressing in pace with their sequencing. The emerging data show that a significant portion of eukaryotic genomes is composed of transposable elements (TEs). Given the abundance and diversity of TEs and the speed at which large quantities of sequence data are emerging, identification and annotation of TEs presents a significant challenge. Here we propose the first unified hierarchical classification system, designed on the basis of the transposition mechanism, sequence similarities and structural relationships, that can be easily applied by non-experts. The system and nomenclature is kept up to date at the WikiPoson web site.


Subject(s)
DNA Transposable Elements/genetics , Eukaryotic Cells/physiology , Terminology as Topic , Animals
5.
Sci Rep ; 13(1): 1486, 2023 01 27.
Article in English | MEDLINE | ID: mdl-36707633

ABSTRACT

Gram-negative bacterial septicemia is mediated through binding of lipopolysaccharide (LPS) to mammalian toll-like receptor protein 4 (TLR4). TLR4 and its cognate protein, myeloid differentiation factor 2 (MD2) form a heterodimeric complex after binding LPS. This complex induces a cascade of reactions that results in increased proinflammatory cytokine gene expression, including TNFα, which leads to activation of innate immunity. In horses, the immune response to LPS varies widely. To determine if this variation is due to differences in TLR4 or MD2, DNA from 15 healthy adult horses with different TNFα dynamics after experimental intravenous LPS infusion was sequenced across exons of TLR4 and MD2. Haplotypes were constructed for both genes using all identified variants. Four haplotypes were observed for each gene. No significant associations were found between either TNFα baseline concentrations or response to LPS and haplotype; however, there was a significant association (P value = 0.0460) between the baseline TNFα concentration and one MD2 missense variant. Three-dimensional structures of the equine TLR4-MD2-LPS complex were built according to haplotype combinations observed in the study horses, and the implications of missense variants on LPS binding were modeled. Although the sample size was small, there was no evidence that variation in TLR4 or MD2 explains the variability in TNFα response observed after LPS exposure in horses.


Subject(s)
Lipopolysaccharides , Toll-Like Receptor 4 , Animals , Horses , Toll-Like Receptor 4/genetics , Toll-Like Receptor 4/metabolism , Tumor Necrosis Factor-alpha/genetics , Tumor Necrosis Factor-alpha/metabolism , Lymphocyte Antigen 96/metabolism , Toll-Like Receptors/metabolism , Lipopolysaccharide Receptors/metabolism , Mammals/metabolism
6.
J Bacteriol ; 194(6): 1605-6, 2012 Mar.
Article in English | MEDLINE | ID: mdl-22374945

ABSTRACT

Mycoplasma haemocanis is a blood pathogen that may cause acute disease in immunosuppressed or splenectomized dogs. The genome of the strain Illinois is a single circular chromosome with 919,992 bp and a GC content of 35%. Analyses of the M. haemocanis genome will provide insights into its biology and in vitro cultivation requirements.


Subject(s)
DNA, Bacterial/chemistry , DNA, Bacterial/genetics , Genome, Bacterial , Mycoplasma/genetics , Animals , Bacteremia/microbiology , Bacteremia/veterinary , Base Composition , DNA, Circular/chemistry , DNA, Circular/genetics , Dog Diseases/microbiology , Dogs , Molecular Sequence Data , Mycoplasma/isolation & purification , Sequence Analysis, DNA
7.
J Bacteriol ; 194(19): 5458-9, 2012 Oct.
Article in English | MEDLINE | ID: mdl-22965086

ABSTRACT

Mycoplasma wenyonii is a hemotrophic mycoplasma that causes acute and chronic infections in cattle. Here, we announce the first complete genome sequence of this organism. The genome is a single circular chromosome with 650,228 bp and G+C% of 33.9. Analyses of M. wenyonii genome will provide insights into its biology.


Subject(s)
Mycoplasma/genetics , Genome, Bacterial , Molecular Sequence Data , Mycoplasma/classification
8.
Vet Res ; 43: 66, 2012 Sep 28.
Article in English | MEDLINE | ID: mdl-23020168

ABSTRACT

Mycoplasma haemocanis is a hemotrophic mycoplasma (hemoplasma), blood pathogen that may cause acute disease in immunosuppressed or splenectomized dogs. The genome of the strain Illinois, isolated from blood of a naturally infected dog, has been entirely sequenced and annotated to gain a better understanding of the biology of M. haemocanis. Its single circular chromosome has 919 992 bp and a low G + C content (35%), representing a typical mycoplasmal genome. A gene-by-gene comparison against its feline counterpart, M. haemofelis, reveals a very similar composition and architecture with most of the genes having conserved synteny extending over their entire chromosomes and differing only by a small set of unique protein coding sequences. As in M. haemofelis, M. haemocanis metabolic pathways are reduced and apparently rely heavily on the nutrients afforded by its host environment. The presence of a major percentage of its genome dedicated to paralogous genes (63.7%) suggests that this bacterium might use antigenic variation as a mechanism to evade the host's immune system as also observed in M. haemofelis genome. Phylogenomic comparisons based on average nucleotide identity (ANI) and tetranucleotide signature suggest that these two pathogens are different species of mycoplasmas, with M. haemocanis infecting dogs and M. haemofelis infecting cats.


Subject(s)
Cat Diseases/microbiology , Dog Diseases/microbiology , Genome, Bacterial , Mycoplasma Infections/veterinary , Mycoplasma/classification , Mycoplasma/genetics , Animals , Cat Diseases/epidemiology , Cats , Dog Diseases/epidemiology , Dogs , Molecular Sequence Data , Mycoplasma Infections/epidemiology , Mycoplasma Infections/microbiology , Sequence Analysis, DNA/veterinary
9.
PLoS Genet ; 5(11): e1000732, 2009 Nov.
Article in English | MEDLINE | ID: mdl-19936065

ABSTRACT

Recent comprehensive sequence analysis of the maize genome now permits detailed discovery and description of all transposable elements (TEs) in this complex nuclear environment. Reiteratively optimized structural and homology criteria were used in the computer-assisted search for retroelements, TEs that transpose by reverse transcription of an RNA intermediate, with the final results verified by manual inspection. Retroelements were found to occupy the majority (>75%) of the nuclear genome in maize inbred B73. Unprecedented genetic diversity was discovered in the long terminal repeat (LTR) retrotransposon class of retroelements, with >400 families (>350 newly discovered) contributing >31,000 intact elements. The two other classes of retroelements, SINEs (four families) and LINEs (at least 30 families), were observed to contribute 1,991 and approximately 35,000 copies, respectively, or a combined approximately 1% of the B73 nuclear genome. With regard to fully intact elements, median copy numbers for all retroelement families in maize was 2 because >250 LTR retrotransposon families contained only one or two intact members that could be detected in the B73 draft sequence. The majority, perhaps all, of the investigated retroelement families exhibited non-random dispersal across the maize genome, with LINEs, SINEs, and many low-copy-number LTR retrotransposons exhibiting a bias for accumulation in gene-rich regions. In contrast, most (but not all) medium- and high-copy-number LTR retrotransposons were found to preferentially accumulate in gene-poor regions like pericentromeric heterochromatin, while a few high-copy-number families exhibited the opposite bias. Regions of the genome with the highest LTR retrotransposon density contained the lowest LTR retrotransposon diversity. These results indicate that the maize genome provides a great number of different niches for the survival and procreation of a great variety of retroelements that have evolved to differentially occupy and exploit this genomic diversity.


Subject(s)
Evolution, Molecular , Genetic Variation , Genome, Plant/genetics , Retroelements/genetics , Zea mays/genetics , Analysis of Variance , Base Sequence , Centromere/genetics , Chromosomes, Plant/genetics , Gene Dosage/genetics , Molecular Sequence Data , Mutagenesis, Insertional/genetics , Short Interspersed Nucleotide Elements/genetics , Terminal Repeat Sequences/genetics
10.
BMC Genomics ; 12: 142, 2011 Mar 08.
Article in English | MEDLINE | ID: mdl-21385395

ABSTRACT

BACKGROUND: Extensive DNA rearrangement of genic colinearity, as revealed by comparison of orthologous genomic regions, has been shown to be a general concept describing evolutionary dynamics of plant genomes. However, the nature, timing, lineages and adaptation of local genomic rearrangement in closely related species (e.g., within a genus) and haplotype variation of genomic rearrangement within populations have not been well documented. RESULTS: We previously identified a hotspot for genic rearrangement and transposon accumulation in the Orp region of Asian rice (Oryza sativa, AA) by comparison with its orthologous region in sorghum. Here, we report the comparative analysis of this region with its orthologous regions in the wild progenitor species (O. nivara, AA) of Asian rice and African rice (O. glaberrima) using the BB genome Oryza species (O. punctata) as an outgroup, and investigation of transposon insertion sites and a segmental inversion event in the AA genomes at the population level. We found that Orp region was primarily and recently expanded in the Asian rice species O. sativa and O. nivara. LTR-retrotransposons shared by the three AA-genomic regions have been fixed in all the 94 varieties that represent different populations of the AA-genome species/subspecies, indicating their adaptive role in genome differentiation. However, LTR-retrotransposons unique to either O. nivara or O. sativa regions exhibited dramatic haplotype variation regarding their presence or absence between or within populations/subpopulations. CONCLUSIONS: The LTR-retrotransposon insertion hotspot in the Orp region was formed recently, independently and concurrently in different AA-genome species, and that the genic rearrangements detected in different species appear to be differentially triggered by transposable elements. This region is located near the end of the short arm of chromosome 8 and contains a high proportion of LTR-retrotransposons similar to observed in the centromeric region of this same chromosome, and thus may represent a genomic region that has recently switched from euchromatic to heterochromatic states. The haplotype variation of LTR-retrotransposon insertions within this region reveals substantial admixture among various subpopulations as established by molecular markers at the whole genome level, and can be used to develop retrotransposon junction markers for simple and rapid classification of O. sativa germplasm.


Subject(s)
Gene Rearrangement , Genetics, Population , Genome, Plant , Oryza/genetics , Retroelements , Chromosomes, Artificial, Bacterial , Comparative Genomic Hybridization , DNA Transposable Elements , DNA, Plant/genetics , Evolution, Molecular , Haplotypes , Molecular Sequence Annotation , Molecular Sequence Data , Phylogeny , Sequence Analysis, DNA , Sequence Inversion , Species Specificity
11.
Vet Res ; 42: 102, 2011 Sep 21.
Article in English | MEDLINE | ID: mdl-21936946

ABSTRACT

Mycoplasma haemofelis is a mycoplasmal pathogen (hemoplasma) that attaches to the host's erythrocytes. Distributed worldwide, it has a significant impact on the health of cats causing acute disease and, despite treatment, establishing chronic infection. It might also have a role as a zoonotic agent, especially in immunocompromised patients. Whole genome sequencing and analyses of M. haemofelis strain Ohio2 was undertaken as a step toward understanding its survival and persistence. Metabolic pathways are reduced, relying on the host to supply many of the nutrients and metabolites needed for survival. M. haemofelis must import glucose for ATP generation and ribose derivates for RNA/DNA synthesis. Hypoxanthine, adenine, guanine, uracil and CMP are scavenged from the environment to support purine and pyrimidine synthesis. In addition, nicotinamide, amino acids and any vitamins needed for growth, must be acquired from its environment. The core proteome of M. haemofelis contains an abundance of paralogous gene families, corresponding to 70.6% of all the CDSs. This "paralog pool" is a rich source of different antigenic epitopes that can be varied to elude the host's immune system and establish chronic infection. M. haemofelis also appears to be capable of phase variation, which is particularly relevant to the cyclic bacteremia and persistence, characteristics of the infection in the cat. The data generated herein should be of great use for understanding the mechanisms of M. haemofelis infection. Further, it will provide new insights into its pathogenicity and clues needed to formulate media to support the in vitro cultivation of M. haemofelis.


Subject(s)
Cat Diseases/microbiology , Genome, Bacterial , Mycoplasma Infections/microbiology , Mycoplasma/genetics , Proteome , Adaptation, Biological , Animals , Cats , Chromosome Mapping/veterinary , Electrophoresis, Agar Gel/veterinary , Erythrocytes/microbiology , Molecular Sequence Annotation , Molecular Sequence Data , Mycoplasma/physiology , Sequence Analysis, DNA/veterinary , Spectrophotometry, Ultraviolet/veterinary
12.
Plant J ; 60(5): 820-31, 2009 Dec.
Article in English | MEDLINE | ID: mdl-19702667

ABSTRACT

Most eukaryotic centromeres contain large quantities of repetitive DNA, such as satellite repeats and retrotransposons. Unlike most transposons in plant genomes, the centromeric retrotransposon (CR) family is conserved over long evolutionary periods among a majority of the grass species. CR elements are highly concentrated in centromeres, and are likely to play a role in centromere function. In order to study centromere evolution in the Oryza (rice) genus, we sequenced the orthologous region to centromere 8 of Oryza sativa from a related species, Oryza brachyantha. We found that O. brachyantha does not have the canonical CRR (CR of rice) found in the centromeres of all other Oryza species. Instead, a new Ty3-gypsy (Metaviridae) retroelement (FRetro3) was found to colonize the centromeres of this species. This retroelement is found in high copy numbers in the O. brachyantha genome, but not in other Oryza genomes, and based on the dating of long terminal repeats (LTRs) of FRetro3 it was amplified in the genome in the last few million years. Interestingly, there is a high level of removal of FRetro3 based on solo-LTRs to full-length elements, and this rapid turnover may have played a role in the replacement of the canonical CRR with the new element by active deletion. Comparison with previously described ChIP cloning data revealed that FRetro3 is found in CENH3-associated chromatin sequences. Thus, within a single lineage of the Oryza genus, the canonical component of grass centromeres has been replaced with a new retrotransposon that has all the hallmarks of a centromeric retroelement.


Subject(s)
Centromere/chemistry , Oryza/genetics , Retroelements , Base Sequence , Conserved Sequence , Genome, Plant , Phylogeny , Sequence Analysis, DNA
13.
BMC Plant Biol ; 10: 81, 2010 May 04.
Article in English | MEDLINE | ID: mdl-20441596

ABSTRACT

BACKGROUND: Strawberry (Fragaria spp.) is the familiar name of a group of economically important crop plants and wild relatives that also represent an emerging system for the study of gene and genome evolution. Its small stature, rapid seed-to-seed cycle, transformability and miniscule basic genome make strawberry an attractive system to study processes related to plant physiology, development and crop production; yet it lacks substantial genomics-level resources. This report addresses this deficiency by characterizing 0.71 Mbp of gene space from a diploid species (F. vesca). The twenty large genomic tracks (30-52 kb) captured as fosmid inserts comprise gene regions with roles in flowering, disease resistance, and metabolism. RESULTS: A detailed description of the studied regions reveals 131 Blastx-supported gene sites and eight additional EST-supported gene sites. Only 15 genes have complete EST coverage, enabling gene modelling, while 76 lack EST support. Instances of microcolinearity with Arabidopsis thaliana were identified in twelve inserts. A relatively high portion (25%) of targeted genes were found in unanticipated tandem duplications. The effectiveness of six FGENESH training models was assessed via comparisons among ab initio predictions and homology-based gene and start/stop codon identifications. Fourteen transposable-element-related sequences and 158 simple sequence repeat loci were delineated. CONCLUSIONS: This report details the structure and content of targeted regions of the strawberry genome. The data indicate that the strawberry genome is gene-dense, with an average of one protein-encoding gene or pseudogene per 5.9 kb. Current overall EST coverage is sparse. The unexpected gene duplications and their differential patterns of EST support suggest possible subfunctionalization or pseudogenization of these sequences. This report provides a high-resolution depiction of targeted gene neighborhoods that will aid whole-genome sequence assembly, provide valuable tools for plant breeders and advance the understanding of strawberry genome evolution.


Subject(s)
Fragaria/genetics , Genome, Plant , Computational Biology , DNA, Plant/genetics , Expressed Sequence Tags , Genes, Plant , Genomic Library , Pseudogenes , Repetitive Sequences, Nucleic Acid , Sequence Analysis, DNA
14.
BMC Genomics ; 9: 621, 2008 Dec 19.
Article in English | MEDLINE | ID: mdl-19099592

ABSTRACT

BACKGROUND: Many plant genomes are resistant to whole-genome assembly due to an abundance of repetitive sequence, leading to the development of gene-rich sequencing techniques. Two such techniques are hypomethylated partial restriction (HMPR) and methylation spanning linker libraries (MSLL). These libraries differ from other gene-rich datasets in having larger insert sizes, and the MSLL clones are designed to provide reads localized to "epigenetic boundaries" where methylation begins or ends. RESULTS: A large-scale study in maize generated 40,299 HMPR sequences and 80,723 MSLL sequences, including MSLL clones exceeding 100 kb. The paired end reads of MSLL and HMPR clones were shown to be effective in linking existing gene-rich sequences into scaffolds. In addition, it was shown that the MSLL clones can be used for anchoring these scaffolds to a BAC-based physical map. The MSLL end reads effectively identified epigenetic boundaries, as indicated by their preferential alignment to regions upstream and downstream from annotated genes. The ability to precisely map long stretches of fully methylated DNA sequence is a unique outcome of MSLL analysis, and was also shown to provide evidence for errors in gene identification. MSLL clones were observed to be significantly more repeat-rich in their interiors than in their end reads, confirming the correlation between methylation and retroelement content. Both MSLL and HMPR reads were found to be substantially gene-enriched, with the SalI MSLL libraries being the most highly enriched (31% align to an EST contig), while the HMPR clones exhibited exceptional depletion of repetitive DNA (to approximately 11%). These two techniques were compared with other gene-enrichment methods, and shown to be complementary. CONCLUSION: MSLL technology provides an unparalleled approach for mapping the epigenetic status of repetitive blocks and for identifying sequences mis-identified as genes. Although the types and natures of epigenetic boundaries are barely understood at this time, MSLL technology flags both approximate boundaries and methylated genes that deserve additional investigation. MSLL and HMPR sequences provide a valuable resource for maize genome annotation, and are a uniquely valuable complement to any plant genome sequencing project. In order to make these results fully accessible to the community, a web display was developed that shows the alignment of MSLL, HMPR, and other gene-rich sequences to the BACs; this display is continually updated with the latest ESTs and BAC sequences.


Subject(s)
Chromosome Mapping/methods , DNA Methylation , Genome, Plant , Zea mays/genetics , Chromosomes, Artificial, Bacterial , DNA, Plant/genetics , Epigenesis, Genetic , Gene Library , Genomics/methods , Sequence Alignment , Sequence Analysis, DNA/methods
15.
Nucleic Acids Res ; 34(Database issue): D771-6, 2006 Jan 01.
Article in English | MEDLINE | ID: mdl-16381977

ABSTRACT

Maize is a staple crop of the grass family and also an excellent model for plant genetics. Owing to the large size and repetitiveness of its genome, we previously investigated two approaches to accelerate gene discovery and genome analysis in maize: methylation filtration and high C(0)t selection. These techniques allow the construction of gene-enriched genomic libraries by minimizing repeat sequences due to either their methylation status or their copy number, yielding a 7-fold enrichment in genic sequences relative to a random genomic library. Approximately 900,000 gene-enriched reads from maize were generated and clustered into Assembled Zea mays (AZM) sequences. Here we report the current AZM release, which consists of approximately 298 Mb representing 243,807 sequence assemblies and singletons. In order to provide a repository of publicly available maize genomic sequences, we have created the TIGR Maize Database (http://maize.tigr.org). In this resource, we have assembled and annotated the AZMs and used available sequenced markers to anchor AZMs to maize chromosomes. We have constructed a maize repeat database and generated draft sequence assemblies of 287 maize bacterial artificial chromosome (BAC) clone sequences, which we annotated along with 172 additional publicly available BAC clones. All sequences, assemblies and annotations are available at the project website via web interfaces and FTP downloads.


Subject(s)
Databases, Nucleic Acid , Genome, Plant , Zea mays/genetics , Chromosome Mapping , Chromosomes, Artificial, Bacterial , Genes, Plant , Genomic Library , Genomics , Internet , Repetitive Sequences, Nucleic Acid , User-Computer Interface
16.
Genetics ; 170(3): 1209-20, 2005 Jul.
Article in English | MEDLINE | ID: mdl-15834137

ABSTRACT

The homeologous Orp1 and Orp2 regions of maize and the orthologous regions in sorghum and rice were compared by generating sequence data for >486 kb of genomic DNA. At least three genic rearrangements differentiate the maize Orp1 and Orp2 segments, including an insertion of a single gene and two deletions that removed one gene each, while no genic rearrangements were detected in the maize Orp2 region relative to sorghum. Extended comparison of the orthologous Orp regions of sorghum and japonica rice uncovered numerous genic rearrangements and the presence of a transposon-rich region in rice. Only 11 of 27 genes (40%) are arranged in the same order and orientation between sorghum and rice. Of the 8 genes that are uniquely present in the sorghum region, 4 were found to have single-copy homologs in both rice and Arabidopsis, but none of these genes are located near each other, indicating frequent gene movement. Further comparison of the Orp segments from two rice subspecies, japonica and indica, revealed that the transposon-rich region is both an ancient and current hotspot for retrotransposon accumulation and genic rearrangement. We also identify unequal gene conversion as a mechanism for maize retrotransposon rearrangement.


Subject(s)
Gene Rearrangement/genetics , Genetic Variation , Genome, Plant , Oryza/genetics , Sorghum/genetics , Tryptophan Synthase/genetics , Zea mays/genetics , Base Sequence , Chromosome Mapping , Chromosomes, Artificial, Bacterial , Gene Components , Genomics/methods , Molecular Sequence Data , Sequence Analysis, DNA , Species Specificity
17.
Genetics ; 170(3): 1221-30, 2005 Jul.
Article in English | MEDLINE | ID: mdl-15879505

ABSTRACT

Little is known about the physical makeup of heterochromatin in the soybean (Glycine max L. Merr.) genome. Using DNA sequencing and molecular cytogenetics, an initial analysis of the repetitive fraction of the soybean genome is presented. BAC 076J21, derived from linkage group L, has sequences conserved in the pericentromeric heterochromatin of all 20 chromosomes. FISH analysis of this BAC and three subclones on pachytene chromosomes revealed relatively strict partitioning of the heterochromatic and euchromatic regions. Sequence analysis showed that this BAC consists primarily of repetitive sequences such as a 102-bp tandem repeat with sequence identity to a previously characterized approximately 120-bp repeat (STR120). Fragments of Calypso-like retroelements, a recently inserted SIRE1 element, and a SIRE1 solo LTR were present within this BAC. Some of these sequences are methylated and are not conserved outside of G. max and G. soja, a close relative of soybean, except for STR102, which hybridized to a restriction fragment from G. latifolia. These data present a picture of the repetitive fraction of the soybean genome that is highly concentrated in the pericentromeric regions, consisting of rapidly evolving tandem repeats with interspersed retroelements.


Subject(s)
Chromosomes, Plant/genetics , Evolution, Molecular , Glycine max/genetics , Heterochromatin/genetics , Retroelements/genetics , Tandem Repeat Sequences/genetics , Base Sequence , Blotting, Southern , Chromosomes, Artificial, Bacterial , In Situ Hybridization, Fluorescence , Molecular Sequence Data , Sequence Analysis, DNA
18.
Genetics ; 162(3): 1389-400, 2002 Nov.
Article in English | MEDLINE | ID: mdl-12454082

ABSTRACT

Orthologous regions in barley, rice, sorghum, and wheat were studied by bacterial artificial chromosome sequence analysis. General microcolinearity was observed for the four shared genes in this region. However, three genic rearrangements were observed. First, the rice region contains a cluster of 48 predicted small nucleolar RNA genes, but the comparable region from sorghum contains no homologous loci. Second, gene 2 was inverted in the barley lineage by an apparent unequal recombination after the ancestors of barley and wheat diverged, 11-15 million years ago (mya). Third, gene 4 underwent direct tandem duplication in a common ancestor of barley and wheat 29-41 mya. All four of the shared genes show the same synonymous substitution rate, but nonsynonymous substitution rates show significant variations between genes 4a and 4b, suggesting that gene 4b was largely released from the strong purifying selection that acts on gene 4a in both barley and wheat. Intergenic retrotransposon blocks, many of them organized as nested insertions, mostly account for the lower gene density of the barley and wheat regions. All but two of the retrotransposons were found in the regions between genes, while all but 2 of the 51 inverted repeat transposable elements were found as insertions in genic regions and outside the retrotransposon blocks.


Subject(s)
Biological Evolution , Edible Grain/genetics , Genome, Plant , Chromosomes, Artificial, Bacterial , DNA Transposable Elements , Molecular Sequence Data , Point Mutation , Sequence Analysis, DNA
19.
Methods Mol Biol ; 722: 51-60, 2011.
Article in English | MEDLINE | ID: mdl-21590412

ABSTRACT

Since the first fungal genome was sequenced in 1996, sequencing technologies have advanced dramatically. In recent years, it has become possible to cost-effectively generate vast amounts of DNA sequence data using a number of cell- and electrophoresis-free sequencing technologies, commonly known as "next" or "second" generation. In this chapter, we present a brief overview of next-generation sequencers that are commercially available now. Their potential applications in fungal genomics studies are discussed.


Subject(s)
Fungi/genetics , Gene Expression Profiling/methods , Genomics/methods , High-Throughput Nucleotide Sequencing/methods , Sequence Analysis, DNA/methods , Gene Expression Profiling/instrumentation , Gene Expression Profiling/trends , Genome, Fungal/genetics , High-Throughput Nucleotide Sequencing/instrumentation , High-Throughput Nucleotide Sequencing/trends , Sequence Analysis, DNA/instrumentation , Sequence Analysis, DNA/trends
20.
PLoS One ; 6(5): e19574, 2011 May 10.
Article in English | MEDLINE | ID: mdl-21573007

ABSTRACT

Mycoplasma suis, the causative agent of porcine infectious anemia, has never been cultured in vitro and mechanisms by which it causes disease are poorly understood. Thus, the objective herein was to use whole genome sequencing and analysis of M. suis to define pathogenicity mechanisms and biochemical pathways. M. suis was harvested from the blood of an experimentally infected pig. Following DNA extraction and construction of a paired end library, whole-genome sequencing was performed using GS-FLX (454) and Titanium chemistry. Reads on paired-end constructs were assembled using GS De Novo Assembler and gaps closed by primer walking; assembly was validated by PFGE. Glimmer and Manatee Annotation Engine were used to predict and annotate protein-coding sequences (CDS). The M. suis genome consists of a single, 742,431 bp chromosome with low G+C content of 31.1%. A total of 844 CDS, 3 single copies, unlinked rRNA genes and 32 tRNAs were identified. Gene homologies and GC skew graph show that M. suis has a typical Mollicutes oriC. The predicted metabolic pathway is concise, showing evidence of adaptation to blood environment. M. suis is a glycolytic species, obtaining energy through sugars fermentation and ATP-synthase. The pentose-phosphate pathway, metabolism of cofactors and vitamins, pyruvate dehydrogenase and NAD(+) kinase are missing. Thus, ribose, NADH, NADPH and coenzyme A are possibly essential for its growth. M. suis can generate purines from hypoxanthine, which is secreted by RBCs, and cytidine nucleotides from uracil. Toxins orthologs were not identified. We suggest that M. suis may cause disease by scavenging and competing for host' nutrients, leading to decreased life-span of RBCs. In summary, genome analysis shows that M. suis is dependent on host cell metabolism and this characteristic is likely to be linked to its pathogenicity. The prediction of essential nutrients will aid the development of in vitro cultivation systems.


Subject(s)
Adaptation, Physiological/genetics , Erythrocytes/microbiology , Genome, Bacterial/genetics , Mycoplasma/genetics , Sequence Analysis, DNA , Animals , Base Sequence , Female , Genes, Bacterial/genetics , Metabolic Networks and Pathways/genetics , Molecular Sequence Data , Multigene Family/genetics , Mycoplasma/pathogenicity , Open Reading Frames/genetics , Replication Origin/genetics , Sequence Homology, Nucleic Acid , Sus scrofa , Virulence/genetics
SELECTION OF CITATIONS
SEARCH DETAIL