Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 26
Filter
Add more filters










Publication year range
1.
PLoS One ; 16(7): e0253267, 2021.
Article in English | MEDLINE | ID: mdl-34228724

ABSTRACT

We report a new subgroup of Type III Restriction-Modification systems that use m4C methylation for host protection. Recognition specificities for six such systems, each recognizing a novel motif, have been determined using single molecule real-time DNA sequencing. In contrast to all previously characterized Type III systems which modify adenine to m6A, protective methylation of the host genome in these new systems is achieved by the N4-methylation of a cytosine base in one strand of an asymmetric 4 to 6 base pair recognition motif. Type III systems are heterotrimeric enzyme complexes containing a single copy of an ATP-dependent restriction endonuclease-helicase (Res) and a dimeric DNA methyltransferase (Mod). The Type III Mods are beta-class amino-methyltransferases, examples of which form either N6-methyl adenine or N4-methyl cytosine in Type II RM systems. The Type III m4C Mod and Res proteins are diverged, suggesting ancient origin or that m4C modification has arisen from m6A MTases multiple times in diverged lineages. Two of the systems, from thermophilic organisms, required expression of both Mod and Res to efficiently methylate an E. coli host, unlike previous findings that Mod alone is proficient at modification, suggesting that the division of labor between protective methylation and restriction activities is atypical in these systems. Two of the characterized systems, and many homologous putative systems, appear to include a third protein; a conserved putative helicase/ATPase subunit of unknown function and located 5' of the mod gene. The function of this additional ATPase is not yet known, but close homologs co-localize with the typical Mod and Res genes in hundreds of putative Type III systems. Our findings demonstrate a rich diversity within Type III RM systems.


Subject(s)
Cytosine , DNA Methylation , DNA Restriction-Modification Enzymes/genetics , DNA/metabolism , Cytosine/metabolism , DNA Modification Methylases/chemistry , DNA Modification Methylases/genetics , DNA Modification Methylases/metabolism , DNA Restriction Enzymes/chemistry , DNA Restriction Enzymes/genetics , DNA Restriction Enzymes/metabolism , DNA Restriction-Modification Enzymes/chemistry , DNA Restriction-Modification Enzymes/metabolism , Escherichia coli/genetics , Escherichia coli Proteins/genetics , Gas Chromatography-Mass Spectrometry , Sequence Alignment , Sequence Analysis, DNA
2.
J Clin Invest ; 127(11): 3970-3986, 2017 Nov 01.
Article in English | MEDLINE | ID: mdl-28945202

ABSTRACT

Consumption of human breast milk (HBM) attenuates the incidence of necrotizing enterocolitis (NEC), which remains a leading and intractable cause of mortality in preterm infants. Here, we report that this diminution correlates with alterations in the gut microbiota, particularly enrichment of Propionibacterium species. Transfaunation of microbiota from HBM-fed preterm infants or a newly identified and cultured Propionibacterium strain, P. UF1, to germfree mice conferred protection against pathogen infection and correlated with profound increases in intestinal Th17 cells. The induction of Th17 cells was dependent on bacterial dihydrolipoamide acetyltransferase (DlaT), a major protein expressed on the P. UF1 surface layer (S-layer). Binding of P. UF1 to its cognate receptor, SIGNR1, on dendritic cells resulted in the regulation of intestinal phagocytes. Importantly, transfer of P. UF1 profoundly mitigated induced NEC-like injury in neonatal mice. Together, these results mechanistically elucidate the protective effects of HBM and P. UF1-induced immunoregulation, which safeguard against proinflammatory diseases, including NEC.


Subject(s)
Propionibacterium/immunology , Th17 Cells/physiology , Animals , Bacterial Proteins/physiology , Cell Differentiation , Colon/immunology , Colon/microbiology , Dihydrolipoyllysine-Residue Acetyltransferase/physiology , Female , Gastrointestinal Microbiome , Genome, Bacterial , Humans , Immunomodulation , Infant, Newborn , Infant, Premature , Mice, 129 Strain , Mice, Inbred BALB C , Mice, Inbred C57BL , Molecular Sequence Annotation , Propionibacterium/enzymology , Propionibacterium/genetics , Sequence Analysis, DNA
3.
Genome Announc ; 5(10)2017 Mar 09.
Article in English | MEDLINE | ID: mdl-28280013

ABSTRACT

Porphyromonas gingivalis is an oral opportunistic pathogen. Sequenced P. gingivalis laboratory strains display limited diversity in antigens that modulate host responses. Here, we present the genome sequence of A7A1-28, a strain possessing atypical fimbrillin and capsule types, with a single contig of 2,249,024 bp and a G+C content of 48.58%.

4.
Genome Announc ; 5(2)2017 Jan 12.
Article in English | MEDLINE | ID: mdl-28082501

ABSTRACT

Porphyromonas gingivalis is associated with both oral and systemic diseases. Strain-specific P. gingivalis invasion phenotypes do not reliably predict disease presentation during in vivo studies. Here, we present the genome sequence of 381, a common laboratory strain, with a single contig of 2,378,872 bp and a G+C content of 48.36%.

5.
Genome Biol Evol ; 8(9): 2952-2963, 2016 10 05.
Article in English | MEDLINE | ID: mdl-27614234

ABSTRACT

Sigma virus (DMelSV) is ubiquitous in natural populations of Drosophila melanogaster. Host-mediated, selective RNA editing of adenosines to inosines (ADAR) may contribute to control of viral infection by preventing transcripts from being transported into the cytoplasm or being translated accurately; or by increasing the viral genomic mutation rate. Previous PCR-based studies showed that ADAR mutations occur in DMelSV at low frequency. Here we use SOLiDTM deep sequencing of flies from a single host population from Athens, GA, USA to comprehensively evaluate patterns of sequence variation in DMelSV with respect to ADAR. GA dinucleotides, which are weak targets of ADAR, are strongly overrepresented in the positive strand of the virus, consistent with selection to generate ADAR resistance on this complement of the transient, double-stranded RNA intermediate in replication and transcription. Potential ADAR sites in a worldwide sample of viruses are more likely to be "resistant" if the sites do not vary among samples. Either variable sites are less constrained and hence are subject to weaker selection than conserved sites, or the variation is driven by ADAR. We also find evidence of mutations segregating within hosts, hereafter referred to as hypervariable sites. Some of these sites were variable only in one or two flies (i.e., rare); others were shared by four or even all five of the flies (i.e., common). Rare and common hypervariable sites were indistinguishable with respect to susceptibility to ADAR; however, polymorphism in rare sites were more likely to be consistent with the action of ADAR than in common ones, again suggesting that ADAR is deleterious to the virus. Thus, in DMelSV, host mutagenesis is constraining viral evolution both within and between hosts.


Subject(s)
Drosophila melanogaster/virology , Genome, Viral , Insect Viruses/genetics , Mutation Rate , Rhabdoviridae/genetics , Adenosine/genetics , Animals , Host-Pathogen Interactions , Inosine/genetics , Mutagenesis , Polymorphism, Genetic , RNA Editing
6.
Genome Announc ; 3(6)2015 Nov 05.
Article in English | MEDLINE | ID: mdl-26543127

ABSTRACT

Porphyromonas gingivalis is associated with oral and systemic diseases. Strain-specific P. gingivalis invasion phenotypes have been correlated with disease presentation in infected laboratory animals. Here, we present the genome sequence of AJW4, a minimally invasive strain, with a single contig of 2,372,492 bp and a G+C content of 48.27%.

7.
Genome Announc ; 3(5)2015 Sep 24.
Article in English | MEDLINE | ID: mdl-26404590

ABSTRACT

Porphyromonas gingivalis is strongly associated with periodontitis. P. gingivalis strain trafficking and tissue homing differ widely, even among presumptive closely related strains, such as W83 and A7436. Here, we present the genome sequence of A7436 with a single contig of 2,367,029 bp and a G+C content of 48.33%.

8.
PLoS One ; 10(8): e0135906, 2015.
Article in English | MEDLINE | ID: mdl-26295943

ABSTRACT

A large, non-coding ATTCT repeat expansion causes the neurodegenerative disorder, spinocerebellar ataxia type 10 (SCA10). In a subset of SCA10 patients, interruption motifs are present at the 5' end of the expansion and strongly correlate with epileptic seizures. Thus, interruption motifs are a predictor of the epileptic phenotype and are hypothesized to act as a phenotypic modifier in SCA10. Yet, the exact internal sequence structure of SCA10 expansions remains unknown due to limitations in current technologies for sequencing across long extended tracts of tandem nucleotide repeats. We used the third generation sequencing technology, Single Molecule Real Time (SMRT) sequencing, to obtain full-length contiguous expansion sequences, ranging from 2.5 to 4.4 kb in length, from three SCA10 patients with different clinical presentations. We obtained sequence spanning the entire length of the expansion and identified the structure of known and novel interruption motifs within the SCA10 expansion. The exact interruption patterns in expanded SCA10 alleles will allow us to further investigate the potential contributions of these interrupting sequences to the pathogenic modification leading to the epilepsy phenotype in SCA10. Our results also demonstrate that SMRT sequencing is useful for deciphering long tandem repeats that pose as "gaps" in the human genome sequence.


Subject(s)
Ataxin-10/genetics , Epilepsy/genetics , Genome, Human , Microsatellite Repeats , Spinocerebellar Ataxias/genetics , Adult , Aged, 80 and over , Alleles , Base Sequence , Chromosome Mapping , DNA Repeat Expansion/genetics , Epilepsy/complications , Epilepsy/pathology , Female , Gene Expression , High-Throughput Nucleotide Sequencing , Humans , Introns , Male , Middle Aged , Molecular Sequence Data , Mutation , Phenotype , Spinocerebellar Ataxias/complications , Spinocerebellar Ataxias/pathology
9.
PLoS One ; 9(7): e101648, 2014.
Article in English | MEDLINE | ID: mdl-24999826

ABSTRACT

The activity of ammonia-oxidizing archaea (AOA) leads to the loss of nitrogen from soil, pollution of water sources and elevated emissions of greenhouse gas. To date, eight AOA genomes are available in the public databases, seven are from the group I.1a of the Thaumarchaeota and only one is from the group I.1b, isolated from hot springs. Many soils are dominated by AOA from the group I.1b, but the genomes of soil representatives of this group have not been sequenced and functionally characterized. The lack of knowledge of metabolic pathways of soil AOA presents a critical gap in understanding their role in biogeochemical cycles. Here, we describe the first complete genome of soil archaeon Candidatus Nitrososphaera evergladensis, which has been reconstructed from metagenomic sequencing of a highly enriched culture obtained from an agricultural soil. The AOA enrichment was sequenced with the high throughput next generation sequencing platforms from Pacific Biosciences and Ion Torrent. The de novo assembly of sequences resulted in one 2.95 Mb contig. Annotation of the reconstructed genome revealed many similarities of the basic metabolism with the rest of sequenced AOA. Ca. N. evergladensis belongs to the group I.1b and shares only 40% of whole-genome homology with the closest sequenced relative Ca. N. gargensis. Detailed analysis of the genome revealed coding sequences that were completely absent from the group I.1a. These unique sequences code for proteins involved in control of DNA integrity, transporters, two-component systems and versatile CRISPR defense system. Notably, genomes from the group I.1b have more gene duplications compared to the genomes from the group I.1a. We suggest that the presence of these unique genes and gene duplications may be associated with the environmental versatility of this group.


Subject(s)
Ammonia/metabolism , Archaea/genetics , Archaea/metabolism , Genomics , Soil Microbiology , Adaptation, Physiological/genetics , Archaea/cytology , Archaea/physiology , Biological Transport/genetics , Carbon/metabolism , Carbon Cycle/genetics , Cell Division/genetics , Chemotaxis/genetics , DNA Repair/genetics , DNA Replication/genetics , Energy Metabolism/genetics , Metals, Heavy/toxicity , Molecular Sequence Annotation , Nitrogen/metabolism , Oceans and Seas , Osmotic Pressure , Oxidation-Reduction , Phylogeny , Terpenes/metabolism
10.
Nat Biotechnol ; 32(7): 656-62, 2014 Jul.
Article in English | MEDLINE | ID: mdl-24908277

ABSTRACT

Cultivated citrus are selections from, or hybrids of, wild progenitor species whose identities and contributions to citrus domestication remain controversial. Here we sequence and compare citrus genomes--a high-quality reference haploid clementine genome and mandarin, pummelo, sweet-orange and sour-orange genomes--and show that cultivated types derive from two progenitor species. Although cultivated pummelos represent selections from one progenitor species, Citrus maxima, cultivated mandarins are introgressions of C. maxima into the ancestral mandarin species Citrus reticulata. The most widely cultivated citrus, sweet orange, is the offspring of previously admixed individuals, but sour orange is an F1 hybrid of pure C. maxima and C. reticulata parents, thus implying that wild mandarins were part of the early breeding germplasm. A Chinese wild 'mandarin' diverges substantially from C. reticulata, thus suggesting the possibility of other unrecognized wild citrus species. Understanding citrus phylogeny through genome analysis clarifies taxonomic relationships and facilitates sequence-directed genetic improvement.


Subject(s)
Breeding , Citrus/classification , Citrus/genetics , Conserved Sequence/genetics , Crops, Agricultural/genetics , Genetic Variation/genetics , Genome, Plant/genetics , Base Sequence , Evolution, Molecular , Molecular Sequence Data , Sequence Analysis, DNA , Species Specificity
11.
Genome Announc ; 1(6)2013 Nov 27.
Article in English | MEDLINE | ID: mdl-24285647

ABSTRACT

We report the complete genome sequence of Carnobacterium gilichinskyi strain WN1359, previously isolated from Siberian permafrost and capable of growth under cold (0°C), anoxic, CO2-dominated, low-pressure (0.7-kPa) conditions in a simulation of the Mars atmosphere.

12.
Genome Announc ; 1(4)2013 Aug 15.
Article in English | MEDLINE | ID: mdl-23950115

ABSTRACT

We report the complete genome sequence of Serratia liquefaciens strain ATCC 27592, which was previously identified as capable of growth under low-pressure conditions. To the best of our knowledge, this is the first announcement of the complete genome sequence of an S. liquefaciens strain.

13.
Retrovirology ; 9: 108, 2012 Dec 17.
Article in English | MEDLINE | ID: mdl-23244298

ABSTRACT

BACKGROUND: Deep sequencing provides the basis for analysis of biodiversity of taxonomically similar organisms in an environment. While extensively applied to microbiome studies, population genetics studies of viruses are limited. To define the scope of HIV-1 population biodiversity within infected individuals, a suite of phylogenetic and population genetic algorithms was applied to HIV-1 envelope hypervariable domain 3 (Env V3) within peripheral blood mononuclear cells from a group of perinatally HIV-1 subtype B infected, therapy-naïve children. RESULTS: Biodiversity of HIV-1 Env V3 quasispecies ranged from about 70 to 270 unique sequence clusters across individuals. Viral population structure was organized into a limited number of clusters that included the dominant variants combined with multiple clusters of low frequency variants. Next generation viral quasispecies evolved from low frequency variants at earlier time points through multiple non-synonymous changes in lineages within the evolutionary landscape. Minor V3 variants detected as long as four years after infection co-localized in phylogenetic reconstructions with early transmitting viruses or with subsequent plasma virus circulating two years later. CONCLUSIONS: Deep sequencing defines HIV-1 population complexity and structure, reveals the ebb and flow of dominant and rare viral variants in the host ecosystem, and identifies an evolutionary record of low-frequency cell-associated viral V3 variants that persist for years. Bioinformatics pipeline developed for HIV-1 can be applied for biodiversity studies of virome populations in human, animal, or plant ecosystems.


Subject(s)
Genetic Variation , HIV-1/genetics , Cluster Analysis , Evolution, Molecular , HIV Envelope Protein gp120/genetics , HIV Infections/virology , High-Throughput Nucleotide Sequencing , Humans , Leukocytes, Mononuclear/virology
14.
BMC Dev Biol ; 12: 33, 2012 Nov 06.
Article in English | MEDLINE | ID: mdl-23126590

ABSTRACT

BACKGROUND: The first distinct differentiation event in mammals occurs at the blastocyst stage when totipotent blastomeres differentiate into either pluripotent inner cell mass (ICM) or multipotent trophectoderm (TE). Here we determined, for the first time, global gene expression patterns in the ICM and TE isolated from bovine blastocysts. The ICM and TE were isolated from blastocysts harvested at day 8 after insemination by magnetic activated cell sorting, and cDNA sequenced using the SOLiD 4.0 system. RESULTS: A total of 870 genes were differentially expressed between ICM and TE. Several genes characteristic of ICM (for example, NANOG, SOX2, and STAT3) and TE (ELF5, GATA3, and KRT18) in mouse and human showed similar patterns in bovine. Other genes, however, showed differences in expression between ICM and TE that deviates from the expected based on mouse and human. CONCLUSION: Analysis of gene expression indicated that differentiation of blastomeres of the morula-stage embryo into the ICM and TE of the blastocyst is accompanied by differences between the two cell lineages in expression of genes controlling metabolic processes, endocytosis, hatching from the zona pellucida, paracrine and endocrine signaling with the mother, and genes supporting the changes in cellular architecture, stemness, and hematopoiesis necessary for development of the trophoblast.


Subject(s)
Blastocyst Inner Cell Mass/metabolism , Ectoderm/metabolism , Transcriptome , Trophoblasts/metabolism , Animals , Cattle , Cluster Analysis , CpG Islands , Ectoderm/cytology , Embryo Culture Techniques , Gene Expression Regulation, Developmental , Humans , Metabolic Networks and Pathways , Mice , Molecular Sequence Annotation , Promoter Regions, Genetic , Up-Regulation
15.
Astrobiology ; 12(1): 40-56, 2012 Jan.
Article in English | MEDLINE | ID: mdl-22221117

ABSTRACT

The spaceflight environment presents unique challenges to terrestrial biology, including but not limited to the direct effects of gravity. As we near the end of the Space Shuttle era, there remain fundamental questions about the response and adaptation of plants to orbital spaceflight conditions. We address a key baseline question of whether gene expression changes are induced by the orbital environment, and then we ask whether undifferentiated cells, cells presumably lacking the typical gravity response mechanisms, perceive spaceflight. Arabidopsis seedlings and undifferentiated cultured Arabidopsis cells were launched in April, 2010, as part of the BRIC-16 flight experiment on STS-131. Biologically replicated DNA microarray and averaged RNA digital transcript profiling revealed several hundred genes in seedlings and cell cultures that were significantly affected by launch and spaceflight. The response was moderate in seedlings; only a few genes were induced by more than 7-fold, and the overall intrinsic expression level for most differentially expressed genes was low. In contrast, cell cultures displayed a more dramatic response, with dozens of genes showing this level of differential expression, a list comprised primarily of heat shock-related and stress-related genes. This baseline transcriptome profiling of seedlings and cultured cells confirms the fundamental hypothesis that survival of the spaceflight environment requires adaptive changes that are both governed and displayed by alterations in gene expression. The comparison of intact plants with cultures of undifferentiated cells confirms a second hypothesis: undifferentiated cells can detect spaceflight in the absence of specialized tissue or organized developmental structures known to detect gravity.


Subject(s)
Arabidopsis/genetics , Extraterrestrial Environment , Gene Expression Regulation, Plant , Space Flight , Transcriptome/genetics , Arabidopsis/cytology , Cells, Cultured , Computers , Gene Expression Profiling , Genes, Plant/genetics , Oligonucleotide Array Sequence Analysis , Reproducibility of Results , Sample Size , Seedlings/genetics , Temperature
16.
Brief Bioinform ; 13(1): 107-21, 2012 Jan.
Article in English | MEDLINE | ID: mdl-21525143

ABSTRACT

Recent advances in massively parallel sequencing technology have created new opportunities to probe the hidden world of microbes. Taxonomy-independent clustering of the 16S rRNA gene is usually the first step in analyzing microbial communities. Dozens of algorithms have been developed in the last decade, but a comprehensive benchmark study is lacking. Here, we survey algorithms currently used by microbiologists, and compare seven representative methods in a large-scale benchmark study that addresses several issues of concern. A new experimental protocol was developed that allows different algorithms to be compared using the same platform, and several criteria were introduced to facilitate a quantitative evaluation of the clustering performance of each algorithm. We found that existing methods vary widely in their outputs, and that inappropriate use of distance levels for taxonomic assignments likely resulted in substantial overestimates of biodiversity in many studies. The benchmark study identified our recently developed ESPRIT-Tree, a fast implementation of the average linkage-based hierarchical clustering algorithm, as one of the best algorithms available in terms of computational efficiency and clustering accuracy.


Subject(s)
Algorithms , Biota , Computational Biology/methods , Cluster Analysis , Genome, Bacterial , RNA, Ribosomal, 16S/genetics , Sequence Alignment
17.
Proc Natl Acad Sci U S A ; 108(14): 5679-84, 2011 Apr 05.
Article in English | MEDLINE | ID: mdl-21282665

ABSTRACT

Ants have evolved very complex societies and are key ecosystem members. Some ants, such as the fire ant Solenopsis invicta, are also major pests. Here, we present a draft genome of S. invicta, assembled from Roche 454 and Illumina sequencing reads obtained from a focal haploid male and his brothers. We used comparative genomic methods to obtain insight into the unique features of the S. invicta genome. For example, we found that this genome harbors four adjacent copies of vitellogenin. A phylogenetic analysis revealed that an ancestral vitellogenin gene first underwent a duplication that was followed by possibly independent duplications of each of the daughter vitellogenins. The vitellogenin genes have undergone subfunctionalization with queen- and worker-specific expression, possibly reflecting differential selection acting on the queen and worker castes. Additionally, we identified more than 400 putative olfactory receptors of which at least 297 are intact. This represents the largest repertoire reported so far in insects. S. invicta also harbors an expansion of a specific family of lipid-processing genes, two putative orthologs to the transformer/feminizer sex differentiation gene, a functional DNA methylation system, and a single putative telomerase ortholog. EST data indicate that this S. invicta telomerase ortholog has at least four spliceforms that differ in their use of two sets of mutually exclusive exons. Some of these and other unique aspects of the fire ant genome are likely linked to the complex social behavior of this species.


Subject(s)
Ants/genetics , Evolution, Molecular , Genome, Insect/genetics , Genomics/methods , Phylogeny , Animals , Base Sequence , Computational Biology , DNA Methylation , Expressed Sequence Tags , Hierarchy, Social , Male , Molecular Sequence Data , Receptors, Odorant/genetics , Sequence Analysis, DNA , Vitellogenins/genetics
18.
Infect Genet Evol ; 10(2): 254-60, 2010 Mar.
Article in English | MEDLINE | ID: mdl-19931646

ABSTRACT

San Miguel Sea Lion Virus (SMSV) is a small RNA virus in the genus Vesivirus with an unusually broad host range. Three populations of SMSV were examined by PCR amplification of the capsid precursor and putative helicase genes, followed by pyrosequencing. The populations were nasal swabs from two SMSV infected California sea lions (Zalophus californianus) from two different years, and a virus isolate from the earlier swab that was passaged in cell culture five times. In the capsid precursor, extensive deletions were prevalent in the passaged virus but uncommon in the clinical samples. A greater prevalence of point mutations was seen in the capsid precursor gene than in the putative helicase gene. In culture, the minority sequence in the capsid precursor at nucleotide position 5826 rapidly shifted after five passages to become the majority sequence. Levels of diversity at individual sites showed much more similarity between the two clinical samples than between the earlier clinical sample and the passaged culture from the same sample. SMSV appears to behave as a quasispecies. Assessment of original patient samples is preferable for understanding clinical SMSV populations.


Subject(s)
Genome, Viral/genetics , Sea Lions/virology , Vesivirus/genetics , Viral Proteins/genetics , Animals , Caliciviridae Infections/veterinary , Caliciviridae Infections/virology , Capsid Proteins/genetics , DNA Mutational Analysis , Evolution, Molecular , Nasal Cavity/virology , Vesivirus/isolation & purification
19.
Plant Physiol ; 149(3): 1478-92, 2009 Mar.
Article in English | MEDLINE | ID: mdl-19109415

ABSTRACT

Plants uniquely contain large numbers of protein kinases, and for the vast majority of the 1,429 kinases predicted in the rice (Oryza sativa) genome, little is known of their functions. Genetic approaches often fail to produce observable phenotypes; thus, new strategies are needed to delineate kinase function. We previously developed a cost-effective high-throughput yeast two-hybrid system. Using this system, we have generated a protein interaction map of 116 representative rice kinases and 254 of their interacting proteins. Overall, the resulting interaction map supports a large number of known or predicted kinase-protein interactions from both plants and animals and reveals many new functional insights. Notably, we found a potential widespread role for E3 ubiquitin ligases in pathogen defense signaling mediated by receptor-like kinases, particularly by the kinases that may have evolved from recently expanded kinase subfamilies in rice. We anticipate that the data provided here will serve as a foundation for targeted functional studies in rice and other plants. The application of yeast two-hybrid and TAPtag analyses for large-scale plant protein interaction studies is also discussed.


Subject(s)
Oryza/enzymology , Plant Proteins/metabolism , Protein Interaction Mapping/methods , Protein Kinases/metabolism , Amino Acid Sequence , Casein Kinase II/metabolism , Cyclin-Dependent Kinases/metabolism , Mitogen-Activated Protein Kinases/metabolism , Models, Biological , Molecular Sequence Data , Phylogeny , Plant Proteins/chemistry , Protein Binding , Protein Kinases/chemistry , Transcription Factors , Two-Hybrid System Techniques , Ubiquitin-Protein Ligases/metabolism
20.
BMC Genomics ; 9: 312, 2008 Jun 30.
Article in English | MEDLINE | ID: mdl-18590545

ABSTRACT

BACKGROUND: Benefits from high-throughput sequencing using 454 pyrosequencing technology may be most apparent for species with high societal or economic value but few genomic resources. Rapid means of gene sequence and SNP discovery using this novel sequencing technology provide a set of baseline tools for genome-level research. However, it is questionable how effective the sequencing of large numbers of short reads for species with essentially no prior gene sequence information will support contig assemblies and sequence annotation. RESULTS: With the purpose of generating the first broad survey of gene sequences in Eucalyptus grandis, the most widely planted hardwood tree species, we used 454 technology to sequence and assemble 148 Mbp of expressed sequences (EST). EST sequences were generated from a normalized cDNA pool comprised of multiple tissues and genotypes, promoting discovery of homologues to almost half of Arabidopsis genes, and a comprehensive survey of allelic variation in the transcriptome. By aligning the sequencing reads from multiple genotypes we detected 23,742 SNPs, 83% of which were validated in a sample. Genome-wide nucleotide diversity was estimated for 2,392 contigs using a modified theta (theta) parameter, adapted for measuring genetic diversity from polymorphisms detected by randomly sequencing a multi-genotype cDNA pool. Diversity estimates in non-synonymous nucleotides were on average 4x smaller than in synonymous, suggesting purifying selection. Non-synonymous to synonymous substitutions (Ka/Ks) among 2,001 contigs averaged 0.30 and was skewed to the right, further supporting that most genes are under purifying selection. Comparison of these estimates among contigs identified major functional classes of genes under purifying and diversifying selection in agreement with previous researches. CONCLUSION: In providing an abundance of foundational transcript sequences where limited prior genomic information existed, this work created part of the foundation for the annotation of the E. grandis genome that is being sequenced by the US Department of Energy. In addition we demonstrated that SNPs sampled in large-scale with 454 pyrosequencing can be used to detect evolutionary signatures among genes, providing one of the first genome-wide assessments of nucleotide diversity and Ka/Ks for a non-model plant species.


Subject(s)
Eucalyptus/genetics , Expressed Sequence Tags , Genes, Plant , Polymorphism, Single Nucleotide/genetics , Sequence Analysis, DNA/methods , Alleles , DNA, Complementary/biosynthesis , Gene Library , Genetic Variation , Haplotypes , Reproducibility of Results , Software , Transcription, Genetic
SELECTION OF CITATIONS
SEARCH DETAIL
...