RESUMO
BACKGROUND: Despite the importance of osmoprotectants, no previous in silico evaluation of high throughput data is available for higher plants. The present approach aimed at the identification and annotation of osmoprotectant-related sequences applied to short transcripts from a soybean HT-SuperSAGE (High Throughput Super Serial Analysis of Gene Expression; 26-bp tags) database, and also its comparison with other transcriptomic and genomic data available from different sources. METHODS: A curated set of osmoprotectants related sequences was generated using text mining and selected seed sequences for identification of the respective transcripts and proteins in higher plants. To test the efficiency of the seed sequences, these were aligned against four HT-SuperSAGE contrasting libraries generated by our group using soybean tolerant and sensible plants against water deficit, considering only differentially expressed transcripts (p ≤ 0.05). Identified transcripts from soybean and their respective tags were aligned and anchored against the soybean virtual genome. RESULTS: The workflow applied resulted in a set including 1,996 seed sequences that allowed the identification of 36 differentially expressed genes related to the biosynthesis of osmoprotectants [Proline (P5CS: 4, P5CR: 2), Trehalose (TPS1: 9, TPPB: 1), Glycine betaine (BADH: 4) and Myo-inositol (MIPS: 7, INPS1: 8)], also mapped in silico in the soybean genome (25 loci). Another approach considered matches using Arabidopsis full length sequences as seed sequences, and allowed the identification of 124 osmoprotectant-related sequences, matching ~10.500 tags anchored in the soybean virtual chromosomes. Osmoprotectant-related genes appeared clustered in all soybean chromosomes, with higher density in some subterminal regions and synteny among some chromosome pairs. CONCLUSIONS: Soybean presents all searched osmoprotectant categories with some important members differentially expressed among the comparisons considered (drought tolerant or sensible vs. control; tolerant vs. sensible), allowing the identification of interesting candidates for biotechnological inferences. The identified tags aligned to corresponding genes that matched 19 soybean chromosomes. Osmoprotectant-related genes are not regularly distributed in the soybean genome, but clustered in some regions near the chromosome terminals, with some redundant clusters in different chromosomes indicating their involvement in previous duplication and rearrangements events. The seed sequences, transcripts and map represent the first transversal evaluation for osmoprotectant-related genes and may be easily applied to other plants of interest.
Assuntos
Glycine max/genética , Estresse Fisiológico/genética , Análise por Conglomerados , Etiquetas de Sequências Expressas , Perfilação da Expressão Gênica , Genes de Plantas , Genoma de Planta , Sequenciamento de Nucleotídeos em Larga Escala , Pressão Osmótica , Sementes/genética , Glycine max/enzimologia , SinteniaRESUMO
Heat shock (HS) leads to the activation of molecular mechanisms, known as HS-response, that prevent damage and enhance survival under stress. Plants have a flexible and specialized network of Heat Shock Factors (HSFs), which are transcription factors that induce the expression of heat shock proteins. The present work aimed to identify and characterize the Glycine max HSF repertory in the Soybean Genome Project (GENOSOJA platform), comparing them with other legumes (Medicago truncatula and Lotus japonicus) in view of current knowledge of Arabidopsis thaliana. The HSF characterization in leguminous plants led to the identification of 25, 19 and 21 candidate ESTs in soybean, Lotus and Medicago, respectively. A search in the SuperSAGE libraries revealed 68 tags distributed in seven HSF gene types. From the total number of obtained tags, more than 70% were related to root tissues (water deficit stress libraries vs. controls), indicating their role in abiotic stress responses, since the root is the first tissue to sense and respond to abiotic stress. Moreover, as heat stress is related to the pressure of dryness, a higher HSF expression was expected at the water deficit libraries. On the other hand, expressive HSF candidates were obtained from the library inoculated with Asian Soybean Rust, inferring crosstalk among genes associated with abiotic and biotic stresses. Evolutionary relationships among sequences were consistent with different HSF classes and subclasses. Expression profiling indicated that regulation of specific genes is associated with the stage of plant development and also with stimuli from other abiotic stresses pointing to the maintenance of HSF expression at a basal level in soybean, favoring its activation under heat-stress conditions.
RESUMO
Plants experience various environmental stresses, but tolerance to these adverse conditions is a very complex phenomenon. The present research aimed to evaluate a set of genes involved in osmotic response, comparing soybean and medicago with the well-described Arabidopsis thaliana model plant. Based on 103 Arabidopsis proteins from 27 categories of osmotic stress response, comparative analyses against Genosoja and Medicago truncatula databases allowed the identification of 1,088 soybean and 1,210 Medicago sequences. The analysis showed a high number of sequences and high diversity, comprising genes from all categories in both organisms. Genes with unknown function were among the most representative, followed by transcription factors, ion transport proteins, water channel, plant defense, protein degradation, cellular structure, organization & biogenesis and senescence. An analysis of sequences with unknown function allowed the annotation of 174 soybean and 217 Medicago sequences, most of them concerning transcription factors. However, for about 30% of the sequences no function could be attributed using in silico procedures. The establishment of a gene set involved in osmotic stress responses in soybean and barrel medic will help to better understand the survival mechanisms for this type of stress condition in legumes.
RESUMO
A major goal of plant genome research is to recognize genes responsible for important traits. Resistance genes are among the most important gene classes for plant breeding purposes being responsible for the specific immune response including pathogen recognition, and activation of plant defence mechanisms. These genes are quite abundant in higher plants, with 210 clusters found in Eucalyptus FOREST database presenting significant homology to known R-genes. All five gene classes of R-genes with their respective conserved domains are present and expressed in Eucalyptus. Most clusters identified (93) belong to the LRR-NBS-TIR (genes with three domains: Leucine-rich-repeat, Nucleotide-binding-site and Toll interleucine 1-receptor), followed by the serine-threonine-kinase class (49 clusters). Some new combinations of domains and motifs of R-genes may be present in Eucalyptus and could represent novel gene structures. Most alignments occurred with dicots (94.3%), with emphasis on Arabidopsis thaliana (Brassicaceae) sequences. All best alignments with monocots (5.2%) occurred with rice (Oryza sativa) sequences and a single cluster aligned with the gymnosperm Pinus sylvestris (0.5%). The results are discussed and compared with available data from other crops and may bring useful evidences for the understanding of defense mechanisms in Eucalyptus and other crop species