RESUMEN
BACKGROUND: Despite the importance of osmoprotectants, no previous in silico evaluation of high throughput data is available for higher plants. The present approach aimed at the identification and annotation of osmoprotectant-related sequences applied to short transcripts from a soybean HT-SuperSAGE (High Throughput Super Serial Analysis of Gene Expression; 26-bp tags) database, and also its comparison with other transcriptomic and genomic data available from different sources. METHODS: A curated set of osmoprotectants related sequences was generated using text mining and selected seed sequences for identification of the respective transcripts and proteins in higher plants. To test the efficiency of the seed sequences, these were aligned against four HT-SuperSAGE contrasting libraries generated by our group using soybean tolerant and sensible plants against water deficit, considering only differentially expressed transcripts (p ≤ 0.05). Identified transcripts from soybean and their respective tags were aligned and anchored against the soybean virtual genome. RESULTS: The workflow applied resulted in a set including 1,996 seed sequences that allowed the identification of 36 differentially expressed genes related to the biosynthesis of osmoprotectants [Proline (P5CS: 4, P5CR: 2), Trehalose (TPS1: 9, TPPB: 1), Glycine betaine (BADH: 4) and Myo-inositol (MIPS: 7, INPS1: 8)], also mapped in silico in the soybean genome (25 loci). Another approach considered matches using Arabidopsis full length sequences as seed sequences, and allowed the identification of 124 osmoprotectant-related sequences, matching ~10.500 tags anchored in the soybean virtual chromosomes. Osmoprotectant-related genes appeared clustered in all soybean chromosomes, with higher density in some subterminal regions and synteny among some chromosome pairs. CONCLUSIONS: Soybean presents all searched osmoprotectant categories with some important members differentially expressed among the comparisons considered (drought tolerant or sensible vs. control; tolerant vs. sensible), allowing the identification of interesting candidates for biotechnological inferences. The identified tags aligned to corresponding genes that matched 19 soybean chromosomes. Osmoprotectant-related genes are not regularly distributed in the soybean genome, but clustered in some regions near the chromosome terminals, with some redundant clusters in different chromosomes indicating their involvement in previous duplication and rearrangements events. The seed sequences, transcripts and map represent the first transversal evaluation for osmoprotectant-related genes and may be easily applied to other plants of interest.
Asunto(s)
Glycine max/genética , Estrés Fisiológico/genética , Análisis por Conglomerados , Etiquetas de Secuencia Expresada , Perfilación de la Expresión Génica , Genes de Plantas , Genoma de Planta , Secuenciación de Nucleótidos de Alto Rendimiento , Presión Osmótica , Semillas/genética , Glycine max/enzimología , SinteníaRESUMEN
The extent to which mammalian cells share similar transcriptomes remains unclear. Notwithstanding, such cross-species gene expression inquiries have been scarce for defined cell types and most lack the dissection of gene regulatory landscapes. Therefore, the work was aimed to determine C-MYC relative expression across mammalian fibroblasts (Ovis aries and Bos taurus) via cross-species RT-qPCR and comprehensively explore its regulatory landscape by in silico tools. The prediction of transcription factor binding sites in C-MYC and its 2.5 kb upstream sequence revealed substantial variation, thus indicating evolutionary-driven re-wiring of cis-regulatory elements. C-MYC and its downstream target TBX3 were up-regulated in Bos taurus fibroblasts. The relative expression of C-MYC regulators [RONIN (also known as THAP11), RXRß, and TCF3] and the C-MYC-associated transcript elongation factor CDK9 did not differ between species. Additional in silico analyses suggested Bos taurus-specific C-MYC exonization, alternative splicing, and binding sites for non-coding RNAs. C-MYC protein orthologs were highly conserved, while variation was in the transactivation domain and the leucine zipper motif. Altogether, mammalian fibroblasts display evolutionary-driven C-MYC relative expression that should be instructive for understanding cellular physiology, cellular reprogramming, and C-MYC-related diseases.
Asunto(s)
Bovinos/genética , Evolución Molecular , Genes myc , Oveja Doméstica/genética , Secuencia de Aminoácidos , Animales , Bovinos/metabolismo , Quinasa 9 Dependiente de la Ciclina/genética , Fibroblastos/metabolismo , Expresión Génica , Procesamiento Proteico-Postraduccional , Proteínas Proto-Oncogénicas c-myc/genética , Proteínas Proto-Oncogénicas c-myc/metabolismo , Elementos Reguladores de la Transcripción , Homología de Secuencia de Aminoácido , Oveja Doméstica/metabolismo , Especificidad de la Especie , Proteínas de Dominio T Box/genética , TranscriptomaRESUMEN
Quantitative reverse transcription PCR (RT-qPCR) remains as an accurate approach for gene expression analysis but requires labor-intensive validation of reference genes using species-specific primers. To ease such demand, the aim was to design and test a multi-species primer set to validate reference genes for inter-genus RT-qPCR gene expression analysis. Primers were designed for ten housekeeping genes using transcript sequences of various livestock species. All ten gene transcripts were detected by RT-PCR in Bos taurus (cattle), Bubalus bubalis (buffaloes), Capra hircus (goats), and Ovis aries (sheep) cDNA. Primer efficiency was attained for eight reference genes using B. taurus-O. aries fibroblast cDNA (95.54-98.39%). The RT-qPCR data normalization was carried out for B. taurus vs. O. aries relative gene expression using Bestkeeper, GeNorm, Norm-finder, Delta CT method, and RefFinder algorithms. Validation of inter-genus RT-qPCR showed up-regulation of TLR4 and ZFX gene transcripts in B. taurus fibroblasts, irrespectively of normalization conditions (two, three, or four reference genes). In silico search in mammalian transcriptomes showed that the multi-species primer set is expected to amplify transcripts of at least two distinct loci in 114 species, and 79 species would be covered by six or more primers. Hence, a multi-species primer set allows for inter-genus gene expression analysis between O. aries and B. taurus fibroblasts and further reveals species-specific gene transcript abundance of key transcription factors.