RESUMEN
The Plant Expression Omnibus (PEO) is a web application that provides biologists with access to gene expression insights across over 100 plant species, ~60 000 manually annotated RNA-seq samples, and more than 4 million genes. The tool allows users to explore the expression patterns of genes across different organs, identify organ-specific genes, and discover top co-expressed genes for any gene of interest. PEO also provides functional annotations for each gene, allowing for the identification of genetic modules and pathways. PEO is designed to facilitate comparative kingdom-wide gene expression analysis and provide a valuable resource for plant biology research. We provide two case studies to demonstrate the utility of PEO in identifying candidate genes in pollen coat biosynthesis in Arabidopsis and investigating the biosynthetic pathway components of capsaicin in Capsicum annuum. The database is freely available at https://expression.plant.tools/.
Asunto(s)
Arabidopsis , Perfilación de la Expresión Génica , Transcriptoma/genética , Plantas/genética , Plantas/metabolismo , Bases de Datos Factuales , RNA-Seq , Arabidopsis/genética , Regulación de la Expresión Génica de las Plantas/genéticaRESUMEN
Diurnal gene expression is a pervasive phenomenon occurring across all kingdoms of life, orchestrating adaptive responses to daily environmental fluctuations and thus enhancing organismal fitness. Our understanding of the plant circadian clock is primarily derived from studies in Arabidopsis and direct comparisons are difficult due to differences in gene family sizes. To this end, the identification of functional orthologs based on diurnal and tissue expression is necessary. The diurnal.plant.tools database constitutes a repository of gene expression profiles from 17 members of the Archaeplastida lineage, with built-in tools facilitating cross-species comparisons. In this database update, we expand the dataset with diurnal gene expression from 4 agriculturally significant crop species and Marchantia, a plant of evolutionary significance. Notably, the inclusion of diurnal gene expression data for Marchantia enables researchers to glean insights into the evolutionary trajectories of the circadian clock and other biological processes spanning from algae to angiosperms. Moreover, integrating diurnal gene expression data with datasets from related gene co-expression databases, such as CoNekt-Plants and CoNekt-Stress, which contain gene expression data for tissue and perturbation experiments, provides a comprehensive overview of gene functions across diverse biological contexts. This expanded database serves as a valuable resource for elucidating the intricacies of diurnal gene regulation and its evolutionary underpinnings in plant biology.
RESUMEN
BACKGROUND: As one of the world's most important beverage crops, tea plants (Camellia sinensis) are renowned for their unique flavors and numerous beneficial secondary metabolites, attracting researchers to investigate the formation of tea quality. With the increasing availability of transcriptome data on tea plants in public databases, conducting large-scale co-expression analyses has become feasible to meet the demand for functional characterization of tea plant genes. However, as the multidimensional noise increases, larger-scale co-expression analyses are not always effective. Analyzing a subset of samples generated by effectively downsampling and reorganizing the global sample set often leads to more accurate results in co-expression analysis. Meanwhile, global-based co-expression analyses are more likely to overlook condition-specific gene interactions, which may be more important and worthy of exploration and research. RESULTS: Here, we employed the k-means clustering method to organize and classify the global samples of tea plants, resulting in clustered samples. Metadata annotations were then performed on these clustered samples to determine the "conditions" represented by each cluster. Subsequently, we conducted gene co-expression network analysis (WGCNA) separately on the global samples and the clustered samples, resulting in global modules and cluster-specific modules. Comparative analyses of global modules and cluster-specific modules have demonstrated that cluster-specific modules exhibit higher accuracy in co-expression analysis. To measure the degree of condition specificity of genes within condition-specific clusters, we introduced the correlation difference value (CDV). By incorporating the CDV into co-expression analyses, we can assess the condition specificity of genes. This approach proved instrumental in identifying a series of high CDV transcription factor encoding genes upregulated during sustained cold treatment in Camellia sinensis leaves and buds, and pinpointing a pair of genes that participate in the antioxidant defense system of tea plants under sustained cold stress. CONCLUSIONS: To summarize, downsampling and reorganizing the sample set improved the accuracy of co-expression analysis. Cluster-specific modules were more accurate in capturing condition-specific gene interactions. The introduction of CDV allowed for the assessment of condition specificity in gene co-expression analyses. Using this approach, we identified a series of high CDV transcription factor encoding genes related to sustained cold stress in Camellia sinensis. This study highlights the importance of considering condition specificity in co-expression analysis and provides insights into the regulation of the cold stress in Camellia sinensis.
Asunto(s)
Camellia sinensis , Camellia sinensis/genética , Camellia sinensis/metabolismo , Análisis por Conglomerados , Genes de Plantas , Perfilación de la Expresión Génica/métodos , Minería de Datos/métodos , Transcriptoma , Regulación de la Expresión Génica de las Plantas , Redes Reguladoras de GenesRESUMEN
In an attempt to address the large inequities faced by the plant biology communities from the Global South (i.e. countries located around the tropics and the Southern Hemisphere) at international conferences, this Viewpoint is the reflexive thinking arising from the concurrent session titled 'Arabidopsis and its translational research in the Global South' organized at the International Conference of Arabidopsis Research 2023 (ICAR 2023) in Chiba, Japan in June 2023. Here, we highlight the main obstacles plant biology communities in the Global South face in terms of knowledge production, as measured by the unequal production and citation of publications, investigating and advancing local plant genomics and biodiversity, combating disparities in gender and diversity, and current initiatives to break isolation of scientists.
Asunto(s)
Investigación , Arabidopsis/genética , Arabidopsis/fisiología , Biodiversidad , Botánica , Plantas/genéticaRESUMEN
Despite the abundance of species with transcriptomic data, a significant number of species still lack sequenced genomes, making it difficult to study gene function and expression in these organisms. While de novo transcriptome assembly can be used to assemble protein-coding transcripts from RNA-sequencing (RNA-seq) data, the datasets used often only feature samples of arbitrarily selected or similar experimental conditions, which might fail to capture condition-specific transcripts. We developed the Large-Scale Transcriptome Assembly Pipeline for de novo assembled transcripts (LSTrAP-denovo) to automatically generate transcriptome atlases of eukaryotic species. Specifically, given an NCBI TaxID, LSTrAP-denovo can (1) filter undesirable RNA-seq accessions based on read data, (2) select RNA-seq accessions via unsupervised machine learning to construct a sample-balanced dataset for download, (3) assemble transcripts via over-assembly, (4) functionally annotate coding sequences (CDS) from assembled transcripts and (5) generate transcriptome atlases in the form of expression matrices for downstream transcriptomic analyses. LSTrAP-denovo is easy to implement, written in Python, and is freely available at https://github.com/pengkenlim/LSTrAP-denovo/.
Asunto(s)
Eucariontes , Transcriptoma , Transcriptoma/genética , Eucariontes/genética , Programas Informáticos , Perfilación de la Expresión Génica/métodos , Análisis de Secuencia de ARN/métodosRESUMEN
Tea, a globally popular beverage, contains various beneficial secondary metabolites. Tea plants (Camellia sinensis) exhibit diverse genetic traits across cultivars, impacting yield, adaptability, morphology, and secondary metabolite composition. Many tea cultivars have been the subject of much research interest, which have led to the accumulation of publicly available RNA-seq data. As such, it has become possible to systematically summarize the characteristics of different cultivars at the transcriptomic level, identify functional genes, and infer gene functions through co-expression analysis. Here, the transcriptomes of 9 tea cultivars were assembled, and comparative analysis was conducted on the coding sequences of 13 cultivars. To give access to this data, we present TeaNekT (https://teanekt.sbs.ntu.edu.sg/), a web resource that facilitates the prediction of gene functions of various tea cultivars. We used TeaNekT to perform a cross-cultivar comparison of co-expressed gene clusters and tissue-specific gene expression. We observed that 'Anji Baicha' possesses the highest number of cultivar-specific genes and the second-highest number of expanded genes. These genes in 'Anji Baicha' tend to be enriched in functions associated with cold stress response, chloroplast thylakoid structure, and nitrogen metabolism. Notably, we identified three significantly expanded homologous genes in 'Anji Baicha' encoding the ICE1, SIZ1, and MAPKK2, which are closely associated with the cold sensitivity of 'Anji Baicha'. Additionally, one significantly expanded homologous gene in 'Anji Baicha' encoding regulatory factor RIQ may play a crucial role in the abnormal chloroplast structure and absence of thylakoid membranes in 'Anji Baicha'.
Asunto(s)
Camellia sinensis , Regulación de la Expresión Génica de las Plantas , Transcriptoma , Camellia sinensis/genética , Transcriptoma/genética , Regulación de la Expresión Génica de las Plantas/genética , Proteínas de Plantas/genética , Proteínas de Plantas/metabolismo , Frío , Genes de Plantas/genética , Perfilación de la Expresión Génica , Respuesta al Choque por Frío/genética , Bases de Datos GenéticasRESUMEN
The importance of the alternative donation of electrons to the ubiquinol pool via the electron-transfer flavoprotein/electron-transfer flavoprotein:ubiquinone oxidoreductase (ETF/ETFQO) complex has been demonstrated. However, the functional significance of this pathway during seed development and germination remains to be elucidated. To assess the function of this pathway, we performed a detailed metabolic and transcriptomic analysis of Arabidopsis mutants to test the molecular consequences of a dysfunctional ETF/ETFQO pathway. We demonstrate that the disruption of this pathway compromises seed germination in the absence of an external carbon source and also impacts seed size and yield. Total protein and storage protein content is reduced in dry seeds, whilst sucrose levels remain invariant. Seeds of ETFQO and related mutants were also characterized by an altered fatty acid composition. During seed development, lower levels of fatty acids and proteins accumulated in the etfqo-1 mutant as well as in mutants in the alternative electron donors isovaleryl-CoA dehydrogenase (ivdh-1) and d-2-hydroxyglutarate dehydrogenase (d2hgdh1-2). Furthermore, the content of several amino acids was increased in etfqo-1 mutants during seed development, indicating that these mutants are not using such amino acids as alternative energy source for respiration. Transcriptome analysis revealed alterations in the expression levels of several genes involved in energy and hormonal metabolism. Our findings demonstrated that the alternative pathway of respiration mediated by the ETF/ETFQO complex affects seed germination and development by directly adjusting carbon storage during seed filling. These results indicate a role for the pathway in the normal plant life cycle to complement its previously defined roles in the response to abiotic stress.
Asunto(s)
Aminoácidos/metabolismo , Arabidopsis/genética , Carbono/metabolismo , Flavoproteínas Transportadoras de Electrones/metabolismo , Proteínas Hierro-Azufre/metabolismo , Oxidorreductasas actuantes sobre Donantes de Grupo CH-NH/metabolismo , Arabidopsis/enzimología , Arabidopsis/crecimiento & desarrollo , Proteínas de Arabidopsis/genética , Proteínas de Arabidopsis/metabolismo , Flavoproteínas Transportadoras de Electrones/genética , Germinación , Proteínas Hierro-Azufre/genética , Mutación , Oxidorreductasas actuantes sobre Donantes de Grupo CH-NH/genética , Semillas/enzimología , Semillas/genética , Semillas/crecimiento & desarrollo , Ubiquinona/análogos & derivados , Ubiquinona/metabolismoRESUMEN
Root hairs (RH) have become an important model system for studying plant growth and how plants modulate their growth in response to cell-intrinsic and environmental stimuli. Here, we will discuss recent advances in our understanding of the molecular mechanisms underlying the growth of Arabidopsis thaliana RH in the interface between responses to environmental cues (e.g. nutrients such as nitrates, phosphate and microorganism) and hormonal stimuli (e.g. auxin). RH growth is under the control of several transcription factors that are also under strong regulation at different levels. In this review we highlight recent new discoveries along these transcriptional pathways that may increase our capacity to enhance nutrient uptake by the roots in the context of abiotic stresses. We used text-mining capacities of the PlantConnectome database to generate the most updated view of RH growth in these complex biological contexts.
RESUMEN
Selaginella moellendorffii is a representative of the lycophyte lineage that is studied to understand the evolution of land plant traits such as the vasculature, leaves, stems, roots, and secondary metabolism. However, only a few studies have investigated the expression and transcriptional coordination of Selaginella genes, precluding us from understanding the evolution of the transcriptional programs behind these traits. We present a gene expression atlas comprising all major organs, tissue types, and the diurnal gene expression profiles for S. moellendorffii We show that the transcriptional gene module responsible for the biosynthesis of lignocellulose evolved in the ancestor of vascular plants and pinpoint the duplication and subfunctionalization events that generated multiple gene modules involved in the biosynthesis of various cell wall types. We demonstrate how secondary metabolism is transcriptionally coordinated and integrated with other cellular pathways. Finally, we identify root-specific genes and show that the evolution of roots did not coincide with an increased appearance of gene families, suggesting that the development of new organs does not coincide with increased fixation of new gene functions. Our updated database at conekt.plant.tools represents a valuable resource for studying the evolution of genes, gene families, transcriptomes, and functional gene modules in the Archaeplastida kingdom.
Asunto(s)
Evolución Biológica , Regulación de la Expresión Génica de las Plantas , Raíces de Plantas/genética , Haz Vascular de Plantas/genética , Metabolismo Secundario/genética , Selaginellaceae/genética , Vías Biosintéticas , Pared Celular/metabolismo , Celulosa/biosíntesis , Duplicación de Gen , Redes Reguladoras de Genes , Lignina/biosíntesis , Especificidad de Órganos , Filogenia , Transcriptoma/genéticaRESUMEN
Wood rot fungus Fulvifomes siamensis infects multiple urban tree species commonly planted in Singapore. A commercial e-nose (Cyranose 320) was used to differentiate some plant and fungi volatiles. The e-nose distinctly clustered the volatiles at 0.25 ppm, and this sensitivity was further increased to 0.05 ppm with the use of nitrogen gas to purge the system and set up the baseline. Nitrogen gas baseline resulted in a higher magnitude of sensor responses and a higher number of responsive sensors. The specificity of the e-nose for F. siamensis was demonstrated by distinctive clustering of its pure culture, fruiting bodies collected from different tree species, and in diseased tissues infected by F. siamensis with a 15-min incubation time. This good specificity was supported by the unique volatile profiles revealed by SPME GC-MS analysis, which also identified the signature volatile for F. siamensis-1,2,4,5-tetrachloro-3,6-dimethoxybenzene. In field conditions, the e-nose successfully identified F. siamensis fruiting bodies on different tree species. The findings of concentration-based clustering and host-tree-specific volatile profiles for fruiting bodies provide further insights into the complexity of volatile-based diagnosis that should be taken into consideration for future studies.
Asunto(s)
Nariz Electrónica , Compuestos Orgánicos Volátiles , Hongos , Cromatografía de Gases y Espectrometría de Masas/métodos , Nitrógeno , Odorantes/análisis , Microextracción en Fase Sólida/métodos , Árboles , Compuestos Orgánicos Volátiles/análisis , Madera/química , Panadizo InterdigitalRESUMEN
Plants accumulate a vast array of secondary metabolites, which constitute a natural resource for pharmaceuticals. Oldenlandia corymbosa belongs to the Rubiaceae family, and has been used in traditional medicine to treat different diseases, including cancer. However, the active metabolites of the plant, their biosynthetic pathway and mode of action in cancer are unknown. To fill these gaps, we exposed this plant to eight different stress conditions and combined different omics data capturing gene expression, metabolic profiles, and anti-cancer activity. Our results show that O. corymbosa extracts are active against breast cancer cell lines and that ursolic acid is responsible for this activity. Moreover, we assembled a high-quality genome and uncovered two genes involved in the biosynthesis of ursolic acid. Finally, we also revealed that ursolic acid causes mitotic catastrophe in cancer cells and identified three high-confidence protein binding targets by Cellular Thermal Shift Assay (CETSA) and reverse docking. Altogether, these results constitute a valuable resource to further characterize the biosynthesis of active metabolites in the Oldenlandia group, while the mode of action of ursolic acid will allow us to further develop this valuable compound.
Asunto(s)
Oldenlandia , Oldenlandia/química , Transcriptoma , Metabolómica , Genómica , Ácido UrsólicoRESUMEN
MOTIVATION: There are now more than two million RNA sequencing experiments for plants, animals, bacteria and fungi publicly available, allowing us to study gene expression within and across species and kingdoms. However, the tools allowing the download, quality control and annotation of this data for more than one species at a time are currently missing. RESULTS: To remedy this, we present the Large-Scale Transcriptomic Analysis Pipeline in Kingdom of Life (LSTrAP-Kingdom) pipeline, which we used to process 134 521 RNA-seq samples, achieving â¼12 000 processed samples per day. Our pipeline generated quality-controlled, annotated gene expression matrices that rival the manually curated gene expression data in identifying functionally related genes. AVAILABILITY AND IMPLEMENTATION: LSTrAP-Kingdom is available from: https://github.com/wirriamm/plants-pipeline and is fully implemented in Python and Bash. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Asunto(s)
Perfilación de la Expresión Génica , Programas Informáticos , Análisis de Secuencia de ARN , Bacterias , Expresión GénicaRESUMEN
Malaria is a tropical parasitic disease caused by the Plasmodium genus, which resulted in an estimated 219 million cases of malaria and 435 000 malaria-related deaths in 2017. Despite the availability of the Plasmodium falciparum genome since 2002, 74% of the genes remain uncharacterized. To remedy this paucity of functional information, we used transcriptomic data to build gene co-expression networks for two Plasmodium species (P. falciparum and P. berghei), and included genomic data of four other Plasmodium species, P. yoelii, P. knowlesi, P. vivax and P. cynomolgi, as well as two non-Plasmodium species from the Apicomplexa, Toxoplasma gondii and Theileria parva. The genomic and transcriptomic data were incorporated into the resulting database, malaria.tools, which is preloaded with tools that allow the identification and cross-species comparison of co-expressed gene neighbourhoods, clusters and life stage-specific expression, thus providing sophisticated tools to predict gene function. Moreover, we exemplify how the tools can be used to easily identify genes relevant for pathogenicity and various life stages of the malaria parasite. The database is freely available at www.malaria.tools.
Asunto(s)
Genoma de Protozoos , Genómica , Malaria/parasitología , Plasmodium/genética , Transcriptoma/genética , Animales , Apicomplexa/genética , Redes Reguladoras de Genes/genética , Humanos , Toxoplasma/genéticaRESUMEN
The avocado, Persea americana, is a fruit crop of immense importance to Mexican agriculture with an increasing demand worldwide. Avocado lies in the anciently diverged magnoliid clade of angiosperms, which has a controversial phylogenetic position relative to eudicots and monocots. We sequenced the nuclear genomes of the Mexican avocado race, P. americana var. drymifolia, and the most commercially popular hybrid cultivar, Hass, and anchored the latter to chromosomes using a genetic map. Resequencing of Guatemalan and West Indian varieties revealed that â¼39% of the Hass genome represents Guatemalan source regions introgressed into a Mexican race background. Some introgressed blocks are extremely large, consistent with the recent origin of the cultivar. The avocado lineage experienced 2 lineage-specific polyploidy events during its evolutionary history. Although gene-tree/species-tree phylogenomic results are inconclusive, syntenic ortholog distances to other species place avocado as sister to the enormous monocot and eudicot lineages combined. Duplicate genes descending from polyploidy augmented the transcription factor diversity of avocado, while tandem duplicates enhanced the secondary metabolism of the species. Phenylpropanoid biosynthesis, known to be elicited by Colletotrichum (anthracnose) pathogen infection in avocado, is one enriched function among tandems. Furthermore, transcriptome data show that tandem duplicates are significantly up- and down-regulated in response to anthracnose infection, whereas polyploid duplicates are not, supporting the general view that collections of tandem duplicates contribute evolutionarily recent "tuning knobs" in the genome adaptive landscapes of given species.
Asunto(s)
Colletotrichum/fisiología , ADN Intergénico , Introgresión Genética , Genoma de Planta , Interacciones Huésped-Patógeno/genética , Magnoliopsida , Persea , Filogenia , Enfermedades de las Plantas , Duplicación de Gen , Magnoliopsida/genética , Magnoliopsida/microbiología , Persea/genética , Persea/microbiología , Enfermedades de las Plantas/genética , Enfermedades de las Plantas/microbiologíaRESUMEN
BACKGROUND: With more than 300 million potentially infected people every year, and with the expanded habitat of mosquitoes due to climate change, Dengue virus (DENV) cannot be considered anymore only a tropical disease. The RNA secondary structure is a functional characteristic of RNA viruses, and together with the accumulated high-throughput sequencing data could provide general insights towards understanding virus biology. Here, we profiled the RNA secondary structure of > 7000 complete viral genomes from 11 different species focusing on viral hemorrhagic fevers, including DENV serotypes, EBOV, and YFV. RESULTS: In our work we demonstrated that the secondary structure and presence of protein-binding domains in the genomes can be used as intrinsic signature to further classify the viruses. With our predictive approach, we achieved high prediction scores of the secondary structure (AUC up to 0.85 with experimental data), and computed consensus secondary structure profiles using hundreds of in silico models. We observed that viruses show different structural patterns, where e.g., DENV-2 and Ebola virus tend to be less structured than the other viruses. Furthermore, we observed virus-specific correlations between secondary structure and the number of interaction sites with human proteins, reaching a correlation of 0.89 in the case of Zika virus. We also identified that helicases-encoding regions are more structured in several flaviviruses, while the regions encoding for the contact proteins exhibit virus-specific clusters in terms of RNA structure and potential protein-RNA interactions. We also used structural data to study the geographical distribution of DENV, finding a significant difference between DENV-3 from Asia and South-America, where the structure is also driving the clustering more than sequence identity, which could imply different evolutionary routes of this subtype. CONCLUSIONS: Our massive computational analysis provided novel results regarding the secondary structure and the interaction with human proteins, not only for DENV serotypes, but also for other flaviviruses and viral hemorrhagic fevers-associated viruses. We showed how the RNA secondary structure can be used to categorise viruses, and even to further classify them based on the interaction with proteins. We envision that these approaches can be used to further classify and characterise these complex viruses.
Asunto(s)
Virus del Dengue , Dengue , Fiebres Hemorrágicas Virales , Infección por el Virus Zika , Virus Zika , Animales , Asia , Virus del Dengue/genética , Humanos , Serogrupo , América del SurRESUMEN
Flowering plants (angiosperms) are characterized by pollen tubes (PTs; male gametophytes) carrying two immobile sperm cells that grow over long distances through the carpel toward the ovules, where double fertilization is executed. It is not understood how these reproductive structures evolved, which genes occur de novo in male gametophytes of angiosperms, and to which extent PT functions are conserved among angiosperms. To contribute to a deeper understanding of the evolution of gametophyte functions, we generated RNA sequencing data from seven reproductive and two vegetative control tissues of the basal angiosperm Amborella trichopoda and complemented these with proteomic data of pollen grains (PGs) and PTs. The eudicot model plant Arabidopsis (Arabidopsis thaliana) served as a reference organism for data analysis, as more than 200 genes have been associated with male gametophyte functions in this species. We describe methods to collect bicellular A. trichopoda PGs, to induce their germination in vitro, and to monitor PT growth and germ cell division. Transcriptomic and proteomic analyses indicate that A. trichopoda PGs are prepared for germination requiring lipids, energy, but likely also reactive oxygen species, while PTs are especially characterized by catabolic/biosynthetic and transport processes including cell wall biosynthesis and gene regulation. Notably, a number of pollen-specific genes were lacking in Arabidopsis, and the number of genes involved in pollen signaling is significantly reduced in A. trichopoda In conclusion, we provide insight into male gametophyte functions of the most basal angiosperm and establish a valuable resource for future studies on the evolution of flowering plants.
Asunto(s)
Arabidopsis/crecimiento & desarrollo , Arabidopsis/genética , Germinación/genética , Magnoliopsida/crecimiento & desarrollo , Magnoliopsida/genética , Polen/crecimiento & desarrollo , Polen/genética , Evolución Biológica , Regulación de la Expresión Génica de las Plantas , Genes de Plantas , Germinación/fisiología , Tubo Polínico/genética , Tubo Polínico/crecimiento & desarrollo , Proteómica , TranscriptomaRESUMEN
BACKGROUND: Bacterial resistance to antibiotics is a growing health problem that is projected to cause more deaths than cancer by 2050. Consequently, novel antibiotics are urgently needed. Since more than half of the available antibiotics target the structurally conserved bacterial ribosomes, factors involved in protein synthesis are thus prime targets for the development of novel antibiotics. However, experimental identification of these potential antibiotic target proteins can be labor-intensive and challenging, as these proteins are likely to be poorly characterized and specific to few bacteria. Here, we use a bioinformatics approach to identify novel components of protein synthesis. RESULTS: In order to identify these novel proteins, we established a Large-Scale Transcriptomic Analysis Pipeline in Crowd (LSTrAP-Crowd), where 285 individuals processed 26 terabytes of RNA-sequencing data of the 17 most notorious bacterial pathogens. In total, the crowd processed 26,269 RNA-seq experiments and used the data to construct gene co-expression networks, which were used to identify more than a hundred uncharacterized genes that were transcriptionally associated with protein synthesis. We provide the identity of these genes together with the processed gene expression data. CONCLUSIONS: We identified genes related to protein synthesis in common bacterial pathogens and thus provide a resource of potential antibiotic development targets for experimental validation. The data can be used to explore additional vulnerabilities of bacteria, while our approach demonstrates how the processing of gene expression data can be easily crowd-sourced.
Asunto(s)
Bacterias/genética , Biología Computacional/métodos , Colaboración de las Masas , Perfilación de la Expresión Génica/métodos , Expresión Génica , Ribosomas/química , Análisis de Secuencia de ARN/métodos , Redes Reguladoras de GenesRESUMEN
Almost all organisms coordinate some aspects of their biology through the diurnal cycle. Photosynthetic organisms, and plants especially, have established complex programs that coordinate physiological, metabolic and developmental processes with the changing light. The diurnal regulation of the underlying transcriptional processes is observed when groups of functionally related genes (gene modules) are expressed at a specific time of the day. However, studying the diurnal regulation of these gene modules in the plant kingdom was hampered by the large amount of data required for the analyses. To meet this need, we used gene expression data from 17 diurnal studies spanning the whole Archaeplastida kingdom (Plantae kingdom in the broad sense) to make an online diurnal database. We have equipped the database with tools that allow user-friendly cross-species comparisons of gene expression profiles, entire co-expression networks, co-expressed clusters (involved in specific biological processes), time-specific gene expression and others. We exemplify how these tools can be used by studying three important biological questions: (i) the evolution of cell division, (ii) the diurnal control of gene modules in algae and (iii) the conservation of diurnally controlled modules across species. The database is freely available at http://diurnal.plant.tools.
Asunto(s)
Regulación de la Expresión Génica de las Plantas , Redes Reguladoras de Genes , Plantas/genética , Plantas/metabolismo , Transcriptoma , División Celular/genética , Chlamydomonas reinhardtii/genética , ADN Polimerasa Dirigida por ADN , Bases de Datos Factuales , Perfilación de la Expresión Génica , Redes Reguladoras de Genes/genética , Anotación de Secuencia Molecular , Fotosíntesis/genéticaRESUMEN
The glaucophyte Cyanophora paradoxa represents the most basal member of the kingdom Archaeplastida, but the function and expression of most of its genes are unknown. This information is needed to uncover how functional gene modules, that is groups of genes performing a given function, evolved in the plant kingdom. We have generated a gene expression atlas capturing responses of Cyanophora to various abiotic stresses. The data were included in the CoNekT-Plants database, enabling comparative transcriptomic analyses across two algae and six land plants. We demonstrate how the database can be used to study gene expression, co-expression networks and gene function in Cyanophora, and how conserved transcriptional programs can be identified. We identified gene modules involved in phycobilisome biosynthesis, response to high light and cell division. While we observed no correlation between the number of differentially expressed genes and the impact on growth of Cyanophora, we found that the response to stress involves a conserved, kingdom-wide transcriptional reprogramming, which is activated upon most stresses in algae and land plants. The Cyanophora stress gene expression atlas and the tools found in the https://conekt.plant.tools/ database thus provide a useful resource to reveal functionally related genes and stress responses in the plant kingdom.
Asunto(s)
Cyanophora/metabolismo , Magnoliopsida/fisiología , Proliferación Celular , Cyanophora/genética , Bases de Datos Genéticas , Regulación hacia Abajo , Regulación de la Expresión Génica de las Plantas , Redes Reguladoras de Genes , Luz , ARN de Planta/genética , Análisis de Secuencia de ARN , Estrés Fisiológico , Temperatura , Transcriptoma , Regulación hacia ArribaRESUMEN
Thiamin pyrophosphate (TPP) is the active form of vitamin B1 and works as an essential cofactor for enzymes in key metabolic pathways, such as the tricarboxylic acid (TCA) cycle and the pentose phosphate pathway. Although its action as a coenzyme has been well documented, the roles of TPP in plant metabolism are still not fully understood. Here, we investigated the functions of TPP in the regulation of the metabolic networks during photoperiod transition using previously described Arabidopsis (Arabidopsis thaliana) riboswitch mutant plants, which accumulate thiamin vitamers. The results show that photosynthetic and metabolic phenotypes of TPP riboswitch mutants are photoperiod dependent. Additionally, the mutants are more distinct from control plants when plants are transferred from a short-day to a long-day photoperiod, suggesting that TPP also plays a role in metabolic acclimation to the photoperiod. Control plants showed changes in the amplitude of diurnal oscillation in the levels of metabolites, including glycine, maltose, and fumarate, following the photoperiod transition. Interestingly, many of these changes are not present in TPP riboswitch mutant plants, demonstrating their lack of metabolic flexibility. Our results also indicate a close relationship between photorespiration and the TCA cycle, as TPP riboswitch mutants accumulate less photorespiratory intermediates. This study shows the potential role of vitamin B1 in the diurnal regulation of central carbon metabolism in plants and the importance of maintaining appropriate cellular levels of thiamin vitamers for the plant's metabolic flexibility and ability to acclimate to an altered photoperiod.