Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 49
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Development ; 149(18)2022 09 15.
Artigo em Inglês | MEDLINE | ID: mdl-35950926

RESUMO

The morphology of the flowering plant is established during early embryogenesis. In recent years, many studies have focused on transcriptional profiling in plant embryogenesis, but the dynamic landscape of the Arabidopsis thaliana proteome remains elusive. In this study, Arabidopsis embryos at 2/4-cell, 8-cell, 16-cell, 32-cell, globular and heart stages were collected for nanoproteomic analysis. In total, 5386 proteins were identified. Of these, 1051 proteins were universally identified in all developmental stages and a range of 27 to 2154 proteins was found to be stage specific. These proteins could be grouped into eight clusters according to their expression levels. Gene Ontology enrichment analysis showed that genes involved in ribosome biogenesis and auxin-activated signalling were enriched during early embryogenesis, indicating that active translation and auxin signalling are important events in Arabidopsis embryo development. Combining RNA-sequencing data with the proteomics analysis, the correlation between mRNA and protein was evaluated. An overall positive correlation was found between mRNA and protein. This work provides a comprehensive landscape of the Arabidopsis proteome in early embryogenesis. Some important proteins/transcription factors identified through network analysis may serve as potential targets for future investigation.


Assuntos
Proteínas de Arabidopsis , Arabidopsis , Arabidopsis/metabolismo , Proteínas de Arabidopsis/genética , Proteínas de Arabidopsis/metabolismo , Desenvolvimento Embrionário , Regulação da Expressão Gênica de Plantas , Ácidos Indolacéticos/metabolismo , Proteoma/metabolismo , RNA/metabolismo , RNA Mensageiro/metabolismo , Fatores de Transcrição/metabolismo
2.
Int J Mol Sci ; 24(4)2023 Feb 16.
Artigo em Inglês | MEDLINE | ID: mdl-36835386

RESUMO

With climate change and labor shortages, direct-seeding rice cultivation is becoming popular worldwide, especially in Asia. Salinity stress negatively affects rice seed germination in the direct-seeding process, and the cultivation of suitable direct-seeding rice varieties under salinity stress is necessary. However, little is known about the underlying mechanism of salt responses during seed germination under salt stress. To investigate the salt tolerance mechanism at the seed germination stage, two contrasting rice genotypes differing in salt tolerance, namely, FL478 (salt-tolerant) and IR29 (salt-sensitive), were used in this study. We observed, that compared to IR29, FL478 appeared to be more tolerant to salt stress with a higher germination rate. GD1 (germination defective 1), which was involved in seed germination by regulating alpha-amylase, was upregulated significantly in the salt-sensitive IR29 strain under salt stress during germination. Transcriptomic data showed that salt-responsive genes tended to be up/downregulated in IR29 but not in FL478. Furthermore, we investigated the epigenetic changes in FL478 and IR29 during germination under saline treatment using whole genome bisulfite DNA sequencing (BS-seq) technology. BS-seq data showed that the global CHH methylation level increased dramatically under salinity stress in both strains, and the hyper CHH differentially methylated regions (DMRs) were predominantly located within the transposable elements regions. Compared with FL478, differentially expressed genes with DMRs in IR29 were mainly related to gene ontology terms such as response to water deprivation, response to salt stress, seed germination, and response to hydrogen peroxide pathways. These results may provide valuable insights into the genetic and epigenetic basis of salt tolerance at the seed germination stage, which is important for direct-seeding rice breeding.


Assuntos
Oryza , Transcriptoma , Oryza/genética , Epigenoma , Germinação , Melhoramento Vegetal , Estresse Salino , Genótipo
3.
Int J Mol Sci ; 24(18)2023 Sep 15.
Artigo em Inglês | MEDLINE | ID: mdl-37762466

RESUMO

In flowering plants, C4 photosynthesis is superior to C3 type in carbon fixation efficiency and adaptation to extreme environmental conditions, but the mechanisms behind the assembly of C4 machinery remain elusive. This study attempts to dissect the evolutionary divergence from C3 to C4 photosynthesis in five photosynthetic model plants from the grass family, using a combined comparative transcriptomics and deep learning technology. By examining and comparing gene expression levels in bundle sheath and mesophyll cells of five model plants, we identified 16 differentially expressed signature genes showing cell-specific expression patterns in C3 and C4 plants. Among them, two showed distinctively opposite cell-specific expression patterns in C3 vs. C4 plants (named as FOGs). The in silico physicochemical analysis of the two FOGs illustrated that C3 homologous proteins of LHCA6 had low and stable pI values of ~6, while the pI values of LHCA6 homologs increased drastically in C4 plants Setaria viridis (7), Zea mays (8), and Sorghum bicolor (over 9), suggesting this protein may have different functions in C3 and C4 plants. Interestingly, based on pairwise protein sequence/structure similarities between each homologous FOG protein, one FOG PGRL1A showed local inconsistency between sequence similarity and structure similarity. To find more examples of the evolutionary characteristics of FOG proteins, we investigated the protein sequence/structure similarities of other FOGs (transcription factors) and found that FOG proteins have diversified incompatibility between sequence and structure similarities during grass family evolution. This raised an interesting question as to whether the sequence similarity is related to structure similarity during C4 photosynthesis evolution.


Assuntos
Magnoliopsida , Setaria (Planta) , Sorghum , Zea mays/genética , Fotossíntese/genética
4.
Int J Mol Sci ; 23(20)2022 Oct 21.
Artigo em Inglês | MEDLINE | ID: mdl-36293547

RESUMO

Proteins are modular functionalities regulating multiple cellular activities in prokaryotes and eukaryotes. As a consequence of higher plants adapting to arid and thermal conditions, C4 photosynthesis is the carbon fixation process involving multi-enzymes working in a coordinated fashion. However, how these enzymes interact with each other and whether they co-evolve in parallel to maintain interactions in different plants remain elusive to date. Here, we report our findings on the global protein co-evolution relationship and local dynamics of co-varying site shifts in key C4 photosynthetic enzymes. We found that in most of the selected key C4 photosynthetic enzymes, global pairwise co-evolution events exist to form functional couplings. Besides, protein-protein interactions between these enzymes may suggest their unknown functionalities in the carbon delivery process. For PEPC and PPCK regulation pairs, pocket formation at the interactive interface are not necessary for their function. This feature is distinct from another well-known regulation pair in C4 photosynthesis, namely, PPDK and PPDK-RP, where the pockets are necessary. Our findings facilitate the discovery of novel protein regulation types and contribute to expanding our knowledge about C4 photosynthesis.


Assuntos
Carbono , Fotossíntese , Carbono/metabolismo , Fotossíntese/fisiologia , Plantas/metabolismo , Ciclo do Carbono
5.
Molecules ; 26(4)2021 Feb 23.
Artigo em Inglês | MEDLINE | ID: mdl-33672342

RESUMO

Glandular trichome (GT) is the dominant site for artemisinin production in Artemisia annua. Several critical genes involved in artemisinin biosynthesis are specifically expressed in GT. However, the molecular mechanism of differential gene expression between GT and other tissue types remains elusive. Chromatin accessibility, defined as the degree to which nuclear molecules are able to interact with chromatin DNA, reflects gene expression capacity to a certain extent. Here, we investigated and compared the landscape of chromatin accessibility in Artemisia annua leaf and GT using the Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq) technique. We identified 5413 GT high accessible and 4045 GT low accessible regions, and these GT high accessible regions may contribute to GT-specific biological functions. Several GT-specific artemisinin biosynthetic genes, such as DBR2 and CYP71AV1, showed higher accessible regions in GT compared to that in leaf, implying that they might be regulated by chromatin accessibility. In addition, transcription factor binding motifs for MYB, bZIP, C2H2, and AP2 were overrepresented in the highly accessible chromatin regions associated with artemisinin biosynthetic genes in glandular trichomes. Finally, we proposed a working model illustrating the chromatin accessibility dynamics in regulating artemisinin biosynthetic gene expression. This work provided new insights into epigenetic regulation of gene expression in GT.


Assuntos
Artemisia annua/metabolismo , Artemisininas/metabolismo , Cromatina/metabolismo , Cromatina/genética , Folhas de Planta/metabolismo
6.
Int J Mol Sci ; 21(17)2020 Aug 24.
Artigo em Inglês | MEDLINE | ID: mdl-32846981

RESUMO

Long noncoding RNA (lncRNA)/microRNA(miRNA)/mRNA triplets contribute to cancer biology. However, identifying significative triplets remains a major challenge for cancer research. The dynamic changes among factors of the triplets have been less understood. Here, by integrating target information and expression datasets, we proposed a novel computational framework to identify the triplets termed as "lncRNA-perturbated triplets". We applied the framework to five cancer datasets in The Cancer Genome Atlas (TCGA) project and identified 109 triplets. We showed that the paired miRNAs and mRNAs were widely perturbated by lncRNAs in different cancer types. LncRNA perturbators and lncRNA-perturbated mRNAs showed significantly higher evolutionary conservation than other lncRNAs and mRNAs. Importantly, the lncRNA-perturbated triplets exhibited high cancer specificity. The pan-cancer perturbator OIP5-AS1 had higher expression level than that of the cancer-specific perturbators. These lncRNA perturbators were significantly enriched in known cancer-related pathways. Furthermore, among the 25 lncRNA in the 109 triplets, lncRNA SNHG7 was identified as a stable potential biomarker in lung adenocarcinoma (LUAD) by combining the TCGA dataset and two independent GEO datasets. Results from cell transfection also indicated that overexpression of lncRNA SNHG7 and TUG1 enhanced the expression of the corresponding mRNA PNMA2 and CDC7 in LUAD. Our study provides a systematic dissection of lncRNA-perturbated triplets and facilitates our understanding of the molecular roles of lncRNAs in cancers.


Assuntos
MicroRNAs/genética , Neoplasias/genética , RNA Longo não Codificante/genética , RNA Mensageiro/genética , Transcriptoma , Células A549 , Adenocarcinoma/genética , Adenocarcinoma/patologia , Biomarcadores Tumorais/genética , Conjuntos de Dados como Assunto , Perfilação da Expressão Gênica , Regulação Neoplásica da Expressão Gênica , Redes Reguladoras de Genes , Genômica/métodos , Humanos , Neoplasias Pulmonares/genética , Neoplasias Pulmonares/patologia , Neoplasias/patologia , Prognóstico , Repetições de Trinucleotídeos/genética
7.
Plant Cell Environ ; 42(4): 1302-1317, 2019 04.
Artigo em Inglês | MEDLINE | ID: mdl-30474863

RESUMO

Light is essential for the plant establishment. Arabidopsis seedlings germinated in the dark cannot grow leaf and only have closed cotyledons. However, exogenous application of H2 O2 can induce leaves (establishment) in the dark. Comparative transcriptomic analysis revealed that light-responsive genes were activated by H2 O2 treatment. These genes are functionally correlated with photosynthesis, photorespiration, and components of photosystem, such as antenna proteins and light-harvesting chlorophyll proteins. We further found that application of H2 O2 facilitates cell cycle by accelerating G2 -M checkpoint transition in shoot apical meristem. Phytochrome-mediated light signalling pathway was also involved in the H2 O2 -facilitated establishment process. The constitutive photomorphogenesis 1 and phytochrome interacting factor 3 proteins were shown to be down-regulated by H2 O2 treatment and accordingly removed their inhibitory effects on photomorphogenesis in the dark. The crosstalk between oxidation and light signal pathways explains the mechanism that H2 O2 regulates plant dark establishment. The endogenous photorespiratory H2 O2 production was mimicked by overexpression of glycolate oxidase genes and supplement of substrate glycolate. As expected, seedling establishment was also induced by the endogenously produced H2 O2 under dark condition. These findings also suggest that photorespiratory H2 O2 production is at least partially involved in postgermination establishment.


Assuntos
Arabidopsis/efeitos da radiação , Peróxido de Hidrogênio/farmacologia , Plântula/efeitos da radiação , Transdução de Sinais/efeitos da radiação , Oxirredutases do Álcool/metabolismo , Arabidopsis/efeitos dos fármacos , Arabidopsis/crescimento & desenvolvimento , Proteínas de Arabidopsis/metabolismo , Ciclo Celular/efeitos dos fármacos , Ciclo Celular/efeitos da radiação , Escuridão , Citometria de Fluxo , Luz , Microscopia Confocal , Microscopia Eletrônica de Varredura , Plantas Geneticamente Modificadas , Reação em Cadeia da Polimerase em Tempo Real , Plântula/efeitos dos fármacos , Plântula/crescimento & desenvolvimento , Transdução de Sinais/efeitos dos fármacos , Transdução de Sinais/fisiologia
8.
BMC Cancer ; 19(1): 263, 2019 Mar 22.
Artigo em Inglês | MEDLINE | ID: mdl-30902072

RESUMO

BACKGROUND: Lung adenocarcinoma is the most common type of lung cancers. Whole-genome sequencing studies disclosed the genomic landscape of lung adenocarcinomas. however, it remains unclear if the genetic alternations could guide prognosis prediction. Effective genetic markers and their based prediction models are also at a lack for prognosis evaluation. METHODS: We obtained the somatic mutation data and clinical data for 371 lung adenocarcinoma cases from The Cancer Genome Atlas. The cases were classified into two prognostic groups (3-year survival), and a comparison was performed between the groups for the somatic mutation frequencies of genes, followed by development of computational models to discrete the different prognosis. RESULTS: Genes were found with higher mutation rates in good (≥ 3-year survival) than in poor (< 3-year survival) prognosis group of lung adenocarcinoma patients. Genes participating in cell-cell adhesion and motility were significantly enriched in the top gene list with mutation rate difference between the good and poor prognosis group. Support Vector Machine models with the gene somatic mutation features could well predict prognosis, and the performance improved as feature size increased. An 85-gene model reached an average cross-validated accuracy of 81% and an Area Under the Curve (AUC) of 0.896 for the Receiver Operating Characteristic (ROC) curves. The model also exhibited good inter-stage prognosis prediction performance, with an average AUC of 0.846 for the ROC curves. CONCLUSION: The prognosis of lung adenocarcinomas is related with somatic gene mutations. The genetic markers could be used for prognosis prediction and furthermore provide guidance for personal medicine.


Assuntos
Adenocarcinoma de Pulmão/mortalidade , Biomarcadores Tumorais/genética , Neoplasias Pulmonares/mortalidade , Modelos Biológicos , Máquina de Vetores de Suporte , Adenocarcinoma de Pulmão/genética , Adenocarcinoma de Pulmão/patologia , Adenocarcinoma de Pulmão/terapia , Biologia Computacional , Conjuntos de Dados como Assunto , Estudos de Viabilidade , Genômica/métodos , Humanos , Neoplasias Pulmonares/genética , Neoplasias Pulmonares/patologia , Neoplasias Pulmonares/terapia , Mutação , Medicina de Precisão/métodos , Prognóstico , Curva ROC , Análise de Sobrevida , Taxa de Sobrevida
9.
Mol Cell Proteomics ; 16(10): 1815-1828, 2017 10.
Artigo em Inglês | MEDLINE | ID: mdl-28827280

RESUMO

Protein cysteinyl residues are the mediators of hydrogen peroxide (H2O2)-dependent redox signaling. However, site-specific mapping of the selectivity and dynamics of these redox reactions in cells poses a major analytical challenge. Here we describe a chemoproteomic platform to systematically and quantitatively analyze the reactivity of thousands of cysteines toward H2O2 in human cells. We identified >900 H2O2-sensitive cysteines, which are defined as the H2O2-dependent redoxome. Although redox sites associated with antioxidative and metabolic functions are consistent, most of the H2O2-dependent redoxome varies dramatically between different cells. Structural analyses reveal that H2O2-sensitive cysteines are less conserved than their redox-insensitive counterparts and display distinct sequence motifs, structural features, and potential for crosstalk with lysine modifications. Notably, our chemoproteomic platform also provides an opportunity to predict oxidation-triggered protein conformational changes. The data are freely accessible as a resource at http://redox.ncpsb.org/OXID/.


Assuntos
Cisteína/química , Peróxido de Hidrogênio/química , Proteoma/análise , Proteômica/métodos , Motivos de Aminoácidos , Linhagem Celular Tumoral , Simulação por Computador , Cisteína/análise , Células HEK293 , Células Hep G2 , Humanos , Peróxido de Hidrogênio/análise , Lisina/análise , Lisina/química , Oxirredução , Conformação Proteica , Proteoma/química
10.
Molecules ; 24(13)2019 Jun 30.
Artigo em Inglês | MEDLINE | ID: mdl-31262005

RESUMO

Machine learning plays an important role in ligand-based virtual screening. However, conventional machine learning approaches tend to be inefficient when dealing with such problems where the data are imbalanced and features describing the chemical characteristic of ligands are high-dimensional. We here describe a machine learning algorithm LBS (local beta screening) for ligand-based virtual screening. The unique characteristic of LBS is that it quantifies the generalization ability of screening directly by a refined loss function, and thus can assess the risk of over-fitting accurately and efficiently for imbalanced and high-dimensional data in ligand-based virtual screening without the help of resampling methods such as cross validation. The robustness of LBS was demonstrated by a simulation study and tests on real datasets, in which LBS outperformed conventional algorithms in terms of screening accuracy and model interpretation. LBS was then used for screening potential activators of HIV-1 integrase multimerization in an independent compound library, and the virtual screening result was experimentally validated. Of the 25 compounds tested, six were proved to be active. The most potent compound in experimental validation showed an EC50 value of 0.71 µM.


Assuntos
Integrase de HIV/química , HIV-1/enzimologia , Aprendizado de Máquina , Modelos Químicos , Simulação por Computador , Ligantes
11.
Entropy (Basel) ; 21(8)2019 Aug 07.
Artigo em Inglês | MEDLINE | ID: mdl-33267482

RESUMO

Analysis of high-dimensional data is a challenge in machine learning and data mining. Feature selection plays an important role in dealing with high-dimensional data for improvement of predictive accuracy, as well as better interpretation of the data. Frequently used evaluation functions for feature selection include resampling methods such as cross-validation, which show an advantage in predictive accuracy. However, these conventional methods are not only computationally expensive, but also tend to be over-optimistic. We propose a novel cross-entropy which is based on beta distribution for feature selection. In beta distribution-based cross-entropy (BetaDCE) for feature selection, the probability density is estimated by beta distribution and the cross-entropy is computed by the expected value of beta distribution, so that the generalization ability can be estimated more precisely than conventional methods where the probability density is learnt from data. Analysis of the generalization ability of BetaDCE revealed that it was a trade-off between bias and variance. The robustness of BetaDCE was demonstrated by experiments on three types of data. In the exclusive or-like (XOR-like) dataset, the false discovery rate of BetaDCE was significantly smaller than that of other methods. For the leukemia dataset, the area under the curve (AUC) of BetaDCE on the test set was 0.93 with only four selected features, which indicated that BetaDCE not only detected the irrelevant and redundant features precisely, but also more accurately predicted the class labels with a smaller number of features than the original method, whose AUC was 0.83 with 50 features. In the metabonomic dataset, the overall AUC of prediction with features selected by BetaDCE was significantly larger than that by the original reported method. Therefore, BetaDCE can be used as a general and efficient framework for feature selection.

12.
Bioinformatics ; 33(20): 3166-3172, 2017 Oct 15.
Artigo em Inglês | MEDLINE | ID: mdl-28633445

RESUMO

MOTIVATION: Although high-throughput sequencing methods have been proposed to identify splicing branch points in the human genome, these methods can only detect a small fraction of the branch points subject to the sequencing depth, experimental cost and the expression level of the mRNA. An accurate computational model for branch point prediction is therefore an ongoing objective in human genome research. RESULTS: We here propose a novel branch point prediction algorithm that utilizes information on the branch point sequence and the polypyrimidine tract. Using experimentally validated data, we demonstrate that our proposed method outperforms existing methods. Availability and implementation: https://github.com/zhqingit/BPP. CONTACT: djguo@cuhk.edu.hk. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Biologia Computacional/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Splicing de RNA , Análise de Sequência de RNA/métodos , Software , Algoritmos , Genoma Humano , Humanos
13.
BMC Bioinformatics ; 18(1): 459, 2017 Oct 24.
Artigo em Inglês | MEDLINE | ID: mdl-29065858

RESUMO

BACKGROUND: Pre-mRNA splicing is the removal of introns from precursor mRNAs (pre-mRNAs) and the concurrent ligation of the flanking exons to generate mature mRNA. This process is catalyzed by the spliceosome, where the splicing factor 1 (SF1) specifically recognizes the seven-nucleotide branch point sequence (BPS) and the U2 snRNP later displaces the SF1 and binds to the BPS. In mammals, the degeneracy of BPS motifs together with the lack of a large set of experimentally verified BPSs complicates the task of BPS prediction in silico. RESULTS: In this paper, we develop a simple and yet efficient heuristic model for human BPS prediction based on a novel scoring scheme, which quantifies the splicing strength of putative BPSs. The candidate BPS is restricted exclusively within a defined BPS search region to avoid the influences of other elements in the intron and therefore the prediction accuracy is improved. Moreover, using two types of relative frequencies for human BPS prediction, we demonstrate our model outperformed other current implementations on experimentally verified human introns. CONCLUSION: We propose that the binding energy contributes to the molecular recognition involved in human pre-mRNA splicing. In addition, a genome-wide human BPS prediction is carried out. The characteristics of predicted BPSs are in accordance with experimentally verified human BPSs, and branch site positions relative to the 3'ss and the 5'end of the shortened AGEZ are consistent with the results of published papers. Meanwhile, a webserver for BPS predictor is freely available at http://biocomputer.bio.cuhk.edu.hk/BPS .


Assuntos
Modelos Moleculares , RNA Mensageiro/metabolismo , Éxons , Humanos , Íntrons , Ligação Proteica , Precursores de RNA/metabolismo , Splicing de RNA , Ribonucleoproteína Nuclear Pequena U2/química , Ribonucleoproteína Nuclear Pequena U2/metabolismo , Termodinâmica
14.
BMC Genomics ; 18(1): 279, 2017 04 04.
Artigo em Inglês | MEDLINE | ID: mdl-28376774

RESUMO

BACKGROUND: Disulfide bonds are traditionally considered to play only structural roles. In recent years, increasing evidence suggests that the disulfide proteome is made up of structural disulfides and reversible disulfides. Unlike structural disulfides, reversible disulfides are usually of important functional roles and may serve as redox switches. Interestingly, only specific disulfide bonds are reversible while others are not. However, whether reversible disulfides can be predicted based on structural information remains largely unknown. METHODS: In this study, two datasets with both types of disulfides were compiled using independent approaches. By comparison of various features extracted from the local structural signatures, we identified several features that differ significantly between reversible and structural disulfides, including disulfide bond length, along with the number, amino acid composition, secondary structure and physical-chemical properties of surrounding amino acids. A SVM-based classifier was developed for predicting reversible disulfides. RESULTS: By 10-fold cross-validation, the model achieved accuracy of 0.750, sensitivity of 0.352, specificity of 0.953, MCC of 0.405 and AUC of 0.751 using the RevSS_PDB dataset. The robustness was further validated by using RevSS_RedoxDB as independent testing dataset. This model was applied to proteins with known structures in the PDB database. The results show that one third of the predicted reversible disulfide containing proteins are well-known redox enzymes, while the remaining are non-enzyme proteins. Given that reversible disulfides are frequently reported from functionally important non-enzyme proteins such as transcription factors, the predictions may provide valuable candidates of novel reversible disulfides for further experimental investigation. CONCLUSIONS: This study provides the first comparative analysis between the reversible and the structural disulfides. Distinct features remarkably different between these two groups of disulfides were identified, and a SVM-based classifier for predicting reversible disulfides was developed accordingly. A web server named RevssPred can be accessed freely from: http://biocomputer.bio.cuhk.edu.hk/RevssPred .


Assuntos
Cistina/química , Proteínas/química , Software , Sequência de Aminoácidos , Simulação por Computador , Humanos , Modelos Moleculares , Estrutura Secundária de Proteína , Curva ROC , Máquina de Vetores de Suporte
15.
BMC Bioinformatics ; 17(1): 316, 2016 Aug 24.
Artigo em Inglês | MEDLINE | ID: mdl-27553667

RESUMO

BACKGROUND: Reactive oxygen species can modify the structure and function of proteins and may also act as important signaling molecules in various cellular processes. Cysteine thiol groups of proteins are particularly susceptible to oxidation. Meanwhile, their reversible oxidation is of critical roles for redox regulation and signaling. Recently, several computational tools have been developed for predicting redox-sensitive cysteines; however, those methods either only focus on catalytic redox-sensitive cysteines in thiol oxidoreductases, or heavily depend on protein structural data, thus cannot be widely used. RESULTS: In this study, we analyzed various sequence-based features potentially related to cysteine redox-sensitivity, and identified three types of features for efficient computational prediction of redox-sensitive cysteines. These features are: sequential distance to the nearby cysteines, PSSM profile and predicted secondary structure of flanking residues. After further feature selection using SVM-RFE, we developed Redox-Sensitive Cysteine Predictor (RSCP), a SVM based classifier for redox-sensitive cysteine prediction using primary sequence only. Using 10-fold cross-validation on RSC758 dataset, the accuracy, sensitivity, specificity, MCC and AUC were estimated as 0.679, 0.602, 0.756, 0.362 and 0.727, respectively. When evaluated using 10-fold cross-validation with BALOSCTdb dataset which has structure information, the model achieved performance comparable to current structure-based method. Further validation using an independent dataset indicates it is robust and of relatively better accuracy for predicting redox-sensitive cysteines from non-enzyme proteins. CONCLUSIONS: In this study, we developed a sequence-based classifier for predicting redox-sensitive cysteines. The major advantage of this method is that it does not rely on protein structure data, which ensures more extensive application compared to other current implementations. Accurate prediction of redox-sensitive cysteines not only enhances our understanding about the redox sensitivity of cysteine, it may also complement the proteomics approach and facilitate further experimental investigation of important redox-sensitive cysteines.


Assuntos
Cisteína/química , Análise de Sequência de Proteína/métodos , Oxirredução , Estrutura Secundária de Proteína , Proteínas/química , Máquina de Vetores de Suporte
16.
BMC Genomics ; 14: 851, 2013 Dec 05.
Artigo em Inglês | MEDLINE | ID: mdl-24304681

RESUMO

BACKGROUND: Cavendish, the most widely grown banana cultivar, is relatively resistant to Race 1 of Fusarium oxysporum f. sp. cubense (Foc1) which caused widespread Panama disease during the first half of the 20th century but is susceptible to Tropical Race 4 of Foc (Foc TR4) which is threatening world banana production. The genome of the diploid species Musa acuminata which is the ancestor of a majority of triploid banana cultivars has recently been sequenced. Availability of banana transcriptomes will be highly useful for improving banana genome annotation and for biological research. The knowledge of global gene expression patterns influenced by infection of different Foc races will help to understand the host responses to the infection. RESULTS: RNA samples from different organs of the Cavendish cultivar were pooled for deep sequencing using the Illumina technology. Analysis of the banana transcriptome led to identification of over 842 genes that were not annotated by the Musa genome project. A large number of simple nucleotide polymorphisms (SNPs) and short insertions and deletion (indels) were identified from the transcriptome data. GFP-expressing Foc1 and Foc TR4 were used to monitor the infection process. Both Foc1 and Foc TR4 were found to be able to invade banana roots and spread to root vascular tissues in the first two days following inoculation. Digital gene expression (DGE) profiling analysis reveal that the infection by Foc1 and Foc TR4 caused very similar changes in the global gene expression profiles in the banana roots during the first two days of infection. The Foc infection led to induction of many well-known defense-related genes. Two genes encoding the ethylene biosynthetic enzyme ACC oxidase and several ethylene-responsive transcription factors (ERF) were among the strongly induced genes by both Foc1 and Foc TR4. CONCLUSIONS: Both Foc1 and Foc TR4 are able to spread into the vascular system of banana roots during the early infection process and their infection led to similar gene expression profiles in banana roots. The transcriptome profiling analysis indicates that the ethylene synthetic and signalling pathways were activated in response to the Foc infection.


Assuntos
Perfilação da Expressão Gênica , Regulação da Expressão Gênica de Plantas , Musa/genética , Doenças das Plantas/genética , Raízes de Plantas/genética , Transcriptoma , Análise por Conglomerados , Etiquetas de Sequências Expressas , Fusarium , Expressão Gênica , Genes Reporter , Mutação INDEL , Dados de Sequência Molecular , Musa/microbiologia , Doenças das Plantas/microbiologia , Raízes de Plantas/microbiologia , Polimorfismo de Nucleotídeo Único
17.
Bioinformatics ; 28(19): 2551-2, 2012 Oct 01.
Artigo em Inglês | MEDLINE | ID: mdl-22833525

RESUMO

SUMMARY: Redox regulation and signaling, which are involved in various cellular processes, have become one of the research focuses in the past decade. Cysteine thiol groups are particularly susceptible to post-translational modification, and their reversible oxidation is of critical role in redox regulation and signaling. With the tremendous improvement of techniques, hundreds of redox proteins along with their redox-sensitive cysteines have been reported, and the number is still fast growing. However, until now there is no database to accommodate the rapid accumulation of information on protein oxidative modification. Here we present RedoxDB-a manually curated database for experimentally validated redox proteins. RedoxDB (version 1.0) consists of two datasets (A and B, for proteins with or without verified modified cysteines, respectively) and includes 2157 redox proteins containing 2203 cysteine residues with oxidative modification. For each modified cysteine, the exact position, modification type and flanking sequence are provided. Additional information, including gene name, organism, sequence, literature references and links to UniProt and PDB, is also supplied. The database supports several functions including data search, blast and browsing. Bulk download of the entire dataset is also available. We expect that RedoxDB will be useful for both experimental studies and computational analyses of protein oxidative modification. AVAILABILITY: The database is freely available at: http://biocomputer.bio.cuhk.edu.hk/RedoxDB. CONTACT: djguo@cuhk.edu.hk SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Online.


Assuntos
Cisteína/química , Bases de Dados de Proteínas , Oxirredução , Proteínas/química , Biologia Computacional , Processamento de Proteína Pós-Traducional , Compostos de Sulfidrila/química , Interface Usuário-Computador
18.
BMC Bioinformatics ; 13: 66, 2012 Apr 30.
Artigo em Inglês | MEDLINE | ID: mdl-22545727

RESUMO

BACKGROUND: Type III Secretion System (T3SS), which plays important roles in pathogenesis or symbiosis, is widely expressed in a variety of gram negative bacteria. However, lack of unique nomenclature for T3SS genes has hindered T3SS related research. It is necessary to set up a knowledgebase integrating T3SS-related research data to facilitate the communication between different research groups interested in different bacteria. DESCRIPTION: A T3SS-related Database (T3DB) was developed. T3DB serves as an integrated platform for sequence collection, function annotation, and ortholog classification for T3SS related apparatus, effector, chaperone and regulatory genes. The collection of T3SS-containing bacteria, T3SS-related genes, function annotation, and the ortholog information were all manually curated from literature. BPBAac, a highly efficient T3SS effector prediction tool, was also implemented. CONCLUSIONS: T3DB is the first systematic platform integrating well-annotated T3SS-related gene and protein information to facilitate T3SS and bacterial pathogenecity related research. The newly constructed T3 ortholog clusters may faciliate effective communication between different research groups and will promote de novo discoveries. Besides, the manually-curated high-quality effector and chaperone data are useful for feature analysis and evolutionary studies of these important proteins.


Assuntos
Bactérias/genética , Sistemas de Secreção Bacterianos , Bases de Dados Genéticas , Bactérias/metabolismo , Genes Bacterianos , Chaperonas Moleculares/genética , Chaperonas Moleculares/metabolismo , Anotação de Sequência Molecular , Software , Interface Usuário-Computador
19.
Plant Biotechnol J ; 10(1): 31-42, 2012 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-21645203

RESUMO

Brassica juncea 3-hydroxy-3-methylglutaryl-CoA synthase (HMGS) is encoded by four isogenes (BjHMGS1-BjHMGS4). In vitro enzyme assays had indicated that the recombinant BjHMGS1 H188N mutant lacked substrate inhibition by acetoacetyl-CoA (AcAc-CoA) and showed 8-fold decreased enzyme activity. The S359A mutant demonstrated 10-fold higher activity, while the H188N/S359A double mutant displayed a 10-fold increased enzyme activity and lacked inhibition by AcAc-CoA. Here, wild-type and mutant BjHMGS1 were overexpressed in Arabidopsis to examine their effects in planta. The expression of selected genes in isoprenoid biosynthesis, isoprenoid content, seed germination and stress tolerance was analysed in HMGS overexpressors (OEs). Those mRNAs encoding enzymes 3-hydroxy-3-methylglutaryl-CoA reductase (HMGR), sterol methyltransferase 2 (SMT2), delta-24 sterol reductase (DWF1), C-22 sterol desaturase (CYP710A1) and brassinosteroid-6-oxidase 2 (BR6OX2) were up-regulated in HMGS-OEs. The total sterol content in leaves and seedlings of OE-wtBjHMGS1, OE-S359A and OE-H188N/S359A was significantly higher than OE-H188N. HMGS-OE seeds germinated earlier than wild-type and vector-transformed controls. HMGS-OEs further displayed reduced hydrogen peroxide (H(2) O(2) )-induced cell death and constitutive expression of salicylic acid (SA)-dependent pathogenesis-related genes (PR1, PR2 and PR5), resulting in an increased resistance to Botrytis cinerea, with OE-S359A showing the highest and OE-H188N the lowest tolerance. These results suggest that overexpression of HMGS up-regulates HMGR, SMT2, DWF1, CYP710A1 and BR6OX2, leading to enhanced sterol content and stress tolerance in Arabidopsis.


Assuntos
Acil Coenzima A/metabolismo , Arabidopsis/genética , Mostardeira/enzimologia , Mutação/genética , Esteróis/biossíntese , Estresse Fisiológico/genética , Regulação para Cima/genética , Acil Coenzima A/genética , Adaptação Fisiológica/efeitos dos fármacos , Adaptação Fisiológica/genética , Arabidopsis/efeitos dos fármacos , Arabidopsis/imunologia , Arabidopsis/microbiologia , Proteínas de Arabidopsis/genética , Proteínas de Arabidopsis/metabolismo , Vias Biossintéticas/efeitos dos fármacos , Vias Biossintéticas/genética , Botrytis/efeitos dos fármacos , Botrytis/fisiologia , Morte Celular/efeitos dos fármacos , Resistência à Doença/efeitos dos fármacos , Resistência à Doença/genética , Regulação Enzimológica da Expressão Gênica/efeitos dos fármacos , Regulação da Expressão Gênica de Plantas/efeitos dos fármacos , Vetores Genéticos/genética , Germinação/efeitos dos fármacos , Germinação/genética , Peróxido de Hidrogênio/farmacologia , Mostardeira/efeitos dos fármacos , Folhas de Planta/efeitos dos fármacos , Folhas de Planta/enzimologia , Folhas de Planta/genética , Folhas de Planta/microbiologia , Plantas Geneticamente Modificadas , Plântula/efeitos dos fármacos , Plântula/genética , Plântula/microbiologia , Transformação Genética/efeitos dos fármacos , Regulação para Cima/efeitos dos fármacos
20.
Bioinformatics ; 27(6): 777-84, 2011 Mar 15.
Artigo em Inglês | MEDLINE | ID: mdl-21233168

RESUMO

MOTIVATION: Bacterial type III secreted (T3S) effectors are delivered into host cells specifically via type III secretion systems (T3SSs), which play important roles in the interaction between bacteria and their hosts. Previous computational methods for T3S protein prediction have only achieved limited accuracy, and distinct features for effective T3S protein prediction remain to be identified. RESULTS: In this work, a distinctive N-terminal position-specific amino acid composition (Aac) feature was identified for T3S proteins. A large portion (∼50%) of T3S proteins exhibit distinct position-specific Aac features that can tolerate position shift. A classifier, BPBAac, was developed and trained using Support Vector Machine (SVM) based on the Aac feature extracted using a Bi-profile Bayes model. We demonstrated that the BPBAac model outperformed other implementations in classification of T3S and non-T3S proteins, giving an average sensitivity of ∼90.97% and an average selectivity of ∼97.42% in a 5-fold cross-validation evaluation. The model was also robust when a small-size training dataset was used. The fact that the position-specific Aac feature is commonly found in T3S proteins across different bacterial species gives this model wide application. To demonstrate the model's application, a genome-wide prediction of T3S effector proteins was performed for Ralstonia solanacearum, an important plant pathogenic bacterium, and a number of putative candidates were identified using this model. AVAILABILITY: An R package of BPBAac tool is freely downloadable from: http://biocomputer.bio.cuhk.edu.hk/softwares/BPBAac.


Assuntos
Proteínas de Bactérias/química , Biologia Computacional/métodos , Ralstonia solanacearum/química , Análise de Sequência de Proteína/métodos , Algoritmos , Sequência de Aminoácidos , Proteínas de Bactérias/classificação , Sistemas de Secreção Bacterianos , Teorema de Bayes , Genoma Bacteriano , Modelos Estatísticos , Software
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA