Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 77
Filtrar
Más filtros

Banco de datos
Tipo del documento
País de afiliación
Intervalo de año de publicación
1.
PLoS Comput Biol ; 19(6): e1011218, 2023 06.
Artículo en Inglés | MEDLINE | ID: mdl-37289843

RESUMEN

Synthetic lethality (SL) occurs when mutations in two genes together lead to cell or organism death, while a single mutation in either gene does not have a significant impact. This concept can also be extended to three or more genes for SL. Computational and experimental methods have been developed to predict and verify SL gene pairs, especially for yeast and Escherichia coli. However, there is currently a lack of a specialized platform to collect microbial SL gene pairs. Therefore, we designed a synthetic interaction database for microbial genetics that collects 13,313 SL and 2,994 Synthetic Rescue (SR) gene pairs that are reported in the literature, as well as 86,981 putative SL pairs got through homologous transfer method in 281 bacterial genomes. Our database website provides multiple functions such as search, browse, visualization, and Blast. Based on the SL interaction data in the S. cerevisiae, we review the issue of duplications' essentiality and observed that the duplicated genes and singletons have a similar ratio of being essential when we consider both individual and SL. The Microbial Synthetic Lethal and Rescue Database (Mslar) is expected to be a useful reference resource for researchers interested in the SL and SR genes of microorganisms. Mslar is open freely to everyone and available on the web at http://guolab.whu.edu.cn/Mslar/.


Asunto(s)
Neoplasias , Saccharomyces cerevisiae , Humanos , Saccharomyces cerevisiae/genética , Mutaciones Letales Sintéticas , Mutación , Genoma Bacteriano/genética , Bases de Datos Genéticas , Neoplasias/genética
2.
Methods ; 210: 10-19, 2023 02.
Artículo en Inglés | MEDLINE | ID: mdl-36621557

RESUMEN

Proteins encoded by small open reading frames (sORFs) can serve as functional elements playing important roles in vivo. Such sORFs also constitute the potential pool for facilitating the de novo gene birth, driving evolutionary innovation and species diversity. Therefore, their theoretical and experimental identification has become a critical issue. Herein, we proposed a protein-coding sORFs prediction method merely based on integrative sequence-derived features. Our prediction performance is better or comparable compared with other nine prevalent methods, which shows that our method can provide a relatively reliable research tool for the prediction of protein-coding sORFs. Our method allows users to estimate the potential expression of a queried sORF, which has been demonstrated by the correlation analysis between our possibility estimation and codon adaption index (CAI). Based on the features that we used, we demonstrated that the sequence features of the protein-coding sORFs in the two domains have significant differences implying that it might be a relatively hard task in terms of cross-domain prediction, hence domain-specific models were developed, which allowed users to predict protein-coding sORFs both in eukaryotes and prokaryotes. Finally, a web-server was developed and provided to boost and facilitate the study of the related field, which is freely available at http://guolab.whu.edu.cn/codingCapacity/index.html.


Asunto(s)
Bosques Aleatorios , Sistemas de Lectura Abierta/genética
3.
BMC Genomics ; 24(1): 482, 2023 Aug 24.
Artículo en Inglés | MEDLINE | ID: mdl-37620754

RESUMEN

BACKGROUND: The natural products, metabolites, of gut microbes are crucial effect factors on diseases. Comprehensive identification and annotation of relationships among disease, metabolites, and microbes can provide efficient and targeted solutions towards understanding the mechanism of complex disease and development of new markers and drugs. RESULTS: We developed Gut Microbial Metabolite Association with Disease (GMMAD), a manually curated database of associations among human diseases, gut microbes, and metabolites of gut microbes. Here, this initial release (i) contains 3,836 disease-microbe associations and 879,263 microbe-metabolite associations, which were extracted from literatures and available resources and then experienced our manual curation; (ii) defines an association strength score and a confidence score. With these two scores, GMMAD predicted 220,690 disease-metabolite associations, where the metabolites all belong to the gut microbes. We think that the positive effective (with both scores higher than suggested thresholds) associations will help identify disease marker and understand the pathogenic mechanism from the sense of gut microbes. The negative effective associations would be taken as biomarkers and have the potential as drug candidates. Literature proofs supported our proposal with experimental consistence; (iii) provides a user-friendly web interface that allows users to browse, search, and download information on associations among diseases, metabolites, and microbes. The resource is freely available at http://guolab.whu.edu.cn/GMMAD . CONCLUSIONS: As the online-available unique resource for gut microbial metabolite-disease associations, GMMAD is helpful for researchers to explore mechanisms of disease- metabolite-microbe and screen the drug and marker candidates for different diseases.


Asunto(s)
Productos Biológicos , Microbioma Gastrointestinal , Humanos , Bases de Datos Factuales , Levamisol
4.
Brief Bioinform ; 21(1): 171-181, 2020 Jan 17.
Artículo en Inglés | MEDLINE | ID: mdl-30496347

RESUMEN

Essential genes have attracted increasing attention in recent years due to the important functions of these genes in organisms. Among the methods used to identify the essential genes, accurate and efficient computational methods can make up for the deficiencies of expensive and time-consuming experimental technologies. In this review, we have collected researches on essential gene predictions in prokaryotes and eukaryotes and summarized the five predominant types of features used in these studies. The five types of features include evolutionary conservation, domain information, network topology, sequence component and expression level. We have described how to implement the useful forms of these features and evaluated their performance based on the data of Escherichia coli MG1655, Bacillus subtilis 168 and human. The prerequisite and applicable range of these features is described. In addition, we have investigated the techniques used to weight features in various models. To facilitate researchers in the field, two available online tools, which are accessible for free and can be directly used to predict gene essentiality in prokaryotes and humans, were referred. This article provides a simple guide for the identification of essential genes in prokaryotes and eukaryotes.

5.
Nucleic Acids Res ; 46(D1): D393-D398, 2018 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-29036676

RESUMEN

CRISPR-Cas is a tool that is widely used for gene editing. However, unexpected off-target effects may occur as a result of long-term nuclease activity. Anti-CRISPR proteins, which are powerful molecules that inhibit the CRISPR-Cas system, may have the potential to promote better utilization of the CRISPR-Cas system in gene editing, especially for gene therapy. Additionally, more in-depth research on these proteins would help researchers to better understand the co-evolution of bacteria and phages. Therefore, it is necessary to collect and integrate data on various types of anti-CRISPRs. Herein, data on these proteins were manually gathered through data screening of the literatures. Then, the first online resource, anti-CRISPRdb, was constructed for effectively organizing these proteins. It contains the available protein sequences, DNA sequences, coding regions, source organisms, taxonomy, virulence, protein interactors and their corresponding three-dimensional structures. Users can access our database at http://cefg.uestc.edu.cn/anti-CRISPRdb/ without registration. We believe that the anti-CRISPRdb can be used as a resource to facilitate research on anti-CRISPR proteins and in related fields.


Asunto(s)
Bacteriófagos/fisiología , Sistemas CRISPR-Cas , Bases de Datos de Proteínas , Proteínas Virales/química , Proteínas Virales/genética , Proteínas Virales/metabolismo
6.
Brief Bioinform ; 18(3): 357-366, 2017 05 01.
Artículo en Inglés | MEDLINE | ID: mdl-26992782

RESUMEN

Genomic islands are genomic fragments of alien origin in bacterial and archaeal genomes, usually involved in symbiosis or pathogenesis. In this work, we described Zisland Explorer, a novel tool to predict genomic islands based on the segmental cumulative GC profile. Zisland Explorer was designed with a novel strategy, as well as a combination of the homogeneity and heterogeneity of genomic sequences. While the sequence homogeneity reflects the composition consistence within each island, the heterogeneity measures the composition bias between an island and the core genome. The performance of Zisland Explorer was evaluated on the data sets of 11 different organisms. Our results suggested that the true-positive rate (TPR) of Zisland Explorer was at least 10.3% higher than that of four other widely used tools. On the other hand, the new tool did not lose overall accuracy with the improvement in the TPR and showed better equilibrium among various evaluation indexes. Also, Zisland Explorer showed better accuracy in the prediction of experimental island data. Overall, the tool provides an alternative solution over other tools, which expands the field of island prediction and offers a supplement to increase the performance of the distinct predicting strategy. We have provided a web service as well as a graphical user interface and open-source code across multiple platforms for Zisland Explorer, which is available at http://cefg.uestc.edu.cn/Zisland_Explorer/ or http://tubic.tju.edu.cn/Zisland_Explorer/.


Asunto(s)
Islas Genómicas , Genoma Arqueal , Genoma Bacteriano , Genómica , Programas Informáticos
7.
Environ Microbiol ; 20(10): 3836-3850, 2018 10.
Artículo en Inglés | MEDLINE | ID: mdl-30187624

RESUMEN

To better understand the mechanisms of bacterial adaptation in oxygen environments, we explored the aerobic living-associated genes in bacteria by comparing Clusters of Orthologous Groups of proteins' (COGs) frequencies and gene expression analyses and 38 COGs were detected at significantly higher frequencies (p-value less than 1e-6) in aerobes than in anaerobes. Differential expression analyses between two conditions further narrowed the prediction to 27 aerobe-specific COGs. Then, we annotated the enzymes associated with these COGs. Literature review revealed that 14 COGs contained enzymes catalysing oxygen-involved reactions or products involved in aerobic pathways, suggesting their important roles for survival in aerobic environments. Additionally, protein-protein interaction analyses and step length comparisons of metabolic networks suggested that the other 13 COGs may function relevantly with the 14 enzymes-corresponding COGs, indicating that these genes may be highly associated with oxygen utilization. Phylogenetic and evolutionary analyses showed that the 27 COGs did not have similar trees, and all suffered purifying selection pressures. The divergent times of species containing or lacking aerobic COGs validated that the appearing time of oxygen-utilizing gene was approximately 2.80 Gyr ago. In addition to help better understand oxygen adaption, our method may be extended to identify genes relevant to other living environments.


Asunto(s)
Bacterias/enzimología , Bacterias/metabolismo , Proteínas Bacterianas/metabolismo , Oxígeno/metabolismo , Aerobiosis , Bacterias/clasificación , Bacterias/genética , Proteínas Bacterianas/genética , Evolución Molecular , Redes y Vías Metabólicas , Filogenia
8.
Bioinformatics ; 33(12): 1758-1764, 2017 Jun 15.
Artículo en Inglés | MEDLINE | ID: mdl-28158612

RESUMEN

MOTIVATION: Previously constructed classifiers in predicting eukaryotic essential genes integrated a variety of features including experimental ones. If we can obtain satisfactory prediction using only nucleotide (sequence) information, it would be more promising. Three groups recently identified essential genes in human cancer cell lines using wet experiments and it provided wonderful opportunity to accomplish our idea. Here we improved the Z curve method into the λ-interval form to denote nucleotide composition and association information and used it to construct the SVM classifying model. RESULTS: Our model accurately predicted human gene essentiality with an AUC higher than 0.88 both for 5-fold cross-validation and jackknife tests. These results demonstrated that the essentiality of human genes could be reliably reflected by only sequence information. We re-predicted the negative dataset by our Pheg server and 118 genes were additionally predicted as essential. Among them, 20 were found to be homologues in mouse essential genes, indicating that some of the 118 genes were indeed essential, however previous experiments overlooked them. As the first available server, Pheg could predict essentiality for anonymous gene sequences of human. It is also hoped the λ-interval Z curve method could be effectively extended to classification issues of other DNA elements. AVAILABILITY AND IMPLEMENTATION: http://cefg.uestc.edu.cn/Pheg. CONTACT: fbguo@uestc.edu.cn. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Composición de Base , Genes Esenciales , Análisis de Secuencia de ADN/métodos , Programas Informáticos , Animales , Eucariontes/genética , Humanos , Ratones , Modelos Genéticos
9.
Nucleic Acids Res ; 44(W1): W550-6, 2016 Jul 08.
Artículo en Inglés | MEDLINE | ID: mdl-27150808

RESUMEN

In order to foster innovation and improve the effectiveness of drug discovery, there is a considerable interest in exploring unknown 'chemical space' to identify new bioactive compounds with novel and diverse scaffolds. Hence, fragment-based drug discovery (FBDD) was developed rapidly due to its advanced expansive search for 'chemical space', which can lead to a higher hit rate and ligand efficiency (LE). However, computational screening of fragments is always hampered by the promiscuous binding model. In this study, we developed a new web server Auto Core Fragment in silico Screening (ACFIS). It includes three computational modules, PARA_GEN, CORE_GEN and CAND_GEN. ACFIS can generate core fragment structure from the active molecule using fragment deconstruction analysis and perform in silico screening by growing fragments to the junction of core fragment structure. An integrated energy calculation rapidly identifies which fragments fit the binding site of a protein. We constructed a simple interface to enable users to view top-ranking molecules in 2D and the binding mode in 3D for further experimental exploration. This makes the ACFIS a highly valuable tool for drug discovery. The ACFIS web server is free and open to all users at http://chemyang.ccnu.edu.cn/ccb/server/ACFIS/.


Asunto(s)
Simulación por Computador , Descubrimiento de Drogas/métodos , Evaluación Preclínica de Medicamentos/métodos , Internet , Ligandos , Proteínas/química , Programas Informáticos , Sitios de Unión , Imagenología Tridimensional , Bibliotecas de Moléculas Pequeñas/química , Bibliotecas de Moléculas Pequeñas/farmacología , Interfaz Usuario-Computador
10.
Nucleic Acids Res ; 44(D1): D1127-32, 2016 Jan 04.
Artículo en Inglés | MEDLINE | ID: mdl-26503249

RESUMEN

The BDB database (http://immunet.cn/bdb) is an update of the MimoDB database, which was previously described in the 2012 Nucleic Acids Research Database issue. The rebranded name BDB is short for Biopanning Data Bank, which aims to be a portal for biopanning results of the combinatorial peptide library. Last updated in July 2015, BDB contains 2904 sets of biopanning data collected from 1322 peer-reviewed papers. It contains 25,786 peptide sequences, 1704 targets, 492 known templates, 447 peptide libraries and 310 crystal structures of target-template or target-peptide complexes. All data stored in BDB were revisited, and information on peptide affinity, measurement method and procedures was added for 2298 peptides from 411 sets of biopanning data from 246 published papers. In addition, a more professional and user-friendly web interface was implemented, a more detailed help system was designed, and a new on-the-fly data visualization tool and a series of tools for data analysis were integrated. With these new data and tools made available, we expect that the BDB database would become a major resource for scholars using phage display, with improved utility for biopanning and related scientific communities.


Asunto(s)
Bases de Datos de Compuestos Químicos , Biblioteca de Péptidos , Péptidos/química , Técnicas de Visualización de Superficie Celular , Internet , Programas Informáticos
11.
Environ Microbiol ; 19(3): 1266-1280, 2017 03.
Artículo en Inglés | MEDLINE | ID: mdl-28028888

RESUMEN

Laribacter hongkongensis is a fish-borne pathogen associated with invasive infections and gastroenteritis. Its adaptive mechanisms to oxygen-limiting conditions in various environmental niches remain unclear. In this study, we compared the transcriptional profiles of L. hongkongensis under aerobic and anaerobic conditions using RNA-sequencing. Expression of genes involved in arginine metabolism significantly increased under anoxic conditions. Arginine was exploited as the sole energy source in L. hongkongensis for anaerobic respiration via the arginine catabolism pathway: specifically via the arginine deiminase (ADI) pathway. A transcriptional regulator FNR was identified to coordinate anaerobic metabolism by tightly regulating the expression of arginine metabolism genes. FNR executed its regulatory function by binding to FNR boxes in arc operons promoters. Survival of isogenic fnr mutant in macrophages decreased significantly when compared with wild-type; and expression level of fnr increased 8 h post-infection. Remarkably, FNR directly interacted with ArgR, another regulator that influences the biological fitness and intracellular survival of L. hongkongensis by regulating arginine metabolism genes. Our results demonstrated that FNR and ArgR work in coordination to respond to oxygen changes in both extracellular and intracellular environments, by finely regulating the ADI pathway and arginine anabolism pathways, thereby optimizing bacterial fitness in various environmental niches.


Asunto(s)
Arginina/metabolismo , Proteínas Bacterianas/metabolismo , Betaproteobacteria/fisiología , Regulación Bacteriana de la Expresión Génica , Proteínas Hierro-Azufre/metabolismo , Aclimatación , Adaptación Fisiológica , Anaerobiosis , Proteínas Bacterianas/genética , Betaproteobacteria/genética , Hidrolasas/metabolismo , Proteínas Hierro-Azufre/genética , Redes y Vías Metabólicas , Operón , Regiones Promotoras Genéticas
12.
BMC Microbiol ; 17(1): 73, 2017 03 27.
Artículo en Inglés | MEDLINE | ID: mdl-28347342

RESUMEN

BACKGROUND: Genomic islands (GIs) are genomic regions that reveal evidence of horizontal DNA transfer. They can code for many functions and may augment a bacterium's adaptation to its host or environment. GIs have been identified in strain J2315 of Burkholderia cenocepacia, whereas in strain AU 1054 there has been no published works on such regions according to our text mining and keyword search in Medline. RESULTS: In this study, we identified 21 GIs in AU 1054 by combining two computational tools. Feature analyses suggested that the predictions are highly reliable and hence illustrated the advantage of joint predictions by two independent methods. Based on putative virulence factors, four GIs were further identified as pathogenicity islands (PAIs). Through experiments of gene deletion mutants in live bacteria, two putative PAIs were confirmed, and the virulence factors involved were identified as lipA and copR. The importance of the genes lipA (from PAI 1) and copR (from PAI 2) for bacterial invasion and replication indicates that they are required for the invasive properties of B. cenocepacia and may function as virulence determinants for bacterial pathogenesis and host infection. CONCLUSIONS: This approach of in silico prediction of GIs and subsequent identification of potential virulence factors in the putative island regions with final validation using wet experiments could be used as an effective strategy to rapidly discover novel virulence factors in other bacterial species and strains.


Asunto(s)
Burkholderia cenocepacia/genética , Islas Genómicas/genética , Genómica , Factores de Virulencia/genética , Factores de Virulencia/aislamiento & purificación , Células A549 , Adhesión Bacteriana , Proteínas Bacterianas/genética , Composición de Base , Infecciones por Burkholderia/microbiología , Burkholderia cenocepacia/crecimiento & desarrollo , Burkholderia cenocepacia/patogenicidad , Técnicas de Cultivo de Célula , Recuento de Colonia Microbiana , Biología Computacional/métodos , ADN Bacteriano , Eliminación de Gen , Transferencia de Gen Horizontal , Genes Bacterianos/genética , Genoma Bacteriano/genética , Humanos
13.
Nucleic Acids Res ; 43(W1): W85-90, 2015 Jul 01.
Artículo en Inglés | MEDLINE | ID: mdl-25977299

RESUMEN

In 2003, we developed an ab initio program, ZCURVE 1.0, to find genes in bacterial and archaeal genomes. In this work, we present the updated version (i.e. ZCURVE 3.0). Using 422 prokaryotic genomes, the average accuracy was 93.7% with the updated version, compared with 88.7% with the original version. Such results also demonstrate that ZCURVE 3.0 is comparable with Glimmer 3.02 and may provide complementary predictions to it. In fact, the joint application of the two programs generated better results by correctly finding more annotated genes while also containing fewer false-positive predictions. As the exclusive function, ZCURVE 3.0 contains one post-processing program that can identify essential genes with high accuracy (generally >90%). We hope ZCURVE 3.0 will receive wide use with the web-based running mode. The updated ZCURVE can be freely accessed from http://cefg.uestc.edu.cn/zcurve/ or http://tubic.tju.edu.cn/zcurveb/ without any restrictions.


Asunto(s)
Genes Arqueales , Genes Bacterianos , Programas Informáticos , Algoritmos , Genes Esenciales , Genoma Arqueal , Genoma Bacteriano , Internet
14.
Mol Biol Evol ; 31(5): 1302-8, 2014 May.
Artículo en Inglés | MEDLINE | ID: mdl-24531082

RESUMEN

Mutation is the ultimate source of genetic variation and evolution. Mutation accumulation (MA) experiments are an alternative approach to study de novo mutation events directly. We have constructed a resource of Spontaneous Mutation Accumulation Lines (SMAL; http://cefg.uestc.edu.cn/smal), which contains all the current publicly available MA lines identified by high-throughput sequencing. We have relocated and mapped the mutations based on the most recent genome annotations. A total of 5,608 single base mutations and 540 other mutations were obtained and are recorded in the current version of the SMAL database. The integrated data in SMAL provide detailed information that can be used in new theoretical analyses. We believe that the SMAL resource will help researchers better understand the processes of genetic variation and the incidence of disease.


Asunto(s)
Bases de Datos Genéticas , Mutación , Animales , Drosophila melanogaster/genética , Escherichia coli/genética , Evolución Molecular , Femenino , Flujo Genético , Aptitud Genética , Genómica , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Masculino , Modelos Genéticos , Salmonella typhimurium/genética
15.
Int J Mol Sci ; 16(9): 23111-26, 2015 Sep 23.
Artículo en Inglés | MEDLINE | ID: mdl-26404268

RESUMEN

Composition bias from Chargaff's second parity rule (PR2) has long been found in sequenced genomes, and is believed to relate strongly with the replication process in microbial genomes. However, some disagreement on the underlying reason for strand composition bias remains. We performed an integrative analysis of various genomic features that might influence composition bias using a large-scale dataset of 1111 genomes. Our results indicate (1) the bias was stronger in obligate intracellular bacteria than in other free-living species (p-value=0.0305); (2) Fusobacteria and Firmicutes had the highest average bias among the 24 microbial phyla analyzed; (3) the strength of selected codon usage bias and generation times were not observably related to strand composition bias (p-value=0.3247); (4) significant negative relationships were found between GC content, genome size, rearrangement frequency, Clusters of Orthologous Groups (COG) functional subcategories A, C, I, Q, and composition bias (p-values<1.0×10(-8)); (5) gene density and COG functional subcategories D, F, J, L, and V were positively related with composition bias (p-value<2.2×10(-16)); and (6) gene density made the most important contribution to composition bias, indicating transcriptional bias was associated strongly with strand composition bias. Therefore, strand composition bias was found to be influenced by multiple factors with varying weights.


Asunto(s)
Bacterias/genética , Genoma Bacteriano , Composición de Base , Dosificación de Gen , Genes Bacterianos , Análisis de Componente Principal , Recombinación Genética
16.
Nucleic Acids Res ; 40(Database issue): D271-7, 2012 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-22053087

RESUMEN

Mimotopes are peptides with affinities to given targets. They are readily obtained through biopanning against combinatorial peptide libraries constructed by phage display and other display technologies such as mRNA display, ribosome display, bacterial display and yeast display. Mimotopes have been used to infer the protein interaction sites and networks; they are also ideal candidates for developing new diagnostics, therapeutics and vaccines. However, such valuable peptides are not collected in the central data resources such as UniProt and NCBI GenPept due to their 'unnatural' short sequences. The MimoDB database is an information portal to biopanning results of random libraries. In version 2.0, it has 15,633 peptides collected from 849 papers and grouped into 1818 sets. Besides the core data on panning experiments and their results, broad background information on target, template, library and structure is included. An accompanied benchmark has also been compiled for bioinformaticians to develop and evaluate their new models, algorithms and programs. In addition, the MimoDB database provides tools for simple and advanced searches, structure visualization, BLAST and alignment view on the fly. The experimental biologists can easily use the database as a virtual control to exclude possible target-unrelated peptides. The MimoDB database is freely available at http://immunet.cn/mimodb.


Asunto(s)
Bases de Datos de Proteínas , Péptidos/química , Péptidos/metabolismo , Mapeo de Interacción de Proteínas , Alineación de Secuencia , Análisis de Secuencia de Proteína , Programas Informáticos , Interfaz Usuario-Computador
17.
Front Bioeng Biotechnol ; 12: 1377334, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38590605

RESUMEN

Sinorhizobium fredii CCBAU45436 is an excellent rhizobium that plays an important role in agricultural production. However, there still needs more comprehensive understanding of the metabolic system of S. fredii CCBAU45436, which hinders its application in agriculture. Therefore, based on the first-generation metabolic model iCC541 we developed a new genome-scale metabolic model iAQY970, which contains 970 genes, 1,052 reactions, 942 metabolites and is scored 89% in the MEMOTE test. Cell growth phenotype predicted by iAQY970 is 81.7% consistent with the experimental data. The results of mapping the proteome data under free-living and symbiosis conditions to the model showed that the biomass production rate in the logarithmic phase was faster than that in the stable phase, and the nitrogen fixation efficiency of rhizobia parasitized in cultivated soybean was higher than that in wild-type soybean, which was consistent with the actual situation. In the symbiotic condition, there are 184 genes that would affect growth, of which 94 are essential; In the free-living condition, there are 143 genes that influence growth, of which 78 are essential. Among them, 86 of the 94 essential genes in the symbiotic condition were consistent with the prediction of iCC541, and 44 essential genes were confirmed by literature information; meanwhile, 30 genes were identified by DEG and 33 genes were identified by Geptop. In addition, we extracted four key nitrogen fixation modules from the model and predicted that sulfite reductase (EC 1.8.7.1) and nitrogenase (EC 1.18.6.1) as the target enzymes to enhance nitrogen fixation by MOMA, which provided a potential focus for strain optimization. Through the comprehensive metabolic model, we can better understand the metabolic capabilities of S. fredii CCBAU45436 and make full use of it in the future.

18.
BMC Evol Biol ; 13: 162, 2013 Aug 02.
Artículo en Inglés | MEDLINE | ID: mdl-23914835

RESUMEN

BACKGROUND: Despite rapid progress in understanding the mechanisms that shape the evolution of proteins, the relative importance of various factors remain to be elucidated. In this study, we have assessed the effects of 16 different biological features on the evolutionary rates (ERs) of protein-coding sequences in bacterial genomes. RESULTS: Our analysis of 18 bacterial species revealed new correlations between ERs and constraining factors. Previous studies have suggested that transcriptional abundance overwhelmingly constrains the evolution of yeast protein sequences. This transcriptional abundance leads to selection against misfolding or misinteractions. In this study we found that there was no single factor in determining the evolution of bacterial proteins. Not only transcriptional abundance (codon adaptation index and expression level), but also protein-protein associations (PPAs), essentiality (ESS), subcellular localization of cytoplasmic membrane (SLM), transmembrane helices (TMH) and hydropathicity score (HS) independently and significantly affected the ERs of bacterial proteins. In some species, PPA and ESS demonstrate higher correlations with ER than transcriptional abundance. CONCLUSIONS: Different forces drive the evolution of protein sequences in yeast and bacteria. In bacteria, the constraints are involved in avoiding a build-up of toxic molecules caused by misfolding/misinteraction (transcriptional abundance), while retaining important functions (ESS, PPA) and maintaining the cell membrane (SLM, TMH and HS). Each of these independently contributes to the variation in protein evolution.


Asunto(s)
Bacterias/genética , Proteínas Bacterianas/genética , Evolución Molecular , Transcripción Genética , Secuencia de Aminoácidos , Bacterias/clasificación , Secuencia de Bases , Codón/metabolismo , Genoma Bacteriano , Datos de Secuencia Molecular
19.
BMC Genomics ; 14: 769, 2013 Nov 09.
Artículo en Inglés | MEDLINE | ID: mdl-24209780

RESUMEN

BACKGROUND: Essential genes are indispensable for the survival of living entities. They are the cornerstones of synthetic biology, and are potential candidate targets for antimicrobial and vaccine design. DESCRIPTION: Here we describe the Cluster of Essential Genes (CEG) database, which contains clusters of orthologous essential genes. Based on the size of a cluster, users can easily decide whether an essential gene is conserved in multiple bacterial species or is species-specific. It contains the similarity value of every essential gene cluster against human proteins or genes. The CEG_Match tool is based on the CEG database, and was developed for prediction of essential genes according to function. The database is available at http://cefg.uestc.edu.cn/ceg. CONCLUSIONS: Properties contained in the CEG database, such as cluster size, and the similarity of essential gene clusters against human proteins or genes, are very important for evolutionary research and drug design. An advantage of CEG is that it clusters essential genes based on function, and therefore decreases false positive results when predicting essential genes in comparison with using the similarity alignment method.


Asunto(s)
Bases de Datos Genéticas , Genes Esenciales , Internet , Algoritmos , Humanos , Análisis por Micromatrices , Programas Informáticos , Especificidad de la Especie
20.
Int J Mol Sci ; 13(3): 3134-3144, 2012.
Artículo en Inglés | MEDLINE | ID: mdl-22489145

RESUMEN

Pneumonia is one kind of common infectious disease, which is usually caused by bacteria, viruses, or fungi. In this paper, we predicted genomic islands in three bacterial pathogens of pneumonia. They are Chlamydophila pneumoniae, Mycoplasma pneumoniae and Streptococcus pneumoniae, respectively. For each pathogen, one clinical strain is involved. After implementing the cumulative GC profile combined with h and BCN index, eight genomic islands are found in three pathogens. Among them, six genomic islands are found to have mobility elements, which constitute a kind of conserved character of genomic islands, and this introduces the possibility that they are genuine genomic islands. The present results show that the cumulative GC profile when combined with h and BCN indexes is a good method for predicting genomic islands in bacteria and it has lower false positive rate than the SIGI method. Specially, three genomic islands are found to contain clusters of genes coding for production of virulence factors and this is useful for research into the pathogenicity of these pathogens and helpful for the treatment of diseases caused by them.


Asunto(s)
Genoma Bacteriano , Islas Genómicas , Neumonía Bacteriana/microbiología , Composición de Base , Neumonía por Clamidia/microbiología , Chlamydophila pneumoniae/genética , Chlamydophila pneumoniae/patogenicidad , ADN Bacteriano/genética , Transferencia de Gen Horizontal , Humanos , Mycoplasma pneumoniae/genética , Mycoplasma pneumoniae/patogenicidad , Neumonía por Mycoplasma/microbiología , Neumonía Neumocócica/microbiología , Streptococcus pneumoniae/genética , Streptococcus pneumoniae/patogenicidad
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA