Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 30
Filtrar
1.
Bioinformatics ; 38(5): 1447-1449, 2022 02 07.
Artigo em Inglês | MEDLINE | ID: mdl-34904625

RESUMO

SUMMARY: Phage-Host Interaction Search Tool (PHIST) predicts prokaryotic hosts of viruses based on exact matches between viral and host genomes. It improves host prediction accuracy at species level over current alignment-based tools (on average by 3 percentage points) as well as alignment-free and CRISPR-based tools (by 14-20 percentage points). PHIST is also two orders of magnitude faster than alignment-based tools making it suitable for metagenomics studies. AVAILABILITY AND IMPLEMENTATION: GNU-licensed C++ code wrapped in Python API available at: https://github.com/refresh-bio/phist. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Bacteriófagos , Vírus , Bacteriófagos/genética , Metagenômica , Vírus/genética , Metagenoma , Software
2.
J Exp Bot ; 73(13): 4528-4545, 2022 07 16.
Artigo em Inglês | MEDLINE | ID: mdl-35275209

RESUMO

MicroRNAs (miRNAs) are small non-coding endogenous RNA molecules, 18-24 nucleotides long, that control multiple gene regulatory pathways via post-transcriptional gene silencing in eukaryotes. To develop a comprehensive picture of the evolutionary history of miRNA biogenesis and action in land plants, studies on bryophyte representatives are needed. Here, we review current understanding of liverwort MIR gene structure, miRNA biogenesis, and function, focusing on the simple thalloid Pellia endiviifolia and the complex thalloid Marchantia polymorpha. We review what is known about conserved and non-conserved miRNAs, their targets, and the functional implications of miRNA action in M. polymorpha and P. endiviifolia. We note that most M. polymorpha miRNAs are encoded within protein-coding genes and provide data for 23 MIR gene structures recognized as independent transcriptional units. We identify M. polymorpha genes involved in miRNA biogenesis that are homologous to those identified in higher plants, including those encoding core microprocessor components and other auxiliary and regulatory proteins that influence the stability, folding, and processing of pri-miRNAs. We analyzed miRNA biogenesis proteins and found similar domain architecture in most cases. Our data support the hypothesis that almost all miRNA biogenesis factors in higher plants are also present in liverworts, suggesting that they emerged early during land plant evolution.


Assuntos
Embriófitas , Hepatófitas , MicroRNAs , Embriófitas/genética , Embriófitas/metabolismo , Hepatófitas/genética , Hepatófitas/metabolismo , MicroRNAs/genética , MicroRNAs/metabolismo , Interferência de RNA , Processamento Pós-Transcricional do RNA
3.
BMC Biol ; 19(1): 223, 2021 10 08.
Artigo em Inglês | MEDLINE | ID: mdl-34625070

RESUMO

BACKGROUND: Characterizing phage-host interactions is critical to understanding the ecological role of both partners and effective isolation of phage therapeuticals. Unfortunately, experimental methods for studying these interactions are markedly slow, low-throughput, and unsuitable for phages or hosts difficult to maintain in laboratory conditions. Therefore, a number of in silico methods emerged to predict prokaryotic hosts based on viral sequences. One of the leading approaches is the application of the BLAST tool that searches for local similarities between viral and microbial genomes. However, this prediction method has three major limitations: (i) top-scoring sequences do not always point to the actual host; (ii) mosaic virus genomes may match to many, typically related, bacteria; and (iii) viral and host sequences may diverge beyond the point where their relationship can be detected by a BLAST alignment. RESULTS: We created an extension to BLAST, named Phirbo, that improves host prediction quality beyond what is obtainable from standard BLAST searches. The tool harnesses information concerning sequence similarity and bacteria relatedness to predict phage-host interactions. Phirbo was evaluated on three benchmark sets of known virus-host pairs, and it improved precision and recall by 11-40 percentage points over currently available, state-of-the-art, alignment-based, alignment-free, and machine-learning host prediction tools. Moreover, the discriminatory power of Phirbo for the recognition of virus-host relationships surpassed the results of other tools by at least 10 percentage points (area under the curve = 0.95), yielding a mean host prediction accuracy of 57% and 68% at the genus and family levels, respectively, and drops by 12 percentage points when using only a fraction of viral genome sequences (3 kb). Finally, we provide insights into a repertoire of protein and ncRNA genes that are shared between phages and hosts and may be prone to horizontal transfer during infection. CONCLUSIONS: Our results suggest that Phirbo is a simple and effective tool for predicting phage-host relationships.


Assuntos
Bacteriófagos , Vírus , Bactérias/genética , Bacteriófagos/genética , Genoma Viral , Aprendizado de Máquina , Vírus/genética
4.
Int J Mol Sci ; 21(18)2020 Sep 10.
Artigo em Inglês | MEDLINE | ID: mdl-32927724

RESUMO

Glycogen synthase kinase 3 (GSK3) is a highly conserved kinase present in all eukaryotes and functions as a key regulator of a wide range of physiological and developmental processes. The kinase, known in land plants as GSK3/SHAGGY-like kinase (GSK), is a key player in the brassinosteroid (BR) signaling pathway. The GSK genes, through the BRs, affect diverse developmental processes and modulate responses to environmental factors. In this work, we describe functional analysis of HvGSK1.1, which is one of the GSK3/SHAGGY-like orthologs in barley. The RNAi-mediated silencing of the target HvGSK1.1 gene was associated with modified expression of its paralogs HvGSK1.2, HvGSK2.1, HvGSK3.1, and HvGSK4.1 in plants grown in normal and in salt stress conditions. Low nucleotide similarity between the silencing fragment and barley GSK genes and the presence of BR-dependent transcription factors' binding sites in promoter regions of barley and rice GSK genes imply an innate mechanism responsible for co-regulation of the genes. The results of the leaf inclination assay indicated that silencing of HvGSK1.1 and the changes of GSK paralogs enhanced the BR-dependent signaling in the plants. The strongest phenotype of transgenic lines with downregulated HvGSK1.1 and GSK paralogs had greater biomass of the seedlings grown in normal conditions and salt stress as well as elevated kernel weight of plants grown in normal conditions. Both traits showed a strong negative correlation with the transcript level of the target gene and the paralogs. The characteristics of barley lines with silenced expression of HvGSK1.1 are compatible with the expected phenotypes of plants with enhanced BR signaling. The results show that manipulation of the GSK-encoding genes provides data to explore their biological functions and confirm it as a feasible strategy to generate plants with improved agricultural traits.


Assuntos
Quinases da Glicogênio Sintase/fisiologia , Hordeum/fisiologia , Tolerância ao Sal/genética , Sementes/crescimento & desenvolvimento , Biomassa , Brassinosteroides/metabolismo , Inativação Gênica , Proteínas de Plantas/fisiologia , Plantas Geneticamente Modificadas/enzimologia , Plantas Geneticamente Modificadas/crescimento & desenvolvimento
5.
Plant Cell Physiol ; 59(1): e1, 2018 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-29145635

RESUMO

tRNA-derived fragments (tRFs) constitute a new class of short regulatory RNAs that are a product of nascent or mature tRNA processing. tRF sequences have been identified in all domains of life; however, most published research pertains to human, yeast and some bacterial organisms. Despite growing interest in plant tRFs and accumulating evidence of their function in plant development and stress responses, no public, web-based repository dedicated to these molecules is currently available. Here, we introduce tRex (http://combio.pl/trex)-the first comprehensive data-driven online resource specifically dedicated to tRFs in the model plant Arabidopsis thaliana. The portal is based on verified Arabidopsis tRNA annotation and includes in-house-generated and publicly available small RNA sequencing experiments from various tissues, ecotypes, genotypes and stress conditions. The provided web-based tools are designed in a user-friendly manner and allow for seamless exploration of the data that are presented in the form of dynamic tables and cumulative coverage profiles. The tRex database is connected to external genomic and citation resources, which makes it a one-stop solution for Arabidopsis tRF-related research.


Assuntos
Arabidopsis/genética , Biologia Computacional/métodos , Bases de Dados Genéticas , RNA de Plantas/genética , RNA de Transferência/genética , Sequência de Bases , Internet , Conformação de Ácido Nucleico , RNA de Plantas/química , RNA de Transferência/química , Análise de Sequência de RNA/métodos , Homologia de Sequência do Ácido Nucleico
6.
Bioinformatics ; 33(8): 1224-1226, 2017 04 15.
Artigo em Inglês | MEDLINE | ID: mdl-28057683

RESUMO

Summary: ORCAN (ORtholog sCANner) is a web-based meta-server for one-click evolutionary and functional annotation of protein sequences. The server combines information from the most popular orthology-prediction resources, including four tools and four online databases. Functional annotation utilizes five additional comparisons between the query and identified homologs, including: sequence similarity, protein domain architectures, functional motifs, Gene Ontology term assignments and a list of associated articles. Furthermore, the server uses a plurality-based rating system to evaluate the orthology relationships and to rank the reference proteins by their evolutionary and functional relevance to the query. Using a dataset of ∼1 million true yeast orthologs as a sample reference set, we show that combining multiple orthology-prediction tools in ORCAN increases the sensitivity and precision by 1-2 percent points. Availability and Implementation: The service is available for free at http://www.combio.pl/orcan/ . Contact: wmk@amu.edu.pl. Supplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Biologia Computacional/métodos , Internet , Análise de Sequência de Proteína/métodos , Homologia de Sequência de Aminoácidos , Software , Ontologia Genética , Domínios Proteicos , Leveduras/genética , Leveduras/metabolismo
7.
Nucleic Acids Res ; 44(D1): D180-3, 2016 Jan 04.
Artigo em Inglês | MEDLINE | ID: mdl-26490961

RESUMO

Ribosomal 5S RNA (5S rRNA) is the ubiquitous RNA component found in the large subunit of ribosomes in all known organisms. Due to its small size, abundance and evolutionary conservation 5S rRNA for many years now is used as a model molecule in studies on RNA structure, RNA-protein interactions and molecular phylogeny. 5SRNAdb (http://combio.pl/5srnadb/) is the first database that provides a high quality reference set of ribosomal 5S RNAs (5S rRNA) across three domains of life. Here, we give an overview of new developments in the database and associated web tools since 2002, including updates to database content, curation processes and user web interfaces.


Assuntos
Bases de Dados de Ácidos Nucleicos , RNA Ribossômico 5S/química , RNA Ribossômico 5S/genética , Eucariotos/genética , Conformação de Ácido Nucleico , RNA Arqueal/química , RNA Arqueal/genética , RNA Bacteriano/química , RNA Bacteriano/genética
8.
Drug Chem Toxicol ; 40(4): 470-483, 2017 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-28049353

RESUMO

Nicotine may affect cell function by rearranging protein complexes. We aimed to determine nicotine-induced alterations of protein complexes in Caenorhabditis elegans (C. elegans) cells, thereby revealing links between nicotine exposure and protein complex modulation. We compared the proteomic alterations induced by low and high nicotine concentrations (0.01 mM and 1 mM) with the control (no nicotine) in vivo by using mass spectrometry (MS)-based techniques, specifically the cetyltrimethylammonium bromide (CTAB) discontinuous gel electrophoresis coupled with liquid chromatography (LC)-MS/MS and spectral counting. As a result, we identified dozens of C. elegans proteins that are present exclusively or in higher abundance in either nicotine-treated or untreated worms. Based on these results, we report a possible network that captures the key protein components of nicotine-induced protein complexes and speculate how the different protein modules relate to their distinct physiological roles. Using functional annotation of detected proteins, we hypothesize that the identified complexes can modulate the energy metabolism and level of oxidative stress. These proteins can also be involved in modulation of gene expression and may be crucial in Alzheimer's disease. The findings reported in our study reveal putative intracellular interactions of many proteins with the cytoskeleton and may contribute to the understanding of the mechanisms of nicotinic acetylcholine receptor (nAChR) signaling and trafficking in cells.


Assuntos
Proteínas de Caenorhabditis elegans/metabolismo , Caenorhabditis elegans/efeitos dos fármacos , Regulação da Expressão Gênica no Desenvolvimento/efeitos dos fármacos , Modelos Biológicos , Nicotina/toxicidade , Agonistas Nicotínicos/toxicidade , Estresse Oxidativo/efeitos dos fármacos , Animais , Caenorhabditis elegans/crescimento & desenvolvimento , Caenorhabditis elegans/metabolismo , Proteínas de Caenorhabditis elegans/química , Proteínas de Caenorhabditis elegans/genética , Biologia Computacional , Citoesqueleto/efeitos dos fármacos , Citoesqueleto/metabolismo , Metabolismo Energético/efeitos dos fármacos , Perfilação da Expressão Gênica , Redes Reguladoras de Genes/efeitos dos fármacos , Anotação de Sequência Molecular , Concentração Osmolar , Multimerização Proteica/efeitos dos fármacos , Transporte Proteico/efeitos dos fármacos , Proteômica/métodos , Receptores Nicotínicos/química , Receptores Nicotínicos/metabolismo , Transdução de Sinais/efeitos dos fármacos
9.
Bioinformatics ; 31(3): 332-9, 2015 Feb 01.
Artigo em Inglês | MEDLINE | ID: mdl-25304778

RESUMO

MOTIVATION: Argonaute-interacting WG/GW proteins are characterized by the presence of repeated sequence motifs containing glycine (G) and tryptophan (W). The motifs seem to be remarkably adaptive to amino acid substitutions and their sequences show non-contiguity. Our previous approach to the detection of GW domains, based on scoring their gross amino acid composition, allowed annotation of several novel proteins involved in gene silencing. The accumulation of new experimental data and more advanced applications revealed some deficiency of the algorithm in prediction selectivity. Additionally, W-motifs, though critical in gene regulation, have not yet been annotated in any available online resources. RESULTS: We present an improved set of computational tools allowing efficient management and annotation of W-based motifs involved in gene silencing. The new prediction algorithms provide novel functionalities by annotation of the W-containing domains at the local sequence motif level rather than by overall compositional properties. This approach represents a significant improvement over the previous method in terms of prediction sensitivity and selectivity. Application of the algorithm allowed annotation of a comprehensive list of putative Argonaute-interacting proteins across eukaryotes. An in-depth characterization of the domains' properties indicates its intrinsic disordered character. In addition, we created a knowledge-based portal (whub) that provides access to tools and information on RNAi-related tryptophan-containing motifs. AVAILABILITY AND IMPLEMENTATION: The web portal and tools are freely available at http://www.comgen.pl/whub. CONTACT: wmk@amu.edu.pl SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Algoritmos , Motivos de Aminoácidos/genética , Proteínas Argonautas/química , Proteínas Argonautas/metabolismo , Glicina/química , Ligação Proteica/genética , Sequências Repetitivas de Aminoácidos/genética , Triptofano/química , Proteínas Argonautas/genética , Estrutura Terciária de Proteína , Software
10.
BMC Plant Biol ; 15: 144, 2015 Jun 16.
Artigo em Inglês | MEDLINE | ID: mdl-26141515

RESUMO

BACKGROUND: MicroRNAs are the key post-transcriptional regulators of gene expression in development and stress responses. Thus, precisely quantifying the level of each particular microRNA is of utmost importance when studying the biology of any organism. DESCRIPTION: The mirEX 2.0 web portal ( http://www.combio.pl/mirex ) provides a comprehensive platform for the exploration of microRNA expression data based on quantitative Real Time PCR and NGS sequencing experiments, covering various developmental stages, from wild-type to mutant plants. The portal includes mature and pri-miRNA expression levels detected in three plant species (Arabidopsis thaliana, Hordeum vulgare and Pellia endiviifolia), and in A. thaliana miRNA biogenesis pathway mutants. In total, the database contains information about the expression of 461 miRNAs representing 268 families. The data can be explored through the use of advanced web tools, including (i) a graphical query builder system allowing a combination of any given species, developmental stages and tissues, (ii) a modular presentation of the results in the form of thematic windows, and (iii) a number of user-friendly utilities such as a community-building discussion system and extensive tutorial documentation (e.g., tooltips, exemplary videos and presentations). All data contained within the mirEX 2.0 database can be downloaded for use in further applications in a context-based way from the result windows or from a dedicated web page. CONCLUSIONS: The mirEX 2.0 portal provides the plant research community with easily accessible data and powerful tools for application in multi-conditioned analyses of miRNA expression from important plant species in different biological and developmental backgrounds.


Assuntos
Arabidopsis/genética , Bases de Dados de Ácidos Nucleicos/organização & administração , Hepatófitas/genética , Hordeum/genética , Internet , MicroRNAs/genética , RNA de Plantas/genética , Arabidopsis/metabolismo , Perfilação da Expressão Gênica , Hepatófitas/metabolismo , Hordeum/metabolismo , MicroRNAs/metabolismo , RNA de Plantas/metabolismo , Reação em Cadeia da Polimerase em Tempo Real
11.
RNA Biol ; 12(7): 761-70, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-26106978

RESUMO

The GW182 proteins are a key component of the miRNA-dependent post-transcriptional silencing pathway in animals. They function as scaffold proteins to mediate the interaction of Argonaute (AGO)-containing complexes with cytoplasmic poly(A)-binding proteins (PABP) and PAN2-PAN3 and CCR4-NOT deadenylases. The AGO-GW182 complexes mediate silencing of the target mRNA through induction of translational repression and/or mRNA degradation. Although the GW182 proteins are a subject of extensive experimental research in the recent years, very little is known about their origin and evolution. Here, based on complex functional annotation and phylogenetic analyses, we reveal 448 members of the GW182 protein family from the earliest animals to humans. Our results indicate that a single-copy GW182/TNRC6C progenitor gene arose with the emergence of multicellularity and it multiplied in the last common ancestor of vertebrates in 2 rounds of whole genome duplication (WGD) resulting in 3 genes. Before the divergence of vertebrates, both the AGO- and CCR4-NOT-binding regions of GW182s showed significant acceleration in the accumulation of amino acid changes, suggesting functional adaptation toward higher specificity to the molecules of the silencing complex. We conclude that the silencing ability of the GW182 proteins improves with higher position in the taxonomic classification and increasing complexity of the organism. The first reconstruction of the molecular journey of GW182 proteins from the ancestral metazoan protein to the current mammalian configuration provides new insight into development of the miRNA-dependent post-transcriptional silencing pathway in animals.


Assuntos
Autoantígenos/metabolismo , Evolução Molecular , Interferência de RNA , Proteínas de Ligação a RNA/metabolismo , Animais , Autoantígenos/genética , Humanos , Invertebrados/genética , Invertebrados/metabolismo , Proteínas de Ligação a RNA/genética , Vertebrados/genética , Vertebrados/metabolismo
12.
Plant Mol Biol Report ; 33: 84-101, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-25620837

RESUMO

The narrow-leafed lupin (Lupinus angustifolius) was recently considered as a legume reference species. Genetic resources have been developed, including a draft genome sequence, linkage maps, nuclear DNA libraries, and cytogenetic chromosome-specific landmarks. Here, we used a complex approach, involving DNA fingerprinting, sequencing, genetic mapping, and molecular cytogenetics, to localize and analyze L. angustifolius gene-rich regions (GRRs). A L. angustifolius genomic bacterial artificial chromosome (BAC) library was screened with short sequence repeat (SSR)-based probes. Selected BACs were fingerprinted and assembled into contigs. BAC-end sequence (BES) annotation allowed us to choose clones for sequencing, targeting GRRs. Additionally, BESs were aligned to the scaffolds of the genome sequence. The genetic map was supplemented with 35 BES-derived markers, distributed in 14 linkage groups and tagging 37 scaffolds. The identified GRRs had an average gene density of 19.6 genes/100 kb and physical-to-genetic distance ratios of 11 to 109 kb/cM. Physical and genetic mapping was supported by multi-BAC-fluorescence in situ hybridization (FISH), and five new linkage groups were assigned to the chromosomes. Syntenic links to the genome sequences of five legume species (Medicago truncatula, Glycine max, Lotus japonicus, Phaseolus vulgaris, and Cajanus cajan) were identified. The comparative mapping of the two largest lupin GRRs provides novel evidence for ancient duplications in all of the studied species. These regions are conserved among representatives of the main clades of Papilionoideae. Furthermore, despite the complex evolution of legumes, some segments of the nuclear genome were not substantially modified and retained their quasi-ancestral structures. Cytogenetic markers anchored in these regions constitute a platform for heterologous mapping of legume genomes.

13.
Nucleic Acids Res ; 40(Database issue): D191-7, 2012 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-22013167

RESUMO

mirEX is a comprehensive platform for comparative analysis of primary microRNA expression data. RT-qPCR-based gene expression profiles are stored in a universal and expandable database scheme and wrapped by an intuitive user-friendly interface. A new way of accessing gene expression data in mirEX includes a simple mouse operated querying system and dynamic graphs for data mining analyses. In contrast to other publicly available databases, the mirEX interface allows a simultaneous comparison of expression levels between various microRNA genes in diverse organs and developmental stages. Currently, mirEX integrates information about the expression profile of 190 Arabidopsis thaliana pri-miRNAs in seven different developmental stages: seeds, seedlings and various organs of mature plants. Additionally, by providing RNA structural models, publicly available deep sequencing results, experimental procedure details and careful selection of auxiliary data in the form of web links, mirEX can function as a one-stop solution for Arabidopsis microRNA information. A web-based mirEX interface can be accessed at http://bioinfo.amu.edu.pl/mirex.


Assuntos
Arabidopsis/genética , Bases de Dados de Ácidos Nucleicos , MicroRNAs/metabolismo , RNA de Plantas/metabolismo , Mineração de Dados , Perfilação da Expressão Gênica , Internet , Software , Interface Usuário-Computador
14.
Nat Commun ; 15(1): 495, 2024 Jan 23.
Artigo em Inglês | MEDLINE | ID: mdl-38263397

RESUMO

Bacteriophages are widely recognised as rapidly evolving biological entities. However, knowledge about ancient bacteriophages is limited. Here, we analyse DNA sequence datasets previously generated from ancient palaeofaeces and human gut-content samples, and identify an ancient phage genome nearly identical to present-day Mushuvirus mushu, a virus that infects gut commensal bacteria. The DNA damage patterns of the genome are consistent with its ancient origin and, despite 1300 years of evolution, the ancient Mushuvirus genome shares 97.7% nucleotide identity with its modern counterpart, indicating a long-term relationship between the prophage and its host. In addition, we reconstruct and authenticate 297 other phage genomes from the last 5300 years, including those belonging to unknown families. Our findings demonstrate the feasibility of reconstructing ancient phage genome sequences, thus expanding the known virosphere and offering insights into phage-bacteria interactions spanning several millennia.


Assuntos
Bacteriófagos , Humanos , Prófagos , Dano ao DNA , Conhecimento , Nucleotídeos
15.
iScience ; 26(8): 107309, 2023 Aug 18.
Artigo em Inglês | MEDLINE | ID: mdl-37539027

RESUMO

Studies in zebrafish can unravel the functions of cellular communication and thus identify novel bench-to-bedside drugs targeting cellular communication signaling molecules. Due to the incomplete annotation of zebrafish proteome, the knowledge of zebrafish receptors, ligands, and tools to explore their interactome is limited. To address this gap, we de novo predicted the cellular localization of zebrafish reference proteome using deep learning algorithm. We combined the predicted and existing annotations on cellular localization of zebrafish proteins and created repositories of zebrafish ligands, membrane receptome, and interactome as well as associated diseases and targeting drugs. Unlike other tools, our interactome atlas is based on both the physical interaction data of zebrafish proteome and existing human ligand-receptor pair databases. The resources are available as R and Python scripts. DanioTalk provides a novel resource for researchers interested in targeting cellular communication in zebrafish, as we demonstrate in applications studying synapse and axo-glial interactome. DanioTalk methodology can be applied to build and explore the ligand-receptor atlas of other non-mammalian model organisms.

16.
Genome Biol Evol ; 15(3)2023 03 03.
Artigo em Inglês | MEDLINE | ID: mdl-36790099

RESUMO

Taxonomically restricted genes (TRGs) are unique for a defined group of organisms and may act as potential genetic determinants of lineage-specific, biological properties. Here, we explore the TRGs of highly diverse and economically important Bacillus bacteria by examining commonly used TRG identification parameters and data sources. We show the significant effects of sequence similarity thresholds, composition, and the size of the reference database in the identification process. Subsequently, we applied stringent TRG search parameters and expanded the identification procedure by incorporating an analysis of noncoding and non-syntenic regions of non-Bacillus genomes. A multiplex annotation procedure minimized the number of false-positive TRG predictions and showed nearly one-third of the alleged TRGs could be mapped to genes missed in genome annotations. We traced the putative origin of TRGs by identifying homologous, noncoding genomic regions in non-Bacillus species and detected sequence changes that could transform these regions into protein-coding genes. In addition, our analysis indicated that Bacillus TRGs represent a specific group of genes mostly showing intermediate sequence properties between genes that are conserved across multiple taxa and nonannotated peptides encoded by open reading frames.


Assuntos
Bacillus , Bacillus/genética , Genoma , Genômica , Fases de Leitura Aberta
17.
Database (Oxford) ; 20232023 08 08.
Artigo em Inglês | MEDLINE | ID: mdl-37555549

RESUMO

The TRGdb database is a resource dedicated to taxonomically restricted genes (TRGs) in bacteria. It provides a comprehensive collection of genes that are specific to different genera and species, according to the latest release of bacterial taxonomy. The user interface allows for easy browsing and searching as well as sequence similarity exploration. The website also provides information on each TRG protein sequence, including its level of disorder, complexity and tendency to aggregate. TRGdb is a valuable resource for gaining a deeper understanding of the TRG-associated, unique features, and characteristics of bacterial organisms. Database URL www.combio.pl/trgdb.


Assuntos
Bactérias , Gerenciamento de Dados , Bactérias/genética , Bases de Dados Factuais , Sequência de Aminoácidos , Interface Usuário-Computador , Internet
18.
Protein Sci ; 32(6): e4647, 2023 06.
Artigo em Inglês | MEDLINE | ID: mdl-37095066

RESUMO

The Escherichia coli enzyme EcAIII catalyzes the hydrolysis of L-Asn to L-Asp and ammonia. Using a nature-inspired mutagenesis approach, we designed and produced five new EcAIII variants (M200I, M200L, M200K, M200T, M200W). The modified proteins were characterized by spectroscopic and crystallographic methods. All new variants were enzymatically active, confirming that the applied mutagenesis procedure has been successful. The determined crystal structures revealed new conformational states of the EcAIII molecule carrying the M200W mutation and allowed a high-resolution observation of an acyl-enzyme intermediate with the M200L mutant. In addition, we performed structure prediction, substrate docking, and molecular dynamics simulations for 25 selected bacterial orthologs of EcAIII, to gain insights into how mutations at the M200 residue affect the active site and substrate binding mode. This comprehensive strategy, including both experimental and computational methods, can be used to guide further enzyme engineering and can be applied to the study of other proteins of medicinal or biotechnological importance.


Assuntos
Asparaginase , Escherichia coli , Asparaginase/química , Escherichia coli/metabolismo , Substituição de Aminoácidos , Simulação de Dinâmica Molecular , Domínio Catalítico , Sítios de Ligação
19.
Bioinformatics ; 27(9): 1318-9, 2011 May 01.
Artigo em Inglês | MEDLINE | ID: mdl-21385787

RESUMO

MOTIVATION: AGO(Argonaute)-binding domains, composed of repeated motifs, in which only binary combinations of tryptophan and glycine are conserved, bind AGO proteins and are essential during RNAi-mediated gene silencing. The amino acid sequence of this domain is extremely divergent and therefore very difficult to detect. Commonly used bioinformatic tools fail to identify tryptophan-glycine and/or glycine-tryptophan motifs (WG/GW) domains and currently there is no publicly available software which can detect these weakly conserved, but functional AGO-binding segments. RESULTS: Recently, we have developed an algorithm based on compositional analysis of the amino acid content of the domain. We have demonstrated that the algorithm can be successfully applied for the identification of the new WG/GW proteins in the Arabidopsis genome. Here we introduce Agos (Argonaute-binding domain screener), a novel universal web service for de novo identification of WG/GW domains in protein sequences. The web implementation of the algorithm contains several new features and enhancements: (i) one universal scoring matrix which allows identification of AGO-binding proteins in sequences representing all organisms; (ii) reduction of false positive predictions by improved selectivity of the algorithm; (iii) graphical interface to easily browse the prediction results; and (iv) the option to submit a DNA sequence which will be automatically translated in six frames before running the prediction algorithm. AVAILABILITY: Freely available at: http://bioinfo.amu.edu.pl/agos/.


Assuntos
Algoritmos , Motivos de Aminoácidos/genética , Proteínas de Arabidopsis/genética , Arabidopsis/genética , Proteínas Argonautas/genética , Internet , Sequência de Aminoácidos , Proteínas de Arabidopsis/metabolismo , Proteínas Argonautas/metabolismo , Sítios de Ligação , Dados de Sequência Molecular , Ligação Proteica/genética , Estrutura Terciária de Proteína/genética , Software
20.
Nucleic Acids Res ; 38(13): 4231-45, 2010 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-20338883

RESUMO

Domains in Arabidopsis proteins NRPE1 and SPT5-like, composed almost exclusively of repeated motifs in which only WG or GW sequences and an overall amino-acid preference are conserved, have been experimentally shown to bind multiple molecules of Argonaute (AGO) protein(s). Domain swapping between the WG/GW domains of NRPE1 and the human protein GW182 showed a conserved function. As classical sequence alignment methods are poorly-adapted to detect such weakly-conserved motifs, we have developed a tool to carry out a systematic analysis to identify genes potentially encoding AGO-binding GW/WG proteins. Here, we describe exhaustive analysis of the Arabidopsis genome for all regions potentially encoding proteins bearing WG/GW motifs and consider the possible role of some of them in AGO-dependent mechanisms. We identified 20 different candidate WG/GW genes, encoding proteins in which the predicted domains range from 92aa to 654aa. These mostly correspond to a limited number of families: RNA-binding proteins, transcription factors, glycine-rich proteins, translation initiation factors and known silencing-associated proteins such as SDE3. Recent studies have argued that the interaction between WG/GW-rich domains and AGO proteins is evolutionarily conserved. Here, we demonstrate by an in silico domain-swapping simulation between plant and mammalian WG/GW proteins that the biased amino-acid composition of the AGO-binding sites is conserved.


Assuntos
Proteínas de Arabidopsis/química , Arabidopsis/genética , Motivos de Aminoácidos , Sequência de Aminoácidos , Proteínas de Arabidopsis/genética , Proteínas de Arabidopsis/metabolismo , Sítios de Ligação , Sequência Conservada , Genoma de Planta , Genômica , Dados de Sequência Molecular , Família Multigênica , Estrutura Terciária de Proteína
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA