Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 26
Filtrar
1.
BMC Bioinformatics ; 17: 385, 2016 Sep 20.
Artigo em Inglês | MEDLINE | ID: mdl-27645252

RESUMO

BACKGROUND: Perfectly or highly conserved DNA elements were found in vertebrates, invertebrates, and plants by various methods. However, little is known about such elements in protists. The evolutionary distance between apicomplexans can be very high, in particular, due to the positive selection pressure on them. This complicates the identification of highly conserved elements in alveolates, which is overcome by the proposed algorithm. RESULTS: A novel algorithm is developed to identify highly conserved DNA elements. It is based on the identification of dense subgraphs in a specially built multipartite graph (whose parts correspond to genomes). Specifically, the algorithm does not rely on genome alignments, nor pre-identified perfectly conserved elements; instead, it performs a fast search for pairs of words (in different genomes) of maximum length with the difference below the specified edit distance. Such pair defines an edge whose weight equals the maximum (or total) length of words assigned to its ends. The graph composed of these edges is then compacted by merging some of its edges and vertices. The dense subgraphs are identified by a cellular automaton-like algorithm; each subgraph defines a cluster composed of similar inextensible words from different genomes. Almost all clusters are considered as predicted highly conserved elements. The algorithm is applied to the nuclear genomes of the superphylum Alveolata, and the corresponding phylogenetic tree is built and discussed. CONCLUSION: We proposed an algorithm for the identification of highly conserved elements. The multitude of identified elements was used to infer the phylogeny of Alveolata.


Assuntos
Algoritmos , Alveolados/classificação , Alveolados/genética , Sequência Conservada , Análise de Sequência de DNA/métodos , Sequência de Bases , DNA/química , Evolução Molecular , Genoma , Filogenia
2.
Biol Direct ; 18(1): 45, 2023 08 11.
Artigo em Inglês | MEDLINE | ID: mdl-37568147

RESUMO

BACKGROUND: It is generally accepted that most evolutionary transformations at the phenotype level are associated either with rearrangements of genomic regulatory elements, which control the activity of gene networks, or with changes in the amino acid contents of proteins. Recently, evidence has accumulated that significant evolutionary transformations could also be associated with the loss/emergence of whole genes. The targeted identification of such genes is a challenging problem for both bioinformatics and evo-devo research. RESULTS: To solve this problem we propose the WINEGRET method, named after the first letters of the title. Its main idea is to search for genes that satisfy two requirements: first, the desired genes were lost/emerged at the same evolutionary stage at which the phenotypic trait of interest was lost/emerged, and second, the expression of these genes changes significantly during the development of the trait of interest in the model organism. To verify the first requirement, we do not use existing databases of orthologs, but rely purely on gene homology and local synteny by using some novel quickly computable conditions. Genes satisfying the second requirement are found by deep RNA sequencing. As a proof of principle, we used our method to find genes absent in extant amniotes (reptiles, birds, mammals) but present in anamniotes (fish and amphibians), in which these genes are involved in the regeneration of large body appendages. As a result, 57 genes were identified. For three of them, c-c motif chemokine 4, eotaxin-like, and a previously unknown gene called here sod4, essential roles for tail regeneration were demonstrated. Noteworthy, we established that the latter gene belongs to a novel family of Cu/Zn-superoxide dismutases lost by amniotes, SOD4. CONCLUSIONS: We present a method for targeted identification of genes whose loss/emergence in evolution could be associated with the loss/emergence of a phenotypic trait of interest. In a proof-of-principle study, we identified genes absent in amniotes that participate in body appendage regeneration in anamniotes. Our method provides a wide range of opportunities for studying the relationship between the loss/emergence of phenotypic traits and the loss/emergence of specific genes in evolution.


Assuntos
Mamíferos , Animais
3.
J Genet Genomics ; 50(5): 330-340, 2023 05.
Artigo em Inglês | MEDLINE | ID: mdl-36414223

RESUMO

Multiple primary lung cancer (MPLC) is an increasingly prevalent subtype of lung cancer. According to recent genomic studies, the different lesions of a single MPLC patient exhibit functional similarities that may reflect evolutionary convergence. We perform whole-exome sequencing for a unique cohort of MPLC patients with multiple samples from each lesion found. Using our own and other relevant public data, evolutionary tree reconstruction reveals that cancer driver gene mutations occurred at the early trunk, indicating evolutionary contingency rather than adaptive convergence. Additionally, tumors from the same MPLC patient are as genetically diverse as those from different patients, while within-tumor genetic heterogeneity is significantly lower. Furthermore, the aberrant molecular functions enriched in mutated genes for a sample show a strong overlap with other samples from the same tumor, but not with samples from other tumors or other patients. Overall, there is no evidence of adaptive convergence during the evolution of MPLC. Most importantly, the similar between-tumor diversity and between-patient diversity suggest that personalized therapies may not adequately account for the genetic diversity among different tumors in an MPLC patient. To fully exploit the strategic value of precision medicine, targeted therapies should be designed and delivered on a per-lesion basis.


Assuntos
Neoplasias Pulmonares , Neoplasias Primárias Múltiplas , Humanos , Neoplasias Pulmonares/genética , Neoplasias Pulmonares/terapia , Neoplasias Pulmonares/patologia , Neoplasias Primárias Múltiplas/genética , Neoplasias Primárias Múltiplas/patologia , Neoplasias Primárias Múltiplas/cirurgia , Mutação
4.
Bioessays ; 31(7): 758-68, 2009 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-19472368

RESUMO

For over a century, Haeckel's Gastraea theory remained a dominant theory to explain the origin of multicellular animals. According to this theory, the animal ancestor was a blastula-like colony of uniform cells that gradually evolved cell differentiation. Today, however, genes that typically control metazoan development, cell differentiation, cell-to-cell adhesion, and cell-to-matrix adhesion are found in various unicellular relatives of the Metazoa, which suggests the origin of the genetic programs of cell differentiation and adhesion in the root of the Opisthokonta. Multicellular stages occurring in the complex life cycles of opisthokont protists (mesomycetozoeans and choanoflagellates) never resemble a blastula. Here, we discuss a more realistic scenario of transition to multicellularity through integration of pre-existing transient cell types into the body of an early metazoon, which possessed a complex life cycle with a differentiated sedentary filter-feeding trophic stage and a non-feeding blastula-like larva, the synzoospore. Choanoflagellates are considered as forms with secondarily simplified life cycles.


Assuntos
Evolução Biológica , Diferenciação Celular , Animais , Genes , Estágios do Ciclo de Vida , Modelos Biológicos , Fatores de Tempo
5.
RNA ; 14(4): 717-35, 2008 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-18359782

RESUMO

T-box antitermination is one of the main mechanisms of regulation of genes involved in amino acid metabolism in Gram-positive bacteria. T-box regulatory sites consist of conserved sequence and RNA secondary structure elements. Using a set of known T-box sites, we constructed the common pattern and used it to scan available bacterial genomes. New T-boxes were found in various Gram-positive bacteria, some Gram-negative bacteria (delta-proteobacteria), and some other bacterial groups (Deinococcales/Thermales, Chloroflexi, Dictyoglomi). The majority of T-box-regulated genes encode aminoacyl-tRNA synthetases. Two other groups of T-box-regulated genes are amino acid biosynthetic genes and transporters, as well as genes with unknown function. Analysis of candidate T-box sites resulted in new functional annotations. We assigned the amino acid specificity to a large number of candidate amino acid transporters and a possible function to amino acid biosynthesis genes. We then studied the evolution of the T-boxes. Analysis of the constructed phylogenetic trees demonstrated that in addition to the normal evolution consistent with the evolution of regulated genes, T-boxes may be duplicated, transferred to other genes, and change specificity. We observed several cases of recent T-box regulon expansion following the loss of a previously existing regulatory system, in particular, arginine regulon in Clostridium difficile and methionine regulon in Lactobacillaceae. Finally, we described a new structural class of T-boxes containing duplicated terminator-antiterminator elements and unusual reduced T-boxes regulating initiation of translation in the Actinobacteria.


Assuntos
Bactérias/genética , Bactérias/metabolismo , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Proteínas com Domínio T/genética , Proteínas com Domínio T/metabolismo , Regiões 5' não Traduzidas , Sistemas de Transporte de Aminoácidos/genética , Sistemas de Transporte de Aminoácidos/metabolismo , Aminoácidos/metabolismo , Sequência de Bases , DNA Bacteriano/genética , Evolução Molecular , Regulação Bacteriana da Expressão Gênica , Genoma Bacteriano , Genômica , Modelos Biológicos , Modelos Moleculares , Dados de Sequência Molecular , Conformação de Ácido Nucleico , Filogenia , RNA Bacteriano/química , RNA Bacteriano/genética , RNA Mensageiro/química , RNA Mensageiro/genética , Regulon , Homologia de Sequência do Ácido Nucleico
6.
In Silico Biol ; 10(3): 163-83, 2010.
Artigo em Inglês | MEDLINE | ID: mdl-22430290

RESUMO

Unlike evolution of genes and proteins, evolution of regulatory systems is a relatively new area of research. In particular, little systematic study has been done on evolution of DNA binding motifs in transcription factor families. We suggest an algorithm that reconstructs the most parsimonious scenario for changes in DNA binding motifs along an evolutionary tree of transcription factor binding sites. The algorithm was validated on several artificial datasets and then applied to reconstruct the evolutionary history of the NrdR, MntR, LacI, FNR, Irr, Fur and Rrf2 transcription factor families. The algorithm seems to be sufficiently robust to be applicable in realistic situations. In most transcription factor families the changes in binding motifs are limited to several branches. Changes in consensus nucleotides proceed via an intermediate stage when the respective position is not conserved.


Assuntos
Proteínas de Bactérias/genética , Evolução Molecular , Modelos Genéticos , Fatores de Transcrição/genética , Algoritmos , Bactérias/genética , Sequência de Bases , Sítios de Ligação , Sequência Consenso , Filogenia , Elementos Reguladores de Transcrição
7.
Biomed Res Int ; 2020: 3465380, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32025518

RESUMO

The lengths of intergenic regions between neighboring genes that are convergent, divergent, or unidirectional were calculated for plastids of the rhodophytic branch and complete archaeal and bacterial genomes. Statistically significant linear relationships between any pair of the medians of these three length types have been revealed in each genomic group. Exponential relationships between the optimal growth temperature and each of the three medians have been revealed as well. The leading coefficients of the regression equations relating all pairs of the medians as well as temperature and any of the medians have the same sign and order of magnitude. The results obtained for plastids, archaea, and bacteria are also similar at the qualitative level. For instance, the medians are always low at high temperatures. At low temperatures, the medians tend to statistically significant greater values and scattering. The original model was used to test our hypothesis that the intergenic distances are optimized in particular to decrease the competition of RNA polymerases within the locus that results in transcribing shortened RNAs. Overall, this points to an effect of temperature for both remote and close genomes.


Assuntos
Archaea/crescimento & desenvolvimento , Archaea/genética , Bactérias/crescimento & desenvolvimento , Bactérias/genética , Plastídeos/genética , Plastídeos/fisiologia , Temperatura , Archaea/metabolismo , Proteínas Arqueais/genética , Bactérias/metabolismo , Proteínas de Bactérias/genética , Técnicas de Cultura de Células , Proteínas de Cloroplastos/genética , Evolução Molecular , Genoma Arqueal , Genoma Bacteriano , Genomas de Plastídeos , Modelos Lineares , Plastídeos/metabolismo
8.
Life (Basel) ; 10(9)2020 Sep 10.
Artigo em Inglês | MEDLINE | ID: mdl-32927891

RESUMO

An original bioinformatics technique is developed to identify the protein-coding genes in rodents, lagomorphs and nonhuman primates that are pseudogenized in humans. The method is based on per-gene verification of local synteny, similarity of exon-intronic structures and orthology in a set of genomes. It is applicable to any genome set, even with the number of genomes exceeding 100, and efficiently implemented using fast computer software. Only 50 evolutionary recent human pseudogenes were predicted. Their functional homologs in model species are often associated with the immune system or digestion and mainly express in the testes. According to current evidence, knockout of most of these genes leads to an abnormal phenotype. Some genes were pseudogenized or lost independently in human and nonhuman hominoids.

9.
BioData Min ; 12: 20, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31728160

RESUMO

BACKGROUND: Gerontogenes include those that modulate life expectancy in various species and may be the actual longevity genes. We believe that a long (relative to body weight) lifespan in individual rodent and primate species can be due, among other things, to the loss of particular genes that are present in short-lived species of the same orders. These genes can also explain the widely different rates of aging among diverse species as well as why similarly sized rodents or primates sometimes have anomalous life expectancies (e.g., naked mole-rats and humans). Here, we consider the gene loss in the context of the prediction of Williams' theory that concerns the reallocation of physiological resources of an organism between active reproduction (r-strategy) and self-maintenance (K-strategy). We have identified such lost genes using an original computer-aided approach; the software considers the loss of a gene as disruptions in gene orthology, local gene synteny or both. RESULTS: A method and software identifying the genes that are absent from a predefined set of species but present in another predefined set of species are suggested. Examples of such pairs of sets include long-lived vs short-lived, homeothermic vs poikilothermic, amniotic vs anamniotic, aquatic vs terrestrial, and neotenic vs nonneotenic species, among others. Species are included in one of two sets according to the property of interest, such as longevity or homeothermy. The program is universal towards these pairs, i.e., towards the underlying property, although the sets should include species with quality genome assemblies. Here, the proposed method was applied to study the longevity of Euarchontoglires species. It largely predicted genes that are highly expressed in the testis, epididymis, uterus, mammary glands, and the vomeronasal and other reproduction-related organs. This agrees with Williams' theory that hypothesizes a species transition from r-strategy to K-strategy. For instance, the method predicts the mouse gene Smpd5, which has an expression level 20 times greater in the testis than in organs unrelated to reproduction as experimentally demonstrated elsewhere. At the same time, its paralog Smpd3 is not predicted by the program and is widely expressed in many organs not specifically related to reproduction. CONCLUSIONS: The method and program, which were applied here to screen for gene losses that can accompany increased lifespan, were also applied to study reduced regenerative capacity and development of the telencephalon, neoteny, etc. Some of these results have been carefully tested experimentally. Therefore, we assume that the method is widely applicable.

10.
Oxid Med Cell Longev ; 2019: 2901057, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31781330

RESUMO

Cardiolipin interacts with many proteins of the mitochondrial inner membrane and, together with cytochrome C and creatine kinase, activates them. It can be considered as an integrating factor for components of the mitochondrial respiratory chain, which provides for an efficient transfer of electrons and protons. The major, if not the only, factor of cardiolipin maturation is tafazzin. Variations of isoform proportions of this enzyme can cause severe diseases such as Barth syndrome. Using bioinformatic methods, we have found conserved C-terminal regions in many tafazzin isoforms and identified new mammalian species that acquired exon 5 as well as rare occasions of intron retention between exons 8 and 9. The regions in the C-terminal part arise from frameshifts relative to the full-length TAZ transcript after skipping exon 9 or retention of the intron between exons 10 and 11. These modifications demonstrate specific distribution among the orders of mammals. The dependence of the species maximum lifespan, body weight, and mitochondrial metabolic rate on the modifications has been demonstrated. Arguably, unconventional tafazzin isoforms provide for the optimal balance between the increased biochemical activity of mitochondria (resulting from specific environmental or nutritional conditions) and lifespan maintenance; and the functional role of such isoforms is linked to the modification of the primary and secondary structures at their C-termini.


Assuntos
Síndrome de Barth/metabolismo , Cardiolipinas/metabolismo , Mitocôndrias/metabolismo , Membranas Mitocondriais/metabolismo , Fatores de Transcrição/metabolismo , Aciltransferases , Animais , Síndrome de Barth/genética , Síndrome de Barth/patologia , Cardiolipinas/genética , Transporte de Elétrons/genética , Humanos , Mitocôndrias/genética , Isoformas de Proteínas/genética , Isoformas de Proteínas/metabolismo , Fatores de Transcrição/genética
11.
Front Genet ; 10: 443, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31178892

RESUMO

Two enigmatic groups of morphologically simple parasites of invertebrates, the Dicyemida (syn. Rhombozoa) and the Orthonectida, since the 19th century have been usually considered as two classes of the phylum Mesozoa. Early molecular evidence suggested their relationship within the Spiralia (=Lophotrochozoa), however, high rates of dicyemid and orthonectid sequence evolution led to contradicting phylogeny reconstructions. Genomic data for orthonectids revealed that they are highly simplified spiralians and possess a reduced set of genes involved in metazoan development and body patterning. Acquiring genomic data for dicyemids, however, remains a challenge due to complex genome rearrangements including chromatin diminution and generation of extrachromosomal circular DNAs, which are reported to occur during the development of somatic cells. We performed genomic sequencing of one species of Dicyema, and obtained transcriptomic data for two Dicyema spp. Homeodomain (homeobox) transcription factors, G-protein-coupled receptors, and many other protein families have undergone a massive reduction in dicyemids compared to other animals. There is also apparent reduction of the bilaterian gene complements encoding components of the neuromuscular systems. We constructed and analyzed a large dataset of predicted orthologous proteins from three species of Dicyema and a set of spiralian animals including the newly sequenced genome of the orthonectid Intoshia linei. Bayesian analyses recovered the orthonectid lineage within the Annelida. In contrast, dicyemids form a separate clade with weak affinity to the Rouphozoa (Platyhelminthes plus Gastrotricha) or (Entoprocta plus Cycliophora) suggesting that the historically proposed Mesozoa is a polyphyletic taxon. Thus, dramatic simplification of body plans in dicyemids and orthonectids, as well as their intricate life cycles that combine metagenesis and heterogony, evolved independently in these two lineages.

12.
Cell Rep ; 29(4): 1027-1040.e6, 2019 10 22.
Artigo em Inglês | MEDLINE | ID: mdl-31644900

RESUMO

The molecular basis of higher regenerative capacity of cold-blooded animals comparing to warm-blooded ones is poorly understood. Although this difference in regenerative capacities is commonly thought to be a result of restructuring of the same regulatory gene network, we hypothesized that it may be due to loss of some genes essential for regeneration. We describe here a bioinformatic method that allowed us to identify such genes. For investigation in depth we selected one of them encoding transmembrane protein, named "c-Answer." Using the Xenopus laevis frog as a model cold-blooded animal, we established that c-Answer regulates regeneration of body appendages and telencephalic development through binding to fibroblast growth factor receptors (FGFRs) and P2ry1 receptors and promoting MAPK/ERK and purinergic signaling. This suggests that elimination of c-answer in warm-blooded animals could lead to decreased activity of at least two signaling pathways, which in turn might contribute to changes in mechanisms regulating regeneration and telencephalic development.


Assuntos
Encéfalo/metabolismo , Regulação da Expressão Gênica no Desenvolvimento , Neurogênese , Regeneração , Animais , Encéfalo/crescimento & desenvolvimento , Encéfalo/fisiologia , Biologia Computacional , Sistema de Sinalização das MAP Quinases , Receptores de Fatores de Crescimento de Fibroblastos/genética , Receptores de Fatores de Crescimento de Fibroblastos/metabolismo , Receptores Purinérgicos P2Y1/genética , Receptores Purinérgicos P2Y1/metabolismo , Xenopus laevis
13.
J Bioinform Comput Biol ; 5(1): 155-80, 2007 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-17477496

RESUMO

A model is proposed primarily for the classical RNA attenuation regulation of gene expression through premature transcription termination. The model is based on the concept of the RNA secondary structure macrostate within the regulatory region between the ribosome and RNA-polymerase, on hypothetical equation describing deceleration of RNA-polymerase by a macrostate and on views of transcription and translation initiation and elongation, under different values of the four basic model parameters which were varied. A special effort was made to select adequate model parameters. We first discuss kinetics of RNA folding and define the concept of the macrostate as a specific parentheses structure used to construct a conventional set of hairpins. The originally developed software that realizes the proposed model offers functionality to fully model RNA secondary folding kinetics. Its performance is compared to that of a public server described in Ref. 1. We then describe the delay in RNA-polymerase shifting to the next base or its premature termination caused by an RNA secondary structure or, herefrom, a macrostate. In this description, essential concepts are the basic and excited states of the polymerase first introduced in Ref. 2: the polymerase shifting to the next base can occur only in the basic state, and its detachment from DNA strand - only in excited state. As to the authors' knowledge, such a model incorporating the above-mentioned attenuation characteristics is not published elsewhere. The model was implemented in an application with command line interface for running in batch mode in Windows and Linux environments, as well as a public web server.(3) The model was tested with a conventional Monte Carlo procedure. In these simulations, the estimate of correlation between the premature transcription termination probability p and concentration c of charged amino acyl-tRNA was obtained as function p(c) for many regulatory regions in many bacterial genomes, as well as for local mutations in these regions.


Assuntos
Fenômenos Fisiológicos Bacterianos , Proteínas de Bactérias/fisiologia , Regulação Bacteriana da Expressão Gênica/fisiologia , Biossíntese de Proteínas/fisiologia , RNA Mensageiro/genética , Ativação Transcricional/fisiologia , Simulação por Computador , Modelos Genéticos
14.
Life (Basel) ; 7(1)2017 Feb 27.
Artigo em Inglês | MEDLINE | ID: mdl-28264444

RESUMO

Recent phylogenetic analyses are incorporating ultraconserved elements (UCEs) and highly conserved elements (HCEs). Models of evolution of the genome structure and HCEs initially faced considerable algorithmic challenges, which gave rise to (often unnatural) constraints on these models, even for conceptually simple tasks such as the calculation of distance between two structures or the identification of UCEs. In our recent works, these constraints have been addressed with fast and efficient solutions with no constraints on the underlying models. These approaches have led us to an unexpected result: for some organelles and taxa, the genome structure and HCE set, despite themselves containing relatively little information, still adequately resolve the evolution of species. We also used the HCE identification to search for promoters and regulatory elements that characterize the functional evolution of the genome.

15.
Biol Direct ; 11(1): 20, 2016 Apr 16.
Artigo em Inglês | MEDLINE | ID: mdl-27084079

RESUMO

Short leader genes usually do not encode stable proteins, although their importance in expression control of bacterial genomes is widely accepted. Such genes are often involved in the control of attenuation regulation. However, the abundance of leader genes suggests that their role in bacteria is not limited to regulation. Specifically, we hypothesize that leader genes increase the expression of protein-coding (structural) genes via ribosome reinitiation at the leader peptide in the case of a short distance between the stop codon of the leader gene and the start codon of the structural gene. For instance, in Actinobacteria, the frequency of leader genes at a distance of 10-11 bp is about 70 % higher than the mean frequency within the 1 to 65 bp range; and it gradually decreases as the range grows longer. A pronounced peak of this frequency-distance relationship is also observed in Proteobacteria, Bacteroidetes, Spirochaetales, Acidobacteria, the Deinococcus-Thermus group, and Planctomycetes. In contrast, this peak falls to the distance of 15-16 bp and is not very pronounced in Firmicutes; and no such peak is observed in cyanobacteria and tenericutes. Generally, this peak is typical for many bacteria. Some leader genes located close to a structural gene probably play a regulatory role as well.


Assuntos
Proteínas de Bactérias/metabolismo , Sinais Direcionadores de Proteínas/fisiologia , Ribossomos/metabolismo , Acidobacteria/metabolismo , Actinobacteria/metabolismo , Bacteroidetes/metabolismo , Spirochaetales/metabolismo
16.
BMC Microbiol ; 5: 54, 2005 Oct 03.
Artigo em Inglês | MEDLINE | ID: mdl-16202131

RESUMO

BACKGROUND: Formation of alternative structures in mRNA in response to external stimuli, either direct or mediated by proteins or other RNAs, is a major mechanism of regulation of gene expression in bacteria. This mechanism has been studied in detail using experimental and computational approaches in proteobacteria and Firmicutes, but not in other groups of bacteria. RESULTS: Comparative analysis of amino acid biosynthesis operons in Actinobacteria resulted in identification of conserved regions upstream of several operons. Classical attenuators were predicted upstream of trp operons in Corynebacterium spp. and Streptomyces spp., and trpS and leuS genes in some Streptomyces spp. Candidate leader peptides with terminators were observed upstream of ilvB genes in Corynebacterium spp., Mycobacterium spp. and Streptomyces spp. Candidate leader peptides without obvious terminators were found upstream of cys operons in Mycobacterium spp. and several other species. A conserved pseudoknot (named LEU element) was identified upstream of leuA operons in most Actinobacteria. Finally, T-boxes likely involved in the regulation of translation initiation were observed upstream of ileS genes from several Actinobacteria. CONCLUSION: The metabolism of tryptophan, cysteine and leucine in Actinobacteria seems to be regulated on the RNA level. In some cases the mechanism is classical attenuation, but in many cases some components of attenuators are missing. The most interesting case seems to be the leuA operon preceded by the LEU element that may fold into a conserved pseudoknot or an alternative structure. A LEU element has been observed in a transposase gene from Bifidobacterium longum, but it is not conserved in genes encoding closely related transposases despite a very high level of protein similarity. One possibility is that the regulatory region of the leuA has been co-opted from some element involved in transposition. Analysis of phylogenetic patterns allowed for identification of ML1624 of M. leprae and its orthologs as the candidate regulatory proteins that may bind to the LEU element. T-boxes upstream of the ileS genes are unusual, as their regulatory mechanism seems to be inhibition of translation initiation via a hairpin sequestering the Shine-Dalgarno box.


Assuntos
Actinobacteria/genética , Actinobacteria/metabolismo , Aminoácidos/metabolismo , RNA Bacteriano/genética , Regiões 5' não Traduzidas , Regulação Bacteriana da Expressão Gênica , Leucina/metabolismo , Óperon , RNA Mensageiro/genética
17.
Biomed Res Int ; 2015: 510598, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-26114108

RESUMO

We report the database of plastid protein families from red algae, secondary and tertiary rhodophyte-derived plastids, and Apicomplexa constructed with the novel method to infer orthology. The families contain proteins with maximal sequence similarity and minimal paralogous content. The database contains 6509 protein entries, 513 families and 278 nonsingletons (from which 230 are paralog-free, and among the remaining 48, 46 contain at maximum two proteins per species, and 2 contain at maximum three proteins per species). The method is compared with other approaches. Expression regulation of the moeB gene is studied using this database and the model of RNA polymerase competition. An analogous database obtained for green algae and their symbiotic descendants, and applications based on it are published earlier.


Assuntos
Evolução Biológica , Proteínas de Cloroplastos/genética , Rodófitas/genética , Simbiose/genética , Apicomplexa/genética , Proteínas de Bactérias/biossíntese , Proteínas de Bactérias/genética , Proteínas de Cloroplastos/biossíntese , Bases de Dados de Proteínas , Regulação da Expressão Gênica , Filogenia , Plastídeos/genética
18.
FEMS Microbiol Lett ; 234(2): 357-70, 2004 May 15.
Artigo em Inglês | MEDLINE | ID: mdl-15135544

RESUMO

Candidate attenuators were identified that regulate operons responsible for biosynthesis of branched amino acids, histidine, threonine, tryptophan, and phenylalanine in gamma- and alpha-proteobacteria, and in some cases in low-GC Gram-positive bacteria, Thermotogales and Bacteroidetes/Chlorobi. This allowed us not only to describe the evolutionary dynamics of regulation by attenuation of transcription, but also to annotate a number of hypothetical genes. In particular, orthologs of ygeA of Escherichia coli were assigned the branched chain amino acid racemase function. Three new families of histidine transporters were predicted, orthologs of yuiF and yvsH of Bacillus subtilis, and lysQ of Lactococcus lactis. In Pasteurellales, the single bifunctional aspartate kinase/homoserine dehydrogenase gene thrA was predicted to be regulated not only by threonine and isoleucine, as in E. coli, but also by methionine. In alpha-proteobacteria, the single acetolactate synthase operon ilvIH was predicted to be regulated by branched amino acids-dependent attenuators. Histidine biosynthetic operons his were predicted to be regulated by histidine-dependent attenuators in Bacillus cereus and Clostridium difficile, and by histidine T-boxes in L. lactis and Streptococcus mutans.


Assuntos
Alphaproteobacteria/genética , Aminoácidos/biossíntese , Aminoácidos/genética , Betaproteobacteria/genética , Gammaproteobacteria/genética , Óperon , Alphaproteobacteria/classificação , Sequência de Bases , Betaproteobacteria/classificação , Gammaproteobacteria/classificação , Genoma Bacteriano , Genômica , Dados de Sequência Molecular , Alinhamento de Sequência , Homologia de Sequência do Ácido Nucleico , Especificidade da Espécie , Transcrição Gênica/genética
19.
Comput Biol Chem ; 49: 7-13, 2014 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-24513779

RESUMO

The paper studies proteins with domains PF00480 or PF14340, as well as some other poorly characterized proteins, encoded by genes associated with leader peptide genes containing a tract of cysteine codons. Such proteins are hypothetically regulated with cysteine-dependent transcription attenuation, namely the Rho-dependent or classic transcription attenuation. Cysteine is an important structural amino acid in various proteins and is required for synthesis of many sulfur-containing compounds, such as methionine, thiamine, glutathione, taurine and the lipoic acid. Earlier a few species of mycobacteria were predicted by the authors to have cysteine-dependent regulation of operons containing the cysK gene. In Escherichia coli this regulation is absent, and the same operon is regulated by the CysB transcription activator. The paper also studies Rho-dependent and classic transcription regulations in all annotated genes of mycobacteria available in GenBank and their orthologs in Actinomycetales. We predict regulations for many genes involved in sulfur metabolism and transport of sulfur-containing compounds; these regulations differ considerably among species. On the basis of predictions, we assign a putative role to proteins encoded by the regulated genes with unknown function, and also describe the structure of corresponding regulons, predict the lack of such regulations for many genes. Thus, all proteins with the uncharacterized Pfam domains PF14340 and PF00480, as well as some others, are predicted to be involved in sulfur metabolism. We also surmise the affinity of some transporters to sulfur-containing compounds. The obtained results considerably extend earlier large-scale studies of Rho-dependent and classic transcription attenuations.


Assuntos
Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Regulação Bacteriana da Expressão Gênica/genética , Enxofre/metabolismo , Actinomycetales/química , Algoritmos , Sequência de Aminoácidos , Proteínas de Bactérias/química , Mycobacteriaceae/química , Estrutura Terciária de Proteína , Alinhamento de Sequência
20.
Biomed Res Int ; 2013: 413450, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-24073405

RESUMO

This study considers transcription regulation of plastid genes involved in sulfate transport in the parasites of invertebrate (Helicosporidium sp.) and other species of the Viridiplantae. A one-box conserved motif with the consensus TAAWATGATT is found near promoters upstream the cysT and cysA genes in many species. In certain cases, the motif is repeated two or three times.


Assuntos
Regulação da Expressão Gênica de Plantas , Genes de Plantas/genética , Plastídeos/genética , Sulfatos/metabolismo , Transcrição Gênica , Viridiplantae/genética , Viridiplantae/metabolismo , Sequência de Aminoácidos , Sequência de Bases , Transporte Biológico/genética , Sequência Conservada/genética , Dados de Sequência Molecular , Motivos de Nucleotídeos/genética , Filogenia , Proteínas de Plantas/química , Proteínas de Plantas/genética , Proteínas de Plantas/metabolismo , Regiões Promotoras Genéticas/genética , Estrutura Terciária de Proteína , Alinhamento de Sequência , Synechocystis/genética , Synechocystis/metabolismo
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa