RESUMO
The triplet nature of the genetic code is considered a universal feature of known organisms. However, frequent stop codons at internal mRNA positions in Euplotes ciliates ultimately specify ribosomal frameshifting by one or two nucleotides depending on the context, thus posing a nontriplet feature of the genetic code of these organisms. Here, we sequenced transcriptomes of eight Euplotes species and assessed evolutionary patterns arising at frameshift sites. We show that frameshift sites are currently accumulating more rapidly by genetic drift than they are removed by weak selection. The time needed to reach the mutational equilibrium is several times longer than the age of Euplotes and is expected to occur after a several-fold increase in the frequency of frameshift sites. This suggests that Euplotes are at an early stage of the spread of frameshifting in expression of their genome. In addition, we find the net fitness burden of frameshift sites to be noncritical for the survival of Euplotes. Our results suggest that fundamental genome-wide changes such as a violation of the triplet character of genetic code can be introduced and maintained solely by neutral evolution.
Assuntos
Cilióforos , Euplotes , Euplotes/genética , Euplotes/metabolismo , Código Genético , Sequência de Bases , Códon de Terminação/genética , Códon de Terminação/metabolismo , Cilióforos/genética , Deriva GenéticaRESUMO
The chromatin interaction assays, particularly Hi-C, enable detailed studies of genome architecture in multiple organisms and model systems, resulting in a deeper understanding of gene expression regulation mechanisms mediated by epigenetics. However, the analysis and interpretation of Hi-C data remain challenging due to technical biases, limiting direct comparisons of datasets obtained in different experiments and laboratories. As a result, removing biases from Hi-C-generated chromatin contact matrices is a critical data analysis step. Our novel approach, HiConfidence, eliminates biases from the Hi-C data by weighing chromatin contacts according to their consistency between replicates so that low-quality replicates do not substantially influence the result. The algorithm is effective for the analysis of global changes in chromatin structures such as compartments and topologically associating domains. We apply the HiConfidence approach to several Hi-C datasets with significant technical biases, that could not be analyzed effectively using existing methods, and obtain meaningful biological conclusions. In particular, HiConfidence aids in the study of how changes in histone acetylation pattern affect chromatin organization in Drosophila melanogaster S2 cells. The method is freely available at GitHub: https://github.com/victorykobets/HiConfidence.
Assuntos
Drosophila melanogaster , Genoma , Animais , Drosophila melanogaster/genética , Cromatina/genética , Cromossomos , ViésRESUMO
Eukaryotic chromosomes are spatially segregated into topologically associating domains (TADs). Some TADs are attached to the nuclear lamina (NL) through lamina-associated domains (LADs). Here, we identified LADs and TADs at two stages of Drosophila spermatogenesis - in bamΔ86 mutant testes which is the commonly used model of spermatogonia (SpG) and in larval testes mainly filled with spermatocytes (SpCs). We found that initiation of SpC-specific transcription correlates with promoters' detachment from the NL and with local spatial insulation of adjacent regions. However, this insulation does not result in the partitioning of inactive TADs into sub-TADs. We also revealed an increased contact frequency between SpC-specific genes in SpCs implying their de novo gathering into transcription factories. In addition, we uncovered the specific X chromosome organization in the male germline. In SpG and SpCs, a single X chromosome is stronger associated with the NL than autosomes. Nevertheless, active chromatin regions in the X chromosome interact with each other more frequently than in autosomes. Moreover, despite the absence of dosage compensation complex in the male germline, randomly inserted SpG-specific reporter is expressed higher in the X chromosome than in autosomes, thus evidencing that non-canonical dosage compensation operates in SpG.
Assuntos
Cromatina , Drosophila , Animais , Diferenciação Celular/genética , Cromatina/genética , Mecanismo Genético de Compensação de Dose , Drosophila/genética , Células Germinativas , MasculinoRESUMO
Over the past decade, genome-wide assays for chromatin interactions in single cells have enabled the study of individual nuclei at unprecedented resolution and throughput. Current chromosome conformation capture techniques survey contacts for up to tens of thousands of individual cells, improving our understanding of genome function in 3D. However, these methods recover a small fraction of all contacts in single cells, requiring specialised processing of sparse interactome data. In this review, we highlight recent advances in methods for the interpretation of single-cell genomic contacts. After discussing the strengths and limitations of these methods, we outline frontiers for future development in this rapidly moving field.
Assuntos
Cromossomos , Biologia Computacional/métodos , Modelos Moleculares , Conformação Molecular , Análise de Célula Única/métodos , Cromatina/química , Cromatina/genética , Mapeamento Cromossômico/métodos , Gráficos por Computador , DNA/química , DNA/genética , Análise de Dados , Genômica/métodos , Humanos , Fluxo de TrabalhoRESUMO
BACKGROUND: The histidine metabolism and transport (his) genes are controlled by a variety of RNA-dependent regulatory systems among diverse taxonomic groups of bacteria including T-box riboswitches in Firmicutes and Actinobacteria and RNA attenuators in Proteobacteria. Using a comparative genomic approach, we previously identified a novel DNA-binding transcription factor (named HisR) that controls the histidine metabolism genes in diverse Gram-positive bacteria from the Firmicutes phylum. RESULTS: Here we report the identification of HisR-binding sites within the regulatory regions of the histidine metabolism and transport genes in 395 genomes representing the Bacilli, Clostridia, Negativicutes, and Tissierellia classes of Firmicutes, as well as in 97 other HisR-encoding genomes from the Actinobacteria, Proteobacteria, and Synergistetes phyla. HisR belongs to the TrpR family of transcription factors, and their predicted DNA binding motifs have a similar 20-bp palindromic structure but distinct lineage-specific consensus sequences. The predicted HisR-binding motif was validated in vitro using DNA binding assays with purified protein from the human gut bacterium Ruminococcus gnavus. To fill a knowledge gap in the regulation of histidine metabolism genes in Firmicutes genomes that lack a hisR repressor gene, we systematically searched their upstream regions for potential RNA regulatory elements. As result, we identified 158 T-box riboswitches preceding the histidine biosynthesis and/or transport genes in 129 Firmicutes genomes. Finally, novel candidate RNA attenuators were identified upstream of the histidine biosynthesis operons in six species from the Bacillus cereus group, as well as in five Eubacteriales and six Erysipelotrichales species. CONCLUSIONS: The obtained distribution of the HisR transcription factor and two RNA-mediated regulatory mechanisms for histidine metabolism genes across over 600 species of Firmicutes is discussed from functional and evolutionary points of view.
Assuntos
Actinobacteria , Riboswitch , Actinobacteria/genética , Bactérias/genética , DNA/metabolismo , Regulação Bacteriana da Expressão Gênica , Bactérias Gram-Positivas/genética , Bactérias Gram-Positivas/metabolismo , Histidina/genética , Histidina/metabolismo , Humanos , Filogenia , Riboswitch/genética , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismoRESUMO
The ribosome is an essential cellular machine performing protein biosynthesis. Its structure and composition are highly conserved in all species. However, some bacteria have been reported to have an incomplete set of ribosomal proteins. We have analyzed ribosomal protein composition in 214 small bacterial genomes (<1 Mb) and found that although the ribosome composition is fairly stable, some ribosomal proteins may be absent, especially in bacteria with dramatically reduced genomes. The protein composition of the large subunit is less conserved than that of the small subunit. We have identified the set of frequently lost ribosomal proteins and demonstrated that they tend to be positioned on the ribosome surface and have fewer contacts to other ribosome components. Moreover, some proteins are lost in an evolutionary correlated manner. The reduction of ribosomal RNA is also common, with deletions mostly occurring in free loops. Finally, the loss of the anti-Shine-Dalgarno sequence is associated with the loss of a higher number of ribosomal proteins.
Assuntos
Tamanho do Genoma , Genoma Bacteriano , Proteínas Ribossômicas/genética , Ribossomos/químicaRESUMO
Construction of chromosomes 3D models based on single cell Hi-C data constitute an important challenge. We present a reconstruction approach, DPDchrom, that incorporates basic knowledge whether the reconstructed conformation should be coil-like or globular and spring relaxation at contact sites. In contrast to previously published protocols, DPDchrom can naturally form globular conformation due to the presence of explicit solvent. Benchmarking of this and several other methods on artificial polymer models reveals similar reconstruction accuracy at high contact density and DPDchrom advantage at low contact density. To compare 3D structures insensitively to spatial orientation and scale, we propose the Modified Jaccard Index. We analyzed two sources of the contact dropout: contact radius change and random contact sampling. We found that the reconstruction accuracy exponentially depends on the number of contacts per genomic bin allowing to estimate the reconstruction accuracy in advance. We applied DPDchrom to model chromosome configurations based on single-cell Hi-C data of mouse oocytes and found that these configurations differ significantly from a random one, that is consistent with other studies.
Assuntos
Cromatina/química , Análise de Célula Única/métodos , Algoritmos , Animais , Camundongos , Conformação ProteicaRESUMO
First triplets of mRNA coding region affect the yield of translation. We have applied the flowseq method to analyze >30 000 variants of the codons 2-11 of the fluorescent protein reporter to identify factors affecting the protein synthesis. While the negative influence of mRNA secondary structure on translation has been confirmed, a positive role of rare codons at the beginning of a coding sequence for gene expression has not been observed. The identity of triplets proximal to the start codon contributes more to the protein yield then more distant ones. Additional in-frame start codons enhance translation, while Shine-Dalgarno-like motifs downstream the initiation codon are inhibitory. The metabolic cost of amino acids affects the yield of protein in the poor medium. The most efficient translation was observed for variants with features resembling those of native Escherichia coli genes.
Assuntos
Códon de Iniciação/genética , Conformação de Ácido Nucleico , Biossíntese de Proteínas , RNA Mensageiro/genética , Códon de Iniciação/ultraestrutura , Escherichia coli/genética , Proteínas de Fluorescência Verde/genética , Iniciação Traducional da Cadeia Peptídica , RNA Mensageiro/ultraestrutura , Ribossomos/genética , Ribossomos/ultraestruturaRESUMO
Dosage compensation equalizes gene expression in a single male X chromosome with that in the pairs of autosomes and female X chromosomes. In the fruit fly Drosophila, canonical dosage compensation is implemented by the male-specific lethal (MSL) complex functioning in all male somatic cells. This complex contains acetyl transferase males absent on the first (MOF), which performs H4K16 hyperacetylation specifically in the male X chromosome, thus facilitating transcription of the X-linked genes. However, accumulating evidence points to an existence of additional, non-canonical dosage compensation mechanisms operating in somatic and germline cells. In this review, we discuss current advances in the understanding of both canonical and non-canonical mechanisms of dosage compensation in Drosophila.
Assuntos
Proteínas de Drosophila , Drosophila , Acetiltransferases/genética , Animais , Mecanismo Genético de Compensação de Dose , Drosophila/genética , Drosophila/metabolismo , Proteínas de Drosophila/genética , Proteínas de Drosophila/metabolismo , Drosophila melanogaster/genética , Drosophila melanogaster/metabolismo , Feminino , Masculino , Proteínas Nucleares/genética , Cromossomo X/genéticaRESUMO
ExuR and UxuR are paralogous proteins belonging to the GntR family of transcriptional regulators. Both are known to control hexuronic acid metabolism in a variety of Gammaproteobacteria but the relative impact of each of them is still unclear. Here, we apply 2D difference electrophoresis followed by mass-spectrometry to characterise the changes in the Escherichia coli proteome in response to a uxuR or exuR deletion. Our data clearly show that the effects are different: deletion of uxuR resulted in strongly enhanced expression of D-mannonate dehydratase UxuA and flagellar protein FliC, and in a reduced amount of outer membrane porin OmpF, while the absence of ExuR did not significantly alter the spectrum of detected proteins. Consequently, the physiological roles of proteins predicted as homologs seem to be far from identical. Effects of uxuR deletion were largely dependent on the cultivation conditions: during growth with glucose, UxuA and FliC were dramatically altered, while during growth with glucuronate, activation of both was not so prominent. During the growth with glucose, maximal activation was detected for FliC. This was further confirmed by expression analysis and physiological tests, thus suggesting the involvement of UxuR in the regulation of bacterial motility and biofilm formation.
Assuntos
Proteínas de Escherichia coli , Escherichia coli , Escherichia coli/genética , Escherichia coli/metabolismo , Proteínas de Escherichia coli/genética , Proteínas de Escherichia coli/metabolismo , Regulação Bacteriana da Expressão Gênica , Glucose/metabolismo , Ácidos Hexurônicos/metabolismo , Proteoma/metabolismo , Fatores de Transcrição/metabolismoRESUMO
Polypedilum vanderplanki is a striking and unique example of an insect that can survive almost complete desiccation. Its genome and a set of dehydration-rehydration transcriptomes, together with the genome of Polypedilum nubifer (a congeneric desiccation-sensitive midge), were recently released. Here, using published and newly generated datasets reflecting detailed transcriptome changes during anhydrobiosis, as well as a developmental series, we show that the TCTAGAA DNA motif, which closely resembles the binding motif of the Drosophila melanogaster heat shock transcription activator (Hsf), is significantly enriched in the promoter regions of desiccation-induced genes in P. vanderplanki, such as genes encoding late embryogenesis abundant (LEA) proteins, thioredoxins, or trehalose metabolism-related genes, but not in P. nubifer Unlike P. nubifer, P. vanderplanki has double TCTAGAA sites upstream of the Hsf gene itself, which is probably responsible for the stronger activation of Hsf in P. vanderplanki during desiccation compared with P. nubifer To confirm the role of Hsf in desiccation-induced gene activation, we used the Pv11 cell line, derived from P. vanderplanki embryo. After preincubation with trehalose, Pv11 cells can enter anhydrobiosis and survive desiccation. We showed that Hsf knockdown suppresses trehalose-induced activation of multiple predicted Hsf targets (including P. vanderplanki-specific LEA protein genes) and reduces the desiccation survival rate of Pv11 cells fivefold. Thus, cooption of the heat shock regulatory system has been an important evolutionary mechanism for adaptation to desiccation in P. vanderplanki.
Assuntos
Chironomidae/fisiologia , Fatores de Transcrição de Choque Térmico/metabolismo , Proteínas de Insetos/metabolismo , Animais , Evolução Biológica , Chironomidae/genética , Desidratação , Feminino , Fatores de Transcrição de Choque Térmico/genética , Resposta ao Choque Térmico , Proteínas de Insetos/genética , Masculino , Estresse FisiológicoRESUMO
An amendment to this paper has been published and can be accessed via the original article.
RESUMO
BACKGROUND: Salivary cell secretion (SCS) plays a critical role in blood feeding by medicinal leeches, making them of use for certain medical purposes even today. RESULTS: We annotated the Hirudo medicinalis genome and performed RNA-seq on salivary cells isolated from three closely related leech species, H. medicinalis, Hirudo orientalis, and Hirudo verbana. Differential expression analysis verified by proteomics identified salivary cell-specific gene expression, many of which encode previously unknown salivary components. However, the genes encoding known anticoagulants have been found to be expressed not only in salivary cells. The function-related analysis of the unique salivary cell genes enabled an update of the concept of interactions between salivary proteins and components of haemostasis. CONCLUSIONS: Here we report a genome draft of Hirudo medicinalis and describe identification of novel salivary proteins and new homologs of genes encoding known anticoagulants in transcriptomes of three medicinal leech species. Our data provide new insights in genetics of blood-feeding lifestyle in leeches.
Assuntos
Genoma , Hirudo medicinalis/genética , Proteínas e Peptídeos Salivares/genética , Animais , Anticoagulantes/metabolismo , Perfilação da Expressão Gênica , Regulação da Expressão Gênica , Hirudo medicinalis/metabolismo , Sanguessugas/classificação , Sanguessugas/genética , Sanguessugas/metabolismo , Proteômica , Saliva/metabolismo , Proteínas e Peptídeos Salivares/metabolismoRESUMO
Changes in splicing are known to affect the function and regulation of genes. We analyzed splicing events that take place during the postnatal development of the prefrontal cortex in humans, chimpanzees, and rhesus macaques based on data obtained from 168 individuals. Our study revealed that among the 38,822 quantified alternative exons, 15% are differentially spliced among species, and more than 6% splice differently at different ages. Mutations in splicing acceptor and/or donor sites might explain more than 14% of all splicing differences among species and up to 64% of high-amplitude differences. A reconstructed trans-regulatory network containing 21 RNA-binding proteins explains a further 4% of splicing variations within species. While most age-dependent splicing patterns are conserved among the three species, developmental changes in intron retention are substantially more pronounced in humans.
Assuntos
Processamento Alternativo/genética , Macaca mulatta/embriologia , Macaca mulatta/genética , Pan troglodytes/embriologia , Pan troglodytes/genética , Córtex Pré-Frontal/embriologia , RNA Mensageiro/genética , Animais , Evolução Molecular , Humanos , Isoformas de Proteínas/genéticaRESUMO
BACKGROUND: The genus Streptococcus comprises pathogens that strongly influence the health of humans and animals. Genome sequencing of multiple Streptococcus strains demonstrated high variability in gene content and order even in closely related strains of the same species and created a newly emerged object for genomic analysis, the pan-genome. Here we analysed the genome evolution of 25 strains of Streptococcus suis, 50 strains of Streptococcus pyogenes and 28 strains of Streptococcus pneumoniae. RESULTS: Fractions of the pan-genome, unique, periphery, and universal genes differ in size, functional composition, the level of nucleotide substitutions, and predisposition to horizontal gene transfer and genomic rearrangements. The density of substitutions in intergenic regions appears to be correlated with selection acting on adjacent genes, implying that more conserved genes tend to have more conserved regulatory regions. The total pan-genome of the genus is open, but only due to strain-specific genes, whereas other pan-genome fractions reach saturation. We have identified the set of genes with phylogenies inconsistent with species and non-conserved location in the chromosome; these genes are rare in at least one species and have likely experienced recent horizontal transfer between species. The strain-specific fraction is enriched with mobile elements and hypothetical proteins, but also contains a number of candidate virulence-related genes, so it may have a strong impact on adaptability and pathogenicity. Mapping the rearrangements to the phylogenetic tree revealed large parallel inversions in all species. A parallel inversion of length 15 kB with breakpoints formed by genes encoding surface antigen proteins PhtD and PhtB in S. pneumoniae leads to replacement of gene fragments that likely indicates the action of an antigen variation mechanism. CONCLUSIONS: Members of genus Streptococcus have a highly dynamic, open pan-genome, that potentially confers them with the ability to adapt to changing environmental conditions, i.e. antibiotic resistance or transmission between different hosts. Hence, integrated analysis of all aspects of genome evolution is important for the identification of potential pathogens and design of drugs and vaccines.
Assuntos
Variação Antigênica/genética , Evolução Biológica , Transferência Genética Horizontal , Seleção Genética , Streptococcus/genética , Animais , Sequência Conservada/genética , DNA Intergênico , Fluxo Gênico , Ontologia Genética , Rearranjo Gênico/genética , Genes Bacterianos , Tamanho do Genoma , Humanos , Hidrolases/metabolismo , Nucleotídeos/genética , Filogenia , Deleção de Sequência , Especificidade da Espécie , Streptococcus pneumoniae/genética , Virulência/genéticaRESUMO
BACKGROUND: Chlamydia are ancient intracellular pathogens with reduced, though strikingly conserved genome. Despite their parasitic lifestyle and isolated intracellular environment, these bacteria managed to avoid accumulation of deleterious mutations leading to subsequent genome degradation characteristic for many parasitic bacteria. RESULTS: We report pan-genomic analysis of sixteen species from genus Chlamydia including identification and functional annotation of orthologous genes, and characterization of gene gains, losses, and rearrangements. We demonstrate the overall genome stability of these bacteria as indicated by a large fraction of common genes with conserved genomic locations. On the other hand, extreme evolvability is confined to several paralogous gene families such as polymorphic membrane proteins and phospholipase D, and likely is caused by the pressure from the host immune system. CONCLUSIONS: This combination of a large, conserved core genome and a small, evolvable periphery likely reflect the balance between the selective pressure towards genome reduction and the need to adapt to escape from the host immunity.
Assuntos
Adaptação Fisiológica/genética , Chlamydia/genética , Chlamydia/fisiologia , Genômica , Interações Hospedeiro-Patógeno/genética , Seleção Genética , Evolução Molecular , Genoma Bacteriano/genética , Anotação de Sequência MolecularRESUMO
Recent advances enabled by the Hi-C technique have unraveled many principles of chromosomal folding that were subsequently linked to disease and gene regulation. In particular, Hi-C revealed that chromosomes of animals are organized into topologically associating domains (TADs), evolutionary conserved compact chromatin domains that influence gene expression. Mechanisms that underlie partitioning of the genome into TADs remain poorly understood. To explore principles of TAD folding in Drosophila melanogaster, we performed Hi-C and poly(A)(+) RNA-seq in four cell lines of various origins (S2, Kc167, DmBG3-c2, and OSC). Contrary to previous studies, we find that regions between TADs (i.e., the inter-TADs and TAD boundaries) in Drosophila are only weakly enriched with the insulator protein dCTCF, while another insulator protein Su(Hw) is preferentially present within TADs. However, Drosophila inter-TADs harbor active chromatin and constitutively transcribed (housekeeping) genes. Accordingly, we find that binding of insulator proteins dCTCF and Su(Hw) predicts TAD boundaries much worse than active chromatin marks do. Interestingly, inter-TADs correspond to decompacted inter-bands of polytene chromosomes, whereas TADs mostly correspond to densely packed bands. Collectively, our results suggest that TADs are condensed chromatin domains depleted in active chromatin marks, separated by regions of active chromatin. We propose the mechanism of TAD self-assembly based on the ability of nucleosomes from inactive chromatin to aggregate, and lack of this ability in acetylated nucleosomal arrays. Finally, we test this hypothesis by polymer simulations and find that TAD partitioning may be explained by different modes of inter-nucleosomal interactions for active and inactive chromatin.
Assuntos
Cromatina/genética , Drosophila melanogaster/genética , Genoma de Inseto , Transcrição Gênica , Animais , Linhagem Celular , Montagem e Desmontagem da Cromatina , Mapeamento Cromossômico , Simulação por Computador , Modelos Moleculares , Nucleossomos/genética , Nucleossomos/metabolismo , Cromossomos Politênicos/genética , Análise de Sequência de RNARESUMO
Yield of protein per translated mRNA may vary by four orders of magnitude. Many studies analyzed the influence of mRNA features on the translation yield. However, a detailed understanding of how mRNA sequence determines its propensity to be translated is still missing. Here, we constructed a set of reporter plasmid libraries encoding CER fluorescent protein preceded by randomized 5Î untranslated regions (5Î-UTR) and Red fluorescent protein (RFP) used as an internal control. Each library was transformed into Escherchia coli cells, separated by efficiency of CER mRNA translation by a cell sorter and subjected to next generation sequencing. We tested efficiency of translation of the CER gene preceded by each of 48 natural 5Î-UTR sequences and introduced random and designed mutations into natural and artificially selected 5Î-UTRs. Several distinct properties could be ascribed to a group of 5Î-UTRs most efficient in translation. In addition to known ones, several previously unrecognized features that contribute to the translation enhancement were found, such as low proportion of cytidine residues, multiple SD sequences and AG repeats. The latter could be identified as translation enhancer, albeit less efficient than SD sequence in several natural 5Î-UTRs.
Assuntos
Regiões 5' não Traduzidas , Escherichia coli/genética , Biossíntese de Proteínas , Sequências Reguladoras de Ácido Ribonucleico , Separação Celular , Citometria de Fluxo , Genes Reporter , Sequenciamento de Nucleotídeos em Larga Escala , Mutação , Conformação de Ácido Nucleico , Nucleotídeos/fisiologiaRESUMO
BACKGROUND: The genus Burkholderia consists of species that occupy remarkably diverse ecological niches. Its best known members are important pathogens, B. mallei and B. pseudomallei, which cause glanders and melioidosis, respectively. Burkholderia genomes are unusual due to their multichromosomal organization, generally comprised of 2-3 chromosomes. RESULTS: We performed integrated genomic analysis of 127 Burkholderia strains. The pan-genome is open with the saturation to be reached between 86,000 and 88,000 genes. The reconstructed rearrangements indicate a strong avoidance of intra-replichore inversions that is likely caused by selection against the transfer of large groups of genes between the leading and the lagging strands. Translocated genes also tend to retain their position in the leading or the lagging strand, and this selection is stronger for large syntenies. Integrated reconstruction of chromosome rearrangements in the context of strains phylogeny reveals parallel rearrangements that may indicate inversion-based phase variation and integration of new genomic islands. In particular, we detected parallel inversions in the second chromosomes of B. pseudomallei with breakpoints formed by genes encoding membrane components of multidrug resistance complex, that may be linked to a phase variation mechanism. Two genomic islands, spreading horizontally between chromosomes, were detected in the B. cepacia group. CONCLUSIONS: This study demonstrates the power of integrated analysis of pan-genomes, chromosome rearrangements, and selection regimes. Non-random inversion patterns indicate selective pressure, inversions are particularly frequent in a recent pathogen B. mallei, and, together with periods of positive selection at other branches, may indicate adaptation to new niches. One such adaptation could be a possible phase variation mechanism in B. pseudomallei.
Assuntos
Burkholderia/genética , Cromossomos Bacterianos , Rearranjo Gênico/genética , Burkholderia/classificação , Bases de Dados Genéticas , FilogeniaRESUMO
Riboswitches are conserved RNA structures located in non-coding regions of mRNA and able to bind small molecules (e.g. metabolites) changing conformation upon binding. This feature enables them to function as regulators of gene expression. The thiamin pyrophosphate (TPP) riboswitch is the only type of riboswitches found not only in bacteria, but also in eukaryotes - in plants, green algae, protists, and fungi. Two main mechanisms of fungal TPP riboswitch action, involving alternative splicing, have been established so far. Here, we report a large-scale bioinformatic study of riboswitch structural features, action mechanisms, and distribution along the fungal taxonomy groups. For each putatively regulated gene, we reconstruct the riboswitch structure, identify other components of the regulation machinery, and establish mechanisms of riboswitch-mediated regulation. In addition to three genes known to be regulated by TPP riboswitches, thiazole synthase THI4, hydroxymethilpyrimidine-syntase NMT1, and putative transporter NCU01977, we identify two new genes, a putative thiamin transporter THI9 and a transporter of unknown specificity. While the riboswitch sequence and structure remain highly conserved in all species and genes, the mode of riboswitch-mediated regulation varies between regulated genes. The riboswitch usage varies strongly between fungal taxa, with the largest number of riboswitch-regulated genes found in Pezizomycotina and no riboswitch-mediated regulation established in Saccaromycotina.