Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 37
Filtrar
1.
BMC Bioinformatics ; 24(1): 391, 2023 Oct 18.
Artigo em Inglês | MEDLINE | ID: mdl-37853347

RESUMO

BACKGROUND: The rapid development of omics acquisition techniques has induced the production of a large volume of heterogeneous and multi-level omics datasets, which require specific and sometimes complex analyses to obtain relevant biological information. Here, we present ASTERICS (version 2.5), a publicly available web interface for the analyses of omics datasets. RESULTS: ASTERICS is designed to make both standard and complex exploratory and integration analysis workflows easily available to biologists and to provide high quality interactive plots. Special care has been taken to provide a comprehensive documentation of the implemented analyses and to guide users toward sound analysis choices regarding some specific omics data. Data and analyses are organized in a comprehensive graphical workflow within ASTERICS workspace to facilitate the understanding of successive data editions and analyses leading to a given result. CONCLUSION: ASTERICS provides an easy to use platform for omics data exploration and integration. The modular organization of its open source code makes it easy to incorporate new workflows and analyses by external contributors. ASTERICS is available at https://asterics.miat.inrae.fr and can also be deployed using provided docker images.


Assuntos
Software , Fluxo de Trabalho
2.
BMC Bioinformatics ; 23(1): 495, 2022 Nov 18.
Artigo em Inglês | MEDLINE | ID: mdl-36401177

RESUMO

BACKGROUND: Sequencing is the key method to study the impact of short RNAs, which include micro RNAs, tRNA-derived RNAs, and piwi-interacting RNA, among others. The first step to make use of these reads is to map them to a genome. Existing mapping tools have been developed for long RNAs in mind, and, so far, no tool has been conceived for short RNAs. However, short RNAs have several distinctive features which make them different from messenger RNAs: they are shorter, they are often redundant, they can be produced by duplicated loci, and they may be edited at their ends. RESULTS: In this work, we present a new tool, srnaMapper, that exhaustively maps these reads with all these features in mind, and is most efficient when applied to reads no longer than 50 base pairs. We show, on several datasets, that srnaMapper is very efficient considering computation time and edition error handling: it retrieves all the hits, with arbitrary number of errors, in time comparable with non-exhaustive tools.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , MicroRNAs , Análise de Sequência de RNA/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , RNA Interferente Pequeno , RNA de Transferência
3.
BMC Genomics ; 18(1): 882, 2017 Nov 16.
Artigo em Inglês | MEDLINE | ID: mdl-29145803

RESUMO

BACKGROUND: Small regulatory RNAs (sRNAs) are widely found in bacteria and play key roles in many important physiological and adaptation processes. Studying their evolution and screening for events of coevolution with other genomic features is a powerful way to better understand their origin and assess a common functional or adaptive relationship between them. However, evolution and coevolution of sRNAs with coding genes have been sparsely investigated in bacterial pathogens. RESULTS: We designed a robust and generic phylogenomics approach that detects correlated evolution between sRNAs and protein-coding genes using their observed and inferred patterns of presence-absence in a set of annotated genomes. We applied this approach on 79 complete genomes of the Listeria genus and identified fifty-two accessory sRNAs, of which most were present in the Listeria common ancestor and lost during Listeria evolution. We detected significant coevolution between 23 sRNA and 52 coding genes and inferred the Listeria sRNA-coding genes coevolution network. We characterized a main hub of 12 sRNAs that coevolved with genes encoding cell wall proteins and virulence factors. Among them, an sRNA specific to L. monocytogenes species, rli133, coevolved with genes involved either in pathogenicity or in interaction with host cells, possibly acting as a direct negative post-transcriptional regulation. CONCLUSIONS: Our approach allowed the identification of candidate sRNAs potentially involved in pathogenicity and host interaction, consistent with recent findings on known pathogenicity actors. We highlight four sRNAs coevolving with seven internalin genes, some of which being important virulence factors in Listeria.


Assuntos
Proteínas de Bactérias/genética , Evolução Molecular , Listeria/genética , Pequeno RNA não Traduzido/genética , Redes Reguladoras de Genes , Genes Bacterianos , Genoma Bacteriano , Listeria/patogenicidade
4.
Brief Bioinform ; 16(5): 795-805, 2015 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-25477348

RESUMO

The revolution in high-throughput sequencing technologies has enabled the acquisition of gigabytes of RNA sequences in many different conditions and has highlighted an unexpected number of small RNAs (sRNAs) in bacteria. Ongoing exploitation of these data enables numerous applications for investigating bacterial transacting sRNA-mediated regulation networks. Focusing on sRNAs that regulate mRNA translation in trans, recent works have noted several sRNA-based regulatory pathways that are essential for key cellular processes. Although the number of known bacterial sRNAs is increasing, the experimental validation of their interactions with mRNA targets remains challenging and involves expensive and time-consuming experimental strategies. Hence, bioinformatics is crucial for selecting and prioritizing candidates before designing any experimental work. However, current software for target prediction produces a prohibitive number of candidates because of the lack of biological knowledge regarding the rules governing sRNA-mRNA interactions. Therefore, there is a real need to develop new approaches to help biologists focus on the most promising predicted sRNA-mRNA interactions. In this perspective, this review aims at presenting the advantages of mixing bioinformatics and visualization approaches for analyzing predicted sRNA-mediated regulatory bacterial networks.


Assuntos
Biologia Computacional , Redes Reguladoras de Genes , RNA Bacteriano/fisiologia
5.
Bioinformatics ; 32(3): 456-8, 2016 Feb 01.
Artigo em Inglês | MEDLINE | ID: mdl-26454273

RESUMO

SUMMARY: Biologists produce large data sets and are in demand of rich and simple web portals in which they can upload and analyze their files. Providing such tools requires to mask the complexity induced by the needed High Performance Computing (HPC) environment. The connection between interface and computing infrastructure is usually specific to each portal. With Jflow, we introduce a Workflow Management System (WMS), composed of jQuery plug-ins which can easily be embedded in any web application and a Python library providing all requested features to setup, run and monitor workflows. AVAILABILITY AND IMPLEMENTATION: Jflow is available under the GNU General Public License (GPL) at http://bioinfo.genotoul.fr/jflow. The package is coming with full documentation, quick start and a running test portal. CONTACT: Jerome.Mariette@toulouse.inra.fr.


Assuntos
Biologia Computacional/métodos , Sistemas de Gerenciamento de Base de Dados , Armazenamento e Recuperação da Informação , Internet , Software , Bases de Dados Factuais , Humanos , Fluxo de Trabalho
6.
BMC Genomics ; 17: 164, 2016 Mar 01.
Artigo em Inglês | MEDLINE | ID: mdl-26931235

RESUMO

BACKGROUND: MicroRNAs (miRNAs) have emerged as important post-transcriptional regulators of gene expression in a wide variety of physiological processes. They can control both temporal and spatial gene expression and are believed to regulate 30 to 70% of the genes. Data are however limited for fish species, with only 9 out of the 30,000 fish species present in miRBase. The aim of the current study was to discover and characterize rainbow trout (Oncorhynchus mykiss) miRNAs in a large number of tissues using next-generation sequencing in order to provide an extensive repertoire of rainbow trout miRNAs. RESULTS: A total of 38 different samples corresponding to 16 different tissues or organs were individually sequenced and analyzed independently in order to identify a large number of miRNAs with high confidence. This led to the identification of 2946 miRNA loci in the rainbow trout genome, including 445 already known miRNAs. Differential expression analysis was performed in order to identify miRNAs exhibiting specific or preferential expression among the 16 analyzed tissues. In most cases, miRNAs exhibit a specific pattern of expression in only a few tissues. The expression data from sRNA sequencing were confirmed by RT-qPCR. In addition, novel miRNAs are described in rainbow trout that had not been previously reported in other species. CONCLUSION: This study represents the first characterization of rainbow trout miRNA transcriptome from a wide variety of tissue and sets an extensive repertoire of rainbow trout miRNAs. It provides a starting point for future studies aimed at understanding the roles of miRNAs in major physiological process such as growth, reproduction or adaptation to stress. These rainbow trout miRNAs repertoire provide a novel resource to advance genomic research in salmonid species.


Assuntos
MicroRNAs/genética , Oncorhynchus mykiss/genética , Transcriptoma , Animais , Sequenciamento de Nucleotídeos em Larga Escala , Análise de Sequência de RNA
7.
BMC Bioinformatics ; 16: 179, 2015 May 29.
Artigo em Inglês | MEDLINE | ID: mdl-26022464

RESUMO

BACKGROUND: Several methods exist for the prediction of precursor miRNAs (pre-miRNAs) in genomic or sRNA-seq (small RNA sequences) data produced by NGS (Next Generation Sequencing). One key information used for this task is the characteristic hairpin structure adopted by pre-miRNAs, that in general are identified using RNA folders whose complexity is cubic in the size of the input. The vast majority of pre-miRNA predictors then rely on further information learned from previously validated miRNAs from the same or a closely related genome for the final prediction of new miRNAs. With this paper, we wished to address three main issues. The first was methodological and aimed at obtaining a more time-efficient predictor, however without losing in accuracy which represented a second issue. We indeed aimed at better predicting miRNAs at a genome scale, but also from sRNAseq data where in some cases, notably of plants, the current folding methods often infer the wrong structure. The third issue is related to the fact that it is important to rely as little as possible on previously recorded examples of miRNAs. We therefore also sought a method that is less dependent on previous miRNA records. RESULTS: As concerns the first and second issues, we present a novel alternative to a classical folder based on a thermodynamic Nearest-Neighbour (NN) model for computing the free energy and predicting the classical hairpin structure of a pre-miRNA. We show that the free energies thus computed correlate well with those of RNAFOLD. This novel method, called MIRINHO, has quadratic instead of cubic complexity and is much more efficient also in practice. When applied to sRNAseq data of plants, it gives in general better results than classical folders. On the third issue, we show that MIRINHO, which uses as only knowledge the length of the loops and stem-arms and the free energy of the pre-miRNA hairpin, compares well with algorithms that require more information. The results, obtained with different datasets, are indeed similar to those of other approaches with which such a comparison was possible. These needed to be publicly available softwares that could be used on a large input. In some cases, MIRINHO is even better in terms of sensitivity or precision. CONCLUSION: We provide a simpler and much faster method with very reasonable sensitivity and precision, which can be applied without special adaptation to the prediction of both animal and plant pre-miRNAs, using as input either genomic sequences or sRNA-seq data.


Assuntos
Arabidopsis/genética , Genoma , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Insetos/genética , MicroRNAs/genética , Análise de Sequência de RNA/métodos , Software , Algoritmos , Animais , Pareamento de Bases , Sequência de Bases , Genômica/métodos , Dados de Sequência Molecular , Homologia de Sequência do Ácido Nucleico
8.
BMC Genomics ; 16: 275, 2015 Apr 09.
Artigo em Inglês | MEDLINE | ID: mdl-25887031

RESUMO

BACKGROUND: Changes to mRNA lifetime adjust mRNA concentration, facilitating the adaptation of growth rate to changes in growth conditions. However, the mechanisms regulating mRNA lifetime are poorly understood at the genome-wide scale and have not been investigated in bacteria growing at different rates. RESULTS: We used linear covariance models and the best model selected according to the Akaike information criterion to identify and rank intrinsic and extrinsic general transcript parameters correlated with mRNA lifetime, using mRNA half-life datasets for E. coli, obtained at four growth rates. The principal parameter correlated with mRNA stability was mRNA concentration, the mRNAs most concentrated in the cells being the least stable. However, sequence-related features (codon adaptation index (CAI), ORF length, GC content, polycistronic mRNA), gene function and essentiality also affected mRNA lifetime at all growth rates. We also identified sequence motifs within the 5'UTRs potentially related to mRNA stability. Growth rate-dependent effects were confined to particular functional categories (e.g. carbohydrate and nucleotide metabolism). Finally, mRNA stability was less strongly correlated with the amount of protein produced than mRNA concentration and CAI. CONCLUSIONS: This study provides the most complete genome-wide analysis to date of the general factors correlated with mRNA lifetime in E. coli. We have generalized for the entire population of transcripts or excluded determinants previously defined as regulators of stability for some particular mRNAs and identified new, unexpected general indicators. These results will pave the way for discussions of the underlying mechanisms and their interaction with the growth physiology of bacteria.


Assuntos
Escherichia coli/genética , Genoma Bacteriano , RNA Mensageiro/metabolismo , Regiões 5' não Traduzidas , Composição de Bases , Sequência de Bases , Códon/metabolismo , Escherichia coli/crescimento & desenvolvimento , Proteínas de Escherichia coli/química , Proteínas de Escherichia coli/metabolismo , Meia-Vida , Modelos Biológicos , Fases de Leitura Aberta/genética , Estabilidade de RNA
9.
Genome Biol ; 25(1): 248, 2024 Sep 30.
Artigo em Inglês | MEDLINE | ID: mdl-39343954

RESUMO

BACKGROUND: Dairy cattle breeds are populations of limited effective size, subject to recurrent outbreaks of recessive defects that are commonly studied using positional cloning. However, this strategy, based on the observation of animals with characteristic features, may overlook a number of conditions, such as immune or metabolic genetic disorders, which may be confused with pathologies of environmental etiology. RESULTS: We present a data mining framework specifically designed to detect recessive defects in livestock that have been previously missed due to a lack of specific signs, incomplete penetrance, or incomplete linkage disequilibrium. This approach leverages the massive data generated by genomic selection. Its basic principle is to compare the observed and expected numbers of homozygotes for sliding haplotypes in animals with different life histories. Within three cattle breeds, we report 33 new loci responsible for increased risk of juvenile mortality and present a series of validations based on large-scale genotyping, clinical examination, and functional studies for candidate variants affecting the NOA1, RFC5, and ITGB7 genes. In particular, we describe disorders associated with NOA1 and RFC5 mutations for the first time in vertebrates. CONCLUSIONS: The discovery of these many new defects will help to characterize the genetic basis of inbreeding depression, while their management will improve animal welfare and reduce losses to the industry.


Assuntos
Genes Recessivos , Animais , Bovinos , Mineração de Dados , Doenças dos Bovinos/genética , Haplótipos
10.
RNA ; 17(11): 1947-56, 2011 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-21947200

RESUMO

The annotation of noncoding RNA genes remains a major bottleneck in genome sequencing projects. Most genome sequences released today still come with sets of tRNAs and rRNAs as the only annotated RNA elements, ignoring hundreds of other RNA families. We have developed a web environment that is dedicated to noncoding RNA (ncRNA) prediction, annotation, and analysis and allows users to run a variety of tools in an integrated and flexible manner. This environment offers complementary ncRNA gene finders and a set of tools for the comparison, visualization, editing, and export of ncRNA candidates. Predictions can be filtered according to a large set of characteristics. Based on this environment, we created a public website located at http://RNAspace.org. It accepts genomic sequences up to 5 Mb, which permits for an online annotation of a complete bacterial genome or a small eukaryotic chromosome. The project is hosted as a Source Forge project (http://rnaspace.sourceforge.net/).


Assuntos
Bases de Dados de Ácidos Nucleicos , Internet , RNA não Traduzido/análise , Sequência de Bases , Genoma , Conformação de Ácido Nucleico , RNA não Traduzido/química , Software
11.
RNA Biol ; 10(5): 659-70, 2013 May.
Artigo em Inglês | MEDLINE | ID: mdl-23422322

RESUMO

Pyrococcales are members of the order Thermococcales, a group of hyperthermophilic euryarchaea that are frequently found in deep sea hydrothermal vents. Infectious genetic elements, such as plasmids and viruses, remain a threat even in this remote environment and these microorganisms have developed several ways to fight their genetic invaders. Among these are the recently discovered CRISPR systems. In this review, we have combined and condensed available information on genetic elements infecting the Thermococcales and on the multiple CRISPR systems found in the Pyrococcales to fight them. Their organization and mode of action will be presented with emphasis on the Type III-B system that is the only CRISPR system known to target RNA molecules in a process reminiscent of RNA interference. The intriguing case of Pyrococcus abyssi, which is among the rare strains to present a CRISPR system devoid of the universal cas1 and cas2 genes, is also discussed.


Assuntos
Sistemas CRISPR-Cas , Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas , Pyrococcus abyssi/genética , Pyrococcus abyssi/virologia , Thermococcales/genética , Thermococcales/virologia , Vírus de Archaea/genética , Vírus de Archaea/fisiologia , Regulação da Expressão Gênica em Archaea , Transferência Genética Horizontal , Genoma Arqueal , Filogenia
12.
Genes (Basel) ; 14(3)2023 03 07.
Artigo em Inglês | MEDLINE | ID: mdl-36980936

RESUMO

By pairing to messenger RNAs (mRNAs for short), microRNAs (miRNAs) regulate gene expression in animals and plants. Accurately identifying which mRNAs interact with a given miRNA and the precise location of the interaction sites is crucial to reaching a more complete view of the regulatory network of an organism. Only a few experimental approaches, however, allow the identification of both within a single experiment. Computational predictions of miRNA-mRNA interactions thus remain generally the first step used, despite their drawback of a high rate of false-positive predictions. The major computational approaches available rely on a diversity of features, among which anchoring the miRNA seed and measuring mRNA accessibility are the key ones, with the first being universally used, while the use of the second remains controversial. Revisiting the importance of each is the aim of this paper, which uses Cross-Linking, Ligation, And Sequencing of Hybrids (CLASH) datasets to achieve this goal. Contrary to what might be expected, the results are more ambiguous regarding the use of the seed match as a feature, while accessibility appears to be a feature worth considering, indicating that, at least under some conditions, it may favour anchoring by miRNAs.


Assuntos
Regulação da Expressão Gênica , MicroRNAs , RNA Mensageiro , MicroRNAs/genética , MicroRNAs/metabolismo , RNA Mensageiro/genética
13.
Sci Data ; 10(1): 369, 2023 06 08.
Artigo em Inglês | MEDLINE | ID: mdl-37291142

RESUMO

Inspired by the production of reference data sets in the Genome in a Bottle project, we sequenced one Charolais heifer with different technologies: Illumina paired-end, Oxford Nanopore, Pacific Biosciences (HiFi and CLR), 10X Genomics linked-reads, and Hi-C. In order to generate haplotypic assemblies, we also sequenced both parents with short reads. From these data, we built two haplotyped trio high quality reference genomes and a consensus assembly, using up-to-date software packages. The assemblies obtained using PacBio HiFi reaches a size of 3.2 Gb, which is significantly larger than the 2.7 Gb ARS-UCD1.2 reference. The BUSCO score of the consensus assembly reaches a completeness of 95.8%, among highly conserved mammal genes. We also identified 35,866 structural variants larger than 50 base pairs. This assembly is a contribution to the bovine pangenome for the "Charolais" breed. These datasets will prove to be useful resources enabling the community to gain additional insight on sequencing technologies for applications such as SNP, indel or structural variant calling, and de novo assembly.


Assuntos
Genômica , Sequenciamento de Nucleotídeos em Larga Escala , Animais , Bovinos , Feminino , Benchmarking , Genoma , Análise de Sequência de DNA
14.
PLoS One ; 17(9): e0273253, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-36070299

RESUMO

Circular RNA (circRNA) is a noncoding RNA class with important implications for gene expression regulation, mostly by interaction with other RNA species or RNA-binding proteins. While the commonly applied short-read Illumina RNA-sequencing techniques can be used to detect circRNAs, their full sequence is not revealed. However, the complete sequence information is needed to analyze potential interactions and thus the mechanism of action of circRNAs. Here, we present an improved protocol to enrich and sequence full-length circRNAs by using the Oxford Nanopore long-read sequencing platform. The protocol involves an enrichment of lowly abundant circRNAs by exonuclease treatment and negative selection of linear RNAs. Then, a cDNA library is created and amplified by PCR. This protocol provides enough material for several sequencing runs. The library is used as input for ligation-based sequencing together with native barcoding. Stringent quality control of the libraries is ensured by a combination of Qubit, Fragment Analyzer and qRT-PCR. Multiplexing of up to 4 libraries yields in total more than 1-2 Million reads per library, of which 1-2% are circRNA-specific reads with >99% of them full-length. The protocol works well with human cancer cell lines. We further provide suggestions for the bioinformatic analysis of the created data, as well as the limitations of our approach together with recommendations for troubleshooting and interpretation. Taken together, this protocol enables reliable full-length analysis of circRNAs, a noncoding RNA type involved in a growing number of physiologic and pathologic conditions. Metadata Associated content. https://dx.doi.org/10.17504/protocols.io.rm7vzy8r4lx1/v2.


Assuntos
Nanoporos , RNA Circular , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , RNA/genética , Análise de Sequência de RNA/métodos
15.
BMC Genomics ; 12: 312, 2011 Jun 13.
Artigo em Inglês | MEDLINE | ID: mdl-21668986

RESUMO

BACKGROUND: Noncoding RNA (ncRNA) has been recognized as an important regulator of gene expression networks in Bacteria and Eucaryota. Little is known about ncRNA in thermococcal archaea except for the eukaryotic-like C/D and H/ACA modification guide RNAs. RESULTS: Using a combination of in silico and experimental approaches, we identified and characterized novel P. abyssi ncRNAs transcribed from 12 intergenic regions, ten of which are conserved throughout the Thermococcales. Several of them accumulate in the late-exponential phase of growth. Analysis of the genomic context and sequence conservation amongst related thermococcal species revealed two novel P. abyssi ncRNA families. The CRISPR family is comprised of crRNAs expressed from two of the four P. abyssi CRISPR cassettes. The 5'UTR derived family includes four conserved ncRNAs, two of which have features similar to known bacterial riboswitches. Several of the novel ncRNAs have sequence similarities to orphan OrfB transposase elements. Based on RNA secondary structure predictions and experimental results, we show that three of the twelve ncRNAs include Kink-turn RNA motifs, arguing for a biological role of these ncRNAs in the cell. Furthermore, our results show that several of the ncRNAs are subjected to processing events by enzymes that remain to be identified and characterized. CONCLUSIONS: This work proposes a revised annotation of CRISPR loci in P. abyssi and expands our knowledge of ncRNAs in the Thermococcales, thus providing a starting point for studies needed to elucidate their biological function.


Assuntos
Loci Gênicos/genética , Pyrococcus abyssi/genética , RNA Arqueal/genética , Riboswitch/genética , Regiões 5' não Traduzidas/genética , Sequência de Bases , Sequência Conservada , DNA Intergênico/genética , Genoma Arqueal/genética , Reação em Cadeia da Polimerase Via Transcriptase Reversa , Especificidade da Espécie
16.
Nucleic Acids Res ; 37(21): 7239-57, 2009 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-19786493

RESUMO

Bioinformatic analysis of the intergenic regions of Staphylococcus aureus predicted multiple regulatory regions. From this analysis, we characterized 11 novel noncoding RNAs (RsaA-K) that are expressed in several S. aureus strains under different experimental conditions. Many of them accumulate in the late-exponential phase of growth. All ncRNAs are stable and their expression is Hfq-independent. The transcription of several of them is regulated by the alternative sigma B factor (RsaA, D and F) while the expression of RsaE is agrA-dependent. Six of these ncRNAs are specific to S. aureus, four are conserved in other Staphylococci, and RsaE is also present in Bacillaceae. Transcriptomic and proteomic analysis indicated that RsaE regulates the synthesis of proteins involved in various metabolic pathways. Phylogenetic analysis combined with RNA structure probing, searches for RsaE-mRNA base pairing, and toeprinting assays indicate that a conserved and unpaired UCCC sequence motif of RsaE binds to target mRNAs and prevents the formation of the ribosomal initiation complex. This study unexpectedly shows that most of the novel ncRNAs carry the conserved C-rich motif, suggesting that they are members of a class of ncRNAs that target mRNAs by a shared mechanism.


Assuntos
Regulação Bacteriana da Expressão Gênica , RNA não Traduzido/química , Staphylococcus aureus/genética , Bacillus subtilis/genética , Bacillus subtilis/metabolismo , Sequência de Bases , Biologia Computacional , Sequência Conservada , Perfilação da Expressão Gênica , Dados de Sequência Molecular , Proteômica , Estabilidade de RNA , RNA Mensageiro/metabolismo , RNA não Traduzido/genética , RNA não Traduzido/metabolismo , Transcrição Gênica
17.
Virus Evol ; 7(2): veab093, 2021 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-35299790

RESUMO

Highly pathogenic avian influenza viruses (HPAIVs) evolve from low pathogenic avian influenza viruses (LPAIVs) of the H5 and H7 subtypes. This evolution is characterized by the acquisition of a multi-basic cleavage site (MBCS) motif in the hemagglutinin (HA) that leads to an extended viral tropism and severe disease in poultry. One key unanswered question is whether the risk of transition to HPAIVs is similar for all LPAIVs H5 or H7 strains, or whether specific determinants in the HA sequence of some H5 or H7 LPAIV strains correlate with a higher risk of transition to HPAIVs. Here, we determined if specific features of the conserved RNA stem-loop located at the HA cleavage site-encoding region could be detected along the LPAIV to HPAIV evolutionary pathway. Analysis of the thermodynamic stability of the predicted RNA structures showed no specific patterns common to HA sequences leading to HPAIVs and distinct from those remaining LPAIVs. However, RNA structure clustering analysis revealed that most of the American lineage ancestors leading to H7 emergences via recombination shared the same viral RNA (vRNA) structure topology at the HA1/HA2 boundary region. Our study thus identified predicted secondary RNA structures present in the HA of H7 viruses, which could promote genetic recombination and acquisition of a multibasic cleavage site motif (MBCS).

18.
BMC Plant Biol ; 10: 283, 2010 Dec 20.
Artigo em Inglês | MEDLINE | ID: mdl-21171996

RESUMO

BACKGROUND: Short interstitial telomere motifs (telo boxes) are short sequences identical to plant telomere repeat units. They are observed within the 5' region of several genes over-expressed in cycling cells. In synergy with various cis-acting elements, these motifs participate in the activation of expression. Here, we have analysed the distribution of telo boxes within Arabidopsis thaliana and Oryza sativa genomes and their association with genes involved in the biogenesis of the translational apparatus. RESULTS: Our analysis showed that the distribution of the telo box (AAACCCTA) in different genomic regions of A. thaliana and O. sativa is not random. As is also the case for plant microsatellites, they are preferentially located in the 5' flanking regions of genes, mainly within the 5' UTR, and distributed as a gradient along the direction of transcription. As previously reported in Arabidopsis, a conserved topological association of telo boxes with site II or TEF cis-acting elements is observed in almost all promoters of genes encoding ribosomal proteins in O. sativa. Such a conserved promoter organization can be found in other genes involved in the biogenesis of the translational machinery including rRNA processing proteins and snoRNAs. Strikingly, the association of telo boxes with site II motifs or TEF boxes is conserved in promoters of genes harbouring snoRNA clusters nested within an intron as well as in the 5' flanking regions of non-intronic snoRNA genes. Thus, the search for associations between telo boxes and site II motifs or TEF box in plant genomes could provide a useful tool for characterizing new cryptic RNA pol II promoters. CONCLUSIONS: The data reported in this work support the model previously proposed for the spreading of telo boxes within plant genomes and provide new insights into a putative process for the acquisition of microsatellites in plants. The association of telo boxes with site II or TEF cis-acting elements appears to be an essential feature of plant genes involved in the biogenesis of ribosomes and clearly indicates that most plant snoRNAs are RNA pol II products.


Assuntos
Arabidopsis/genética , Genoma de Planta/genética , Oryza/genética , Telômero/genética , Regiões 3' não Traduzidas/genética , Região 5'-Flanqueadora/genética , Regiões 5' não Traduzidas/genética , Sequência de Bases , Regulação da Expressão Gênica de Plantas , Genes de Plantas/genética , Dados de Sequência Molecular , Regiões Promotoras Genéticas/genética , Sequências Reguladoras de Ácido Nucleico/genética , Proteínas Ribossômicas/genética
19.
PLoS One ; 15(5): e0231738, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32463818

RESUMO

High-throughput sequencing makes it possible to provide the genome-wide distribution of small non coding RNAs in a single experiment, and contributed greatly to the identification and understanding of these RNAs in the last decade. Small non coding RNAs gather a wide collection of classes, such as microRNAs, tRNA-derived fragments, small nucleolar RNAs and small nuclear RNAs, to name a few. As usual in RNA-seq studies, the sequencing step is followed by a feature quantification step: when a genome is available, the reads are aligned to the genome, their genomic positions are compared to the already available annotations, and the corresponding features are quantified. However, problem arises when many reads map at several positions and while different strategies exist to circumvent this problem, all of them are biased. In this article, we present a new strategy that compares all the reads that map at several positions, and their annotations when available. In many cases, all the hits co-localize with the same feature annotation (a duplicated miRNA or a duplicated gene, for instance). When different annotations exist for a given read, we propose to merge existing features and provide the counts for the merged features. This new strategy has been implemented in a tool, mmannot, freely available at https://github.com/mzytnicki/mmannot.


Assuntos
Pequeno RNA não Traduzido/genética , Análise de Sequência de RNA/métodos , Software , Genômica , Sequenciamento de Nucleotídeos em Larga Escala , MicroRNAs/genética , Anotação de Sequência Molecular
20.
Front Microbiol ; 10: 2701, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31824468

RESUMO

CRISPR/Cas systems provide adaptive defense mechanisms against invading nucleic acids in prokaryotes. Because of its interest as a genetic tool, the Type II CRISPR/Cas9 system from Streptococcus pyogenes has been extensively studied. It includes the Cas9 endonuclease that is dependent on a dual-guide RNA made of a tracrRNA and a crRNA. Target recognition relies on crRNA annealing and the presence of a protospacer adjacent motif (PAM). Mollicutes are currently the bacteria with the smallest genome in which CRISPR/Cas systems have been reported. Many of them are pathogenic to humans and animals (mycoplasmas and ureaplasmas) or plants (phytoplasmas and some spiroplasmas). A global survey was conducted to identify and compare CRISPR/Cas systems found in the genome of these minimal bacteria. Complete or degraded systems classified as Type II-A and less frequently as Type II-C were found in the genome of 21 out of 52 representative mollicutes species. Phylogenetic reconstructions predicted a common origin of all CRISPR/Cas systems of mycoplasmas and at least two origins were suggested for spiroplasmas systems. Cas9 in mollicutes were structurally related to the S. aureus Cas9 except the PI domain involved in the interaction with the PAM, suggesting various PAM might be recognized by Cas9 of different mollicutes. Structure of the predicted crRNA/tracrRNA hybrids was conserved and showed typical stem-loop structures pairing the Direct Repeat part of crRNAs with the 5' region of tracrRNAs. Most mollicutes crRNA/tracrRNAs showed G + C% significantly higher than the genome, suggesting a selective pressure for maintaining stability of these secondary structures. Examples of CRISPR spacers matching with mollicutes phages were found, including the textbook case of Mycoplasma cynos strain C142 having no prophage sequence but a CRISPR/Cas system with spacers targeting prophage sequences that were found in the genome of another M. cynos strain that is devoid of a CRISPR system. Despite their small genome size, mollicutes have maintained protective means against invading DNAs, including restriction/modification and CRISPR/Cas systems. The apparent lack of CRISPR/Cas systems in several groups of species including main pathogens of humans, ruminants, and plants suggests different evolutionary routes or a lower risk of phage infection in specific ecological niches.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA