Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 97
Filtrar
1.
Bioinformatics ; 2024 Sep 10.
Artigo em Inglês | MEDLINE | ID: mdl-39254573

RESUMO

MOTIVATION: Antibiotic resistance has emerged as a major global health threat, with an increasing number of bacterial infections becoming difficult to treat. Predicting the underlying resistance mechanisms of antibiotic resistance genes (ARGs) is crucial for understanding and combating this problem. However, existing methods struggle to accurately predict resistance mechanisms for ARGs with low similarity to known sequences and lack sufficient interpretability of the prediction models. RESULTS: In this study, we present a novel approach for predicting ARG resistance mechanisms using ProteinBERT, a protein language model based on deep learning. Our method outperforms state-of-the-art techniques on diverse ARG datasets, including those with low homology to the training data, highlighting its potential for predicting the resistance mechanisms of unknown ARGs. Attention analysis of the model reveals that it considers biologically relevant features, such as conserved amino acid residues and antibiotic target binding sites, when making predictions. These findings provide valuable insights into the molecular basis of antibiotic resistance and demonstrate the interpretability of protein language models, offering a new perspective on their application in bioinformatics. AVAILABILITY: The source code is available for free at https://github.com/hmdlab/ARG-BERT. The output results of the model are published at https://waseda.box.com/v/ARG-BERT-suppl. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

2.
J Clin Invest ; 134(17)2024 Sep 03.
Artigo em Inglês | MEDLINE | ID: mdl-38954486

RESUMO

The progression of kidney disease varies among individuals, but a general methodology to quantify disease timelines is lacking. Particularly challenging is the task of determining the potential for recovery from acute kidney injury following various insults. Here, we report that quantitation of post-transcriptional adenosine-to-inosine (A-to-I) RNA editing offers a distinct genome-wide signature, enabling the delineation of disease trajectories in the kidney. A well-defined murine model of endotoxemia permitted the identification of the origin and extent of A-to-I editing, along with temporally discrete signatures of double-stranded RNA stress and adenosine deaminase isoform switching. We found that A-to-I editing of antizyme inhibitor 1 (AZIN1), a positive regulator of polyamine biosynthesis, serves as a particularly useful temporal landmark during endotoxemia. Our data indicate that AZIN1 A-to-I editing, triggered by preceding inflammation, primes the kidney and activates endogenous recovery mechanisms. By comparing genetically modified human cell lines and mice locked in either A-to-I-edited or uneditable states, we uncovered that AZIN1 A-to-I editing not only enhances polyamine biosynthesis but also engages glycolysis and nicotinamide biosynthesis to drive the recovery phenotype. Our findings implicate that quantifying AZIN1 A-to-I editing could potentially identify individuals who have transitioned to an endogenous recovery phase. This phase would reflect their past inflammation and indicate their potential for future recovery.


Assuntos
Adenosina , Inosina , Edição de RNA , Animais , Camundongos , Inosina/metabolismo , Inosina/genética , Adenosina/metabolismo , Adenosina/genética , Humanos , Rim/metabolismo , Rim/patologia , Injúria Renal Aguda/metabolismo , Injúria Renal Aguda/genética , Injúria Renal Aguda/patologia , Endotoxemia/metabolismo , Endotoxemia/genética , Endotoxemia/patologia , Inflamação/metabolismo , Inflamação/genética , Inflamação/patologia , Adenosina Desaminase/metabolismo , Adenosina Desaminase/genética , Proteínas de Transporte/metabolismo , Proteínas de Transporte/genética , Masculino
3.
J Biochem ; 176(3): 205-215, 2024 Sep 03.
Artigo em Inglês | MEDLINE | ID: mdl-38740386

RESUMO

The viral infectivity factor (Vif) of human immunodeficiency virus 1 forms a complex with host proteins, designated as Vif-CBFß-ELOB-ELOC-CUL5 (VßBCC), initiating the ubiquitination and subsequent proteasomal degradation of the human antiviral protein APOBEC3G (A3G), thereby negating its antiviral function. Whilst recent cryo-electron microscopy (cryo-EM) studies have implicated RNA molecules in the Vif-A3G interaction that leads to A3G ubiquitination, our findings indicated that the VßBCC complex can also directly impede A3G-mediated DNA deamination, bypassing the proteasomal degradation pathway. Employing the Systematic Evolution of Ligands by EXponential enrichment (SELEX) method, we have identified RNA aptamers with high affinity for the VßBCC complex. These aptamers not only bind to the VßBCC complex but also reinstate A3G's DNA deamination activity by inhibiting the complex's function. Moreover, we delineated the sequences and secondary structures of these aptamers, providing insights into the mechanistic aspects of A3G inhibition by the VßBCC complex. Analysis using selected aptamers will enhance our understanding of the inhibition of A3G by the VßBCC complex, offering potential avenues for therapeutic intervention.


Assuntos
Aptâmeros de Nucleotídeos , Produtos do Gene vif do Vírus da Imunodeficiência Humana , Aptâmeros de Nucleotídeos/química , Aptâmeros de Nucleotídeos/metabolismo , Humanos , Produtos do Gene vif do Vírus da Imunodeficiência Humana/metabolismo , Produtos do Gene vif do Vírus da Imunodeficiência Humana/química , Desaminase APOBEC-3G/metabolismo , Desaminase APOBEC-3G/genética , Desaminase APOBEC-3G/química , Subunidade beta de Fator de Ligação ao Core/metabolismo , Subunidade beta de Fator de Ligação ao Core/química , Técnica de Seleção de Aptâmeros , HIV-1/metabolismo , Proteínas Culina
4.
Gene ; 917: 148464, 2024 Jul 30.
Artigo em Inglês | MEDLINE | ID: mdl-38615981

RESUMO

Cells sense, respond, and adapt to environmental conditions that cause stress. In a previous study using HeLa cells, we isolated reporter cells responding to the endoplasmic reticulum (ER) stress inducers, thapsigargin and tunicamycin, using a highly sensitive promoter trap vector system. Splinkerette PCR and 5' rapid amplification of cDNA ends (5' RACE) identified a novel transcript that is upregulated by ER stress. Its endogenous expression increased approximately 10-fold in response to thapsigargin and tunicamycin within 1 h, but was down-regulated after 4 h. Because the transcript starts from an intron of a long noncoding RNA known as LINC-PINT, we designated the newly identified transcript TISPL (transcript induced by stressors from LINC-PINTlocus). TISPL was also expressed under several other stress conditions. It was particularly increased > 10-fold upon glucose starvation and 7-fold by arsenite exposure. Furthermore, in silico analyses, including a ChIP-atlas search, revealed that there is an ATF4-binding region with a c/ebp-Atf response element (CARE) downstream of the transcription start site of TISPL. Based on these results, we hypothesized that TISPL may be induced by the phospho-eIF2α and ATF4- axis of the integrated stress response pathway, which is known to be activated by the stress conditions listed above. As expected, knockout of ATF4 abolished the stress-induced upregulation of TISPL. Our results indicate that TISPL may be a useful biomarker for detecting stress conditions that activate ATF4. Our highly sensitive trap vector system proved beneficial in discovering new biomarkers.


Assuntos
Fator 4 Ativador da Transcrição , Estresse do Retículo Endoplasmático , RNA Longo não Codificante , Humanos , Fator 4 Ativador da Transcrição/genética , Fator 4 Ativador da Transcrição/metabolismo , Arsenitos/toxicidade , Arsenitos/farmacologia , Estresse do Retículo Endoplasmático/efeitos dos fármacos , Estresse do Retículo Endoplasmático/genética , Células HeLa , RNA Longo não Codificante/genética , RNA Longo não Codificante/metabolismo , Tapsigargina/farmacologia , Tunicamicina/farmacologia , Regulação para Cima
5.
Biochemistry ; 63(7): 906-912, 2024 Apr 02.
Artigo em Inglês | MEDLINE | ID: mdl-38457656

RESUMO

Optimization of aptamers in length and chemistry is crucial for industrial applications. Here, we developed aptamers against the SARS-CoV-2 spike protein and achieved optimization with a deep-learning-based algorithm, RaptGen. We conducted a primer-less SELEX against the receptor binding domain (RBD) of the spike with an RNA/DNA hybrid library, and the resulting sequences were subjected to RaptGen analysis. Based on the sequence profiling by RaptGen, a short truncation aptamer of 26 nucleotides was obtained and further optimized by a chemical modification of relevant nucleotides. The resulting aptamer is bound to RBD not only of SARS-CoV-2 wildtype but also of its variants, SARS-CoV-1, and Middle East respiratory syndrome coronavirus (MERS-CoV). We concluded that the RaptGen-assisted discovery is efficient for developing optimized aptamers.


Assuntos
Aptâmeros de Nucleotídeos , SARS-CoV-2 , Humanos , COVID-19/prevenção & controle , DNA , SARS-CoV-2/química , SARS-CoV-2/genética , Glicoproteína da Espícula de Coronavírus/genética , Glicoproteína da Espícula de Coronavírus/química
6.
Nat Methods ; 21(3): 435-443, 2024 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-38238559

RESUMO

RNA engineering has immense potential to drive innovation in biotechnology and medicine. Despite its importance, a versatile platform for the automated design of functional RNA is still lacking. Here, we propose RNA family sequence generator (RfamGen), a deep generative model that designs RNA family sequences in a data-efficient manner by explicitly incorporating alignment and consensus secondary structure information. RfamGen can generate novel and functional RNA family sequences by sampling points from a semantically rich and continuous representation. We have experimentally demonstrated the versatility of RfamGen using diverse RNA families. Furthermore, we confirmed the high success rate of RfamGen in designing functional ribozymes through a quantitative massively parallel assay. Notably, RfamGen successfully generates artificial sequences with higher activity than natural sequences. Overall, RfamGen significantly improves our ability to design functional RNA and opens up new potential for generative RNA engineering in synthetic biology.


Assuntos
RNA Catalítico , Humanos , RNA Catalítico/genética , RNA Catalítico/química , RNA/genética , Biotecnologia , Biologia Sintética
7.
Comput Struct Biotechnol J ; 21: 5350-5357, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37954146

RESUMO

Recent advances in microbiome research have led to the further development of microbial interventions, such as probiotics and prebiotics, which are potential treatments for constipation. However, the effects of probiotics vary from person to person; therefore, the effectiveness of probiotics needs to be verified for each individual. Individuals showing significant effects of the target probiotic are called responders. A statistical model for the evaluation of responders was proposed in a previous study. However, the previous model does not consider the lag between intake and effect periods of the probiotic. It is expected that the lag exists when probiotics are administered and when they are effective. In this study, we propose a Bayesian statistical model to estimate the probability that a subject is a responder, by considering the lag between intake and effect periods. In synthetic dataset experiments, the proposed model was found to outperform the base model, which did not factor in the lag. Further, we found that the proposed model could distinguish responders showing large uncertainty in terms of the lag between intake and effect periods.

8.
bioRxiv ; 2023 Nov 09.
Artigo em Inglês | MEDLINE | ID: mdl-37986799

RESUMO

The progression of kidney disease varies among individuals, but a general methodology to quantify disease timelines is lacking. Particularly challenging is the task of determining the potential for recovery from acute kidney injury following various insults. Here, we report that quantitation of post-transcriptional adenosine-to-inosine (A-to-I) RNA editing offers a distinct genome-wide signature, enabling the delineation of disease trajectories in the kidney. A well-defined murine model of endotoxemia permitted the identification of the origin and extent of A-to-I editing, along with temporally discrete signatures of double-stranded RNA stress and Adenosine Deaminase isoform switching. We found that A-to-I editing of Antizyme Inhibitor 1 (AZIN1), a positive regulator of polyamine biosynthesis, serves as a particularly useful temporal landmark during endotoxemia. Our data indicate that AZIN1 A-to-I editing, triggered by preceding inflammation, primes the kidney and activates endogenous recovery mechanisms. By comparing genetically modified human cell lines and mice locked in either A-to-I edited or uneditable states, we uncovered that AZIN1 A-to-I editing not only enhances polyamine biosynthesis but also engages glycolysis and nicotinamide biosynthesis to drive the recovery phenotype. Our findings implicate that quantifying AZIN1 A-to-I editing could potentially identify individuals who have transitioned to an endogenous recovery phase. This phase would reflect their past inflammation and indicate their potential for future recovery.

9.
Front Bioinform ; 3: 1275787, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37881622

RESUMO

RNA accessibility is a useful RNA secondary structural feature for predicting RNA-RNA interactions and translation efficiency in prokaryotes. However, conventional accessibility calculation tools, such as Raccess, are computationally expensive and require considerable computational time to perform transcriptome-scale analysis. In this study, we developed DeepRaccess, which predicts RNA accessibility based on deep learning methods. DeepRaccess was trained to take artificial RNA sequences as input and to predict the accessibility of these sequences as calculated by Raccess. Simulation and empirical dataset analyses showed that the accessibility predicted by DeepRaccess was highly correlated with the accessibility calculated by Raccess. In addition, we confirmed that DeepRaccess could predict protein abundance in E.coli with moderate accuracy from the sequences around the start codon. We also demonstrated that DeepRaccess achieved tens to hundreds of times software speed-up in a GPU environment. The source codes and the trained models of DeepRaccess are freely available at https://github.com/hmdlab/DeepRaccess.

10.
Front Immunol ; 14: 1185322, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37614230

RESUMO

Primary sensory neurons regulate inflammatory processes in innervated regions through neuro-immune communication. However, how their immune-modulating functions are regulated in concert remains largely unknown. Here, we show that Neat1 long non-coding RNA (lncRNA) organizes the proinflammatory gene expressions in the dorsal root ganglion (DRG) in chronic intractable neuropathic pain in rats. Neat1 was abundantly expressed in the DRG and was upregulated after peripheral nerve injury. Neat1 overexpression in primary sensory neurons caused mechanical and thermal hypersensitivity, whereas its knockdown alleviated neuropathic pain. Bioinformatics analysis of comprehensive transcriptome changes indicated the inflammatory response was the most relevant function of genes upregulated through Neat1. Consistent with this, upregulation of proinflammatory genes in the DRG following nerve injury was suppressed by Neat1 knockdown. Expression changes of these proinflammatory genes were regulated through Neat1-mRNA interaction-dependent and -independent mechanisms. Notably, Neat1 increased proinflammatory genes by stabilizing its interacting mRNAs in neuropathic pain. Finally, Neat1 in primary sensory neurons contributed to spinal inflammatory processes that mediated peripheral neuropathic pain. These findings demonstrate that Neat1 lncRNA is a key regulator of neuro-immune communication in neuropathic pain.


Assuntos
Neuralgia , RNA Longo não Codificante , Traumatismos do Sistema Nervoso , Animais , Ratos , RNA Longo não Codificante/genética , Gânglios Espinais , Neuralgia/genética , RNA Mensageiro , Transcriptoma
11.
Nucleic Acids Res ; 51(15): 7820-7831, 2023 08 25.
Artigo em Inglês | MEDLINE | ID: mdl-37463833

RESUMO

Phase-separated membraneless organelles often contain RNAs that exhibit unusual semi-extractability using the conventional RNA extraction method, and can be efficiently retrieved by needle shearing or heating during RNA extraction. Semi-extractable RNAs are promising resources for understanding RNA-centric phase separation. However, limited assessments have been performed to systematically identify and characterize semi-extractable RNAs. In this study, 1074 semi-extractable RNAs, including ASAP1, DANT2, EXT1, FTX, IGF1R, LIMS1, NEAT1, PHF21A, PVT1, SCMH1, STRG.3024.1, TBL1X, TCF7L2, TVP23C-CDRT4, UBE2E2, ZCCHC7, ZFAND3 and ZSWIM6, which exhibited consistent semi-extractability were identified across five human cell lines. By integrating publicly available datasets, we found that semi-extractable RNAs tend to be distributed in the nuclear compartments but are dissociated from the chromatin. Long and repeat-containing semi-extractable RNAs act as hubs to provide global RNA-RNA interactions. Semi-extractable RNAs were divided into four groups based on their k-mer content. The NEAT1 group preferred to interact with paraspeckle proteins, such as FUS and NONO, implying that RNAs in this group are potential candidates of architectural RNAs that constitute nuclear bodies.


Assuntos
RNA Longo não Codificante , RNA , Humanos , Linhagem Celular , Núcleo Celular/metabolismo , Cromatina/metabolismo , Proteínas de Ligação a DNA/genética , RNA/isolamento & purificação , RNA Longo não Codificante/genética , RNA Longo não Codificante/metabolismo
12.
Commun Biol ; 6(1): 631, 2023 06 10.
Artigo em Inglês | MEDLINE | ID: mdl-37301950

RESUMO

Mammalian brains have evolved in stages over a long history to acquire higher functions. Recently, several transposable element (TE) families have been shown to evolve into cis-regulatory elements of brain-specific genes. However, it is not fully understood how TEs are important for gene regulatory networks. Here, we performed a single-cell level analysis using public data of scATAC-seq to discover TE-derived cis-elements that are important for specific cell types. Our results suggest that DNA elements derived from TEs, MER130 and MamRep434, can function as transcription factor-binding sites based on their internal motifs for Neurod2 and Lhx2, respectively, especially in glutamatergic neuronal progenitors. Furthermore, MER130- and MamRep434-derived cis-elements were amplified in the ancestors of Amniota and Eutheria, respectively. These results suggest that the acquisition of cis-elements with TEs occurred in different stages during evolution and may contribute to the acquisition of different functions or morphologies in the brain.


Assuntos
Elementos de DNA Transponíveis , Redes Reguladoras de Genes , Humanos , Animais , Elementos de DNA Transponíveis/genética , Vertebrados/genética , Ligação Proteica , Mamíferos/genética , Encéfalo
13.
Nat Genet ; 55(6): 939-951, 2023 06.
Artigo em Inglês | MEDLINE | ID: mdl-37169872

RESUMO

Mobile genetic elements (MEs) are heritable mutagens that recursively generate structural variants (SVs). ME variants (MEVs) are difficult to genotype and integrate in statistical genetics, obscuring their impact on genome diversification and traits. We developed a tool that accurately genotypes MEVs using short-read whole-genome sequencing (WGS) and applied it to global human populations. We find unexpected population-specific MEV differences, including an Alu insertion distribution distinguishing Japanese from other populations. Integrating MEVs with expression quantitative trait loci (eQTL) maps shows that MEV classes regulate tissue-specific gene expression by shared mechanisms, including creating or attenuating enhancers and recruiting post-transcriptional regulators, supporting class-wide interpretability. MEVs more often associate with gene expression changes than SNVs, thus plausibly impacting traits. Performing genome-wide association study (GWAS) with MEVs pinpoints potential causes of disease risk, including a LINE-1 insertion associated with keloid and fasciitis. This work implicates MEVs as drivers of human divergence and disease risk.


Assuntos
Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo Único , Humanos , Regulação da Expressão Gênica , Locos de Características Quantitativas , Fenótipo
14.
Brief Bioinform ; 24(4)2023 07 20.
Artigo em Inglês | MEDLINE | ID: mdl-37232359

RESUMO

Computational analysis of RNA sequences constitutes a crucial step in the field of RNA biology. As in other domains of the life sciences, the incorporation of artificial intelligence and machine learning techniques into RNA sequence analysis has gained significant traction in recent years. Historically, thermodynamics-based methods were widely employed for the prediction of RNA secondary structures; however, machine learning-based approaches have demonstrated remarkable advancements in recent years, enabling more accurate predictions. Consequently, the precision of sequence analysis pertaining to RNA secondary structures, such as RNA-protein interactions, has also been enhanced, making a substantial contribution to the field of RNA biology. Additionally, artificial intelligence and machine learning are also introducing technical innovations in the analysis of RNA-small molecule interactions for RNA-targeted drug discovery and in the design of RNA aptamers, where RNA serves as its own ligand. This review will highlight recent trends in the prediction of RNA secondary structure, RNA aptamers and RNA drug discovery using machine learning, deep learning and related technologies, and will also discuss potential future avenues in the field of RNA informatics.


Assuntos
Aptâmeros de Nucleotídeos , Aprendizado Profundo , Inteligência Artificial , RNA/genética , Aprendizado de Máquina , Descoberta de Drogas/métodos , Informática
15.
Comput Struct Biotechnol J ; 21: 1774-1784, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36874163

RESUMO

The coronavirus disease-2019 (COVID-19) pandemic has elucidated major limitations in the capacity of medical and research institutions to appropriately manage emerging infectious diseases. We can improve our understanding of infectious diseases by unveiling virus-host interactions through host range prediction and protein-protein interaction prediction. Although many algorithms have been developed to predict virus-host interactions, numerous issues remain to be solved, and the entire network remains veiled. In this review, we comprehensively surveyed algorithms used to predict virus-host interactions. We also discuss the current challenges, such as dataset biases toward highly pathogenic viruses, and the potential solutions. The complete prediction of virus-host interactions remains difficult; however, bioinformatics can contribute to progress in research on infectious diseases and human health.

16.
Methods Mol Biol ; 2586: 163-173, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36705904

RESUMO

The computational prediction of RNA-RNA interactions has long been studied in RNA informatics. Most of the existing approaches focused on the interaction prediction of short RNAs in small datasets. However, in recent years, two fast prediction methods, RIsearch2 and RIblast, have been developed to predict transcriptome-scale interactions or long RNA interactions. The key idea of the software acceleration of these tools was the integration of a seed-and-extend method, which is used in fast sequence alignment tools, into RNA-RNA interaction prediction. As a result, the two software programs were ten to a thousand times faster than the existing tools; because of this acceleration, detection of genome-wide microRNA target sites or interaction partners of function-unknown long noncoding RNAs has become possible. In this review, we describe the basic concept of the algorithm, its applications, and the future perspectives of the fast RNA-RNA interaction prediction tools.


Assuntos
MicroRNAs , RNA Longo não Codificante , Transcriptoma , Software , MicroRNAs/genética , Algoritmos , RNA Longo não Codificante/genética , Biologia Computacional/métodos
17.
Methods Mol Biol ; 2586: 175-195, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36705905

RESUMO

Non-coding RNAs have various biological functions such as translational regulation, and RNA-RNA interactions play essential roles in the mechanisms of action of these RNAs. Therefore, RNA-RNA interaction prediction is an important problem in bioinformatics, and many tools have been developed for the computational prediction of RNA-RNA interactions. In addition to the development of novel algorithms with high accuracy, the development and maintenance of web services is essential for enhancing usability by experimental biologists. In this review, we survey web services for RNA-RNA interaction predictions and introduce how to use primary web services. We present various prediction tools, including general interaction prediction tools, prediction tools for specific RNA classes, and RNA-RNA interaction-based RNA design tools. Additionally, we discuss the future perspectives of the development of RNA-RNA interaction prediction tools and the sustainability of web services.


Assuntos
MicroRNAs , RNA , RNA/genética , Algoritmos , Biologia Computacional , MicroRNAs/genética
18.
NAR Genom Bioinform ; 4(4): lqac092, 2022 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-36465498

RESUMO

Long-read sequencers, such as Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT) sequencers, have improved their read length and accuracy, thereby opening up unprecedented research. Many tools and algorithms have been developed to analyze long reads, and rapid progress in PacBio and ONT has further accelerated their development. Together with the development of high-throughput sequencing technologies and their analysis tools, many read simulators have been developed and effectively utilized. PBSIM is one of the popular long-read simulators. In this study, we developed PBSIM3 with three new functions: error models for long reads, multi-pass sequencing for high-fidelity read simulation and transcriptome sequencing simulation. Therefore, PBSIM3 is now able to meet a wide range of long-read simulation requirements.

19.
Nucleic Acids Res ; 50(19): 11229-11242, 2022 10 28.
Artigo em Inglês | MEDLINE | ID: mdl-36259651

RESUMO

Non-coding RNAs (ncRNAs) ubiquitously exist in normal and cancer cells. Despite their prevalent distribution, the functions of most long ncRNAs remain uncharacterized. The fission yeast Schizosaccharomyces pombe expresses >1800 ncRNAs annotated to date, but most unconventional ncRNAs (excluding tRNA, rRNA, snRNA and snoRNA) remain uncharacterized. To discover the functional ncRNAs, here we performed a combinatory screening of computational and biological tests. First, all S. pombe ncRNAs were screened in silico for those showing conservation in sequence as well as in secondary structure with ncRNAs in closely related species. Almost a half of the 151 selected conserved ncRNA genes were uncharacterized. Twelve ncRNA genes that did not overlap with protein-coding sequences were next chosen for biological screening that examines defects in growth or sexual differentiation, as well as sensitivities to drugs and stresses. Finally, we highlighted an ncRNA transcribed from SPNCRNA.1669, which inhibited untimely initiation of sexual differentiation. A domain that was predicted as conserved secondary structure by the computational operations was essential for the ncRNA to function. Thus, this study demonstrates that in silico selection focusing on conservation of the secondary structure over species is a powerful method to pinpoint novel functional ncRNAs.


Assuntos
Schizosaccharomyces , Schizosaccharomyces/genética , Diferenciação Sexual , RNA não Traduzido/genética , RNA não Traduzido/química , RNA Nucleolar Pequeno/genética , Fases de Leitura Aberta
20.
Methods Mol Biol ; 2509: 315-340, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35796972

RESUMO

With a large number of annotated non-coding RNAs (ncRNAs), repetitive sequences are found to constitute functional components (termed as repetitive elements) in ncRNAs that perform specific biological functions. Bioinformatics analysis is a powerful tool for improving our understanding of the role of repetitive elements in ncRNAs. This chapter summarizes recent findings that reveal the role of repetitive elements in ncRNAs. Furthermore, relevant bioinformatics approaches are systematically reviewed, which promises to provide valuable resources for studying the functional impact of repetitive elements on ncRNAs.


Assuntos
Biologia Computacional , RNA não Traduzido , RNA não Traduzido/genética , Sequências Repetitivas de Ácido Nucleico/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA