Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 4 de 4
Filtrar
Más filtros












Base de datos
Intervalo de año de publicación
1.
OMICS ; 17(9): 486-93, 2013 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-23808606

RESUMEN

MicroRNAs play important roles in most biological processes, including cell proliferation, tissue differentiation, and embryonic development, among others. They originate from precursor transcripts (pre-miRNAs), which contain phylogenetically conserved stem-loop structures. An important bioinformatics problem is to distinguish the pre-miRNAs from pseudo pre-miRNAs that have similar stem-loop structures. We present here a novel method for tackling this bioinformatics problem. Our method, named MirID, accepts an RNA sequence as input, and classifies the RNA sequence either as positive (i.e., a real pre-miRNA) or as negative (i.e., a pseudo pre-miRNA). MirID employs a feature mining algorithm for finding combinations of features suitable for building pre-miRNA classification models. These models are implemented using support vector machines, which are combined to construct a classifier ensemble. The accuracy of the classifier ensemble is further enhanced by the utilization of an AdaBoost algorithm. When compared with two closely related tools on twelve species analyzed with these tools, MirID outperforms the existing tools on the majority of the twelve species. MirID was also tested on nine additional species, and the results showed high accuracies on the nine species. The MirID web server is fully operational and freely accessible at http://bioinformatics.njit.edu/MirID/ . Potential applications of this software in genomics and medicine are also discussed.


Asunto(s)
Biología Computacional , Minería de Datos , MicroARNs/clasificación , Precursores del ARN/clasificación , Programas Informáticos , Algoritmos , Animales , Biología Computacional/métodos , Minería de Datos/métodos , Bases de Datos de Ácidos Nucleicos , Humanos , Internet , MicroARNs/química , MicroARNs/genética , Precursores del ARN/química , Precursores del ARN/genética , Reproducibilidad de los Resultados , Sensibilidad y Especificidad
2.
Recent Pat DNA Gene Seq ; 7(2): 115-22, 2013 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-22974261

RESUMEN

Motif finding in DNA, RNA and proteins plays an important role in life science research. Recent patents concerning motif finding in biomolecular data are recorded in the DNA Patent Database which serves as a resource for policy makers and members of the general public interested in fields like genomics, genetics and biotechnology. In this paper, we present a computational approach to mining for RNA tertiary motifs in genomic sequences. Specifically, we describe a method, named CSminer, and show, as a case study, the application of CSminer to genome-wide search for coaxial helical stackings in RNA 3-way junctions. A coaxial helical stacking occurs in an RNA 3-way junction where two separate helical elements form a pseudocontiguous helix and provide thermodynamic stability to the RNA molecule as a whole. Experimental results demonstrate the effectiveness of our approach.


Asunto(s)
Biología Computacional , ARN/química , Secuencia de Bases , Cromosomas de Archaea/genética , Haloarcula/genética , Conformación de Ácido Nucleico , Motivos de Nucleótidos , Patentes como Asunto
3.
Nucleic Acids Res ; 40(2): 487-98, 2012 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-21917853

RESUMEN

RNA junctions are important structural elements that form when three or more helices come together in space in the tertiary structures of RNA molecules. Determining their structural configuration is important for predicting RNA 3D structure. We introduce a computational method to predict, at the secondary structure level, the coaxial helical stacking arrangement in junctions, as well as classify the junction topology. Our approach uses a data mining approach known as random forests, which relies on a set of decision trees trained using length, sequence and other variables specified for any given junction. The resulting protocol predicts coaxial stacking within three- and four-way junctions with an accuracy of 81% and 77%, respectively; the accuracy increases to 83% and 87%, respectively, when knowledge from the junction family type is included. Coaxial stacking predictions for the five to ten-way junctions are less accurate (60%) due to sparse data available for training. Additionally, our application predicts the junction family with an accuracy of 85% for three-way junctions and 74% for four-way junctions. Comparisons with other methods, as well applications to unsolved RNAs, are also presented. The web server Junction-Explorer to predict junction topologies is freely available at: http://bioinformatics.njit.edu/junction.


Asunto(s)
Árboles de Decisión , ARN Bicatenario/química , Algoritmos , Biología Computacional/métodos , Minería de Datos , Modelos Moleculares , Conformación de Ácido Nucleico
4.
BMC Genomics ; 9: 189, 2008 Apr 25.
Artículo en Inglés | MEDLINE | ID: mdl-18439287

RESUMEN

BACKGROUND: UnTranslated Regions (UTRs) of mRNAs contain regulatory elements for various aspects of mRNA metabolism, such as mRNA localization, translation, and mRNA stability. Several RNA stem-loop structures in UTRs have been experimentally identified, including the histone 3' UTR stem-loop structure (HSL3) and iron response element (IRE). These stem-loop structures are conserved among mammalian orthologs, and exist in a group of genes encoding proteins involved in the same biological pathways. It is not known to what extent RNA structures like these exist in all mammalian UTRs. RESULTS: In this paper we took a systematic approach, named GLEAN-UTR, to identify small stem-loop RNA structure elements in UTRs that are conserved between human and mouse orthologs and exist in multiple genes with common Gene Ontology terms. This approach resulted in 90 distinct RNA structure groups containing 748 structures, with HSL3 and IRE among the top hits based on conservation of structure. CONCLUSION: Our result indicates that there may exist many conserved stem-loop structures in mammalian UTRs that are involved in coordinate post-transcriptional regulation of biological pathways.


Asunto(s)
ARN Mensajero/química , ARN Mensajero/genética , Regiones no Traducidas , Animales , Secuencia de Bases , Análisis por Conglomerados , Secuencia Conservada , Bases de Datos de Ácidos Nucleicos , Humanos , Ratones , Alineación de Secuencia , Diseño de Software , Especificidad de la Especie
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...