Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 4 de 4
Filtrar
Mais filtros








Base de dados
Intervalo de ano de publicação
1.
Bioinformatics ; 30(3): 392-7, 2014 Feb 01.
Artigo em Inglês | MEDLINE | ID: mdl-24273243

RESUMO

MOTIVATION: MicroRNAs (miRNAs) are a highly abundant class of non-coding RNA genes involved in cellular regulation and thus also diseases. Despite miRNAs being important disease factors, miRNA-disease associations remain low in number and of variable reliability. Furthermore, existing databases and prediction methods do not explicitly facilitate forming hypotheses about the possible molecular causes of the association, thereby making the path to experimental follow-up longer. RESULTS: Here we present miRPD in which miRNA-Protein-Disease associations are explicitly inferred. Besides linking miRNAs to diseases, it directly suggests the underlying proteins involved, which can be used to form hypotheses that can be experimentally tested. The inference of miRNAs and diseases is made by coupling known and predicted miRNA-protein associations with protein-disease associations text mined from the literature. We present scoring schemes that allow us to rank miRNA-disease associations inferred from both curated and predicted miRNA targets by reliability and thereby to create high- and medium-confidence sets of associations. Analyzing these, we find statistically significant enrichment for proteins involved in pathways related to cancer and type I diabetes mellitus, suggesting either a literature bias or a genuine biological trend. We show by example how the associations can be used to extract proteins for disease hypothesis. AVAILABILITY AND IMPLEMENTATION: All datasets, software and a searchable Web site are available at http://mirpd.jensenlab.org.


Assuntos
Doença/genética , MicroRNAs/metabolismo , Proteínas/metabolismo , Software , Diabetes Mellitus/genética , Humanos
2.
Sci Rep ; 3: 2468, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-24002378

RESUMO

Molecular detection of viruses has been aided by high-throughput sequencing, permitting the genomic characterization of emerging strains. In this study, we comprehensively screened 500 respiratory secretions from children with upper and/or lower respiratory tract infections for viral pathogens. The viruses detected are described, including a divergent human parainfluenza virus type 4 from GS FLX pyrosequencing of 92 specimens. Complete full-genome characterization of the virus followed, using Single Molecule, Real-Time (SMRT) sequencing. Subsequent "primer walking" combined with Sanger sequencing validated the RS platform's utility in viral sequencing from complex clinical samples. Comparative genomics reveals the divergent strain clusters with the only completely sequenced HPIV4a subtype. However, it also exhibits various structural features present in one of the HPIV4b reference strains, opening questions regarding their lifecycle and evolutionary relationships among these viruses. Clinical data from patients infected with the strain, as well as viral prevalence estimates using real-time PCR, is also described.


Assuntos
Metagenômica , Vírus da Parainfluenza 4 Humana/genética , Infecções Respiratórias/virologia , Sequência de Bases , Variação Genética , Genoma Viral , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Metagenômica/métodos , Dados de Sequência Molecular , Fases de Leitura Aberta , Vírus da Parainfluenza 4 Humana/classificação , Vírus da Parainfluenza 4 Humana/isolamento & purificação , Filogenia , Prevalência , Infecções Respiratórias/epidemiologia , Alinhamento de Sequência
3.
Bioinformatics ; 28(5): 636-42, 2012 Mar 01.
Artigo em Inglês | MEDLINE | ID: mdl-22215819

RESUMO

MOTIVATION: Probabilistic logic programming offers a powerful way to describe and evaluate structured statistical models. To investigate the practicality of probabilistic logic programming for structure learning in bioinformatics, we undertook a simplified bacterial gene-finding benchmark in PRISM, a probabilistic dialect of Prolog. RESULTS: We evaluate Hidden Markov Model structures for bacterial protein-coding gene potential, including a simple null model structure, three structures based on existing bacterial gene finders and two novel model structures. We test standard versions as well as ADPH length modeling and three-state versions of the five model structures. The models are all represented as probabilistic logic programs and evaluated using the PRISM machine learning system in terms of statistical information criteria and gene-finding prediction accuracy, in two bacterial genomes. Neither of our implementations of the two currently most used model structures are best performing in terms of statistical information criteria or prediction performances, suggesting that better-fitting models might be achievable. AVAILABILITY: The source code of all PRISM models, data and additional scripts are freely available for download at: http://github.com/somork/codonhmm. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Bacillus subtilis/genética , Escherichia coli/genética , Genes Bacterianos , Cadeias de Markov , Modelos Genéticos , Algoritmos , Inteligência Artificial , Modelos Estatísticos , Linguagens de Programação , Alinhamento de Sequência
4.
Mol Biol Evol ; 24(4): 909-17, 2007 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-17272679

RESUMO

Alternative splicing (AS) is an important contributor to proteome diversity and is regarded as an explanatory factor for the relatively low number of human genes compared with less complex animals. To assess the evolutionary conservation of AS and its developmental regulation, we have investigated the qualitative and quantitative expression of 21 orthologous alternative splice events through the development of 2 nematode species separated by 85-110 Myr of evolutionary time. We demonstrate that most of these alternative splice events present in Caenorhabditis elegans are conserved in Caenorhabditis briggsae. Moreover, we find that relative isoform expression levels vary significantly during development for 78% of the AS events and that this quantitative variation is highly conserved between the 2 species. Our results suggest that AS is generally tightly regulated through development and that the regulatory mechanisms controlling AS are to a large extent conserved during the evolution of Caenorhabditis. This strong conservation indicates that both major and minor splice forms have important functional roles and that the relative quantities in which they are expressed are crucial. Our results therefore suggest that the quantitative regulation of isoform expression levels is an intrinsic part of most AS events. Moreover, our results indicate that AS contributes little to transcript variation in Caenorhabditis genes and that gene duplication may be the major evolutionary mechanism for the origin of novel transcripts in these 2 species.


Assuntos
Processamento Alternativo , Caenorhabditis elegans/genética , Caenorhabditis/genética , Animais , Caenorhabditis/crescimento & desenvolvimento , Caenorhabditis elegans/crescimento & desenvolvimento , Proteínas de Caenorhabditis elegans/genética , DNA Complementar/química , DNA Complementar/genética , Evolução Molecular , Regulação da Expressão Gênica no Desenvolvimento , Genes de Helmintos/genética , Dados de Sequência Molecular , Isoformas de Proteínas/genética , Reação em Cadeia da Polimerase Via Transcriptase Reversa , Análise de Sequência de DNA
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA