Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 3 de 3
Filtrar
1.
J Med Primatol ; 43(5): 317-28, 2014 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-24810475

RESUMO

BACKGROUND: The genome annotations of rhesus (Macaca mulatta) and cynomolgus (Macaca fascicularis) macaques, two of the most common non-human primate animal models, are limited. METHODS: We analyzed large-scale macaque RNA-based next-generation sequencing (RNAseq) data to identify un-annotated macaque transcripts. RESULTS: For both macaque species, we uncovered thousands of novel isoforms for annotated genes and thousands of un-annotated intergenic transcripts enriched with non-coding RNAs. We also identified thousands of transcript sequences which are partially or completely 'missing' from current macaque genome assemblies. We showed that many newly identified transcripts were differentially expressed during SIV infection of rhesus macaques or during Ebola virus infection of cynomolgus macaques. CONCLUSIONS: For two important macaque species, we uncovered thousands of novel isoforms and un-annotated intergenic transcripts including coding and non-coding RNAs, polyadenylated and non-polyadenylated transcripts. This resource will greatly improve future macaque studies, as demonstrated by their applications in infectious disease studies.


Assuntos
Doença pelo Vírus Ebola/genética , Macaca fascicularis , Macaca mulatta , Doenças dos Macacos/genética , Síndrome de Imunodeficiência Adquirida dos Símios/genética , Transcriptoma , Animais , Ebolavirus/fisiologia , Doença pelo Vírus Ebola/virologia , Sequenciamento de Nucleotídeos em Larga Escala , Índia , Maurício , Dados de Sequência Molecular , Doenças dos Macacos/virologia , RNA não Traduzido/genética , RNA não Traduzido/metabolismo , Análise de Sequência de RNA , Síndrome de Imunodeficiência Adquirida dos Símios/virologia , Vírus da Imunodeficiência Símia/fisiologia
2.
Bioinformatics ; 22(1): 35-9, 2006 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-16267089

RESUMO

MOTIVATION: Non-coding RNAs (ncRNAs) are functional RNA molecules that do not code for proteins. Covariance Models (CMs) are a useful statistical tool to find new members of an ncRNA gene family in a large genome database, using both sequence and, importantly, RNA secondary structure information. Unfortunately, CM searches are extremely slow. Previously, we created rigorous filters, which provably sacrifice none of a CM's accuracy, while making searches significantly faster for virtually all ncRNA families. However, these rigorous filters make searches slower than heuristics could be. RESULTS: In this paper we introduce profile HMM-based heuristic filters. We show that their accuracy is usually superior to heuristics based on BLAST. Moreover, we compared our heuristics with those used in tRNAscan-SE, whose heuristics incorporate a significant amount of work specific to tRNAs, where our heuristics are generic to any ncRNA. Performance was roughly comparable, so we expect that our heuristics provide a high-quality solution that--unlike family-specific solutions--can scale to hundreds of ncRNA families. AVAILABILITY: The source code is available under GNU Public License at the supplementary web site.


Assuntos
Biologia Computacional/métodos , RNA não Traduzido/química , Alinhamento de Sequência/métodos , Algoritmos , Genoma , Humanos , Cadeias de Markov , Modelos Estatísticos , Conformação de Ácido Nucleico , Estrutura Secundária de Proteína , Proteínas/química , RNA/química , RNA de Transferência/química , Curva ROC , Sensibilidade e Especificidade , Software
3.
Bioinformatics ; 20 Suppl 1: i334-41, 2004 Aug 04.
Artigo em Inglês | MEDLINE | ID: mdl-15262817

RESUMO

MOTIVATION: Non-coding RNAs (ncRNAs)-functional RNA molecules not coding for proteins-are grouped into hundreds of families of homologs. To find new members of an ncRNA gene family in a large genome database, covariance models (CMs) are a useful statistical tool, as they use both sequence and RNA secondary structure information. Unfortunately, CM searches are slow. Previously, we introduced 'rigorous filters', which provably sacrifice none of CMs' accuracy, although often scanning much faster. A rigorous filter, using a profile hidden Markov model (HMM), is built based on the CM, and filters the genome database, eliminating sequences that provably could not be annotated as homologs. The CM is run only on the remainder. Some biologically important ncRNA families could not be scanned efficiently with this technique, largely due to the significance of conserved secondary structure relative to primary sequence in identifying these families. Current heuristic filters are also expected to perform poorly on such families. RESULTS: By augmenting profile HMMs with limited secondary structure information, we obtain rigorous filters that accelerate CM searches for virtually all known ncRNA families from the Rfam Database and tRNA models in tRNAscan-SE. These filters scan an 8 gigabase database in weeks instead of years, and uncover homologs missed by heuristic techniques to speed CM searches. AVAILABILITY: Software in development; contact the authors.


Assuntos
Algoritmos , Mapeamento Cromossômico/métodos , Sequência Conservada/genética , RNA não Traduzido/genética , Análise de Sequência de RNA/métodos , Cadeias de Markov , Reconhecimento Automatizado de Padrão/métodos , Reprodutibilidade dos Testes , Sensibilidade e Especificidade , Homologia de Sequência do Ácido Nucleico
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA