Data-driven information retrieval in heterogeneous collections of transcriptomics data links SIM2s to malignant pleural mesothelioma.
Bioinformatics
; 28(2): 246-53, 2012 Jan 15.
Article
em En
| MEDLINE
| ID: mdl-22106335
MOTIVATION: Genome-wide measurement of transcript levels is an ubiquitous tool in biomedical research. As experimental data continues to be deposited in public databases, it is becoming important to develop search engines that enable the retrieval of relevant studies given a query study. While retrieval systems based on meta-data already exist, data-driven approaches that retrieve studies based on similarities in the expression data itself have a greater potential of uncovering novel biological insights. RESULTS: We propose an information retrieval method based on differential expression. Our method deals with arbitrary experimental designs and performs competitively with alternative approaches, while making the search results interpretable in terms of differential expression patterns. We show that our model yields meaningful connections between biological conditions from different studies. Finally, we validate a previously unknown connection between malignant pleural mesothelioma and SIM2s suggested by our method, via real-time polymerase chain reaction in an independent set of mesothelioma samples. AVAILABILITY: Supplementary data and source code are available from http://www.ebi.ac.uk/fg/research/rex.
Texto completo:
1
Coleções:
01-internacional
Base de dados:
MEDLINE
Assunto principal:
Neoplasias Pleurais
/
Armazenamento e Recuperação da Informação
/
Perfilação da Expressão Gênica
/
Mesotelioma
Limite:
Animals
/
Humans
Idioma:
En
Ano de publicação:
2012
Tipo de documento:
Article