Your browser doesn't support javascript.
loading
SNPranker 2.0: a gene-centric data mining tool for diseases associated SNP prioritization in GWAS.
Merelli, Ivan; Calabria, Andrea; Cozzi, Paolo; Viti, Federica; Mosca, Ettore; Milanesi, Luciano.
Affiliation
  • Merelli I; Consiglio Nazionale delle Ricerche - Istituto di Tecnologie Biomediche (CNR-ITB), Via F,lli Cervi 93, 20090 Segrate (MI), Italy.
BMC Bioinformatics ; 14 Suppl 1: S9, 2013.
Article in En | MEDLINE | ID: mdl-23369106
ABSTRACT

BACKGROUND:

The capability of correlating specific genotypes with human diseases is a complex issue in spite of all advantages arisen from high-throughput technologies, such as Genome Wide Association Studies (GWAS). New tools for genetic variants interpretation and for Single Nucleotide Polymorphisms (SNPs) prioritization are actually needed. Given a list of the most relevant SNPs statistically associated to a specific pathology as result of a genotype study, a critical issue is the identification of genes that are effectively related to the disease by re-scoring the importance of the identified genetic variations. Vice versa, given a list of genes, it can be of great importance to predict which SNPs can be involved in the onset of a particular disease, in order to focus the research on their effects.

RESULTS:

We propose a new bioinformatics approach to support biological data mining in the analysis and interpretation of SNPs associated to pathologies. This system can be employed to design custom genotyping chips for disease-oriented studies and to re-score GWAS results. The proposed method relies (1) on the data integration of public resources using a gene-centric database design, (2) on the evaluation of a set of static biomolecular annotations, defined as features, and (3) on the SNP scoring function, which computes SNP scores using parameters and weights set by users. We employed a machine learning classifier to set default feature weights and an ontological annotation layer to enable the enrichment of the input gene set. We implemented our method as a web tool called SNPranker 2.0 (http//www.itb.cnr.it/snpranker), improving our first published release of this system. A user-friendly interface allows the input of a list of genes, SNPs or a biological process, and to customize the features set with relative weights. As result, SNPranker 2.0 returns a list of SNPs, localized within input and ontologically enriched genes, combined with their prioritization scores.

CONCLUSIONS:

Different databases and resources are already available for SNPs annotation, but they do not prioritize or re-score SNPs relying on a-priori biomolecular knowledge. SNPranker 2.0 attempts to fill this gap through a user-friendly integrated web resource. End users, such as researchers in medical genetics and epidemiology, may find in SNPranker 2.0 a new tool for data mining and interpretation able to support SNPs analysis. Possible scenarios are GWAS data re-scoring, SNPs selection for custom genotyping arrays and SNPs/diseases association studies.
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Software / Disease / Polymorphism, Single Nucleotide / Genome-Wide Association Study / Data Mining Type of study: Prognostic_studies / Risk_factors_studies Limits: Humans Language: En Journal: BMC Bioinformatics Journal subject: INFORMATICA MEDICA Year: 2013 Document type: Article Affiliation country: Italy

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Software / Disease / Polymorphism, Single Nucleotide / Genome-Wide Association Study / Data Mining Type of study: Prognostic_studies / Risk_factors_studies Limits: Humans Language: En Journal: BMC Bioinformatics Journal subject: INFORMATICA MEDICA Year: 2013 Document type: Article Affiliation country: Italy