Your browser doesn't support javascript.
loading
Crohn disease risk prediction-Best practices and pitfalls with exome data.
Giollo, Manuel; Jones, David T; Carraro, Marco; Leonardi, Emanuela; Ferrari, Carlo; Tosatto, Silvio C E.
Affiliation
  • Giollo M; Institute of Structural and Molecular Biology, University College London, London, United Kingdom.
  • Jones DT; Institute of Structural and Molecular Biology, University College London, London, United Kingdom.
  • Carraro M; Department of Biomedical Sciences, University of Padova, Padova, Italy.
  • Leonardi E; Department of Woman and Child Health, University of Padova, Padova, Italy.
  • Ferrari C; Department of Information Engineering, University of Padova, Padova, Italy.
  • Tosatto SCE; Department of Woman and Child Health, University of Padova, Padova, Italy.
Hum Mutat ; 38(9): 1193-1200, 2017 09.
Article in En | MEDLINE | ID: mdl-28087895
The Critical Assessment of Genome Interpretation (CAGI) experiment is the first attempt to evaluate the state-of-the-art in genetic data interpretation. Among the proposed challenges, Crohn disease (CD) risk prediction has become the most classic problem spanning three editions. The scientific question is very hard: can anybody assess the risk to develop CD given the exome data alone? This is one of the ultimate goals of genetic analysis, which motivated most CAGI participants to look for powerful new methods. In the 2016 CD challenge, we implemented all the best methods proposed in the past editions. This resulted in 10 algorithms, which were evaluated fairly by CAGI organizers. We also used all the data available from CAGI 11 and 13 to maximize the amount of training samples. The most effective algorithms used known genes associated with CD from the literature. No method could evaluate effectively the importance of unannotated variants by using heuristics. As a downside, all CD datasets were strongly affected by sample stratification. This affected the performance reported by assessors. Therefore, we expect that future datasets will be normalized in order to remove population effects. This will improve methods comparison and promote algorithms focused on causal variants discovery.
Subject(s)
Key words

Full text: 1 Database: MEDLINE Main subject: Crohn Disease / Computational Biology Type of study: Etiology_studies / Guideline / Prognostic_studies / Risk_factors_studies Limits: Humans Language: En Year: 2017 Type: Article

Full text: 1 Database: MEDLINE Main subject: Crohn Disease / Computational Biology Type of study: Etiology_studies / Guideline / Prognostic_studies / Risk_factors_studies Limits: Humans Language: En Year: 2017 Type: Article