Your browser doesn't support javascript.
loading
Imperfect gold standard gene sets yield inaccurate evaluation of causal gene identification methods.
Wang, Lijia; Wen, Xiaoquan; Morrison, Jean.
Affiliation
  • Wang L; Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA.
  • Wen X; Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA. xwen@umich.edu.
  • Morrison J; Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA. jvmorr@umich.edu.
Commun Biol ; 7(1): 873, 2024 Jul 17.
Article in En | MEDLINE | ID: mdl-39020054
ABSTRACT
Causal gene discovery methods are often evaluated using reference sets of causal genes, which are treated as gold standards (GS) for the purposes of evaluation. However, evaluation methods typically treat genes not in the GS positive set as known negatives rather than unknowns. This leads to inaccurate estimates of sensitivity, specificity, and AUC. Labeling biases in GS gene sets can also lead to inaccurate ordering of alternative causal gene discovery methods. We argue that the evaluation of causal gene discovery methods should rely on statistical techniques like those used for variant discovery rather than on comparison with GS gene sets.
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Reference Standards Limits: Humans Language: En Journal: Commun Biol Year: 2024 Document type: Article Affiliation country: United States

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Reference Standards Limits: Humans Language: En Journal: Commun Biol Year: 2024 Document type: Article Affiliation country: United States