Your browser doesn't support javascript.
loading
Automated annotation of disease subtypes.
Ofer, Dan; Linial, Michal.
Afiliação
  • Ofer D; Department of Biological Chemistry, The Life Science Institute, The Hebrew University of Jerusalem, Israel. Electronic address: dan.ofer@mail.huji.ac.il.
  • Linial M; Department of Biological Chemistry, The Life Science Institute, The Hebrew University of Jerusalem, Israel. Electronic address: michall@mail.huji.ac.il.
J Biomed Inform ; 154: 104650, 2024 Jun.
Article em En | MEDLINE | ID: mdl-38701887
ABSTRACT

BACKGROUND:

Distinguishing diseases into distinct subtypes is crucial for study and effective treatment strategies. The Open Targets Platform (OT) integrates biomedical, genetic, and biochemical datasets to empower disease ontologies, classifications, and potential gene targets. Nevertheless, many disease annotations are incomplete, requiring laborious expert medical input. This challenge is especially pronounced for rare and orphan diseases, where resources are scarce.

METHODS:

We present a machine learning approach to identifying diseases with potential subtypes, using the approximately 23,000 diseases documented in OT. We derive novel features for predicting diseases with subtypes using direct evidence. Machine learning models were applied to analyze feature importance and evaluate predictive performance for discovering both known and novel disease subtypes.

RESULTS:

Our model achieves a high (89.4%) ROC AUC (Area Under the Receiver Operating Characteristic Curve) in identifying known disease subtypes. We integrated pre-trained deep-learning language models and showed their benefits. Moreover, we identify 515 disease candidates predicted to possess previously unannotated subtypes.

CONCLUSIONS:

Our models can partition diseases into distinct subtypes. This methodology enables a robust, scalable approach for improving knowledge-based annotations and a comprehensive assessment of disease ontology tiers. Our candidates are attractive targets for further study and personalized medicine, potentially aiding in the unveiling of new therapeutic indications for sought-after targets.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Aprendizado de Máquina Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Aprendizado de Máquina Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article