Your browser doesn't support javascript.
loading
Semantic Harmonization of Alzheimer's Disease Datasets Using AD-Mapper.
Wegner, Philipp; Balabin, Helena; Ay, Mehmet Can; Bauermeister, Sarah; Killin, Lewis; Gallacher, John; Hofmann-Apitius, Martin; Salimi, Yasamin.
Affiliation
  • Wegner P; Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Sankt Augustin, Germany.
  • Balabin H; Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn, Germany.
  • Ay MC; German Center for Neurodegenerative Diseases (DZNE), Bonn, Germany.
  • Bauermeister S; Department of Neurosciences, Laboratory for Cognitive Neurology, KU Leuven, Leuven, Belgium.
  • Killin L; Department of Computer Science, Language Intelligence and Information Retrieval Lab, KU Leuven, Leuven, Belgium.
  • Gallacher J; Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Sankt Augustin, Germany.
  • Hofmann-Apitius M; Bonn-Aachen International Center for IT, Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn, Germany.
  • Salimi Y; Department of Psychiatry, Warneford Hospital, University of Oxford, Oxford, UK.
J Alzheimers Dis ; 99(4): 1409-1423, 2024.
Article de En | MEDLINE | ID: mdl-38759012
ABSTRACT

Background:

Despite numerous past endeavors for the semantic harmonization of Alzheimer's disease (AD) cohort studies, an automatic tool has yet to be developed.

Objective:

As cohort studies form the basis of data-driven analysis, harmonizing them is crucial for cross-cohort analysis. We aimed to accelerate this task by constructing an automatic harmonization tool.

Methods:

We created a common data model (CDM) through cross-mapping data from 20 cohorts, three CDMs, and ontology terms, which was then used to fine-tune a BioBERT model. Finally, we evaluated the model using three previously unseen cohorts and compared its performance to a string-matching baseline model.

Results:

Here, we present our AD-Mapper interface for automatic harmonization of AD cohort studies, which outperformed a string-matching baseline on previously unseen cohort studies. We showcase our CDM comprising 1218 unique variables.

Conclusion:

AD-Mapper leverages semantic similarities in naming conventions across cohorts to improve mapping performance.
Sujet(s)
Mots clés

Texte intégral: 1 Collection: 01-internacional Base de données: MEDLINE Sujet principal: Sémantique / Maladie d'Alzheimer Limites: Humans Langue: En Journal: J Alzheimers Dis Sujet du journal: GERIATRIA / NEUROLOGIA Année: 2024 Type de document: Article Pays d'affiliation: Allemagne Pays de publication: Pays-Bas

Texte intégral: 1 Collection: 01-internacional Base de données: MEDLINE Sujet principal: Sémantique / Maladie d'Alzheimer Limites: Humans Langue: En Journal: J Alzheimers Dis Sujet du journal: GERIATRIA / NEUROLOGIA Année: 2024 Type de document: Article Pays d'affiliation: Allemagne Pays de publication: Pays-Bas