Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 959
Filter
1.
BMC Genomics ; 25(1): 869, 2024 Sep 16.
Article in English | MEDLINE | ID: mdl-39285315

ABSTRACT

BACKGROUND: Bio-ontologies are keys in structuring complex biological information for effective data integration and knowledge representation. Semantic similarity analysis on bio-ontologies quantitatively assesses the degree of similarity between biological concepts based on the semantics encoded in ontologies. It plays an important role in structured and meaningful interpretations and integration of complex data from multiple biological domains. RESULTS: We present simona, a novel R package for semantic similarity analysis on general bio-ontologies. Simona implements infrastructures for ontology analysis by offering efficient data structures, fast ontology traversal methods, and elegant visualizations. Moreover, it provides a robust toolbox supporting over 70 methods for semantic similarity analysis. With simona, we conducted a benchmark against current semantic similarity methods. The results demonstrate methods are clustered based on their mathematical methodologies, thus guiding researchers in the selection of appropriate methods. Additionally, we explored annotation-based versus topology-based methods, revealing that semantic similarities solely based on ontology topology can efficiently reveal semantic similarity structures, facilitating analysis on less-studied organisms and other ontologies. CONCLUSIONS: Simona offers a versatile interface and efficient implementation for processing, visualization, and semantic similarity analysis on bio-ontologies. We believe that simona will serve as a robust tool for uncovering relationships and enhancing the interoperability of biological knowledge systems.


Subject(s)
Biological Ontologies , Semantics , Software , Computational Biology/methods
2.
Rev Sci Tech ; 43: 69-78, 2024 Aug.
Article in English | MEDLINE | ID: mdl-39222110

ABSTRACT

The Global Burden of Animal Diseases (GBADs) programme aims to assess the impact of animal health on agricultural animals, livestock production systems and associated communities worldwide. As part of the objectives of GBADs'Animal Health Ontology theme, the programme reviewed conceptual frameworks, ontologies and classification systems in biomedical science. The focus was on data requirements in animal health and the connections between animal health and human and environmental health. In May 2023, the team conducted searches of recognised repositories of biomedical ontologies, including BioPortal, Open Biological and Biomedical Ontology Foundry, and Ontology Lookup Service, to identify animal and livestock ontologies and those containing relevant concepts. Sixteen ontologies were found, covering topics such as surveillance, anatomy and genetics. Notable examples include the Animal Trait Ontology for Livestock, the Animal Health Surveillance Ontology, the National Center for Biotechnology Information Taxonomy and the Uberon Multi-Species Anatomy Ontology. However, some ontologies lacked class definitions for a significant portion of their classes. The review highlights the need for domain evidence to support proposed models, critical appraisal of external ontologies before reuse, and external expert reviews along with statistical tests of agreements. The findings from this review informed the structural framework, concepts and rationales of the animal health ontology for GBADs. This animal health ontology aims to increase the interoperability and transparency of GBADs data, thereby enabling estimates of the impacts of animal diseases on agriculture, livestock production systems and associated communities globally.


Le programme " Impact mondial des maladies animales " (GBADs) vise à évaluer l'impact de la santé animale sur les animaux d'élevage, les systèmes de production animale et les communautés liées à ce secteur d'activités dans le monde. Afin de définir une ontologie de la santé animale répondant aux objectifs du GBADs, le programme a procédé à un examen des cadres conceptuels, des ontologies et des systèmes de classification actuellement appliqués en sciences biomédicales. Il s'agissait de définir les besoins en données dans le domaine de la santé animale ainsi que les connexions entre la santé animale, la santé publique et la santé environnementale. En mai 2023, l'équipe a procédé à des recherches dans des référentiels reconnus d'ontologies biomédicales, notamment BioPortal, Open Biological and Biomedical Ontology Foundry et Ontology Lookup Service, afin de recenser les ontologies relatives aux animaux et au bétail ainsi que celles contenant des concepts pertinents. Seize ontologies ont été relevées, couvrant des thèmes tels que la surveillance, l'anatomie et la génétique. Parmi les exemples notables on peut citer : Animal Trait Ontology for Livestock (ontologie dédiée aux caractères phénotypiques des animaux d'élevage), Animal Health Surveillance Ontology (ontologie dédiée à la surveillance de la santé animale), National Center for Biotechnology Information Taxonomy (la base de données Taxonomie du Centre américain pour les informations biotechnologiques), et Uberon Multi-Species Anatomy Ontology (ontologie anatomique représentant diverses espèces animales). Il a cependant été constaté que certaines ontologies ne disposent pas de définitions de classes pour une grande partie des classes qui les composent. L'examen a souligné l'importance d'étayer les modèles proposés par des données issues des spécialités en question, de procéder à une évaluation critique des ontologies externes avant de les réutiliser et de faire effectuer des examens complémentaires par des experts externes ainsi que des tests statistiques de concordance. Les résultats de cette étude ont apporté des éléments permettant de définir le cadre structurel, les concepts et les principes de l'ontologie relative à la santé animale destinée au GBADs. Cette ontologie de la santé animale vise à accroître l'interopérabilité et la transparence des données du GBADs, ce qui permet d'effectuer des estimations de l'impact des maladies animales sur l'agriculture, les systèmes de production animale et les communautés associées à ce secteur d'activités à l'échelle mondiale.


El programa sobre el impacto global de las enfermedades animales (GBADs) tiene como objetivo evaluar el impacto de la sanidad animal en los animales de granja, los sistemas de producción ganadera y las comunidades conexas en todo el mundo. Como parte de los objetivos en torno al tema de la ontología de la sanidad animal del GBADs, el programa revisó marcos conceptuales, ontologías y sistemas de clasificación en el ámbito de la ciencia biomédica. Se hizo hincapié en los requisitos de datos sobre la sanidad animal y en las conexiones entre la sanidad animal y la salud humana y ambiental. En mayo de 2023, el equipo realizó búsquedas en repositorios reconocidos de ontologías biomédicas, como BioPortal, Open Biological and Biomedical Ontology Foundry y Ontology Lookup Service, para identificar no solo ontologías animales y ganaderas, sino también aquellas que incluyeran conceptos relevantes. En este sentido, se encontraron dieciséis ontologías, que abarcan temas como vigilancia, anatomía y genética. Entre los ejemplos más destacados figuran Animal Trait Ontology for Livestock (Ontología de Características Animales para el Ganado), Animal Health Surveillance Ontology (Ontología de Vigilancia de la Sanidad Animal), National Center for Biotechnology Information Taxonomy (la base de datos Taxonomía del Centro Nacional para la Información Biotecnológica) y Uberon Multi-Species Anatomy Ontology (Ontología Anatómica de Especies Múltiples). Sin embargo, algunas ontologías carecían de definiciones para una parte significativa de sus clases. La revisión pone de relieve la necesidad de contar con datos probatorios del ámbito en cuestión que respalden los modelos propuestos, una evaluación crítica de las ontologías externas antes de su reutilización y revisiones de expertos externos junto con pruebas estadísticas de los acuerdos. Los resultados de esta revisión han servido de base para el marco estructural, los conceptos y los fundamentos de la ontología de la sanidad animal para el GBADs. Esta ontología pretende aumentar la interoperabilidad y la transparencia de los datos del GBADs, permitiendo así estimar el impacto de las enfermedades animales en la agricultura, los sistemas de producción ganadera y las comunidades conexas en todo el mundo.


Subject(s)
Animal Diseases , Biological Ontologies , Livestock , Animals , Animal Diseases/epidemiology , Global Health , Humans
3.
J Biomed Semantics ; 15(1): 15, 2024 Aug 20.
Article in English | MEDLINE | ID: mdl-39160586

ABSTRACT

BACKGROUND: Within the Open Biological and Biomedical Ontology (OBO) Foundry, many ontologies represent the execution of a plan specification as a process in which a realizable entity that concretizes the plan specification, a "realizable concretization" (RC), is realized. This representation, which we call the "RC-account", provides a straightforward way to relate a plan specification to the entity that bears the realizable concretization and the process that realizes the realizable concretization. However, the adequacy of the RC-account has not been evaluated in the scientific literature. In this manuscript, we provide this evaluation and, thereby, give ontology developers sound reasons to use or not use the RC-account pattern. RESULTS: Analysis of the RC-account reveals that it is not adequate for representing failed plans. If the realizable concretization is flawed in some way, it is unclear what (if any) relation holds between the realizable entity and the plan specification. If the execution (i.e., realization) of the realizable concretization fails to carry out the actions given in the plan specification, it is unclear under the RC-account how to directly relate the failed execution to the entity carrying out the instructions given in the plan specification. These issues are exacerbated in the presence of changing plans. CONCLUSIONS: We propose two solutions for representing failed plans. The first uses the Common Core Ontologies 'prescribed by' relation to connect a plan specification to the entity or process that utilizes the plan specification as a guide. The second, more complex, solution incorporates the process of creating a plan (in the sense of an intention to execute a plan specification) into the representation of executing plan specifications. We hypothesize that the first solution (i.e., use of 'prescribed by') is adequate for most situations. However, more research is needed to test this hypothesis as well as explore the other solutions presented in this manuscript.


Subject(s)
Biological Ontologies
4.
J Biomed Semantics ; 15(1): 14, 2024 Aug 10.
Article in English | MEDLINE | ID: mdl-39123237

ABSTRACT

BACKGROUND: Vaccines have revolutionized public health by providing protection against infectious diseases. They stimulate the immune system and generate memory cells to defend against targeted diseases. Clinical trials evaluate vaccine performance, including dosage, administration routes, and potential side effects. CLINICALTRIALS: gov is a valuable repository of clinical trial information, but the vaccine data in them lacks standardization, leading to challenges in automatic concept mapping, vaccine-related knowledge development, evidence-based decision-making, and vaccine surveillance. RESULTS: In this study, we developed a cascaded framework that capitalized on multiple domain knowledge sources, including clinical trials, the Unified Medical Language System (UMLS), and the Vaccine Ontology (VO), to enhance the performance of domain-specific language models for automated mapping of VO from clinical trials. The Vaccine Ontology (VO) is a community-based ontology that was developed to promote vaccine data standardization, integration, and computer-assisted reasoning. Our methodology involved extracting and annotating data from various sources. We then performed pre-training on the PubMedBERT model, leading to the development of CTPubMedBERT. Subsequently, we enhanced CTPubMedBERT by incorporating SAPBERT, which was pretrained using the UMLS, resulting in CTPubMedBERT + SAPBERT. Further refinement was accomplished through fine-tuning using the Vaccine Ontology corpus and vaccine data from clinical trials, yielding the CTPubMedBERT + SAPBERT + VO model. Finally, we utilized a collection of pre-trained models, along with the weighted rule-based ensemble approach, to normalize the vaccine corpus and improve the accuracy of the process. The ranking process in concept normalization involves prioritizing and ordering potential concepts to identify the most suitable match for a given context. We conducted a ranking of the Top 10 concepts, and our experimental results demonstrate that our proposed cascaded framework consistently outperformed existing effective baselines on vaccine mapping, achieving 71.8% on top 1 candidate's accuracy and 90.0% on top 10 candidate's accuracy. CONCLUSION: This study provides a detailed insight into a cascaded framework of fine-tuned domain-specific language models improving mapping of VO from clinical trials. By effectively leveraging domain-specific information and applying weighted rule-based ensembles of different pre-trained BERT models, our framework can significantly enhance the mapping of VO from clinical trials.


Subject(s)
Biological Ontologies , Clinical Trials as Topic , Vaccines , Vaccines/immunology , Humans , Natural Language Processing , Unified Medical Language System
5.
Sci Data ; 11(1): 906, 2024 Aug 22.
Article in English | MEDLINE | ID: mdl-39174566

ABSTRACT

The "RNA world" represents a novel frontier for the study of fundamental biological processes and human diseases and is paving the way for the development of new drugs tailored to each patient's biomolecular characteristics. Although scientific data about coding and non-coding RNA molecules are constantly produced and available from public repositories, they are scattered across different databases and a centralized, uniform, and semantically consistent representation of the "RNA world" is still lacking. We propose RNA-KG, a knowledge graph (KG) encompassing biological knowledge about RNAs gathered from more than 60 public databases, integrating functional relationships with genes, proteins, and chemicals and ontologically grounded biomedical concepts. To develop RNA-KG, we first identified, pre-processed, and characterized each data source; next, we built a meta-graph that provides an ontological description of the KG by representing all the bio-molecular entities and medical concepts of interest in this domain, as well as the types of interactions connecting them. Finally, we leveraged an instance-based semantically abstracted knowledge model to specify the ontological alignment according to which RNA-KG was generated. RNA-KG can be downloaded in different formats and also queried by a SPARQL endpoint. A thorough topological analysis of the resulting heterogeneous graph provides further insights into the characteristics of the "RNA world". RNA-KG can be both directly explored and visualized, and/or analyzed by applying computational methods to infer bio-medical knowledge from its heterogeneous nodes and edges. The resource can be easily updated with new experimental data, and specific views of the overall KG can be extracted according to the bio-medical problem to be studied.


Subject(s)
RNA , RNA/genetics , Humans , Biological Ontologies
6.
BMC Med Inform Decis Mak ; 23(Suppl 1): 302, 2024 Aug 30.
Article in English | MEDLINE | ID: mdl-39215285

ABSTRACT

Ontologies and terminologies serve as the backbone of knowledge representation in biomedical domains, facilitating data integration, interoperability, and semantic understanding across diverse applications. However, the quality assurance and enrichment of these resources remain an ongoing challenge due to the dynamic nature of biomedical knowledge. In this editorial, we provide an introductory summary of seven articles included in this special supplement issue for quality assurance and enrichment of biological and biomedical ontologies and terminologies. These articles span a spectrum of topics, such as development of automated quality assessment frameworks for Resource Description Framework (RDF) resources, identification of missing concepts in SNOMED CT through logical definitions, and developing a COVID interface terminology to enable automatic annotations of COVID-19 related Electronic Health Records (EHRs). Collectively, these contributions underscore the ongoing efforts to improve the accuracy, consistency, and interoperability of biomedical ontologies and terminologies, thus advancing their pivotal role in healthcare and biomedical research.


Subject(s)
Biological Ontologies , Humans , COVID-19 , Vocabulary, Controlled , Electronic Health Records/standards
7.
Stud Health Technol Inform ; 316: 1482-1486, 2024 Aug 22.
Article in English | MEDLINE | ID: mdl-39176484

ABSTRACT

Biomedical decision support systems play a crucial role in modern healthcare by assisting clinicians in making informed decisions. Events, such as physiological changes or drug reactions, are integral components of these systems, influencing patient outcomes and treatment strategies. However, effectively modeling events within these systems presents significant challenges due to the complexity and dynamic nature of medical data. Especially the differentiation between events and processes as well as the nature of events is often unclear. This paper explores approaches to modeling events in biomedical decision support systems, considering factors such as ontology-based representation. By addressing these challenges, we strive to provide the means for enhancing the functionality and interpretability of biomedical decision support systems concerning events.


Subject(s)
Biological Ontologies , Decision Support Systems, Clinical , Humans
8.
Stud Health Technol Inform ; 316: 1338-1342, 2024 Aug 22.
Article in English | MEDLINE | ID: mdl-39176629

ABSTRACT

Ontology is essential for achieving health information and information technology application interoperability in the biomedical fields and beyond. Traditionally, ontology construction is carried out manually by human domain experts (HDE). Here, we explore an active learning approach to automatically identify candidate terms from publications, with manual verification later as a part of a deep learning model training and learning process. We introduce the overall architecture of the active learning pipeline and present some preliminary results. This work is a critical and complementary component in addition to manually building the ontology, especially during the long-term maintenance stage.


Subject(s)
Biological Ontologies , Humans , Terminology as Topic , Problem-Based Learning , Supervised Machine Learning , Vocabulary, Controlled
9.
Stud Health Technol Inform ; 316: 1385-1389, 2024 Aug 22.
Article in English | MEDLINE | ID: mdl-39176639

ABSTRACT

Interoperability is crucial to overcoming various challenges of data integration in the healthcare domain. While OMOP and FHIR data standards handle syntactic heterogeneity among heterogeneous data sources, ontologies support semantic interoperability to overcome the complexity and disparity of healthcare data. This study proposes an ontological approach in the context of the EUCAIM project to support semantic interoperability among distributed big data repositories that have applied heterogeneous cancer image data models using a semantically well-founded Hyperontology for the oncology domain.


Subject(s)
Semantics , Humans , Biological Ontologies , Health Information Interoperability , Medical Oncology , Neoplasms , Big Data
10.
Stud Health Technol Inform ; 316: 1427-1431, 2024 Aug 22.
Article in English | MEDLINE | ID: mdl-39176649

ABSTRACT

The task of managing diverse electronic health records requires the consolidation of data from different sources to facilitate clinical research and decision-making support, with the emergence of the Observational Medical Outcomes Partnership - Common Data Model (OMOP-CDM) as a standard relational database schema for structuring health records from different sources. Working with ontologies is strongly associated with reasoners. Implementing them over expansive and intricate Ontologies can pose computational challenges, potentially resulting in slow performance. In this paper, we propose the implementation of a new reasoner based on categorical logic over a translation of OMOP-CDM into an ontology model. This enables enhancements to the efficiency and scalability of implementing such models.


Subject(s)
Electronic Health Records , Humans , Medical Informatics , Biological Ontologies
11.
Stud Health Technol Inform ; 316: 1432-1436, 2024 Aug 22.
Article in English | MEDLINE | ID: mdl-39176650

ABSTRACT

Common Data Models (CDMs) enhance data exchange and integration across diverse sources, preserving semantics and context. Transforming local data into CDMs is typically cumbersome and resource-intensive, with limited reusability. This article compares OntoBridge, an ontology-based tool designed to streamline the conversion of local datasets into CDMs, with traditional ETL methods in adopting the OMOP CDM. We examine flexibility and scalability in the management of new data sources, CDM updates, and the adoption of new CDMs. OntoBridge showed greater flexibility in integrating new data sources and adapting to CDM updates. It was also more scalable, facilitating the adoption of various CDMs like i2b2, unlike traditional methods reliant on OMOP-specific tools developed by OHDSI. In summary, while traditional ETL provides a structured approach to data integration, OntoBridge offers a more flexible, scalable, and maintenance-efficient alternative.


Subject(s)
Biological Ontologies , Electronic Health Records , Semantics , Software , Humans , Information Storage and Retrieval/methods
12.
Stud Health Technol Inform ; 316: 1805-1806, 2024 Aug 22.
Article in English | MEDLINE | ID: mdl-39176841

ABSTRACT

The population of dementia patients is on the rise, as society undergoes rapid aging. This led to an expansion of dementia-related data. This study aims to develop a comprehensive dementia ontology to facilitate the collection and analysis of high-quality dementia data. We followed an ontology building process from Ontology Development 101 and the content of the dementia ontology was validated by experts in dementia care and dementia-related research.


Subject(s)
Biological Ontologies , Dementia , Humans , Electronic Health Records
13.
Stud Health Technol Inform ; 316: 1933-1937, 2024 Aug 22.
Article in English | MEDLINE | ID: mdl-39176870

ABSTRACT

Biomedical data analysis and visualization often demand data experts for each unique health event. There is a clear lack of automatic tools for semantic visualization of the spread of health risks through biomedical data. Illnesses such as coronavirus disease (COVID-19) and Monkeypox spread rampantly around the world before governments could make decisions based on the analysis of such data. We propose the design of a knowledge graph (KG) for spatio-temporal tracking of public health event propagation. To achieve this, we propose the specialization of the Core Propagation Phenomenon Ontology (PropaPhen) into a health-related propagation phenomenon domain ontology. Data from the UMLS and OpenStreetMaps are suggested for instantiating the proposed knowledge graph. Finally, the results of a use case on COVID-19 data from the World Health Organization are analyzed to evaluate the possibilities of our approach.


Subject(s)
COVID-19 , Humans , SARS-CoV-2 , Biological Ontologies , Unified Medical Language System
14.
Stud Health Technol Inform ; 316: 771-775, 2024 Aug 22.
Article in English | MEDLINE | ID: mdl-39176907

ABSTRACT

Ontologies play a key role in representing and structuring domain knowledge. In the biomedical domain, the need for this type of representation is crucial for structuring, coding, and retrieving data. However, available ontologies do not encompass all the relevant concepts and relationships. In this paper, we propose the framework SiMHOMer (Siamese Models for Health Ontologies Merging) to semantically merge and integrate the most relevant ontologies in the healthcare domain, with a first focus on diseases, symptoms, drugs, and adverse events. We propose to rely on the siamese neural models we developed and trained on biomedical data, BioSTransformers, to identify new relevant relations between concepts and to create new semantic relations, the objective being to build a new merging ontology that could be used in applications. To validate the proposed approach and the new relations, we relied on the UMLS Metathesaurus and the Semantic Network. Our first results show promising improvements for future research.


Subject(s)
Biological Ontologies , Semantics , Neural Networks, Computer , Humans , Unified Medical Language System
15.
J Biomed Semantics ; 15(1): 16, 2024 Aug 30.
Article in English | MEDLINE | ID: mdl-39210467

ABSTRACT

Automatic disease progression prediction models require large amounts of training data, which are seldom available, especially when it comes to rare diseases. A possible solution is to integrate data from different medical centres. Nevertheless, various centres often follow diverse data collection procedures and assign different semantics to collected data. Ontologies, used as schemas for interoperable knowledge bases, represent a state-of-the-art solution to homologate the semantics and foster data integration from various sources. This work presents the BrainTeaser Ontology (BTO), an ontology that models the clinical data associated with two brain-related rare diseases (ALS and MS) in a comprehensive and modular manner. BTO assists in organizing and standardizing the data collected during patient follow-up. It was created by harmonizing schemas currently used by multiple medical centers into a common ontology, following a bottom-up approach. As a result, BTO effectively addresses the practical data collection needs of various real-world situations and promotes data portability and interoperability. BTO captures various clinical occurrences, such as disease onset, symptoms, diagnostic and therapeutic procedures, and relapses, using an event-based approach. Developed in collaboration with medical partners and domain experts, BTO offers a holistic view of ALS and MS for supporting the representation of retrospective and prospective data. Furthermore, BTO adheres to Open Science and FAIR (Findable, Accessible, Interoperable, and Reusable) principles, making it a reliable framework for developing predictive tools to aid in medical decision-making and patient care. Although BTO is designed for ALS and MS, its modular structure makes it easily extendable to other brain-related diseases, showcasing its potential for broader applicability.Database URL  https://zenodo.org/records/7886998 .


Subject(s)
Biological Ontologies , Humans , Retrospective Studies , Amyotrophic Lateral Sclerosis , Multiple Sclerosis , Semantics
16.
Stud Health Technol Inform ; 315: 727-728, 2024 Jul 24.
Article in English | MEDLINE | ID: mdl-39049401

ABSTRACT

This poster presents the use of Interpretive Description in ontology development. The methods selected attended to the need for quality and rigour.


Subject(s)
Biological Ontologies , Humans , Vocabulary, Controlled
17.
J Biomed Semantics ; 15(1): 13, 2024 Jul 31.
Article in English | MEDLINE | ID: mdl-39080729

ABSTRACT

BACKGROUND: Identifying chemical mentions within the Alzheimer's and dementia literature can provide a powerful tool to further therapeutic research. Leveraging the Chemical Entities of Biological Interest (ChEBI) ontology, which is rich in hierarchical and other relationship types, for entity normalization can provide an advantage for future downstream applications. We provide a reproducible hybrid approach that combines an ontology-enhanced PubMedBERT model for disambiguation with a dictionary-based method for candidate selection. RESULTS: There were 56,553 chemical mentions in the titles of 44,812 unique PubMed article abstracts. Based on our gold standard, our method of disambiguation improved entity normalization by 25.3 percentage points compared to using only the dictionary-based approach with fuzzy-string matching for disambiguation. For the CRAFT corpus, our method outperformed baselines (maximum 78.4%) with a 91.17% accuracy. For our Alzheimer's and dementia cohort, we were able to add 47.1% more potential mappings between MeSH and ChEBI when compared to BioPortal. CONCLUSION: Use of natural language models like PubMedBERT and resources such as ChEBI and PubChem provide a beneficial way to link entity mentions to ontology terms, while further supporting downstream tasks like filtering ChEBI mentions based on roles and assertions to find beneficial therapies for Alzheimer's and dementia.


Subject(s)
Alzheimer Disease , Dementia , Alzheimer Disease/drug therapy , Alzheimer Disease/metabolism , Humans , Translational Research, Biomedical , Natural Language Processing , Biological Ontologies
18.
BMC Med Inform Decis Mak ; 24(1): 216, 2024 Jul 31.
Article in English | MEDLINE | ID: mdl-39085883

ABSTRACT

BACKGROUND: Intraoperative neurophysiological monitoring (IOM) plays a pivotal role in enhancing patient safety during neurosurgical procedures. This vital technique involves the continuous measurement of evoked potentials to provide early warnings and ensure the preservation of critical neural structures. One of the primary challenges has been the effective documentation of IOM events with semantically enriched characterizations. This study aimed to address this challenge by developing an ontology-based tool. METHODS: We structured the development of the IOM Documentation Ontology (IOMDO) and the associated tool into three distinct phases. The initial phase focused on the ontology's creation, drawing from the OBO (Open Biological and Biomedical Ontology) principles. The subsequent phase involved agile software development, a flexible approach to encapsulate the diverse requirements and swiftly produce a prototype. The last phase entailed practical evaluation within real-world documentation settings. This crucial stage enabled us to gather firsthand insights, assessing the tool's functionality and efficacy. The observations made during this phase formed the basis for essential adjustments to ensure the tool's productive utilization. RESULTS: The core entities of the ontology revolve around central aspects of IOM, including measurements characterized by timestamp, type, values, and location. Concepts and terms of several ontologies were integrated into IOMDO, e.g., the Foundation Model of Anatomy (FMA), the Human Phenotype Ontology (HPO) and the ontology for surgical process models (OntoSPM) related to general surgical terms. The software tool developed for extending the ontology and the associated knowledge base was built with JavaFX for the user-friendly frontend and Apache Jena for the robust backend. The tool's evaluation involved test users who unanimously found the interface accessible and usable, even for those without extensive technical expertise. CONCLUSIONS: Through the establishment of a structured and standardized framework for characterizing IOM events, our ontology-based tool holds the potential to enhance the quality of documentation, benefiting patient care by improving the foundation for informed decision-making. Furthermore, researchers can leverage the semantically enriched data to identify trends, patterns, and areas for surgical practice enhancement. To optimize documentation through ontology-based approaches, it's crucial to address potential modeling issues that are associated with the Ontology of Adverse Events.


Subject(s)
Biological Ontologies , Neurosurgical Procedures , Humans , Neurosurgical Procedures/standards , Documentation/standards , Software
19.
J Am Med Inform Assoc ; 31(9): 2076-2083, 2024 Sep 01.
Article in English | MEDLINE | ID: mdl-38829731

ABSTRACT

OBJECTIVE: We aim to develop a novel method for rare disease concept normalization by fine-tuning Llama 2, an open-source large language model (LLM), using a domain-specific corpus sourced from the Human Phenotype Ontology (HPO). METHODS: We developed an in-house template-based script to generate two corpora for fine-tuning. The first (NAME) contains standardized HPO names, sourced from the HPO vocabularies, along with their corresponding identifiers. The second (NAME+SYN) includes HPO names and half of the concept's synonyms as well as identifiers. Subsequently, we fine-tuned Llama 2 (Llama2-7B) for each sentence set and conducted an evaluation using a range of sentence prompts and various phenotype terms. RESULTS: When the phenotype terms for normalization were included in the fine-tuning corpora, both models demonstrated nearly perfect performance, averaging over 99% accuracy. In comparison, ChatGPT-3.5 has only ∼20% accuracy in identifying HPO IDs for phenotype terms. When single-character typos were introduced in the phenotype terms, the accuracy of NAME and NAME+SYN is 10.2% and 36.1%, respectively, but increases to 61.8% (NAME+SYN) with additional typo-specific fine-tuning. For terms sourced from HPO vocabularies as unseen synonyms, the NAME model achieved 11.2% accuracy, while the NAME+SYN model achieved 92.7% accuracy. CONCLUSION: Our fine-tuned models demonstrate ability to normalize phenotype terms unseen in the fine-tuning corpus, including misspellings, synonyms, terms from other ontologies, and laymen's terms. Our approach provides a solution for the use of LLMs to identify named medical entities from clinical narratives, while successfully normalizing them to standard concepts in a controlled vocabulary.


Subject(s)
Biological Ontologies , Natural Language Processing , Phenotype , Rare Diseases , Vocabulary, Controlled , Humans
20.
Neuroimage ; 297: 120688, 2024 Aug 15.
Article in English | MEDLINE | ID: mdl-38878916

ABSTRACT

The human brain is organized as a complex, hierarchical network. However, the structural covariance patterns among brain regions and the underlying biological substrates of such covariance networks remain to be clarified. The present study proposed a novel individualized structural covariance network termed voxel-based texture similarity networks (vTSNs) based on 76 refined voxel-based textural features derived from structural magnetic resonance images. Validated in three independent longitudinal healthy cohorts (40, 23, and 60 healthy participants, respectively) with two common brain atlases, we found that the vTSN could robustly resolve inter-subject variability with high test-retest reliability. In contrast to the regional-based texture similarity networks (rTSNs) that calculate radiomic features based on region-of-interest information, vTSNs had higher inter- and intra-subject variability ratios and test-retest reliability in connectivity strength and network topological properties. Moreover, the Spearman correlation indicated a stronger association of the gene expression similarity network (GESN) with vTSNs than with rTSNs (vTSN: r = 0.600, rTSN: r = 0.433, z = 39.784, P < 0.001). Hierarchical clustering identified 3 vTSN subnets with differential association patterns with 13 coexpression modules, 16 neurotransmitters, 7 electrophysiology, 4 metabolism, and 2 large-scale structural and 4 functional organization maps. Moreover, these subnets had unique biological hierarchical organization from the subcortex-limbic system to the ventral neocortex and then to the dorsal neocortex. Based on 424 unrelated, qualified healthy subjects from the Human Connectome Project, we found that vTSNs could sensitively represent sex differences, especially for connections in the subcortex-limbic system and between the subcortex-limbic system and the ventral neocortex. Moreover, a multivariate variance component model revealed that vTSNs could explain a significant proportion of inter-subject behavioral variance in cognition (80.0 %) and motor functions (63.4 %). Finally, using 494 healthy adults (aged 19-80 years old) from the Southwest University Adult Lifespan Dataset, the Spearman correlation identified a significant association between aging and vTSN strength, especially within the subcortex-limbic system and between the subcortex-limbic system and the dorsal neocortex. In summary, our proposed vTSN is robust in uncovering individual variability and neurobiological brain processes, which can serve as biologically plausible measures for linking biological processes and human behavior.


Subject(s)
Brain , Magnetic Resonance Imaging , Humans , Male , Female , Magnetic Resonance Imaging/methods , Adult , Brain/diagnostic imaging , Brain/anatomy & histology , Brain/physiology , Young Adult , Biological Ontologies , Nerve Net/diagnostic imaging , Nerve Net/physiology , Nerve Net/anatomy & histology , Middle Aged , Connectome/methods , Reproducibility of Results , Aged
SELECTION OF CITATIONS
SEARCH DETAIL