Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 50
Filtrar
1.
J Biomed Semantics ; 15(1): 1, 2024 Mar 04.
Artículo en Inglés | MEDLINE | ID: mdl-38438913

RESUMEN

The increasing number of articles on adverse interactions that may occur when specific foods are consumed with certain drugs makes it difficult to keep up with the latest findings. Conflicting information is available in the scientific literature and specialized knowledge bases because interactions are described in an unstructured or semi-structured format. The FIDEO ontology aims to integrate and represent information about food-drug interactions in a structured way. This article reports on the new version of this ontology in which more than 1700 interactions are integrated from two online resources: DrugBank and Hedrine. These food-drug interactions have been represented in FIDEO in the form of precompiled concepts, each of which specifies both the food and the drug involved. Additionally, competency questions that can be answered are reviewed, and avenues for further enrichment are discussed.


Asunto(s)
Interacciones Alimento-Droga , Bases del Conocimiento
2.
Sci Data ; 10(1): 871, 2023 12 06.
Artículo en Inglés | MEDLINE | ID: mdl-38057380

RESUMEN

Drug repositioning is a faster and more affordable solution than traditional drug discovery approaches. From this perspective, computational drug repositioning using knowledge graphs is a very promising direction. Knowledge graphs constructed from drug data and information can be used to generate hypotheses (molecule/drug - target links) through link prediction using machine learning algorithms. However, it remains rare to have a holistically constructed knowledge graph using the broadest possible features and drug characteristics, which is freely available to the community. The OREGANO knowledge graph aims at filling this gap. The purpose of this paper is to present the OREGANO knowledge graph, which includes natural compounds related data. The graph was developed from scratch by retrieving data directly from the knowledge sources to be integrated. We therefore designed the expected graph model and proposed a method for merging nodes between the different knowledge sources, and finally, the data were cleaned. The knowledge graph, as well as the source codes for the ETL process, are openly available on the GitHub of the OREGANO project ( https://gitub.u-bordeaux.fr/erias/oregano ).


Asunto(s)
Reposicionamiento de Medicamentos , Algoritmos , Descubrimiento de Drogas , Aprendizaje Automático
4.
Yearb Med Inform ; 32(1): 2-6, 2023 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-38575142

RESUMEN

OBJECTIVES: To introduce the 2023 International Medical Informatics Association (IMIA) Yearbook by the editors. METHODS: The editorial provides an introduction and overview to the 2023 IMIA Yearbook where the special topic is "Informatics for One Health". The special topic, survey papers and some best papers are discussed. The section changes in the Yearbook editorial committee are also described. RESULTS: IMIA Yearbook 2023 provides many perspectives on a relatively new topic called "One Digital Health". The subject is vast, and includes the use of digital technologies to promote the well-being of people and animals, but also of the environment in which they evolve. Many sections produced new work in the topic including One Health and all sections included the latest themes in many specialties in medical informatics. CONCLUSIONS: The theme of "Informatics for One Health" is relatively new but the editors of the IMIA Yearbook have presented excellent and thought-provoking work for biomedical informatics in 2023.


Asunto(s)
Informática Médica , Salud Única , Humanos
5.
Yearb Med Inform ; 31(1): 2-6, 2022 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-36463863

RESUMEN

OBJECTIVES: To introduce the 2022 International Medical Informatics Association (IMIA) Yearbook by the editors. METHODS: The editorial provides an introduction and overview to the 2022 IMIA Yearbook whose special topic is "Inclusive Digital Health: Addressing Equity, Literacy, and Bias for Resilient Health Systems". The special topic, survey papers, section editor synopses and some best papers are discussed. The sections' changes in the Yearbook Editorial Committee are also described. RESULTS: As shown in the previous edition, health informatics in the context of a global pandemic has led to the development of ways to collect, standardize, disseminate and reuse data worldwide. The Corona Virus Disease 2019 (COVID-19) pandemic has demonstrated the need for timely, reliable, open, and globally available information to support decision making. It has also highlighted the need to address social inequities and disparities in access to care across communities. This edition of the Yearbook acknowledges the fact that much work has been done to study health equity in recent years in the various fields of health informatics research. CONCLUSION: There is a strong desire to better consider disparities between populations to avoid biases being induced in Artificial Intelligence algorithms in particular. Telemedicine and m-health must be more inclusive for people with disabilities or living in isolated geographical areas.


Asunto(s)
COVID-19 , Informática Médica , Humanos , Inteligencia Artificial , Pandemias , Algoritmos
6.
Stud Health Technol Inform ; 294: 312-316, 2022 May 25.
Artículo en Inglés | MEDLINE | ID: mdl-35612083

RESUMEN

New use cases and the need for quality control and imaging data sharing in health studies require the capacity to align them to reference terminologies. We are interested in mapping the local terminology used at our center to describe imaging procedures to reference terminologies for imaging procedures (RadLex Playbook and LOINC/RSNA Radiology Playbook). We performed a manual mapping of the 200 most frequent imaging report titles at our center (i.e. 73.2% of all imaging exams). The mapping method was based only on information explicitly stated in the titles. The results showed 57.5% and 68.8% of exact mapping to the RadLex and LOINC/RSNA Radiology Playbooks, respectively. We identified the reasons for the mapping failure and analyzed the issues encountered.


Asunto(s)
Difusión de la Información/métodos , Logical Observation Identifiers Names and Codes , Sistemas de Información Radiológica/tendencias , Radiología , Radiografía , Radiología/métodos , Radiología/tendencias , Terminología como Asunto
7.
Stud Health Technol Inform ; 294: 322-326, 2022 May 25.
Artículo en Inglés | MEDLINE | ID: mdl-35612085

RESUMEN

Information about drugs is numerous and varied, and many drugs can share the same information. Grouping drugs that have common characteristics can be useful to avoid redundancy and facilitate interoperability. Our work focused on the evaluation of the relevance of classes allowing this type of grouping: the "Virtual Drug". Thus, in this paper, we describe the process of creating this class from the data of the French Public Drug Database, which is then evaluated against the codes of the Anatomical Therapeutic Chemical classification associated with the drugs. Our evaluation showed that 99.55% of the "Virtual Drug" classes have a good intra-class consistency.

8.
Stud Health Technol Inform ; 294: 332-336, 2022 May 25.
Artículo en Inglés | MEDLINE | ID: mdl-35612087

RESUMEN

Secondary use of health data is made difficult in part because of large semantic heterogeneity. Many efforts are being made to align local terminologies with international standards. With increasing concerns about data privacy, we focused here on the use of machine learning methods to align biological data elements using aggregated features that could be shared as open data. A 3-step methodology (features engineering, blocking strategy and supervised learning) was proposed. The first results, although modest, are encouraging for the future development of this approach.


Asunto(s)
Aprendizaje Automático , Privacidad
9.
Yearb Med Inform ; 30(1): 4-7, 2021 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-34479377

RESUMEN

OBJECTIVES: To introduce the 2021 International Medical Informatics Association (IMIA) Yearbook by the editors. METHODS: The editorial provides an introduction and overview to the 2021 IMIA Yearbook whose special topic is "Managing Pandemics with Health Informatics - Successes and Challenges". The Special Topic, the keynote paper, and survey papers are discussed. The IMIA President's statement and the IMIA dialogue with the World Health Organization are introduced. The sections' changes in the Yearbook Editorial Committee are also described. RESULTS: Health informatics, in the context of a global pandemic, led to the development of ways to collect, standardize, disseminate and reuse data worldwide: public health data but also information from social networks and scientific literature. Fact checking methods were mostly based on artificial intelligence and natural language processing. The pandemic also introduced new challenges for telehealth support in times of critical response. Next generation sequencing in bioinformatics helped in decoding the sequence of the virus and the development of messenger ribonucleic acid (mRNA) vaccines. CONCLUSIONS: The Corona Virus Disease 2019 (COVID-19) pandemic shows the need for timely, reliable, open, and globally available information to support decision making and efficiently control outbreaks. Applying Findable, Accessible, Interoperable, and Reusable (FAIR) requirements for data is a key success factor while challenging ethical issues have to be considered.


Asunto(s)
COVID-19 , Comunicación en Salud , Difusión de la Información , Intercambio de Información en Salud , Humanos , Informática Médica
10.
Nat Commun ; 12(1): 4385, 2021 07 19.
Artículo en Inglés | MEDLINE | ID: mdl-34282143

RESUMEN

As the capacity for generating large-scale molecular profiling data continues to grow, the ability to extract meaningful biological knowledge from it remains a limitation. Here, we describe the development of a new fixed repertoire of transcriptional modules, BloodGen3, that is designed to serve as a stable reusable framework for the analysis and interpretation of blood transcriptome data. The construction of this repertoire is based on co-clustering patterns observed across sixteen immunological and physiological states encompassing 985 blood transcriptome profiles. Interpretation is supported by customized resources, including module-level analysis workflows, fingerprint grid plot visualizations, interactive web applications and an extensive annotation framework comprising functional profiling reports and reference transcriptional profiles. Taken together, this well-characterized and well-supported transcriptional module repertoire can be employed for the interpretation and benchmarking of blood transcriptome profiles within and across patient cohorts. Blood transcriptome fingerprints for the 16 reference cohorts can be accessed interactively via:  https://drinchai.shinyapps.io/BloodGen3Module/ .


Asunto(s)
Análisis Químico de la Sangre , Sangre , Perfilación de la Expresión Génica/métodos , Transcriptoma , Bacterias , Sangre/inmunología , Análisis Químico de la Sangre/métodos , Análisis por Conglomerados , Biología Computacional/métodos , Redes Reguladoras de Genes , Humanos
11.
JAMIA Open ; 4(2): ooab035, 2021 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-34131637

RESUMEN

OBJECTIVE: Our study consists in aligning the interface terminology of the Bordeaux university hospital (TLAB) to the Logical Observation Identifiers Names and Codes (LOINC). The objective was to facilitate the shared and integrated use of biological results with other health information systems. MATERIALS AND METHODS: We used an innovative approach based on a decomposition and re-composition of LOINC concepts according to the transversal relations that may be described between LOINC concepts and their definitional attributes. TLAB entities were first anchored to LOINC attributes and then aligned to LOINC concepts through the appropriate combination of definitional attributes. Finally, using laboratory results of the Bordeaux data-warehouse, an instance-based filtering process has been applied. RESULTS: We found a small overlap between the tokens constituting the labels of TLAB and LOINC. However, the TLAB entities have been easily aligned to LOINC attributes. Thus, 99.8% of TLAB entities have been related to a LOINC analyte and 61.0% to a LOINC system. A total of 55.4% of used TLAB entities in the hospital data-warehouse have been mapped to LOINC concepts. We performed a manual evaluation of all 1-1 mappings between TLAB entities and LOINC concepts and obtained a precision of 0.59. CONCLUSION: We aligned TLAB and LOINC with reasonable performances, given the poor quality of TLAB labels. In terms of interoperability, the alignment of interface terminologies with LOINC could be improved through a more formal LOINC structure. This would allow queries on LOINC attributes rather than on LOINC concepts only.

12.
Stud Health Technol Inform ; 281: 253-257, 2021 May 27.
Artículo en Inglés | MEDLINE | ID: mdl-34042744

RESUMEN

This paper presents a prototype for the visualization of food-drug interactions implemented in the MIAM project, whose objective is to develop methods for the extraction and representation of these interactions and to make them available in the Thériaque database. The prototype provides users with a graphical visualization showing the hierarchies of drugs and foods in front of each other and the links between them representing the existing interactions as well as additional details about them, including the number of articles reporting the interaction. The prototype is interactive in the following ways: hierarchies can be easily folded and unfolded, a filter can be applied to view only certain types of interactions, and details about a given interaction are displayed when the mouse is moved over the corresponding link. Future work includes proposing a version more suitable for non-health professional users and the representation of the food hierarchy based on a reference classification.


Asunto(s)
Interacciones Alimento-Droga , Animales , Bases de Datos Factuales , Ratones
13.
AMIA Annu Symp Proc ; 2020: 933-942, 2020.
Artículo en Inglés | MEDLINE | ID: mdl-33936469

RESUMEN

The aim of our study was to create a graph model for the description of LOINC® concepts. The main objective of the constructed structure is to facilitate the alignment of French local terminologies to LOINC. The process consisted of automatically incorporating the naming rules of LOINC labels, based on punctuation. We implemented these rules and applied them to the French variants of LOINC and then created attributes and concepts described with synonymous labels. When comparing the created attributes to the stated ones, the multiple mappings led to the identification of errors that must be corrected for improving the translation quality. These mappings are consecutive to semantic errors generated during the translation process. They mainly corresponded to misinterpretations of LOINC concepts and/or LOINC attributes.


Asunto(s)
Logical Observation Identifiers Names and Codes , Semántica
14.
NAR Genom Bioinform ; 2(2): lqaa017, 2020 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-33575577

RESUMEN

The revolution in new sequencing technologies is greatly leading to new understandings of the relations between genotype and phenotype. To interpret and analyze data that are grouped according to a phenotype of interest, methods based on statistical enrichment became a standard in biology. However, these methods synthesize the biological information by a priori selecting the over-represented terms and may suffer from focusing on the most studied genes that represent a limited coverage of annotated genes within a gene set. Semantic similarity measures have shown great results within the pairwise gene comparison by making advantage of the underlying structure of the Gene Ontology. We developed GSAn, a novel gene set annotation method that uses semantic similarity measures to synthesize a priori Gene Ontology annotation terms. The originality of our approach is to identify the best compromise between the number of retained annotation terms that has to be drastically reduced and the number of related genes that has to be as large as possible. Moreover, GSAn offers interactive visualization facilities dedicated to the multi-scale analysis of gene set annotations. Compared to enrichment analysis tools, GSAn has shown excellent results in terms of maximizing the gene coverage while minimizing the number of terms.

15.
Stud Health Technol Inform ; 264: 1445-1446, 2019 Aug 21.
Artículo en Inglés | MEDLINE | ID: mdl-31438173

RESUMEN

Clinical information in electronic health records (EHRs) is mostly unstructured. With the ever-increasing amount of information in patients' EHRs, manual extraction of clinical information for data reuse can be tedious and time-consuming without dedicated tools. In this paper, we present SmartCRF, a prototype to visualize, search and ease the extraction and structuration of information from EHRs stored in an i2b2 data warehouse.


Asunto(s)
Data Warehousing , Almacenamiento y Recuperación de la Información , Registros Electrónicos de Salud
16.
Stud Health Technol Inform ; 264: 79-82, 2019 Aug 21.
Artículo en Inglés | MEDLINE | ID: mdl-31437889

RESUMEN

The W3C project, "Linking Open Drug Data" (LODD), linked several publicly available sources of drug data together. So far, French data, like marketed drugs and their summary of product characteristics, were not integrated and remained difficult to query. In this paper, we present Romedi (Référentiel Ouvert du Médicament), an open dataset that links French data on drugs to international resources. The principles and standard recommendations created by the W3C for sharing information were adopted. Romedi was connected to the Unified Medical Language System and DrugBank, two central resources of the LODD project. A SPARQL endpoint is available to query Romedi and services are provided to annotate textual content with Romedi terms. This paper describes its content, its services, its links to external resources, and expected future developments.


Asunto(s)
Preparaciones Farmacéuticas , Web Semántica , Francia , Almacenamiento y Recuperación de la Información , Internet , Lenguaje , Semántica , Unified Medical Language System
17.
PLoS One ; 13(11): e0208037, 2018.
Artículo en Inglés | MEDLINE | ID: mdl-30481204

RESUMEN

MOTIVATION: The recent revolution in new sequencing technologies, as a part of the continuous process of adopting new innovative protocols has strongly impacted the interpretation of relations between phenotype and genotype. Thus, understanding the resulting gene sets has become a bottleneck that needs to be addressed. Automatic methods have been proposed to facilitate the interpretation of gene sets. While statistical functional enrichment analyses are currently well known, they tend to focus on well-known genes and to ignore new information from less-studied genes. To address such issues, applying semantic similarity measures is logical if the knowledge source used to annotate the gene sets is hierarchically structured. In this work, we propose a new method for analyzing the impact of different semantic similarity measures on gene set annotations. RESULTS: We evaluated the impact of each measure by taking into consideration the two following features that correspond to relevant criteria for a "good" synthetic gene set annotation: (i) the number of annotation terms has to be drastically reduced and the representative terms must be retained while annotating the gene set, and (ii) the number of genes described by the selected terms should be as large as possible. Thus, we analyzed nine semantic similarity measures to identify the best possible compromise between both features while maintaining a sufficient level of details. Using Gene Ontology to annotate the gene sets, we obtained better results with node-based measures that use the terms' characteristics than with measures based on edges that link the terms. The annotation of the gene sets achieved with the node-based measures did not exhibit major differences regardless of the characteristics of terms used.


Asunto(s)
Anotación de Secuencia Molecular/métodos , Inmunidad Adaptativa/fisiología , Algoritmos , Análisis por Conglomerados , Biología Computacional/métodos , Humanos , Reconocimiento de Normas Patrones Automatizadas/métodos , Semántica
18.
Methods ; 132: 3-18, 2018 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-28887085

RESUMEN

Life sciences are currently going through a great number of transformations raised by the in-going revolution in high-throughput technologies for the acquisition of data. The integration of their high dimensionality, ranging from omics to clinical data, is becoming one of the most challenging stages. It involves inter-disciplinary developments with the aim to move towards an enhanced understanding of human physiology for caring purposes. Biologists, bioinformaticians, physicians and other experts related to the healthcare domain have to accompany each step of the analysis process in order to investigate and expertise these various data. In this perspective, methods related to information visualization are gaining increasing attention within life sciences. The softwares based on these methods are now well recognized to facilitate expert users' success in carrying out their data analysis tasks. This article aims at reviewing the current methods and techniques dedicated to information visualisation and their current use in software development related to omics or/and clinical data.


Asunto(s)
Biología Computacional , Presentación de Datos , Conjuntos de Datos como Asunto , Humanos , Almacenamiento y Recuperación de la Información , Programas Informáticos
19.
J Biomed Inform ; 74: 46-58, 2017 10.
Artículo en Inglés | MEDLINE | ID: mdl-28844750

RESUMEN

In oncology, the reuse of data is confronted with the heterogeneity of terminologies. It is necessary to semantically integrate these distinct terminologies. The semantic integration by using a third terminology as a support is a conventional approach for the integration of two terminologies that are not very structured. The aim of our study was to use SNOMED CT for integrating ICD-10 and ICD-O3. We used two complementary resources, mapping tables provided by SNOMED CT and the NCI Metathesaurus, in order to find mappings between ICD-10 or ICD-O3 concepts and SNOMED CT concepts. We used the SNOMED CT structure to filter inconsistent mappings, as well as to disambiguate multiple mappings. Based on the remaining mappings, we used semantic relations from SNOMED CT to establish links between ICD-10 and ICD-O3. Overall, the coverage of ICD-O3 and ICD10 codes was over 88%. Finally, we obtained an integration of 24% (203/852) of ICD-10 concepts with 86% (888/1032) of ICD-O3 morphology concepts combined to 39% (127/330) of ICD-O3 topography concepts. Comparing our results with the 23,684 ICD-O3 pairs mapped to ICD-10 concepts in the SEER conversion file, we found 17,447 pairs of ICD-O3 concepts in common among which 11,932 pairs were integrated with the same ICD-10 concept as the SEER conversion file. The automated process leverages logical definitions of SNOMED CT concepts. While the low quality of some of these definitions impacted negatively the integration process, the identification of such situations made it possible to indirectly audit the structure of SNOMED CT.


Asunto(s)
Neoplasias/diagnóstico , Systematized Nomenclature of Medicine , Humanos , Clasificación Internacional de Enfermedades
20.
J Biomed Semantics ; 8(1): 6, 2017 Feb 07.
Artículo en Inglés | MEDLINE | ID: mdl-28173841

RESUMEN

BACKGROUND: Identifying incident cancer cases within a population remains essential for scientific research in oncology. Data produced within electronic health records can be useful for this purpose. Due to the multiplicity of providers, heterogeneous terminologies such as ICD-10 and ICD-O-3 are used for oncology diagnosis recording purpose. To enable disease identification based on these diagnoses, there is a need for integrating disease classifications in oncology. Our aim was to build a model integrating concepts involved in two disease classifications, namely ICD-10 (diagnosis) and ICD-O-3 (topography and morphology), despite their structural heterogeneity. Based on the NCIt, a "derivative" model for linking diagnosis and topography-morphology combinations was defined and built. ICD-O-3 and ICD-10 codes were then used to instantiate classes of the "derivative" model. Links between terminologies obtained through the model were then compared to mappings provided by the Surveillance, Epidemiology, and End Results (SEER) program. RESULTS: The model integrated 42% of neoplasm ICD-10 codes (excluding metastasis), 98% of ICD-O-3 morphology codes (excluding metastasis) and 68% of ICD-O-3 topography codes. For every codes instantiating at least a class in the "derivative" model, comparison with SEER mappings reveals that all mappings were actually available in the model as a link between the corresponding codes. CONCLUSIONS: We have proposed a method to automatically build a model for integrating ICD-10 and ICD-O-3 based on the NCIt. The resulting "derivative" model is a machine understandable resource that enables an integrated view of these heterogeneous terminologies. The NCIt structure and the available relationships can help to bridge disease classifications taking into account their structural and granular heterogeneities. However, (i) inconsistencies exist within the NCIt leading to misclassifications in the "derivative" model, (ii) the "derivative" model only integrates a part of ICD-10 and ICD-O-3. The NCIt is not sufficient for integration purpose and further work based on other termino-ontological resources is needed in order to enrich the model and avoid identified inconsistencies.


Asunto(s)
Clasificación Internacional de Enfermedades , National Cancer Institute (U.S.) , Neoplasias/clasificación , Humanos , Estados Unidos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...