Search | VHL Regional Portal

Integration of relational and textual biomedical sources. A pilot experiment using a semi-automated method for logical schema acquisition.

García-Remesal, M; Maojo, V; Billhardt, H; Crespo, J.

Methods Inf Med ; 49(4): 337-48, 2010.

Article in English | MEDLINE | ID: mdl-19936436

ABSTRACT

OBJECTIVES: Bringing together structured and text-based sources is an exciting challenge for biomedical informaticians, since most relevant biomedical sources belong to one of these categories. In this paper we evaluate the feasibility of integrating relational and text-based biomedical sources using: i) an original logical schema acquisition method for textual databases developed by the authors, and ii) OntoFusion, a system originally designed by the authors for the integration of relational sources. METHODS: We conducted an integration experiment involving a test set of seven differently structured sources covering the domain of genetic diseases. We used our logical schema acquisition method to generate schemas for all textual sources. The sources were integrated using the methods and tools provided by OntoFusion. The integration was validated using a test set of 500 queries. RESULTS: A panel of experts answered a questionnaire to evaluate i) the quality of the extracted schemas, ii) the query processing performance of the integrated set of sources, and iii) the relevance of the retrieved results. The results of the survey show that our method extracts coherent and representative logical schemas. Experts' feedback on the performance of the integrated system and the relevance of the retrieved results was also positive. Regarding the validation of the integration, the system successfully provided correct results for all queries in the test set. CONCLUSIONS: The results of the experiment suggest that text-based sources including a logical schema can be regarded as equivalent to structured databases. Using our method, previous research and existing tools designed for the integration of structured databases can be reused - possibly subject to minor modifications - to integrate differently structured sources.

Subject(s)

Data Mining/methods , Databases, Genetic , Medical Informatics/organization & administration , Natural Language Processing , Access to Information , Algorithms , Artificial Intelligence , Classification , Computer Simulation , Data Collection , Expert Testimony , Health Care Surveys , Humans , Medical Informatics/methods , Pilot Projects , Quality Assurance, Health Care , Spain , Surveys and Questionnaires , Vocabulary

An agent- and ontology-based system for integrating public gene, protein, and disease databases.

Alonso-Calvo, R; Maojo, V; Billhardt, H; Martin-Sanchez, F; García-Remesal, M; Pérez-Rey, D.

J Biomed Inform ; 40(1): 17-29, 2007 Feb.

Article in English | MEDLINE | ID: mdl-16621723

ABSTRACT

In this paper, we describe OntoFusion, a database integration system. This system has been designed to provide unified access to multiple, heterogeneous biological and medical data sources that are publicly available over Internet. Many of these databases do not offer a direct connection, and inquiries must be made via Web forms, returning results as HTML pages. A special module in the OntoFusion system is needed to integrate these public 'Web-based' databases. Domain ontologies are used to do this and provide database mapping and unification. We have used the system to integrate seven significant and widely used public biomedical databases: OMIM, PubMed, Enzyme, Prosite and Prosite documentation, PDB, SNP, and InterPro. A case study is detailed in depth, showing system performance. We analyze the system's architecture and methods and discuss its use as a tool for biomedical researchers.

Subject(s)

Biomedical Research/methods , Computational Biology/methods , Database Management Systems , Databases, Genetic , Genetic Predisposition to Disease/genetics , Genomics/methods , Information Storage and Retrieval/methods , Animals , Artificial Intelligence , Biomedical Research/trends , Computational Biology/trends , Genomics/trends , Humans , Information Storage and Retrieval/trends , Oligonucleotide Array Sequence Analysis/methods , Oligonucleotide Array Sequence Analysis/trends , Systems Integration , User-Computer Interface

Designing new methodologies for integrating biomedical information in clinical trials.

Maojo, V; García-Remesal, M; Billhardt, H; Alonso-Calvo, R; Pérez-Rey, D; Martín-Sánchez, F.

Methods Inf Med ; 45(2): 180-5, 2006.

Article in English | MEDLINE | ID: mdl-16538285

ABSTRACT

OBJECTIVES: To propose a modification to current methodologies for clinical trials, improving data collection and cost-efficiency. To describe a system to integrate distributed and heterogeneous medical and genetic databases for improving information access, retrieval and analysis of biomedical information. METHODS: Data for clinical trials can be collected from remote, distributed and heterogeneous data sources. In this distributed scenario, we propose an ontologybased approach, with two basic operations: mapping and unification. Mapping outputs the semantic model of a virtual repository with the information model of a specific database. Unification provides a single schema for two or more previously available virtual repositories. In both processes, domain ontologies can improve other traditional approaches. RESULTS: Private clinical databases and public genomic and disease databases (e.g., OMIM, Prosite and others) were integrated. We successfully tested the system using thirteen databases containing clinical and biological information and biomedical vocabularies. CONCLUSIONS: We present a domain-independent approach to biomedical database integration, used in this paper as a reference for the design of future models of clinico-genomic trials where information will be integrated, retrieved and analyzed. Such an approach to biomedical data integration has been one of the goals of the IST INFOBIOMED Network of Excellence in Biomedical Informatics, funded by the European Commission, and the new ACGT (Advanced Clinico-Genomic Trials on Cancer) project, where the authors will apply these methods to research experiments.

Subject(s)

Clinical Trials as Topic/statistics & numerical data , Computational Biology , Data Collection/methods , Humans , Research Design , Software , Spain

ONTOFUSION: ontology-based integration of genomic and clinical databases.

Pérez-Rey, D; Maojo, V; García-Remesal, M; Alonso-Calvo, R; Billhardt, H; Martin-Sánchez, F; Sousa, A.

Comput Biol Med ; 36(7-8): 712-30, 2006.

Article in English | MEDLINE | ID: mdl-16144697

ABSTRACT

ONTOFUSION is an ontology-based system designed for biomedical database integration. It is based on two processes: mapping and unification. Mapping is a semi-automated process that uses ontologies to link a database schema with a conceptual framework-named virtual schema. There are three methodologies for creating virtual schemas, according to the origin of the domain ontology used: (1) top-down--e.g. using an existing ontology, such as the UMLS or Gene Ontology--, (2) bottom-up--building a new domain ontology-- and (3) a hybrid combination. Unification is an automated process for integrating ontologies and hence the database to which they are linked. Using these methods, we employed ONTOFUSION to integrate a large number of public genomic and clinical databases, as well as biomedical ontologies.

Subject(s)

Databases, Factual , Databases, Genetic , Medical Informatics , Data Collection , Database Management Systems , Humans , User-Computer Interface

Establishing an agenda for biomedical informatics.

Maojo, V; Martin-Sanchez, F; Billhardt, H; Iakovidis, I; Kulikowski, C.

Methods Inf Med ; 42(2): 121-5, 2003.

Article in English | MEDLINE | ID: mdl-12743647

ABSTRACT

OBJECTIVE: To describe potential areas of collaboration between Medical Informatics (BI) and Bioinformatics (BI) and their effects on planning future work in both disciplines. METHODS: Some reflections on the objectives and rationale underpinning MI and BI are given, and preliminary results from the BIOINFOMED workgroup, supported by the European Commission, are introduced. RESULTS: Applications from both subfields suggest topics for sharing and exchange between the subfields within the emerging field of Biomedical Informatics. CONCLUSIONS: We suggest how the nature and degree of collaboration between the sub-disciplines can impact future work in molecular medicine.

Subject(s)

Biomedical Research , Computational Biology , Medical Informatics , Cooperative Behavior , Europe , Genetics, Medical , Humans , Molecular Biology

Theory, abstraction and design in medical informatics.

Maojo, V; Martín, F; Crespo, J; Billhardt, H.

Methods Inf Med ; 41(1): 44-50, 2002.

Article in English | MEDLINE | ID: mdl-11933763

ABSTRACT

OBJECTIVE: To analyze the scientific and engineering components of Medical Informatics. A clear characterization of these components should be undertaken to categorize different areas of Medical Informatics and create a research agenda for the future. METHODS: We have adapted a classical ACM and IEEE report on computing to analyze Medical Informatics from three different viewpoints: Theory, Abstraction, and Design. RESULTS: We suggest that Medical Informatics can be considered from these three perspectives: (1) Theory, from which medical informaticians formally characterize the properties of the objects of study, creating new theories or using and adapting existing theories (e.g., from mathematics), (2) Abstraction, from which medical informaticians deal with all aspects of medical information and create new abstractions, methods, and technology-independent models, which can be experimentally verified, and (3) Design, from which medical informaticians develop systems or act as information brokers or advisors between medical and technology professionals, to improve the quality of computer applications in medicine. CONCLUSION: Based on this framework, we suggest that Medical Informatics has an independent scientific character, different from other applied informatics areas. Finally, we analyze these three perspectives using data mining in medicine.

Subject(s)

Medical Informatics , Medical Informatics Applications , Medical Informatics Computing , Models, Theoretical , Research , Systems Theory

A comparison of machine learning methods for the diagnosis of pigmented skin lesions.

Dreiseitl, S; Ohno-Machado, L; Kittler, H; Vinterbo, S; Billhardt, H; Binder, M.

J Biomed Inform ; 34(1): 28-36, 2001 Feb.

Article in English | MEDLINE | ID: mdl-11376540

ABSTRACT

We analyze the discriminatory power of k-nearest neighbors, logistic regression, artificial neural networks (ANNs), decision tress, and support vector machines (SVMs) on the task of classifying pigmented skin lesions as common nevi, dysplastic nevi, or melanoma. Three different classification tasks were used as benchmarks: the dichotomous problem of distinguishing common nevi from dysplastic nevi and melanoma, the dichotomous problem of distinguishing melanoma from common and dysplastic nevi, and the trichotomous problem of correctly distinguishing all three classes. Using ROC analysis to measure the discriminatory power of the methods shows that excellent results for specific classification problems in the domain of pigmented skin lesions can be achieved with machine-learning methods. On both dichotomous and trichotomous tasks, logistic regression, ANNs, and SVMs performed on about the same level, with k-nearest neighbors and decision trees performing worse.

Subject(s)

Algorithms , Diagnosis, Computer-Assisted , Skin Diseases/diagnosis , Decision Trees , Humans , Logistic Models , Melanoma/diagnosis , Neural Networks, Computer , Nevus/diagnosis , Nevus, Pigmented/diagnosis , Skin Diseases/classification , Skin Neoplasms/diagnosis , Skin Pigmentation

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL