RESUMEN
The biomedical research community relies on a diverse set of resources, both within their own institutions and at other research centers. In addition, an increasing number of shared electronic resources have been developed. Without effective means to locate and query these resources, it is challenging, if not impossible, for investigators to be aware of the myriad resources available, or to effectively perform resource discovery when the need arises. In this paper, we describe the development and use of the Biomedical Resource Ontology (BRO) to enable semantic annotation and discovery of biomedical resources. We also describe the Resource Discovery System (RDS) which is a federated, inter-institutional pilot project that uses the BRO to facilitate resource discovery on the Internet. Through the RDS framework and its associated Biositemaps infrastructure, the BRO facilitates semantic search and discovery of biomedical resources, breaking down barriers and streamlining scientific research that will improve human health.
Asunto(s)
Investigación Biomédica , Sistemas de Administración de Bases de Datos , Documentación , Informática Médica , Investigación Biomédica Traslacional , Animales , Biología Computacional , Humanos , Internet , Semántica , Interfaz Usuario-ComputadorRESUMEN
BACKGROUND: Recent advances in genomics, proteomics, and the increasing demands for biomarker validation studies have catalyzed changes in the landscape of cancer research, fueling the development of tissue banks for translational research. A result of this transformation is the need for sufficient quantities of clinically annotated and well-characterized biospecimens to support the growing needs of the cancer research community. Clinical annotation allows samples to be better matched to the research question at hand and ensures that experimental results are better understood and can be verified. To facilitate and standardize such annotation in bio-repositories, we have combined three accepted and complementary sets of data standards: the College of American Pathologists (CAP) Cancer Checklists, the protocols recommended by the Association of Directors of Anatomic and Surgical Pathology (ADASP) for pathology data, and the North American Association of Central Cancer Registry (NAACCR) elements for epidemiology, therapy and follow-up data. Combining these approaches creates a set of International Standards Organization (ISO) - compliant Common Data Elements (CDEs) for the mesothelioma tissue banking initiative supported by the National Institute for Occupational Safety and Health (NIOSH) of the Center for Disease Control and Prevention (CDC). METHODS: The purpose of the project is to develop a core set of data elements for annotating mesothelioma specimens, following standards established by the CAP checklist, ADASP cancer protocols, and the NAACCR elements. We have associated these elements with modeling architecture to enhance both syntactic and semantic interoperability. The system has a Java-based multi-tiered architecture based on Unified Modeling Language (UML). RESULTS: Common Data Elements were developed using controlled vocabulary, ontology and semantic modeling methodology. The CDEs for each case are of different types: demographic, epidemiologic data, clinical history, pathology data including block level annotation, and follow-up data including treatment, recurrence and vital status. The end result of such an effort would eventually provide an increased sample set to the researchers, and makes the system interoperable between institutions. CONCLUSION: The CAP, ADASP and the NAACCR elements represent widely established data elements that are utilized in many cancer centers. Herein, we have shown these representations can be combined and formalized to create a core set of annotations for banked mesothelioma specimens. Because these data elements are collected as part of the normal workflow of a medical center, data sets developed on the basis of these elements can be easily implemented and maintained.
Asunto(s)
Aplicaciones de la Informática Médica , Mesotelioma , Neoplasias Pleurales , Bancos de Tejidos , Biología Computacional , Bases de Datos como Asunto , Humanos , Programas Informáticos , Integración de SistemasRESUMEN
BACKGROUND: Advances in translational research have led to the need for well characterized biospecimens for research. The National Mesothelioma Virtual Bank is an initiative which collects annotated datasets relevant to human mesothelioma to develop an enterprising biospecimen resource to fulfill researchers' need. METHODS: The National Mesothelioma Virtual Bank architecture is based on three major components: (a) common data elements (based on College of American Pathologists protocol and National North American Association of Central Cancer Registries standards), (b) clinical and epidemiologic data annotation, and (c) data query tools. These tools work interoperably to standardize the entire process of annotation. The National Mesothelioma Virtual Bank tool is based upon the caTISSUE Clinical Annotation Engine, developed by the University of Pittsburgh in cooperation with the Cancer Biomedical Informatics Grid (caBIG, see http://cabig.nci.nih.gov). This application provides a web-based system for annotating, importing and searching mesothelioma cases. The underlying information model is constructed utilizing Unified Modeling Language class diagrams, hierarchical relationships and Enterprise Architect software. RESULT: The database provides researchers real-time access to richly annotated specimens and integral information related to mesothelioma. The data disclosed is tightly regulated depending upon users' authorization and depending on the participating institute that is amenable to the local Institutional Review Board and regulation committee reviews. CONCLUSION: The National Mesothelioma Virtual Bank currently has over 600 annotated cases available for researchers that include paraffin embedded tissues, tissue microarrays, serum and genomic DNA. The National Mesothelioma Virtual Bank is a virtual biospecimen registry with robust translational biomedical informatics support to facilitate basic science, clinical, and translational research. Furthermore, it protects patient privacy by disclosing only de-identified datasets to assure that biospecimens can be made accessible to researchers.
Asunto(s)
Mesotelioma/diagnóstico , Mesotelioma/patología , Neoplasias Pleurales/diagnóstico , Neoplasias Pleurales/patología , Bancos de Tejidos , Biología Computacional/métodos , ADN/metabolismo , Bases de Datos como Asunto , Humanos , National Institutes of Health (U.S.) , Análisis de Secuencia por Matrices de Oligonucleótidos , Parafina , Estudios Prospectivos , Estudios Retrospectivos , Programas Informáticos , Estados Unidos , Interfaz Usuario-ComputadorRESUMEN
The National Mesothelioma Virtual Bank (NMVB), developed six years ago, gathers clinically annotated human mesothelioma specimens for basic and clinical science research. During this period, this resource has greatly increased its collection of specimens by expanding the number of contributing academic health centers including New York University, University of Pennsylvania, University of Pittsburgh Medical Center, and Mount Sinai School of Medicine. Marketing efforts at both national and international annual conferences increase awareness and availability of the mesothelioma specimens at no cost to approved investigators, who query the web-based NMVB database for cumulative and appropriate patient clinicopathological information on the specimens. The data disclosure and specimen distribution protocols are tightly regulated to maintain compliance with participating institutions' IRB and regulatory committee reviews. The NMVB currently has over 1120 annotated cases available for researchers, including paraffin embedded tissues, fresh frozen tissue, tissue microarrays (TMA), blood samples, and genomic DNA. In addition, the resource offers expertise and assistance for collaborative research. Furthermore, in the last six years, the resource has provided hundreds of specimens to the research community. The investigators can request specimens and/or data by submitting a Letter of Intent (LOI) that is evaluated by NMVB research evaluation panel (REP).