Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 16 de 16
Filter
1.
N Biotechnol ; 77: 12-19, 2023 Nov 25.
Article in English | MEDLINE | ID: mdl-37295722

ABSTRACT

Data quality has recently become a critical topic for the research community. European guidelines recommend that scientific data should be made FAIR: findable, accessible, interoperable and reusable. However, as FAIR guidelines do not specify how the stated principles should be implemented, it might not be straightforward for researchers to know how actually to make their data FAIR. This can prevent life-science researchers from sharing their datasets and pipelines, ultimately hindering the progress of research. To address this difficulty, we developed the BIBBOX, which is a platform that supports researchers publishing their datasets and the associated software in a FAIR manner.


Subject(s)
Mobile Applications
2.
Commun Med (Lond) ; 3(1): 59, 2023 Apr 24.
Article in English | MEDLINE | ID: mdl-37095223

ABSTRACT

BACKGROUND: Presence of lymph node metastasis (LNM) influences prognosis and clinical decision-making in colorectal cancer. However, detection of LNM is variable and depends on a number of external factors. Deep learning has shown success in computational pathology, but has struggled to boost performance when combined with known predictors. METHODS: Machine-learned features are created by clustering deep learning embeddings of small patches of tumor in colorectal cancer via k-means, and then selecting the top clusters that add predictive value to a logistic regression model when combined with known baseline clinicopathological variables. We then analyze performance of logistic regression models trained with and without these machine-learned features in combination with the baseline variables. RESULTS: The machine-learned extracted features provide independent signal for the presence of LNM (AUROC: 0.638, 95% CI: [0.590, 0.683]). Furthermore, the machine-learned features add predictive value to the set of 6 clinicopathologic variables in an external validation set (likelihood ratio test, p < 0.00032; AUROC: 0.740, 95% CI: [0.701, 0.780]). A model incorporating these features can also further risk-stratify patients with and without identified metastasis (p < 0.001 for both stage II and stage III). CONCLUSION: This work demonstrates an effective approach to combine deep learning with established clinicopathologic factors in order to identify independently informative features associated with LNM. Further work building on these specific results may have important impact in prognostication and therapeutic decision making for LNM. Additionally, this general computational approach may prove useful in other contexts.


When colorectal cancers spread to the lymph nodes, it can indicate a poorer prognosis. However, detecting lymph node metastasis (spread) can be difficult and depends on a number of factors such as how samples are taken and processed. Here, we show that machine learning, which involves computer software learning from patterns in data, can predict lymph node metastasis in patients with colorectal cancer from the microscopic appearance of their primary tumor and the clinical characteristics of the patients. We also show that the same approach can predict patient survival. With further work, our approach may help clinicians to inform patients about their prognosis and decide on appropriate treatments.

3.
JAMA Netw Open ; 6(3): e2254891, 2023 03 01.
Article in English | MEDLINE | ID: mdl-36917112

ABSTRACT

Importance: Identifying new prognostic features in colon cancer has the potential to refine histopathologic review and inform patient care. Although prognostic artificial intelligence systems have recently demonstrated significant risk stratification for several cancer types, studies have not yet shown that the machine learning-derived features associated with these prognostic artificial intelligence systems are both interpretable and usable by pathologists. Objective: To evaluate whether pathologist scoring of a histopathologic feature previously identified by machine learning is associated with survival among patients with colon cancer. Design, Setting, and Participants: This prognostic study used deidentified, archived colorectal cancer cases from January 2013 to December 2015 from the University of Milano-Bicocca. All available histologic slides from 258 consecutive colon adenocarcinoma cases were reviewed from December 2021 to February 2022 by 2 pathologists, who conducted semiquantitative scoring for tumor adipose feature (TAF), which was previously identified via a prognostic deep learning model developed with an independent colorectal cancer cohort. Main Outcomes and Measures: Prognostic value of TAF for overall survival and disease-specific survival as measured by univariable and multivariable regression analyses. Interpathologist agreement in TAF scoring was also evaluated. Results: A total of 258 colon adenocarcinoma histopathologic cases from 258 patients (138 men [53%]; median age, 67 years [IQR, 65-81 years]) with stage II (n = 119) or stage III (n = 139) cancer were included. Tumor adipose feature was identified in 120 cases (widespread in 63 cases, multifocal in 31, and unifocal in 26). For overall survival analysis after adjustment for tumor stage, TAF was independently prognostic in 2 ways: TAF as a binary feature (presence vs absence: hazard ratio [HR] for presence of TAF, 1.55 [95% CI, 1.07-2.25]; P = .02) and TAF as a semiquantitative categorical feature (HR for widespread TAF, 1.87 [95% CI, 1.23-2.85]; P = .004). Interpathologist agreement for widespread TAF vs lower categories (absent, unifocal, or multifocal) was 90%, corresponding to a κ metric at this threshold of 0.69 (95% CI, 0.58-0.80). Conclusions and Relevance: In this prognostic study, pathologists were able to learn and reproducibly score for TAF, providing significant risk stratification on this independent data set. Although additional work is warranted to understand the biological significance of this feature and to establish broadly reproducible TAF scoring, this work represents the first validation to date of human expert learning from machine learning in pathology. Specifically, this validation demonstrates that a computationally identified histologic feature can represent a human-identifiable, prognostic feature with the potential for integration into pathology practice.


Subject(s)
Adenocarcinoma , Colonic Neoplasms , Male , Humans , Aged , Colonic Neoplasms/diagnosis , Pathologists , Artificial Intelligence , Machine Learning , Risk Assessment
4.
Mod Pathol ; 35(1): 87-95, 2022 01.
Article in English | MEDLINE | ID: mdl-34645984

ABSTRACT

Focal nodular hyperplasia (FNH) is a polyclonal tumour-like hepatic lesion characterised by parenchymal nodules, connective tissue septa without interlobular bile ducts, pronounced ductular reaction and inflammation. It may represent a response to local arterial hyperperfusion and hyperoxygenation resulting in oxidative stress. We aimed at obtaining closer insight into the pathogenesis of FNH with its characteristic morphologic features. Immunohistochemistry and immunofluorescence microscopy was performed on FNH specimens using antibodies against keratins (K) 7 and 19, neural cell adhesion molecule (NCAM), lamin B1, senescence markers (CDK inhibitor 1/p21Cip1, CDK inhibitor /p16Ink4a, senescence-associated (SA) ß- galactosidase activity), proliferation markers (Ki-67, proliferating-cell nuclear antigen (PCNA)), and the abnormally phosphorylated histone γ-H2AX, indicating DNA double strand breaks; moreover SA ß- galactosidase activity was determined histochemically. Ductular metaplasia of hepatocytes indicated by K7 expression in the absence of K19 plays a major role in the development of ductular reaction in FNH. Moreover, the expression of senescence markers (p21Cip1, p16Ink4a, γ-H2AX, SA ß-galactosidase activity) in hepatocytes and cholangiocytes suggests that stress-induced cellular senescence contributes to fibrosis and inflammation via production of components of the senescence-associated secretory phenotype. Expression of proliferation markers (Ki-67, PCNA) was not enhanced in hepatocytes and biliary cells. Senescence and ductular metaplasia of hepatocytes may thus be involved in inflammation, fibrosis and apoptosis resistance. Hence, fibrosis, inflammation and reduced apoptotic cell death, rather than proliferation (hyperplasia) may be responsible for increased tissue mass and tumour-like appearance of FNH.


Subject(s)
Bile Ducts/pathology , Focal Nodular Hyperplasia/pathology , Liver/pathology , Adult , Cellular Senescence , Female , Frozen Sections , Genes, p16/physiology , Hepatocytes/metabolism , Humans , Immunohistochemistry , Keratin-19/metabolism , Keratin-7/immunology , Keratin-7/metabolism , Ki-67 Antigen/immunology , Male , Middle Aged , Neural Cell Adhesion Molecules/immunology , Young Adult , beta-Galactosidase/metabolism
5.
Biopreserv Biobank ; 19(5): 414-421, 2021 Oct.
Article in English | MEDLINE | ID: mdl-34182766

ABSTRACT

Various biological resources, such as biobanks and disease-specific registries, have become indispensable resources to better understand the epidemiology and biological mechanisms of disease and are fundamental for advancing medical research. Nevertheless, biobanks and similar resources still face significant challenges to become more findable and accessible by users on both national and global scales. One of the main challenges for users is to find relevant resources using cataloging and search services such as the BBMRI-ERIC Directory, operated by European Research Infrastructure on Biobanking and Biomolecular Resources (BBMRI-ERIC), as these often do not contain the information needed by the researchers to decide if the resource has relevant material/data; these resources are only weakly characterized. Hence, the researcher is typically left with too many resources to explore and investigate. In addition, resources often have complex procedures for accessing holdings, particularly for depletable biological materials. This article focuses on designing a system for effective negotiation of access to holdings, in which a researcher can approach many resources simultaneously, while giving each resource team the ability to implement their own mechanisms to check if the material/data are available and to decide if access should be provided. The BBMRI-ERIC has developed and implemented an access and negotiation tool called the BBMRI-ERIC Negotiator. The Negotiator enables access negotiation to more than 600 biobanks from the BBMRI-ERIC Directory and other discovery services such as GBA/BBMRI-ERIC Locator or RD-Connect Finder. This article summarizes the principles that guided the design of the tool, the terminology used and underlying data model, request workflows, authentication and authorization mechanism(s), and the mechanisms and monitoring processes to stimulate the desired behavior of the resources: to effectively deliver access to biological material and data.


Subject(s)
Biological Specimen Banks , Biomedical Research , Information Dissemination
6.
NPJ Digit Med ; 4(1): 71, 2021 Apr 19.
Article in English | MEDLINE | ID: mdl-33875798

ABSTRACT

Deriving interpretable prognostic features from deep-learning-based prognostic histopathology models remains a challenge. In this study, we developed a deep learning system (DLS) for predicting disease-specific survival for stage II and III colorectal cancer using 3652 cases (27,300 slides). When evaluated on two validation datasets containing 1239 cases (9340 slides) and 738 cases (7140 slides), respectively, the DLS achieved a 5-year disease-specific survival AUC of 0.70 (95% CI: 0.66-0.73) and 0.69 (95% CI: 0.64-0.72), and added significant predictive value to a set of nine clinicopathologic features. To interpret the DLS, we explored the ability of different human-interpretable features to explain the variance in DLS scores. We observed that clinicopathologic features such as T-category, N-category, and grade explained a small fraction of the variance in DLS scores (R2 = 18% in both validation sets). Next, we generated human-interpretable histologic features by clustering embeddings from a deep-learning-based image-similarity model and showed that they explained the majority of the variance (R2 of 73-80%). Furthermore, the clustering-derived feature most strongly associated with high DLS scores was also highly prognostic in isolation. With a distinct visual appearance (poorly differentiated tumor cell clusters adjacent to adipose tissue), this feature was identified by annotators with 87.0-95.5% accuracy. Our approach can be used to explain predictions from a prognostic deep learning model and uncover potentially-novel prognostic features that can be reliably identified by people for future validation studies.

7.
Commun Med (Lond) ; 1: 10, 2021.
Article in English | MEDLINE | ID: mdl-35602201

ABSTRACT

Background: Gleason grading of prostate cancer is an important prognostic factor, but suffers from poor reproducibility, particularly among non-subspecialist pathologists. Although artificial intelligence (A.I.) tools have demonstrated Gleason grading on-par with expert pathologists, it remains an open question whether and to what extent A.I. grading translates to better prognostication. Methods: In this study, we developed a system to predict prostate cancer-specific mortality via A.I.-based Gleason grading and subsequently evaluated its ability to risk-stratify patients on an independent retrospective cohort of 2807 prostatectomy cases from a single European center with 5-25 years of follow-up (median: 13, interquartile range 9-17). Results: Here, we show that the A.I.'s risk scores produced a C-index of 0.84 (95% CI 0.80-0.87) for prostate cancer-specific mortality. Upon discretizing these risk scores into risk groups analogous to pathologist Grade Groups (GG), the A.I. has a C-index of 0.82 (95% CI 0.78-0.85). On the subset of cases with a GG provided in the original pathology report (n = 1517), the A.I.'s C-indices are 0.87 and 0.85 for continuous and discrete grading, respectively, compared to 0.79 (95% CI 0.71-0.86) for GG obtained from the reports. These represent improvements of 0.08 (95% CI 0.01-0.15) and 0.07 (95% CI 0.00-0.14), respectively. Conclusions: Our results suggest that A.I.-based Gleason grading can lead to effective risk stratification, and warrants further evaluation for improving disease management.

8.
Eur J Hum Genet ; 28(6): 728-731, 2020 06.
Article in English | MEDLINE | ID: mdl-32444797

ABSTRACT

During the COVID-19 pandemic, the European biobanking infrastructure is in a unique position to preserve valuable biological material complemented with detailed data for future research purposes. Biobanks can be either integrated into healthcare, where preservation of the biological material is a fork in clinical routine diagnostics and medical treatment processes or they can also host prospective cohorts or material related to clinical trials. The paper discussed objectives of BBMRI-ERIC, the European research infrastructure established to facilitate access to quality-defined biological materials and data for research purposes, with respect to the COVID-19 crisis: (a) to collect information on available European as well as non-European COVID-19-relevant biobanking resources in BBMRI-ERIC Directory and to facilitate access to these via BBMRI-ERIC Negotiator platform; (b) to help harmonizing guidelines on how data and biological material is to be collected to maximize utility for future research, including large-scale data processing in artificial intelligence, by participating in activities such as COVID-19 Host Genetics Initiative; (c) to minimize risks for all involved parties dealing with (potentially) infectious material by developing recommendations and guidelines; (d) to provide a European-wide platform of exchange in relation to ethical, legal, and societal issues (ELSI) specific to the collection of biological material and data during the COVID-19 pandemic.


Subject(s)
Betacoronavirus/pathogenicity , Biomedical Research/organization & administration , Coronavirus Infections/epidemiology , Information Dissemination/methods , International Cooperation/legislation & jurisprudence , Pandemics , Pneumonia, Viral/epidemiology , Antiviral Agents/therapeutic use , Artificial Intelligence , Betacoronavirus/drug effects , Betacoronavirus/genetics , Biological Specimen Banks/supply & distribution , COVID-19 , Clinical Trials as Topic , Coronavirus Infections/diagnosis , Coronavirus Infections/drug therapy , Coronavirus Infections/genetics , Datasets as Topic , Europe/epidemiology , Humans , Information Dissemination/ethics , Pneumonia, Viral/diagnosis , Pneumonia, Viral/drug therapy , Pneumonia, Viral/genetics , Practice Guidelines as Topic , Public Health/economics , SARS-CoV-2
9.
Biochim Biophys Acta Mol Basis Dis ; 1865(2): 308-321, 2019 02 01.
Article in English | MEDLINE | ID: mdl-30419338

ABSTRACT

Biliary tract cancer (BTC) represents a malignant tumor of the biliary tract including cholangiocarcinoma (CCA) and the carcinoma of the gallbladder (GBC) with a 5-year survival rate between 5 and 18% due to late diagnosis and rapid disease progression. Chronic inflammation is one of the main risk factors for CCA and GBC in particular. IL-6, as a mediator of inflammation, can act through a membrane-bound receptor alpha-chain (mIL-6R, "IL-6 classic signaling") or via soluble forms (sIL-6R, "IL-6 trans-signaling"). However, little is known about the impact on cellular responses of IL-6 trans-signaling on BTC. We analyzed primary tumors as whole sections and as tissue microarrays, and also searched The Cancer Genome Atlas database. Compared to non-neoplastic, non-inflamed gallbladder tissue, IL-6Rα was downregulated in GBC, and this correlated with the patients' overall survival. Furthermore, different CCA cell lines and compounds for activation (IL-6 and Hyper-IL-6) or inhibition (Tocilizumab and sgp130Fc) of IL-6 classic signaling and trans-signaling were used to determine their effects on cellular processes between the two modes of IL-6 signaling. Inhibition of IL-6 trans-signaling by sgp130Fc reduced CCA cell line viability and apoptosis, whereas migration and proliferation were increased. We conclude that IL-6Rα expression is a good prognostic marker for GBC, and that the blocking of IL-6 trans-signaling and activation of IL-6 classic signaling have tumor promoting activity. These findings warrant the exclusion of patients with GBC or other malignancies associated with bile metabolism from IL-6R inhibitor therapy.


Subject(s)
Bile Duct Neoplasms/metabolism , Bile Duct Neoplasms/pathology , Cholangiocarcinoma/metabolism , Cholangiocarcinoma/pathology , Receptors, Interleukin-6/antagonists & inhibitors , Aged , Antibodies, Monoclonal, Humanized/pharmacology , Apoptosis/drug effects , Cell Line, Tumor , Cell Movement/drug effects , Cell Proliferation/drug effects , Cell Survival/drug effects , Down-Regulation/drug effects , Down-Regulation/genetics , Female , G2 Phase/drug effects , Gallbladder/metabolism , Gallbladder/pathology , Humans , Interleukin-6/metabolism , Male , Middle Aged , Mitosis/drug effects , Models, Biological , Phosphorylation/drug effects , Phosphotyrosine/metabolism , Receptors, Interleukin-6/metabolism , Recombinant Fusion Proteins/pharmacology , STAT3 Transcription Factor/metabolism , Signal Transduction/drug effects , Survival Analysis
10.
Eur J Hum Genet ; 26(5): 631-643, 2018 05.
Article in English | MEDLINE | ID: mdl-29396563

ABSTRACT

In rare disease (RD) research, there is a huge need to systematically collect biomaterials, phenotypic, and genomic data in a standardized way and to make them findable, accessible, interoperable and reusable (FAIR). RD-Connect is a 6 years global infrastructure project initiated in November 2012 that links genomic data with patient registries, biobanks, and clinical bioinformatics tools to create a central research resource for RDs. Here, we present RD-Connect Registry & Biobank Finder, a tool that helps RD researchers to find RD biobanks and registries and provide information on the availability and accessibility of content in each database. The finder concentrates information that is currently sparse on different repositories (inventories, websites, scientific journals, technical reports, etc.), including aggregated data and metadata from participating databases. Aggregated data provided by the finder, if appropriately checked, can be used by researchers who are trying to estimate the prevalence of a RD, to organize a clinical trial on a RD, or to estimate the volume of patients seen by different clinical centers. The finder is also a portal to other RD-Connect tools, providing a link to the RD-Connect Sample Catalogue, a large inventory of RD biological samples available in participating biobanks for RD research. There are several kinds of users and potential uses for the RD-Connect Registry & Biobank Finder, including researchers collaborating with academia and the industry, dealing with the questions of basic, translational, and/or clinical research. As of November 2017, the finder is populated with aggregated data for 222 registries and 21 biobanks.


Subject(s)
Computational Biology , Genomics , Metadata , Rare Diseases/genetics , Biological Specimen Banks , Biomedical Research , Databases, Factual , Humans , Information Dissemination , Patients , Rare Diseases/blood , Rare Diseases/epidemiology , Registries
11.
Oncotarget ; 8(52): 89736-89745, 2017 Oct 27.
Article in English | MEDLINE | ID: mdl-29163784

ABSTRACT

Overexpression of the oncofetal insulin-like growth factor 2 mRNA-binding protein 2 (IMP2/IGF2BP2) has been described in different cancer types. Gallbladder carcinoma (GBC) is a rare but highly aggressive cancer entity with late clinical detection and poor prognosis. The aim of this study was to investigate the role of IMP2 in human GBC. Tissue microarrays (TMAs) of an international multi-center GBC sample collection from n = 483 patients were analyzed by immunohistochemistry. IMP2 immunoreactivity was found in 74.3% of the tumor samples on TMA, of which 14.0% showed strong and 86.0% low staining intensity. 72.4% of the tumor samples were IMP1 positive, but IMP1 showed lower expression in tumor tissue compared to control tissues. IMP3 immunoreactivity was observed in 92.7% of all tumors, of which 53.6% revealed strong IMP3 expression. Kaplan-Meier analysis linked high IMP2 expression to shorter survival time (p = 0.033), whereas neither IMP1 nor IMP3 expression was linked to a decreased survival time. Eight different human biliary tract cancer (BTC) cell lines were evaluated for tumor growth kinetics in mouse xenografts. Cell lines with high IMP2 expression levels showed the fastest increase in tumor volumes in murine xenografts. Furthermore, IMP2 expression in these cells correlated with the generation of reactive oxygen species (ROS) and RAC1 expression in BTC cells, suggesting RAC1-induced ROS generation as a potential mechanism of IMP2-promoted progression of GBC. In conclusion, IMP2 is frequently overexpressed in GBC and significantly associated with poor prognosis and growth rates in vivo. IMP2 might therefore represent a new target for the treatment of advanced GBC.

12.
Biopreserv Biobank ; 15(4): 332-340, 2017 Aug.
Article in English | MEDLINE | ID: mdl-28380303

ABSTRACT

INTRODUCTION: Sample collections and data are hosted within different biobanks at diverse institutions across Europe. Our data integration framework aims at incorporating data about sample collections from different biobanks into a common research infrastructure, facilitating researchers' abilities to obtain high-quality samples to conduct their research. The resulting information must be locally gathered and distributed to searchable higher level information biobank directories to maximize the visibility on the national and European levels. Therefore, biobanks and sample collections must be clearly described and unambiguously identified. We describe how to tackle the challenges of integrating biobank-related data between biobank directories using heterogeneous data schemas and different technical environments. METHODS: To establish a data exchange infrastructure between all biobank directories involved, we propose the following steps: (A) identification of core entities, terminology, and semantic relationships, (B) harmonization of heterogeneous data schemas of different Biobanking and Biomolecular Resources Research Infrastructure (BBMRI) directories, and (C) formulation of technical core principles for biobank data exchange between directories. RESULTS: (A) We identified the major core elements to describe biobanks in biobank directories. Since all directory data models were partially based on Minimum Information About BIobank Data Sharing (MIABIS) 2.0, the MIABIS 2.0 core model was used for compatibility. (B) Different projection scenarios were elaborated in collaboration with all BBMRI.at partners. A minimum set of mandatory and optional core entities and data items was defined for mapping across all directory levels. (C) Major core data exchange principles were formulated and data interfaces implemented by all biobank directories involved. DISCUSSION: We agreed on a MIABIS 2.0-based core set of harmonized biobank attributes and established a list of data exchange core principles for integrating biobank directories on different levels. This generic approach and the data exchange core principles proposed herein can also be applied in related tasks like integration and harmonization of biobank data on the individual sample and patient levels.


Subject(s)
Biological Specimen Banks , Information Dissemination/methods , Specimen Handling/methods , Austria , Humans
13.
Health Technol (Berl) ; 7(1): 81-88, 2017.
Article in English | MEDLINE | ID: mdl-28344914

ABSTRACT

In this paper an automatic classification system for pathological findings is presented. The starting point in our undertaking was a pathologic tissue collection with about 1.4 million tissue samples described by free text records over 23 years. Exploring knowledge out of this "big data" pool is a challenging task, especially when dealing with unstructured data spanning over many years. The classification is based on an ontology-based term extraction and decision tree build with a manually curated classification system. The information extracting system is based on regular expressions and a text substitution system. We describe the generation of the decision trees by medical experts using a visual editor. Also the evaluation of the classification process with a reference data set is described. We achieved an F-Score of 89,7% for ICD-10 and an F-Score of 94,7% for ICD-O classification. For the information extraction of the tumor staging and receptors we achieved am F-Score ranging from 81,8 to 96,8%.

14.
Health Technol (Berl) ; 7(1): 89-95, 2017.
Article in English | MEDLINE | ID: mdl-28344915

ABSTRACT

The domain of biobanking has gone through many stages and as a result there are a wide range of commercial and open source software solutions available. The utilization of these software tools requires different levels of domain and technical skills for installation, configuration and ultimate us of these biobank software tools. To compound this complexity the biobanking community are required to work together in order to share knowledge and jointly build solutions to underpin the research infrastructure. We have evaluated the available tools, described them in a catalogue (BiobankApps) and made a selection of tools available to biobanks in a reference toolbox (BIBBOX) that are use-case driven. In the BiobankApps tool catalogue, both commercial and open source software solutions related to the biobanking domain are included, classified and evaluated. The evaluation covers: 1) "user review" by an authenticated user 2) domain expert: quick analysis by BBMRI members and 3) domain expert: detailed analysis and test installation with real world data. The evaluation is paired with a survey across the more "advanced" (from a technology perspective) biobanks to investigate what tools are currently used and summarises known benefits/drawbacks of the respective packages. In the second step we recommend tools for specific use cases, and install, configure and connect these in the BIBBOX framework. This service also builds on the existing work in the United Kingdom in seeking to establish the motivations for different stakeholders to become involved and therefore assisting in prioritising the use-cases based on the level of need and support within the research community. All tools associated to a use-case are available as BIBBOX applications (technically this is achieved by docker containers), which are integrated in the BIBBOX framework with central identification and user management. In future work we plan to share the acquired knowledge with other networks, develop an Application Programmable Interface (API) for the exchange of metadata with other tool catalogues and work on an ontology for the evaluation of biobank software.

16.
BMC Bioinformatics ; 15 Suppl 6: S5, 2014.
Article in English | MEDLINE | ID: mdl-25079119

ABSTRACT

BACKGROUND: This paper presents multilevel data glyphs optimized for the interactive knowledge discovery and visualization of large biomedical data sets. Data glyphs are three- dimensional objects defined by multiple levels of geometric descriptions (levels of detail) combined with a mapping of data attributes to graphical elements and methods, which specify their spatial position. METHODS: In the data mapping phase, which is done by a biomedical expert, meta information about the data attributes (scale, number of distinct values) are compared with the visual capabilities of the graphical elements in order to give a feedback to the user about the correctness of the variable mapping. The spatial arrangement of glyphs is done in a dimetric view, which leads to high data density, a simplified 3D navigation and avoids perspective distortion. RESULTS: We show the usage of data glyphs in the disease analyser a visual analytics application for personalized medicine and provide an outlook to a biomedical web visualization scenario. CONCLUSIONS: Data glyphs can be successfully applied in the disease analyser for the analysis of big medical data sets. Especially the automatic validation of the data mapping, selection of subgroups within histograms and the visual comparison of the value distributions were seen by experts as an important functionality.


Subject(s)
Medical Informatics/methods , Data Mining , Humans , Internet , Medical Informatics/instrumentation
SELECTION OF CITATIONS
SEARCH DETAIL
...