Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 63
Filtrar
1.
Front Bioinform ; 4: 1331043, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38375239

RESUMO

Current visualizations in microbiome research rely on aggregations in taxonomic classifications or do not show less abundant taxa. We introduce Snowflake: a new visualization method that creates a clear overview of the microbiome composition in collected samples without losing any information due to classification or neglecting less abundant reads. Snowflake displays every observed OTU/ASV in the microbiome abundance table and provides a solution to include the data's hierarchical structure and additional information obtained from downstream analysis (e.g., alpha- and beta-diversity) and metadata. Based on the value-driven ICE-T evaluation methodology, Snowflake was positively received. Experts in microbiome research found the visualizations to be user-friendly and detailed and liked the possibility of including and relating additional information to the microbiome's composition. Exploring the topological structure of the microbiome abundance table allows them to quickly identify which taxa are unique to specific samples and which are shared among multiple samples (i.e., separating sample-specific taxa from the core microbiome), and see the compositional differences between samples. An R package for constructing and visualizing Snowflake microbiome composition graphs is available at https://gitlab.com/vda-lab/snowflake.

2.
IEEE Trans Vis Comput Graph ; 30(1): 649-660, 2024 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-37934634

RESUMO

This paper is a call to action for research and discussion on data visualization education. As visualization evolves and spreads through our professional and personal lives, we need to understand how to support and empower a broad and diverse community of learners in visualization. Data Visualization is a diverse and dynamic discipline that combines knowledge from different fields, is tailored to suit diverse audiences and contexts, and frequently incorporates tacit knowledge. This complex nature leads to a series of interrelated challenges for data visualization education. Driven by a lack of consolidated knowledge, overview, and orientation for visualization education, the 21 authors of this paper-educators and researchers in data visualization-identify and describe 19 challenges informed by our collective practical experience. We organize these challenges around seven themes People, Goals & Assessment, Environment, Motivation, Methods, Materials, and Change. Across these themes, we formulate 43 research questions to address these challenges. As part of our call to action, we then conclude with 5 cross-cutting opportunities and respective action items: embrace DIVERSITY+INCLUSION, build COMMUNITIES, conduct RESEARCH, act AGILE, and relish RESPONSIBILITY. We aim to inspire researchers, educators and learners to drive visualization education forward and discuss why, how, who and where we educate, as we learn to use visualization to address challenges across many scales and many domains in a rapidly changing world: viseducationchallenges.github.io.

3.
PLoS One ; 18(12): e0295361, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-38096184

RESUMO

One of the challenges in multi-omics data analysis for precision medicine is the efficient exploration of undiscovered molecular interactions in disease processes. We present BioMOBS, a workflow consisting of two data visualization tools integrated with an open-source molecular information database to perform clinically relevant analyses (https://github.com/driesheylen123/BioMOBS). We performed exploratory pathway analysis with BioMOBS and demonstrate its ability to generate relevant molecular hypotheses, by reproducing recent findings in type 2 diabetes UK biobank data. The central visualisation tool, where data-driven and literature-based findings can be integrated, is available within the github link as well. BioMOBS is a workflow that leverages information from multiple data-driven interactive analyses and visually integrates it with established pathway knowledge. The demonstrated use cases place trust in the usage of BioMOBS as a procedure to offer clinically relevant insights in disease pathway analyses on various types of omics data.


Assuntos
Diabetes Mellitus Tipo 2 , Software , Humanos , Multiômica , Fluxo de Trabalho
4.
Eur Urol Open Sci ; 56: 39-46, 2023 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-37822515

RESUMO

Background: The Prostate Cancer Radiological Estimation of Change in Sequential Evaluation (PRECISE) score has been developed to standardise prostate magnetic resonance imaging (MRI) reporting in men on active surveillance (AS) for prostate cancer (PCa). Objective: To evaluate the feasibility of PRECISE scoring and assess its diagnostic accuracy. Design setting and participants: All PCa patients on AS with a baseline MRI and at least one follow-up MRI scan between January 2008 and September 2022 at a single tertiary referral centre were included in a database. The follow-up protocol of the Prostate Cancer International Active Surveillance (PRIAS) study was used. All scans were retrospectively re-reported by a dedicated uroradiologist and appointed a Prostate Imaging Reporting and Data System (version 2.1) and PRECISE score. Outcome measurements and statistical analysis: Clinically significant progression was defined by histopathological upgrading (on biopsy or radical prostatectomy) to grade group ≥3 and/or evolution to T3 stage. A survival analysis was performed to assess differential progression-free survival (PFS) according to the PRECISE score. Results and limitations: A total of 188 patients were included for an analysis with a total of 358 repeat MRI scans and 144 repeat biopsies. The median follow-up was 46 mo (interquartile range 21-74). Radiological progression (PRECISE 4-5) had sensitivity, specificity, negative predictive value, and positive predictive value of, respectively, 78%, 70%, 90%, and 49% for clinically significant progression. Four-year PFS was 91% for PRECISE 1-3 versus 66% for PRECISE 4-5 (p < 0.001). In total, 137 patients underwent a confirmation MRI scan within 18 mo after diagnosis. Four-year PFS in this group was 81% for PRECISE 1-3 versus 43% for PRECISE 4-5 (p < 0.001). Limitations include retrospective design and no strict adherence to AS protocol. Conclusions: Implementation of PRECISE scoring for PCa patients on AS is feasible and offers a prognostic value. Patients with PRECISE score 4-5 on confirmation MRI within 18 mo after diagnosis have a three-fold higher risk of clinically significant progression after 4 yr. Patient summary: Patients with low-risk prostate cancer can be followed up carefully. In this study, we evaluate the standardised reporting of repeat magnetic resonance imaging scans (using the Prostate Cancer Radiological Estimation of Change in Sequential Evaluation [PRECISE] recommendations). PRECISE scoring is feasible and helps identify patients in need of further treatment.

5.
Biom J ; 65(1): e2100186, 2023 01.
Artigo em Inglês | MEDLINE | ID: mdl-35818698

RESUMO

This work presents a joint spatial modeling framework to improve estimation of the spatial distribution of the latent COVID-19 incidence in Belgium, based on test-confirmed COVID-19 cases and crowd-sourced symptoms data as reported in a large-scale online survey. Correction is envisioned for stochastic dependence between the survey's response rate and spatial COVID-19 incidence, commonly known as preferential sampling, but not found significant. Results show that an online survey can provide valuable auxiliary data to optimize spatial COVID-19 incidence estimation based on confirmed cases in situations with limited testing capacity. Furthermore, it is shown that an online survey on COVID-19 symptoms with a sufficiently large sample size per spatial entity is capable of pinpointing the same locations that appear as test-confirmed clusters, approximately 1 week earlier. We conclude that a large-scale online study provides an inexpensive and flexible method to collect timely information of an epidemic during its early phase, which can be used by policy makers in an early phase of an epidemic and in conjunction with other monitoring systems.


Assuntos
COVID-19 , Humanos , COVID-19/epidemiologia , SARS-CoV-2 , Autorrelato , Incidência
6.
Euro Surveill ; 27(7)2022 02.
Artigo em Inglês | MEDLINE | ID: mdl-35177167

RESUMO

BackgroundCOVID-19 mortality, excess mortality, deaths per million population (DPM), infection fatality ratio (IFR) and case fatality ratio (CFR) are reported and compared for many countries globally. These measures may appear objective, however, they should be interpreted with caution.AimWe examined reported COVID-19-related mortality in Belgium from 9 March 2020 to 28 June 2020, placing it against the background of excess mortality and compared the DPM and IFR between countries and within subgroups.MethodsThe relation between COVID-19-related mortality and excess mortality was evaluated by comparing COVID-19 mortality and the difference between observed and weekly average predictions of all-cause mortality. DPM were evaluated using demographic data of the Belgian population. The number of infections was estimated by a stochastic compartmental model. The IFR was estimated using a delay distribution between infection and death.ResultsIn the study period, 9,621 COVID-19-related deaths were reported, which is close to the excess mortality estimated using weekly averages (8,985 deaths). This translates to 837 DPM and an IFR of 1.5% in the general population. Both DPM and IFR increase with age and are substantially larger in the nursing home population.DiscussionDuring the first pandemic wave, Belgium had no discrepancy between COVID-19-related mortality and excess mortality. In light of this close agreement, it is useful to consider the DPM and IFR, which are both age, sex, and nursing home population-dependent. Comparison of COVID-19 mortality between countries should rather be based on excess mortality than on COVID-19-related mortality.


Assuntos
COVID-19 , Bélgica/epidemiologia , Humanos , Mortalidade , Casas de Saúde , Pandemias , SARS-CoV-2
7.
Front Physiol ; 12: 723510, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34512391

RESUMO

Precision medicine as a framework for disease diagnosis, treatment, and prevention at the molecular level has entered clinical practice. From the start, genetics has been an indispensable tool to understand and stratify the biology of chronic and complex diseases in precision medicine. However, with the advances in biomedical and omics technologies, quantitative proteomics is emerging as a powerful technology complementing genetics. Quantitative proteomics provide insight about the dynamic behaviour of proteins as they represent intermediate phenotypes. They provide direct biological insights into physiological patterns, while genetics accounting for baseline characteristics. Additionally, it opens a wide range of applications in clinical diagnostics, treatment stratification, and drug discovery. In this mini-review, we discuss the current status of quantitative proteomics in precision medicine including the available technologies and common methods to analyze quantitative proteomics data. Furthermore, we highlight the current challenges to put quantitative proteomics into clinical settings and provide a perspective to integrate proteomics data with genomics data for future applications in precision medicine.

8.
IEEE Comput Graph Appl ; 41(6): 15-24, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34582348

RESUMO

In this article, we report on our experiences of running visual design workshops within the context of a master's level data visualization course, in a remote setting. These workshops aim to teach students to explore visual design space for data by creating and discussing hand-drawn sketches. We describe the technical setup employed, the different parts of the workshop, how the actual sessions were run, and to what extent the remote version can substitute for in-person sessions. In general, the visual designs created by the students as well as the feedback provided by them indicate that the setup described here can be a feasible replacement for in-person visual design workshops.


Assuntos
Visualização de Dados , Estudantes , Humanos
9.
PeerJ Comput Sci ; 7: e430, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-33954230

RESUMO

A large number of clinical concepts are categorized under standardized formats that ease the manipulation, understanding, analysis, and exchange of information. One of the most extended codifications is the International Classification of Diseases (ICD) used for characterizing diagnoses and clinical procedures. With formatted ICD concepts, a patient profile can be described through a set of standardized and sorted attributes according to the relevance or chronology of events. This structured data is fundamental to quantify the similarity between patients and detect relevant clinical characteristics. Data visualization tools allow the representation and comprehension of data patterns, usually of a high dimensional nature, where only a partial picture can be projected. In this paper, we provide a visual analytics approach for the identification of homogeneous patient cohorts by combining custom distance metrics with a flexible dimensionality reduction technique. First we define a new metric to measure the similarity between diagnosis profiles through the concordance and relevance of events. Second we describe a variation of the Simplified Topological Abstraction of Data (STAD) dimensionality reduction technique to enhance the projection of signals preserving the global structure of data. The MIMIC-III clinical database is used for implementing the analysis into an interactive dashboard, providing a highly expressive environment for the exploration and comparison of patients groups with at least one identical diagnostic ICD code. The combination of the distance metric and STAD not only allows the identification of patterns but also provides a new layer of information to establish additional relationships between patient cohorts. The method and tool presented here add a valuable new approach for exploring heterogeneous patient populations. In addition, the distance metric described can be applied in other domains that employ ordered lists of categorical data.

10.
Front Bioinform ; 1: 774631, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-36303773

RESUMO

Research on the microbiome has boomed recently, which resulted in a wide range of tools, packages, and algorithms to analyze microbiome data. Here we investigate and map currently existing tools that can be used to perform visual analysis on the microbiome, and associate the including methods, visual representations and data features to the research objectives currently of interest in microbiome research. The analysis is based on a combination of a literature review and workshops including a group of domain experts. Both the reviewing process and workshops are based on domain characterization methods to facilitate communication and collaboration between researchers from different disciplines. We identify several research questions related to microbiomes, and describe how different analysis methods and visualizations help in tackling them.

11.
IEEE Trans Vis Comput Graph ; 27(10): 3994-4008, 2021 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-32746253

RESUMO

The connections in a graph generate a structure that is independent of a coordinate system. This visual metaphor allows creating a more flexible representation of data than a two-dimensional scatterplot. In this article, we present STAD (Simplified Topological Abstraction of Data), a parameter-free dimensionality reduction method that projects high-dimensional data into a graph. STAD generates an abstract representation of high-dimensional data by giving each data point a location in a graph which preserves the approximate distances in the original high-dimensional space. The STAD graph is built upon the Minimum Spanning Tree (MST) to which new edges are added until the correlation between the distances from the graph and the original dataset is maximized. Additionally, STAD supports the inclusion of additional functions to focus the exploration and allow the analysis of data from new perspectives, emphasizing traits in data which otherwise would remain hidden. We demonstrate the effectiveness of our method by applying it to two real-world datasets: traffic density in Barcelona and temporal measurements of air quality in Castile and León in Spain.

12.
Spat Spatiotemporal Epidemiol ; 35: 100379, 2020 11.
Artigo em Inglês | MEDLINE | ID: mdl-33138946

RESUMO

Although COVID-19 has been spreading throughout Belgium since February, 2020, its spatial dynamics in Belgium remain poorly understood, partly due to the limited testing of suspected cases during the epidemic's early phase. We analyse data of COVID-19 symptoms, as self-reported in a weekly online survey, which is open to all Belgian citizens. We predict symptoms' incidence using binomial models for spatially discrete data, and we introduce these as a covariate in the spatial analysis of COVID-19 incidence, as reported by the Belgian government during the days following a survey round. The symptoms' incidence is moderately predictive of the variation in the relative risks based on the confirmed cases; exceedance probability maps of the symptoms' incidence and confirmed cases' relative risks overlap partly. We conclude that this framework can be used to detect COVID-19 clusters of substantial sizes, but it necessitates spatial information on finer scales to locate small clusters.


Assuntos
Infecções por Coronavirus/epidemiologia , Inquéritos Epidemiológicos/estatística & dados numéricos , Pneumonia Viral/epidemiologia , Análise Espacial , Adulto , Idoso , Bélgica/epidemiologia , Betacoronavirus , COVID-19 , Feminino , Inquéritos Epidemiológicos/métodos , Humanos , Incidência , Masculino , Pessoa de Meia-Idade , Pandemias , Medição de Risco , SARS-CoV-2
13.
Proc Natl Acad Sci U S A ; 116(24): 11878-11887, 2019 06 11.
Artigo em Inglês | MEDLINE | ID: mdl-31127050

RESUMO

Notwithstanding important advances in the context of single-variant pathogenicity identification, novel breakthroughs in discerning the origins of many rare diseases require methods able to identify more complex genetic models. We present here the Variant Combinations Pathogenicity Predictor (VarCoPP), a machine-learning approach that identifies pathogenic variant combinations in gene pairs (called digenic or bilocus variant combinations). We show that the results produced by this method are highly accurate and precise, an efficacy that is endorsed when validating the method on recently published independent disease-causing data. Confidence labels of 95% and 99% are identified, representing the probability of a bilocus combination being a true pathogenic result, providing geneticists with rational markers to evaluate the most relevant pathogenic combinations and limit the search space and time. Finally, the VarCoPP has been designed to act as an interpretable method that can provide explanations on why a bilocus combination is predicted as pathogenic and which biological information is important for that prediction. This work provides an important step toward the genetic understanding of rare diseases, paving the way to clinical knowledge and improved patient care.


Assuntos
Predisposição Genética para Doença/genética , Variação Genética/genética , Doenças Raras/genética , Marcadores Genéticos/genética , Humanos
14.
Bioinformatics ; 35(12): 2159-2161, 2019 06 01.
Artigo em Inglês | MEDLINE | ID: mdl-30445495

RESUMO

SUMMARY: Inferring a Gene Regulatory Network (GRN) from gene expression data is a computationally expensive task, exacerbated by increasing data sizes due to advances in high-throughput gene profiling technology, such as single-cell RNA-seq. To equip researchers with a toolset to infer GRNs from large expression datasets, we propose GRNBoost2 and the Arboreto framework. GRNBoost2 is an efficient algorithm for regulatory network inference using gradient boosting, based on the GENIE3 architecture. Arboreto is a computational framework that scales up GRN inference algorithms complying with this architecture. Arboreto includes both GRNBoost2 and an improved implementation of GENIE3, as a user-friendly open source Python package. AVAILABILITY AND IMPLEMENTATION: Arboreto is available under the 3-Clause BSD license at http://arboreto.readthedocs.io. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Algoritmos , Redes Reguladoras de Genes , Biologia Computacional , Expressão Gênica , Software
15.
PeerJ Comput Sci ; 4: e145, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-33816801

RESUMO

Finding useful patterns in datasets has attracted considerable interest in the field of visual analytics. One of the most common tasks is the identification and representation of clusters. However, this is non-trivial in heterogeneous datasets since the data needs to be analyzed from different perspectives. Indeed, highly variable patterns may mask underlying trends in the dataset. Dendrograms are graphical representations resulting from agglomerative hierarchical clustering and provide a framework for viewing the clustering at different levels of detail. However, dendrograms become cluttered when the dataset gets large, and the single cut of the dendrogram to demarcate different clusters can be insufficient in heterogeneous datasets. In this work, we propose a visual analytics methodology called MCLEAN that offers a general approach for guiding the user through the exploration and detection of clusters. Powered by a graph-based transformation of the relational data, it supports a scalable environment for representation of heterogeneous datasets by changing the spatialization. We thereby combine multilevel representations of the clustered dataset with community finding algorithms. Our approach entails displaying the results of the heuristics to users, providing a setting from which to start the exploration and data analysis. To evaluate our proposed approach, we conduct a qualitative user study, where participants are asked to explore a heterogeneous dataset, comparing the results obtained by MCLEAN with the dendrogram. These qualitative results reveal that MCLEAN is an effective way of aiding users in the detection of clusters in heterogeneous datasets. The proposed methodology is implemented in an R package available at https://bitbucket.org/vda-lab/mclean.

16.
Nat Methods ; 14(11): 1083-1086, 2017 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-28991892

RESUMO

We present SCENIC, a computational method for simultaneous gene regulatory network reconstruction and cell-state identification from single-cell RNA-seq data (http://scenic.aertslab.org). On a compendium of single-cell data from tumors and brain, we demonstrate that cis-regulatory analysis can be exploited to guide the identification of transcription factors and cell states. SCENIC provides critical biological insights into the mechanisms driving cellular heterogeneity.


Assuntos
Redes Reguladoras de Genes , Análise de Célula Única , Algoritmos , Animais , Encéfalo/metabolismo , Análise por Conglomerados , Perfilação da Expressão Gênica , Humanos , Camundongos
17.
Nucleic Acids Res ; 44(W1): W117-21, 2016 07 08.
Artigo em Inglês | MEDLINE | ID: mdl-27131783

RESUMO

Genomic studies and high-throughput experiments often produce large lists of candidate genes among which only a small fraction are truly relevant to the disease, phenotype or biological process of interest. Gene prioritization tackles this problem by ranking candidate genes by profiling candidates across multiple genomic data sources and integrating this heterogeneous information into a global ranking. We describe an extended version of our gene prioritization method, Endeavour, now available for six species and integrating 75 data sources. The performance (Area Under the Curve) of Endeavour on cross-validation benchmarks using 'gold standard' gene sets varies from 88% (for human phenotypes) to 95% (for worm gene function). In addition, we have also validated our approach using a time-stamped benchmark derived from the Human Phenotype Ontology, which provides a setting close to prospective validation. With this benchmark, using 3854 novel gene-phenotype associations, we observe a performance of 82%. Altogether, our results indicate that this extended version of Endeavour efficiently prioritizes candidate genes. The Endeavour web server is freely available at https://endeavour.esat.kuleuven.be/.


Assuntos
Algoritmos , Predisposição Genética para Doença , Genótipo , Software , Animais , Benchmarking , Estudos de Associação Genética , Humanos , Internet , Fenótipo
18.
BMC Proc ; 9(Suppl 6 Proceedings of the 5th Symposium on Biological Data): S1, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-26361497
19.
AIDS ; 29(15): 2045-52, 2015 Sep 24.
Artigo em Inglês | MEDLINE | ID: mdl-26355575

RESUMO

OBJECTIVES: Surveillance drug resistance mutations (SDRMs) in drug-naive patients are typically used to survey HIV-1-transmitted drug resistance (TDR). We test here how SDRMs in patients failing treatment, the original source of TDR, contribute to assessing TDR, transmissibility and transmission source of SDRMs. DESIGN: This is a retrospective observational study analyzing a Portuguese cohort of HIV-1-infected patients. METHODS: The prevalence of SDRMs to protease inhibitors, nucleoside reverse transcriptase inhibitors (NRTIs) and nonnucleoside reverse transcriptase inhibitors (NNRTIs) in drug-naive and treatment-failing patients was measured for 3554 HIV-1 subtype B patients. Transmission ratio (prevalence in drug-naive/prevalence in treatment-failing patients), average viral load and robust linear regression with outlier detection (prevalence in drug-naive versus in treatment-failing patients) were analyzed and used to interpret transmissibility. RESULTS: Prevalence of SDRMs in drug-naive and treatment-failing patients were linearly correlated, but some SDRMs were classified as outliers - above (PRO: D30N, N88D/S, L90 M, RT: G190A/S/E) or below (RT: M184I/V) expectations. The normalized regression slope was 0.073 for protease inhibitors, 0.084 for NRTIs and 0.116 for NNRTIs. Differences between SDRMs transmission ratios were not associated with differences in viral loads. CONCLUSION: The significant linear correlation between prevalence of SDRMs in drug-naive and in treatment-failing patients indicates that the prevalence in treatment-failing patients can be useful to predict levels of TDR. The slope is a cohort-dependent estimate of rate of TDR per drug class and outlier detection reveals comparative persistence of SDRMs. Outlier SDRMs with higher transmissibility are more persistent and more likely to have been acquired from drug-naive patients. Those with lower transmissibility have faster reversion dynamics after transmission and are associated with acquisition from treatment-failing patients.


Assuntos
Transmissão de Doença Infecciosa , Farmacorresistência Viral , Infecções por HIV/transmissão , Infecções por HIV/virologia , HIV-1/efeitos dos fármacos , Adulto , Monitoramento Epidemiológico , Feminino , Técnicas de Genotipagem , Infecções por HIV/epidemiologia , HIV-1/genética , HIV-1/isolamento & purificação , Proteínas do Vírus da Imunodeficiência Humana/genética , Humanos , Incidência , Masculino , Pessoa de Meia-Idade , Mutação de Sentido Incorreto , Portugal/epidemiologia , Prevalência , Estudos Retrospectivos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...