Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 36
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Bioinformatics ; 35(6): 1070-1072, 2019 03 15.
Artículo en Inglés | MEDLINE | ID: mdl-30875428

RESUMEN

SUMMARY: Adjutant is an open-source, interactive and R-based application to support mining PubMed for literature reviews. Given a PubMed-compatible search query, Adjutant downloads the relevant articles and allows the user to perform an unsupervised clustering analysis to identify data-driven topic clusters. Following clustering, users can also sample documents using different strategies to obtain a more manageable dataset for further analysis. Adjutant makes explicit trade-offs between speed and accuracy, which are modifiable by the user, such that a complete analysis of several thousand documents can take a few minutes. All analytic datasets generated by Adjutant are saved, allowing users to easily conduct other downstream analyses that Adjutant does not explicitly support. AVAILABILITY AND IMPLEMENTATION: Adjutant is implemented in R, using Shiny, and is available at https://github.com/amcrisan/Adjutant. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Programas Informáticos , Análisis por Conglomerados , PubMed
2.
Bioinformatics ; 35(10): 1668-1676, 2019 05 15.
Artículo en Inglés | MEDLINE | ID: mdl-30256887

RESUMEN

MOTIVATION: Data visualization is an important tool for exploring and communicating findings from genomic and healthcare datasets. Yet, without a systematic way of organizing and describing the design space of data visualizations, researchers may not be aware of the breadth of possible visualization design choices or how to distinguish between good and bad options. RESULTS: We have developed a method that systematically surveys data visualizations using the analysis of both text and images. Our method supports the construction of a visualization design space that is explorable along two axes: why the visualization was created and how it was constructed. We applied our method to a corpus of scientific research articles from infectious disease genomic epidemiology and derived a Genomic Epidemiology Visualization Typology (GEViT) that describes how visualizations were created from a series of chart types, combinations and enhancements. We have also implemented an online gallery that allows others to explore our resulting design space of visualizations. Our results have important implications for visualization design and for researchers intending to develop or use data visualization tools. Finally, the method that we introduce is extensible to constructing visualizations design spaces across other research areas. AVAILABILITY AND IMPLEMENTATION: Our browsable gallery is available at http://gevit.net and all project code can be found at https://github.com/amcrisan/gevitAnalysisRelease. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Visualización de Datos , Programas Informáticos , Genoma , Genómica , Encuestas y Cuestionarios
3.
Children (Basel) ; 10(8)2023 Aug 07.
Artículo en Inglés | MEDLINE | ID: mdl-37628354

RESUMEN

Data tracking is a common feature of pain e-health applications, however, viewing visualizations of this data has not been investigated for its potential as an intervention itself. We conducted a pilot feasibility parallel randomized cross-over trial, 1:1 allocation ratio. Participants were youth age 12-18 years recruited from a tertiary-level pediatric chronic pain clinic in Western Canada. Participants completed two weeks of Ecological Momentary Assessment (EMA) data collection, one of which also included access to a data visualization platform to view their results. Order of weeks was randomized, participants were not masked to group assignment. Objectives were to establish feasibility related to recruitment, retention, and participant experience. Of 146 youth approached, 48 were eligible and consented to participation, two actively withdrew prior to the EMA. Most participants reported satisfaction with the process and provided feedback on additional variables of interest. Technical issues with the data collection platform impacted participant experience and data analysis, and only 48% viewed the visualizations. Four youth reported adverse events not related to visualizations. Data visualization offers a promising clinical tool, and patient experience feedback is critical to modifying the platform and addressing technical issues to prepare for deployment in a larger trial.

4.
IEEE Trans Vis Comput Graph ; 28(1): 747-757, 2022 01.
Artículo en Inglés | MEDLINE | ID: mdl-34596545

RESUMEN

Visualization collections, accessed by platforms such as Tableau Online or Power Bl, are used by millions of people to share and access diverse analytical knowledge in the form of interactive visualization bundles. Result snippets, compact previews of these bundles, are presented to users to help them identify relevant content when browsing collections. Our engagement with Tableau product teams and review of existing snippet designs on five platforms showed us that current practices fail to help people judge the relevance of bundles because they include only the title and one image. Users frequently need to undertake the time-consuming endeavour of opening a bundle within its visualization system to examine its many views and dashboards. In response, we contribute the first systematic approach to visualization snippet design. We propose a framework for snippet design that addresses eight key challenges that we identify. We present a computational pipeline to compress the visual and textual content of bundles into representative previews that is adaptive to a provided pixel budget and provides high information density with multiple images and carefully chosen keywords. We also reflect on the method of visual inspection through random sampling to gain confidence in model and parameter choices.

5.
IEEE Trans Vis Comput Graph ; 28(6): 2500-2516, 2022 06.
Artículo en Inglés | MEDLINE | ID: mdl-35120005

RESUMEN

Graph neural networks (GNNs) are a class of powerful machine learning tools that model node relations for making predictions of nodes or links. GNN developers rely on quantitative metrics of the predictions to evaluate a GNN, but similar to many other neural networks, it is difficult for them to understand if the GNN truly learns characteristics of a graph as expected. We propose an approach to corresponding an input graph to its node embedding (aka latent space), a common component of GNNs that is later used for prediction. We abstract the data and tasks, and develop an interactive multi-view interface called CorGIE to instantiate the abstraction. As the key function in CorGIE, we propose the K-hop graph layout to show topological neighbors in hops and their clustering structure. To evaluate the functionality and usability of CorGIE, we present how to use CorGIE in two usage scenarios, and conduct a case study with five GNN experts. Availability: Open-source code at https://github.com/zipengliu/corgie-ui/, supplemental materials & video at https://osf.io/tr3sb/.


Asunto(s)
Gráficos por Computador , Redes Neurales de la Computación , Análisis por Conglomerados , Aprendizaje Automático , Programas Informáticos
6.
IEEE Trans Vis Comput Graph ; 28(12): 4855-4872, 2022 12.
Artículo en Inglés | MEDLINE | ID: mdl-34449391

RESUMEN

Genomic Epidemiology (genEpi) is a branch of public health that uses many different data types including tabular, network, genomic, and geographic, to identify and contain outbreaks of deadly diseases. Due to the volume and variety of data, it is challenging for genEpi domain experts to conduct data reconnaissance; that is, have an overview of the data they have and make assessments toward its quality, completeness, and suitability. We present an algorithm for data reconnaissance through automatic visualization recommendation, GEViTRec. Our approach handles a broad variety of dataset types and automatically generates visually coherent combinations of charts, in contrast to existing systems that primarily focus on singleton visual encodings of tabular datasets. We automatically detect linkages across multiple input datasets by analyzing non-numeric attribute fields, creating a data source graph within which we analyze and rank paths. For each high-ranking path, we specify chart combinations with positional and color alignments between shared fields, using a gradual binding approach to transform initial partial specifications of singleton charts to complete specifications that are aligned and oriented consistently. A novel aspect of our approach is its combination of domain-agnostic elements with domain-specific information that is captured through a domain-specific visualization prevalence design space. Our implementation is applied to both synthetic data and real Ebola outbreak data. We compare GEViTRec's output to what previous visualization recommendation systems would generate, and to manually crafted visualizations used by practitioners. We conducted formative evaluations with ten genEpi experts to assess the relevance and interpretability of our results. Code, Data, and Study Materials Availability: https://github.com/amcrisan/GEVitRec.


Asunto(s)
Gráficos por Computador , Programas Informáticos , Prevalencia , Genómica , Genoma
7.
Pilot Feasibility Stud ; 8(1): 223, 2022 Oct 03.
Artículo en Inglés | MEDLINE | ID: mdl-36192779

RESUMEN

BACKGROUND: Chronic pain is a common and costly condition in youth, associated with negative implications that reach far beyond the pain experience itself (e.g., interference with recreational, social, and academic activities, mental health sequelae). As a self-appraised condition, pain experience is influenced by patient's biases and meaning-making in relation to their symptoms and triggers. We propose that interacting with self-reported data will impact the experience of pain by altering understanding and expectations of symptom experience and how pain interacts with other factors (e.g., sleep, emotions, social interactions). In this study, we aim to establish the feasibility and acceptability of using a data visualization platform to track and monitor symptoms and their relationship with other factors, versus simply daily reporting of symptoms using a smartphone-based Ecological Momentary Assessment (EMA). METHODS: This protocol is for a randomized, single-center, open-label crossover trial. We aim to recruit 50 typically developing youth aged 12-18 years with chronic pain to take part in two phases of data collection. The trial will utilize an A-B counterbalanced design in which participants will be randomly assigned to receive either Part A (EMA alone for 7 days) or Part B (EMA plus visualization platform for 7 days) first and then receive the opposite phase after a 7-day break (washout period). Key outcomes will be participant reports of acceptability and feasibility, EMA completion rates, barriers, and perceptions of the benefits or risks of participation. Secondary exploratory analyses will examine the relationship between EMA-reported symptoms over time and in relation to baseline measures, as well as pilot data on any improvements in symptoms related to engaging with the data visualization platform. DISCUSSION: This protocol describes the feasibility and pilot testing of a novel approach to promoting self-management and facilitating symptom appraisal using visualized data. We aim to determine whether there is a sufficient rationale, both from the perspective of feasibility and patient satisfaction/acceptability, to conduct a larger randomized controlled trial of this intervention. This intervention has the potential to support clinical care for youth with chronic pain and other conditions where self-appraisal and understanding of symptom patterns are a critical component of functional recovery. TRIAL REGISTRATION: Open Science Framework doi: https://doi.org/10.17605/OSF.IO/HQX7C . Registered on October 25, 2021, osf.io/hqx7c.

8.
IEEE Trans Vis Comput Graph ; 27(2): 957-966, 2021 02.
Artículo en Inglés | MEDLINE | ID: mdl-33074823

RESUMEN

For the many journalists who use data and computation to report the news, data wrangling is an integral part of their work. Despite an abundance of literature on data wrangling in the context of enterprise data analysis, little is known about the specific operations, processes, and pain points journalists encounter while performing this tedious, time-consuming task. To better understand the needs of this user group, we conduct a technical observation study of 50 public repositories of data and analysis code authored by 33 professional journalists at 26 news organizations. We develop two detailed and cross-cutting taxonomies of data wrangling in computational journalism, for actions and for processes. We observe the extensive use of multiple tables, a notable gap in previous wrangling analyses. We develop a concise, actionable framework for general multi-table data wrangling that includes wrangling operations documented in our taxonomy that are without clear parallels in other work. This framework, the first to incorporate tables as first-class objects, will support future interactive wrangling tools for both computational journalism and general-purpose use. We assess the generative and descriptive power of our framework through discussion of its relationship to our set of taxonomies.

9.
IEEE Trans Vis Comput Graph ; 27(2): 495-505, 2021 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-33048709

RESUMEN

Cloud-based visualization services have made visual analytics accessible to a much wider audience than ever before. Systems such as Tableau have started to amass increasingly large repositories of analytical knowledge in the form of interactive visualization workbooks. When shared, these collections can form a visual analytic knowledge base. However, as the size of a collection increases, so does the difficulty in finding relevant information. Content-based recommendation (CBR) systems could help analysts in finding and managing workbooks relevant to their interests. Toward this goal, we focus on text-based content that is representative of the subject matter of visualizations rather than the visual encodings and style. We discuss the challenges associated with creating a CBR based on visualization specifications and explore more concretely how to implement the relevance measures required using Tableau workbook specifications as the source of content data. We also demonstrate what information can be extracted from these visualization specifications and how various natural language processing techniques can be used to compute similarity between workbooks as one way to measure relevance. We report on a crowd-sourced user study to determine if our similarity measure mimics human judgement. Finally, we choose latent Dirichl et al.ocation (LDA) as a specific model and instantiate it in a proof-of-concept recommender tool to demonstrate the basic function of our similarity measure.

10.
IEEE Trans Vis Comput Graph ; 16(6): 908-17, 2010.
Artículo en Inglés | MEDLINE | ID: mdl-20975127

RESUMEN

Cells in an organism share the same genetic information in their DNA, but have very different forms and behavior because of the selective expression of subsets of their genes. The widely used approach of measuring gene expression over time from a tissue sample using techniques such as microarrays or sequencing do not provide information about the spatial position within the tissue where these genes are expressed. In contrast, we are working with biologists who use techniques that measure gene expression in every individual cell of entire fruitfly embryos over an hour of their development, and do so for multiple closely-related subspecies of Drosophila. These scientists are faced with the challenge of integrating temporal gene expression data with the spatial location of cells and, moreover, comparing this data across multiple related species. We have worked with these biologists over the past two years to develop MulteeSum, a visualization system that supports inspection and curation of data sets showing gene expression over time, in conjunction with the spatial location of the cells where the genes are expressed--it is the first tool to support comparisons across multiple such data sets. MulteeSum is part of a general and flexible framework we developed with our collaborators that is built around multiple summaries for each cell, allowing the biologists to explore the results of computations that mix spatial information, gene expression measurements over time, and data from multiple related species or organisms. We justify our design decisions based on specific descriptions of the analysis needs of our collaborators, and provide anecdotal evidence of the efficacy of MulteeSum through a series of case studies.


Asunto(s)
Gráficos por Computador , Perfilación de la Expresión Génica/estadística & datos numéricos , Animales , Drosophila/embriología , Drosophila/genética , Regulación del Desarrollo de la Expresión Génica , Genes de Insecto
11.
IEEE Trans Vis Comput Graph ; 26(9): 2732-2747, 2020 09.
Artículo en Inglés | MEDLINE | ID: mdl-30736000

RESUMEN

We address the visual comparison of multiple phylogenetic trees that arises in evolutionary biology, specifically between one reference tree and a collection of dozens to hundreds of other trees. We abstract the domain questions of phylogenetic tree comparison as tasks to look for supporting or conflicting evidence for hypotheses that requires inspection of both topological structure and attribute values at different levels of detail in the tree collection. We introduce the new visual encoding idiom of aggregated dendrograms to concisely summarize the topological relationships between interactively chosen focal subtrees according to biologically meaningful criteria, and provide a layout algorithm that automatically adapts to the available screen space. We design and implement the ADView system, which represents trees at multiple levels of detail across multiple views: the entire collection, a subset of trees, an individual tree, specific subtrees of interest, and the individual branch level. We benchmark the algorithms developed for ADView, compare its information density to previous work, and demonstrate its utility for quickly gathering evidence about biological hypotheses through usage scenarios with data from recently published phylogenetic analysis and case studies of expert use with real-world data, drawn from a summative interview study.


Asunto(s)
Análisis por Conglomerados , Gráficos por Computador , Filogenia , Humanos , Programas Informáticos
12.
IEEE Trans Vis Comput Graph ; 26(6): 2180-2191, 2020 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-32012018

RESUMEN

Graph drawing readability metrics are routinely used to assess and create node-link layouts of network data. Existing readability metrics fall short in three ways. The many count-based metrics such as edge-edge or node-edge crossings simply provide integer counts, missing the opportunity to quantify the amount of overlap between items, which may vary in size, at a more fine-grained level. Current metrics focus solely on single-level topological structure, ignoring the possibility of multi-level structure such as large and thus highly salient metanodes. Most current metrics focus on the measurement of clutter in the form of crossings and overlaps, and do not take into account the trade-off between the clutter and the information sparsity of the drawing, which we refer to as sprawl. We propose an area-aware approach to clutter metrics that tracks the extent of geometric overlaps between node-node, node-edge, and edge-edge pairs in detail. It handles variable-size nodes and explicitly treats metanodes and leaf nodes uniformly. We call the combination of a sprawl metric and an area-aware clutter metric a sprawlter metric. We present an instantiation of the sprawlter metrics featuring a formal and thorough discussion of the crucial component, the penalty mapping function. We implement and validate our proposed metrics with extensive computational analysis of graph layouts, considering four layout algorithms and 56 layouts encompassing both real-world data and synthetic examples illustrating specific configurations of interest.

13.
Mol Syst Biol ; 4: 218, 2008.
Artículo en Inglés | MEDLINE | ID: mdl-18766178

RESUMEN

Although considerable progress has been made in dissecting the signaling pathways involved in the innate immune response, it is now apparent that this response can no longer be productively thought of in terms of simple linear pathways. InnateDB (www.innatedb.ca) has been developed to facilitate systems-level analyses that will provide better insight into the complex networks of pathways and interactions that govern the innate immune response. InnateDB is a publicly available, manually curated, integrative biology database of the human and mouse molecules, experimentally verified interactions and pathways involved in innate immunity, along with centralized annotation on the broader human and mouse interactomes. To date, more than 3500 innate immunity-relevant interactions have been contextually annotated through the review of 1000 plus publications. Integrated into InnateDB are novel bioinformatics resources, including network visualization software, pathway analysis, orthologous interaction network construction and the ability to overlay user-supplied gene expression data in an intuitively displayed molecular interaction network and pathway context, which will enable biologists without a computational background to explore their data in a more systems-oriented manner.


Asunto(s)
Bases de Datos Factuales , Inmunidad Innata , Transducción de Señal/inmunología , Programas Informáticos , Animales , Biología Computacional/métodos , Humanos , Internet , Biología de Sistemas
14.
IEEE Trans Vis Comput Graph ; 15(6): 921-8, 2009.
Artículo en Inglés | MEDLINE | ID: mdl-19834155

RESUMEN

We present a nested model for the visualization design process with four layers: characterize the problem domain, abstract into operations on data types, design visual encoding and interaction techniques, and create algorithms to execute techniques efficiently. The output from a level above is input to the level below, bringing attention to the design challenge that an upstream error inevitably cascades to all downstream levels. This model provides prescriptive guidance for determining appropriate evaluation approaches by identifying threats to validity unique to each level. We call attention to specific steps in the design and evaluation process that are often given short shrift. We also provide three recommendations motivated by this model:authors should distinguish between these levels when claiming contributions at more than one of them, authors should explicitly state upstream assumptions at levels above the focus of a paper, and visualization venues should accept more papers on domain characterization.

15.
IEEE Trans Vis Comput Graph ; 15(2): 249-61, 2009.
Artículo en Inglés | MEDLINE | ID: mdl-19147889

RESUMEN

We present Glimmer, a new multilevel algorithm for multidimensional scaling designed to exploit modern graphics processing unit (GPU) hardware. We also present GPU-SF, a parallel, force-based subsystem used by Glimmer. Glimmer organizes input into a hierarchy of levels and recursively applies GPU-SF to combine and refine the levels. The multilevel nature of the algorithm makes local minima less likely while the GPU parallelism improves speed of computation. We propose a robust termination condition for GPU-SF based on a filtered approximation of the normalized stress function. We demonstrate the benefits of Glimmer in terms of speed, normalized stress, and visual quality against several previous algorithms for a range of synthetic and real benchmark datasets. We also show that the performance of Glimmer on GPUs is substantially faster than a CPU implementation of the same algorithm.

16.
IEEE Trans Vis Comput Graph ; 15(6): 897-904, 2009.
Artículo en Inglés | MEDLINE | ID: mdl-19834152

RESUMEN

In the field of comparative genomics, scientists seek to answer questions about evolution and genomic function by comparing the genomes of species to find regions of shared sequences. Conserved syntenic blocks are an important biological data abstraction for indicating regions of shared sequences. The goal of this work is to show multiple types of relationships at multiple scales in a way that is visually comprehensible in accordance with known perceptual principles. We present a task analysis for this domain where the fundamental questions asked by biologists can be understood by a characterization of relationships into the four types of proximity/location, size, orientation, and similarity/strength, and the four scales of genome, chromosome, block, and genomic feature. We also propose a new taxonomy of the design space for visually encoding conservation data. We present MizBee, a multiscale synteny browser with the unique property of providing interactive side-by-side views of the data across the range of scales supporting exploration of all of these relationship types. We conclude with case studies from two biologists who used MizBee to augment their previous automatic analysis work flow, providing anecdotal evidence about the efficacy oft he system for the visualization of syntenic data, the analysis of conservation relationships, and the communication of scientific insights.


Asunto(s)
Algoritmos , Biología Computacional/métodos , Gráficos por Computador , Genes , Genoma , Animales , Peces , Rhizopus
17.
Bioinformatics ; 23(8): 1040-2, 2007 Apr 15.
Artículo en Inglés | MEDLINE | ID: mdl-17309895

RESUMEN

UNLABELLED: Cerebral (Cell Region-Based Rendering And Layout) is an open-source Java plugin for the Cytoscape biomolecular interaction viewer. Given an interaction network and subcellular localization annotation, Cerebral automatically generates a view of the network in the style of traditional pathway diagrams, providing an intuitive interface for the exploration of a biological pathway or system. The molecules are separated into layers according to their subcellular localization. Potential products or outcomes of the pathway can be shown at the bottom of the view, clustered according to any molecular attribute data-protein function-for example. Cerebral scales well to networks containing thousands of nodes. AVAILABILITY: http://www.pathogenomics.ca/cerebral


Asunto(s)
Fenómenos Fisiológicos Celulares , Modelos Biológicos , Proteoma/metabolismo , Transducción de Señal/fisiología , Programas Informáticos , Fracciones Subcelulares/metabolismo , Interfaz Usuario-Computador , Algoritmos , Gráficos por Computador , Simulación por Computador , Lenguajes de Programación
18.
IEEE Trans Vis Comput Graph ; 14(4): 900-13, 2008.
Artículo en Inglés | MEDLINE | ID: mdl-18467763

RESUMEN

Several previous systems allow users to interactively explore a large input graph through cuts of a superimposed hierarchy. This hierarchy is often created using clustering algorithms or topological features present in the graph. However, many graphs have domain-specific attributes associated with the nodes and edges, which could be used to create many possible hierarchies providing unique views of the input graph. GrouseFlocks is a system for the exploration of this graph hierarchy space. By allowing users to see several different possible hierarchies on the same graph, the system helps users investigate graph hierarchy space instead of a single fixed hierarchy. GrouseFlocks provides a simple set of operations so that users can create and modify their graph hierarchies based on selections. These selections can be made manually or based on patterns in the attribute data provided with the graph. It provides feedback to the user within seconds, allowing interactive exploration of this space.


Asunto(s)
Algoritmos , Gráficos por Computador , Análisis Numérico Asistido por Computador , Interfaz Usuario-Computador , Movimiento (Física)
19.
IEEE Trans Vis Comput Graph ; 14(6): 1253-60, 2008.
Artículo en Inglés | MEDLINE | ID: mdl-18988971

RESUMEN

Systems biologists use interaction graphs to model the behavior of biological systems at the molecular level. In an iterative process, such biologists observe the reactions of living cells under various experimental conditions, view the results in the context of the interaction graph, and then propose changes to the graph model. These graphs ser ve as a form of dynamic knowledge representation of the biological system being studied and evolve as new insight is gained from the experimental data. While numerous graph layout and drawing packages are available, these tools did not fully meet the needs of our immunologist collaborators. In this paper, we describe the data information display needs of these immunologists and translate them into design decisions. These decisions led us to create Cerebral, a system that uses a biologically guided graph layout and incorporates experimental data directly into the graph display. Small multiple views of different experimental conditions and a data-driven parallel coordinates view enable correlations between experimental conditions to be analyzed at the same time that the data is viewed in the graph context. This combination of coordinated views allows the biologist to view the data from many different perspectives simultaneously. To illustrate the typical analysis tasks performed, we analyze two datasets using Cerebral. Based on feedback from our collaborators we conclude that Cerebral is a valuable tool for analyzing experimental data in the context of an interaction graph model.


Asunto(s)
Gráficos por Computador , Modelos Biológicos , Proteoma/metabolismo , Transducción de Señal/fisiología , Programas Informáticos , Interfaz Usuario-Computador , Biología/métodos , Simulación por Computador
20.
IEEE Trans Vis Comput Graph ; 24(1): 435-445, 2018 01.
Artículo en Inglés | MEDLINE | ID: mdl-28880179

RESUMEN

Visualization researchers and practitioners engaged in generating or evaluating designs are faced with the difficult problem of transforming the questions asked and actions taken by target users from domain-specific language and context into more abstract forms. Existing abstract task classifications aim to provide support for this endeavour by providing a carefully delineated suite of actions. Our experience is that this bottom-up approach is part of the challenge: low-level actions are difficult to interpret without a higher-level context of analysis goals and the analysis process. To bridge this gap, we propose a framework based on analysis reports derived from open-coding 20 design study papers published at IEEE InfoVis 2009-2015, to build on the previous work of abstractions that collectively encompass a broad variety of domains. The framework is organized in two axes illustrated by nine analysis goals. It helps situate the analysis goals by placing each goal under axes of specificity (Explore, Describe, Explain, Confirm) and number of data populations (Single, Multiple). The single-population types are Discover Observation, Describe Observation, Identify Main Cause, and Collect Evidence. The multiple-population types are Compare Entities, Explain Differences, and Evaluate Hypothesis. Each analysis goal is scoped by an input and an output and is characterized by analysis steps reported in the design study papers. We provide examples of how we and others have used the framework in a top-down approach to abstracting domain problems: visualization designers or researchers first identify the analysis goals of each unit of analysis in an analysis stream, and then encode the individual steps using existing task classifications with the context of the goal, the level of specificity, and the number of populations involved in the analysis.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA