Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 25
Filtrar
Más filtros













Base de datos
Intervalo de año de publicación
1.
Artículo en Inglés | MEDLINE | ID: mdl-38861444

RESUMEN

The integration of visualizations and text is commonly found in data news, analytical reports, and interactive documents. For example, financial articles are presented along with interactive charts to show the changes in stock prices on Yahoo Finance. Visualizations enhance the perception of facts in the text while the text reveals insights of visual representation. However, effectively combining text and visualizations is challenging and tedious, which usually involves advanced programming skills. This paper proposes a semi-automatic pipeline that builds links between text and visualization. To resolve the relationship between text and visualizations, we present a method which structures a visualization and the underlying data as a contextual knowledge graph, based on which key phrases in the text are extracted, grouped, and mapped with visual elements. To support flexible customization of text-visualization links, our pipeline incorporates user knowledge to revise the links in a mixed-initiative manner. To demonstrate the usefulness and the versatility of our method, we replicate prior studies or cases in crafting interactive word-sized visualizations, annotating visualizations, and creating text-chart interactions based on a prototype system. We carry out two preliminary model tests and a user study and the results and user feedbacks suggest our method is effective.

2.
Artículo en Inglés | MEDLINE | ID: mdl-38713569

RESUMEN

Querying time series based on their relations is a crucial part of multiple time series analysis. By retrieving and understanding time series relations, analysts can easily detect anomalies and validate hypotheses in complex time series datasets. However, current relation extraction approaches, including knowledge- and data-driven ones, tend to be laborious and do not support heterogeneous relations. By conducting a formative study with 11 experts, we concluded six time series relations, including correlation, causality, similarity, lag, arithmetic, and meta, and summarized three pain points in querying time series involving these relations. We proposed RelaQ, an interactive system that supports the time series query via relation specifications. RelaQ allows users to intuitively specify heterogeneous relations when querying multiple time series, understand the query results based on a scalable, multi-level visualization, and explore possible relations beyond the existing queries. RelaQ is evaluated with two cases and a user study with 12 participants, showing promising effectiveness and usability.

3.
IEEE Trans Vis Comput Graph ; 30(6): 3008-3021, 2024 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-38625779

RESUMEN

High-quality data is critical to deriving useful and reliable information. However, real-world data often contains quality issues undermining the value of the derived information. Most existing research on data quality management focuses on tabular data, leaving semi-structured data under-exploited. Due to the schema-less and hierarchical features of semi-structured data, discovering and fixing quality issues is challenging and time-consuming. To address the challenge, this paper presents JsonCurer, an interactive visualization system to assist with data quality management in the context of JSON data. To have an overview of quality issues, we first construct a taxonomy based on interviews with data practitioners and a review of 119 real-world JSON files. Then we highlight a schema visualization that presents structural information, statistical features, and quality issues of JSON data. Based on a similarity-based aggregation technique, the visualization depicts the entire JSON data with a concise tree, where summary visualizations are given above each node, and quality issues are illustrated using Bubble Sets across nodes. We evaluate the effectiveness and usability of JsonCurer with two case studies. One is in the domain of data analysis while the other concerns quality assurance in MongoDB documents.

4.
IEEE Trans Vis Comput Graph ; 30(1): 1194-1204, 2024 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-37883274

RESUMEN

In geo-related fields such as urban informatics, atmospheric science, and geography, large-scale spatial time (ST) series (i.e., geo-referred time series) are collected for monitoring and understanding important spatiotemporal phenomena. ST series visualization is an effective means of understanding the data and reviewing spatiotemporal phenomena, which is a prerequisite for in-depth data analysis. However, visualizing these series is challenging due to their large scales, inherent dynamics, and spatiotemporal nature. In this study, we introduce the notion of patterns of evolution in ST series. Each evolution pattern is characterized by 1) a set of ST series that are close in space and 2) a time period when the trends of these ST series are correlated. We then leverage Storyline techniques by considering an analogy between evolution patterns and sessions, and finally design a novel visualization called GeoChron, which is capable of visualizing large-scale ST series in an evolution pattern-aware and narrative-preserving manner. GeoChron includes a mining framework to extract evolution patterns and two-level visualizations to enhance its visual scalability. We evaluate GeoChron with two case studies, an informal user study, an ablation study, parameter analysis, and running time analysis.

5.
Artículo en Inglés | MEDLINE | ID: mdl-37910408

RESUMEN

Tables are a ubiquitous data format for insight communication. However, transforming data into consumable tabular views remains a challenging and time-consuming task. To lower the barrier of such a task, research efforts have been devoted to developing interactive approaches for data transformation, but many approaches still presume that their users have considerable knowledge of various data transformation concepts and functions. In this study, we leverage natural language (NL) as the primary interaction modality to improve the accessibility of average users to performing complex data transformation and facilitate intuitive table generation and editing. Designing an NL-driven data transformation approach introduces two challenges: a) NL-driven synthesis of interpretable pipelines and b) incremental refinement of synthesized tables. To address these challenges, we present NL2Rigel, an interactive tool that assists users in synthesizing and improving tables from semi-structured text with NL instructions. Based on a large language model and prompting techniques, NL2Rigel can interpret the given NL instructions into a table synthesis pipeline corresponding to Rigel specifications, a declarative language for tabular data transformation. An intuitive interface is designed to visualize the synthesis pipeline and the generated tables, helping users understand the transformation process and refine the results efficiently with targeted NL instructions. The comprehensiveness of NL2Rigel is demonstrated with an example gallery, and we further confirmed NL2Rigel's usability with a comparative user study by showing that the task completion time with NL2Rigel is significantly shorter than that with the original version of Rigel with comparable completion rates.

6.
IEEE Trans Vis Comput Graph ; 29(1): 128-138, 2023 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-36191098

RESUMEN

We present Rigel, an interactive system for rapid transformation of tabular data. Rigel implements a new declarative mapping approach that formulates the data transformation procedure as direct mappings from data to the row, column, and cell channels of the target table. To construct such mappings, Rigel allows users to directly drag data attributes from input data to these three channels and indirectly drag or type data values in a spreadsheet, and possible mappings that do not contradict these interactions are recommended to achieve efficient and straightforward data transformation. The recommended mappings are generated by enumerating and composing data variables based on the row, column, and cell channels, thereby revealing the possibility of alternative tabular forms and facilitating open-ended exploration in many data transformation scenarios, such as designing tables for presentation. In contrast to existing systems that transform data by composing operations (like transposing and pivoting), Rigel requires less prior knowledge on these operations, and constructing tables from the channels is more efficient and results in less ambiguity than generating operation sequences as done by the traditional by-example approaches. User study results demonstrated that Rigel is significantly less demanding in terms of time and interactions and suits more scenarios compared to the state-of-the-art by-example approach. A gallery of diverse transformation cases is also presented to show the potential of Rigel's expressiveness.

7.
IEEE Trans Vis Comput Graph ; 29(1): 1091-1101, 2023 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-36191102

RESUMEN

Improving the efficiency of coal-fired power plants has numerous benefits. The control strategy is one of the major factors affecting such efficiency. However, due to the complex and dynamic environment inside the power plants, it is hard to extract and evaluate control strategies and their cascading impact across massive sensors. Existing manual and data-driven approaches cannot well support the analysis of control strategies because these approaches are time-consuming and do not scale with the complexity of the power plant systems. Three challenges were identified: a) interactive extraction of control strategies from large-scale dynamic sensor data, b) intuitive visual representation of cascading impact among the sensors in a complex power plant system, and c) time-lag-aware analysis of the impact of control strategies on electricity generation efficiency. By collaborating with energy domain experts, we addressed these challenges with ECoalVis, a novel interactive system for experts to visually analyze the control strategies of coal-fired power plants extracted from historical sensor data. The effectiveness of the proposed system is evaluated with two usage scenarios on a real-world historical dataset and received positive feedback from experts.

8.
Comput Vis Media (Beijing) ; 9(1): 3-39, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-36277276

RESUMEN

Developing effective visual analytics systems demands care in characterization of domain problems and integration of visualization techniques and computational models. Urban visual analytics has already achieved remarkable success in tackling urban problems and providing fundamental services for smart cities. To promote further academic research and assist the development of industrial urban analytics systems, we comprehensively review urban visual analytics studies from four perspectives. In particular, we identify 8 urban domains and 22 types of popular visualization, analyze 7 types of computational method, and categorize existing systems into 4 types based on their integration of visualization techniques and computational models. We conclude with potential research directions and opportunities.

9.
Artículo en Inglés | MEDLINE | ID: mdl-37015452

RESUMEN

Numerous patterns found in urban phenomena, such as air pollution and human mobility, can be characterized as many directed geospatial networks (geo-networks) that represent spreading processes in urban space. These geo-networks can be analyzed from multiple levels, ranging from the macro-level of summarizing all geo-networks, meso-level of comparing or summarizing parts of geo-networks, and micro-level of inspecting individual geo-networks. Most of the existing visualizations cannot support multilevel analysis well. These techniques work by: 1) showing geo-networks separately with multiple maps leads to heavy context switching costs between different maps; 2) summarizing all geo-networks into a single network can lead to the loss of individual information; 3) drawing all geo-networks onto one map might suffer from the visual scalability issue in distinguishing individual geo-networks. In this study, we propose GeoNetverse, a novel visualization technique for analyzing aggregate geo-networks from multiple levels. Inspired by metro maps, GeoNetverse balances the overview and details of the geo-networks by placing the edges shared between geo-networks in a stacked manner. To enhance the visual scalability, GeoNetverse incorporates a level-of-detail rendering, a progressive crossing minimization, and a coloring technique. A set of evaluations was conducted to evaluate GeoNetverse from multiple perspectives.

10.
IEEE Trans Vis Comput Graph ; 28(6): 2486-2499, 2022 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-33822726

RESUMEN

Many spatiotemporal events can be viewed as contagions. These events implicitly propagate across space and time by following cascading patterns, expanding their influence, and generating event cascades that involve multiple locations. Analyzing such cascading processes presents valuable implications in various urban applications, such as traffic planning and pollution diagnostics. Motivated by the limited capability of the existing approaches in mining and interpreting cascading patterns, we propose a visual analytics system called VisCas. VisCas combines an inference model with interactive visualizations and empowers analysts to infer and interpret the latent cascading patterns in the spatiotemporal context. To develop VisCas, we address three major challenges 1) generalized pattern inference; 2) implicit influence visualization; and 3) multifaceted cascade analysis. For the first challenge, we adapt the state-of-the-art cascading network inference technique to general urban scenarios, where cascading patterns can be reliably inferred from large-scale spatiotemporal data. For the second and third challenges, we assemble a set of effective visualizations to support location navigation, influence inspection, and cascading exploration, and facilitate the in-depth cascade analysis. We design a novel influence view based on a three-fold optimization strategy for analyzing the implicit influences of the inferred patterns. We demonstrate the capability and effectiveness of VisCas with two case studies conducted on real-world traffic congestion and air pollution datasets with domain experts.

11.
IEEE Trans Vis Comput Graph ; 28(12): 4127-4140, 2022 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-33909565

RESUMEN

In multiple coordinated views (MCVs), visualizations across views update their content in response to users' interactions in other views. Interactive systems provide direct manipulation to create coordination between views, but are restricted to limited types of predefined templates. By contrast, textual specification languages enable flexible coordination but expose technical burden. To bridge the gap, we contribute Nebula, a grammar based on natural language for coordinating visualizations in MCVs. The grammar design is informed by a novel framework based on a systematic review of 176 coordinations from existing theories and applications, which describes coordination by demonstration, i.e., how coordination is performed by users. With the framework, Nebula specification formalizes coordination as a composition of user- and coordination-triggered interactions in origin and destination views, respectively, along with potential data transformation between the interactions. We evaluate Nebula by demonstrating its expressiveness with a gallery of diverse examples and analyzing its usability on cognitive dimensions.

12.
IEEE Trans Vis Comput Graph ; 28(1): 1051-1061, 2022 01.
Artículo en Inglés | MEDLINE | ID: mdl-34596550

RESUMEN

The spatial time series generated by city sensors allow us to observe urban phenomena like environmental pollution and traffic congestion at an unprecedented scale. However, recovering causal relations from these observations to explain the sources of urban phenomena remains a challenging task because these causal relations tend to be time-varying and demand proper time series partitioning for effective analyses. The prior approaches extract one causal graph given long-time observations, which cannot be directly applied to capturing, interpreting, and validating dynamic urban causality. This paper presents Compass, a novel visual analytics approach for in-depth analyses of the dynamic causality in urban time series. To develop Compass, we identify and address three challenges: detecting urban causality, interpreting dynamic causal relations, and unveiling suspicious causal relations. First, multiple causal graphs over time among urban time series are obtained with a causal detection framework extended from the Granger causality test. Then, a dynamic causal graph visualization is designed to reveal the time-varying causal relations across these causal graphs and facilitate the exploration of the graphs along the time. Finally, a tailored multi-dimensional visualization is developed to support the identification of spurious causal relations, thereby improving the reliability of causal analyses. The effectiveness of Compass is evaluated with two case studies conducted on the real-world urban datasets, including the air pollution and traffic speed datasets, and positive feedback was received from domain experts.

13.
IEEE Trans Vis Comput Graph ; 28(1): 857-867, 2022 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-34596553

RESUMEN

The efficiency of warehouses is vital to e-commerce. Fast order processing at the warehouses ensures timely deliveries and improves customer satisfaction. However, monitoring, analyzing, and manipulating order processing in the warehouses in real time are challenging for traditional methods due to the sheer volume of incoming orders, the fuzzy definition of delayed order patterns, and the complex decision-making of order handling priorities. In this paper, we adopt a data-driven approach and propose OrderMonitor, a visual analytics system that assists warehouse managers in analyzing and improving order processing efficiency in real time based on streaming warehouse event data. Specifically, the order processing pipeline is visualized with a novel pipeline design based on the sedimentation metaphor to facilitate real-time order monitoring and suggest potentially abnormal orders. We also design a novel visualization that depicts order timelines based on the Gantt charts and Marey's graphs. Such a visualization helps the managers gain insights into the performance of order processing and find major blockers for delayed orders. Furthermore, an evaluating view is provided to assist users in inspecting order details and assigning priorities to improve the processing performance. The effectiveness of OrderMonitor is evaluated with two case studies on a real-world warehouse dataset.

14.
Front Cell Dev Biol ; 9: 669145, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-34422799

RESUMEN

Background: Hepatocellular carcinoma (HCC) is the sixth most common malignancy with a high mortality worldwide. N6-methyladenosine (m6A) may participate extensively in tumor progression. Methods: To reveal the landscape of tumor immune microenvironment (TIME), ESTIMATE analysis, ssGSEA algorithm, and the CIBERSORT method were used. Taking advantage of consensus clustering, two different HCC categories were screened. We analyzed the correlation of clustering results with TIME and immunotherapy. Then, we yielded a risk signature by systematical bioinformatics analyses. Immunophenoscore (IPS) was implemented to estimate the immunotherapeutic significance of risk signature. Results: The m6A-based clusters were significantly correlated with overall survival (OS), immune score, immunological signature, immune infiltrating, and ICB-associated genes. Risk signature possessed robust prognostic validity and significantly correlated with TIME context. IPS was employed as a surrogate of immunotherapeutic outcome, and patients with low-risk scores showed significantly higher immunophenoscores. Conclusion: Collectively, m6A-based clustering subtype and signature was a robust prognostic indicator and correlated with TIME and immunotherapy, providing novel insight into antitumor management and prognostic prediction in HCC.

16.
Cancer Cell Int ; 21(1): 190, 2021 Apr 01.
Artículo en Inglés | MEDLINE | ID: mdl-33794886

RESUMEN

BACKGROUND: Hepatocellular carcinoma (HCC) ranks the sixth prevalent tumors with high mortality globally. Alternative splicing (AS) drives protein diversity, the imbalance of which might act an important factor in tumorigenesis. This study aimed to construct of AS-based prognostic signature and elucidate the role in tumor immune microenvironment (TIME) and immunotherapy in HCC. METHODS: Univariate Cox regression analysis was performed to determine the prognosis-related AS events and gene set enrichment analysis (GSEA) was employed for functional annotation, followed by the development of prognostic signatures using univariate Cox, LASSO and multivariate Cox regression. K-M survival analysis, proportional hazards model, and ROC curves were conducted to validate prognostic value. ESTIMATE R package, ssGSEA algorithm and CIBERSORT method and TIMER database exploration were performed to uncover the context of TIME in HCC. Quantitative real-time polymerase chain reaction was implemented to detect ZDHHC16 mRNA expression. Cytoscape software 3.8.0 were employed to visualize AS-splicing factors (SFs) regulatory networks. RESULTS: A total of 3294 AS events associated with survival of HCC patients were screened. Based on splicing subtypes, eight AS prognostic signature with robust prognostic predictive accuracy were constructed. Furthermore, quantitative prognostic nomogram was developed and exhibited robust validity in prognostic prediction. Besides, the consolidated signature was significantly correlated with TIME diversity and ICB-related genes. ZDHHC16 presented promising prospect as prognostic factor in HCC. Finally, the splicing regulatory network uncovered the potential functions of splicing factors (SFs). CONCLUSION: Herein, exploration of AS patterns may provide novel and robust indicators (i.e., risk signature, prognostic nomogram, etc.,) for prognostic prediction of HCC. The AS-SF networks could open up new approach for investigation of potential regulatory mechanisms. And pivotal players of AS events in context of TIME and immunotherapy efficiency were revealed, contributing to clinical decision-making and personalized prognosis monitoring of HCC.

17.
IEEE Trans Vis Comput Graph ; 27(2): 817-827, 2021 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-33048743

RESUMEN

Bus routes are typically updated every 3-5 years to meet constantly changing travel demands. However, identifying deficient bus routes and finding their optimal replacements remain challenging due to the difficulties in analyzing a complex bus network and the large solution space comprising alternative routes. Most of the automated approaches cannot produce satisfactory results in real-world settings without laborious inspection and evaluation of the candidates. The limitations observed in these approaches motivate us to collaborate with domain experts and propose a visual analytics solution for the performance analysis and incremental planning of bus routes based on an existing bus network. Developing such a solution involves three major challenges, namely, a) the in-depth analysis of complex bus route networks, b) the interactive generation of improved route candidates, and c) the effective evaluation of alternative bus routes. For challenge a, we employ an overview-to-detail approach by dividing the analysis of a complex bus network into three levels to facilitate the efficient identification of deficient routes. For challenge b, we improve a route generation model and interpret the performance of the generation with tailored visualizations. For challenge c, we incorporate a conflict resolution strategy in the progressive decision-making process to assist users in evaluating the alternative routes and finding the most optimal one. The proposed system is evaluated with two usage scenarios based on real-world data and received positive feedback from the experts. Index Terms-Bus route planning, spatial decision-making, urban data visual analytics.

18.
IEEE Trans Vis Comput Graph ; 26(1): 800-810, 2020 01.
Artículo en Inglés | MEDLINE | ID: mdl-31443012

RESUMEN

Air pollution has become a serious public health problem for many cities around the world. To find the causes of air pollution, the propagation processes of air pollutants must be studied at a large spatial scale. However, the complex and dynamic wind fields lead to highly uncertain pollutant transportation. The state-of-the-art data mining approaches cannot fully support the extensive analysis of such uncertain spatiotemporal propagation processes across multiple districts without the integration of domain knowledge. The limitation of these automated approaches motivates us to design and develop AirVis, a novel visual analytics system that assists domain experts in efficiently capturing and interpreting the uncertain propagation patterns of air pollution based on graph visualizations. Designing such a system poses three challenges: a) the extraction of propagation patterns; b) the scalability of pattern presentations; and c) the analysis of propagation processes. To address these challenges, we develop a novel pattern mining framework to model pollutant transportation and extract frequent propagation patterns efficiently from large-scale atmospheric data. Furthermore, we organize the extracted patterns hierarchically based on the minimum description length (MDL) principle and empower expert users to explore and analyze these patterns effectively on the basis of pattern topologies. We demonstrated the effectiveness of our approach through two case studies conducted with a real-world dataset and positive feedback from domain experts.


Asunto(s)
Contaminación del Aire/análisis , Gráficos por Computador , Minería de Datos/métodos , Monitoreo del Ambiente/métodos , Ciudades
19.
Lipids Health Dis ; 18(1): 14, 2019 Jan 15.
Artículo en Inglés | MEDLINE | ID: mdl-30646922

RESUMEN

BACKGROUND: Polymorphisms have been identified to predispose to NAFLD. Here, we accessed the seven polymorphisms of rs1260326, rs780094 in GCKR, rs2954021 near TRIB1, rs2228603 in NCAN, rs58542926 in TM6SF2, rs12137855 near LYPLAL1, and rs10883437 near CPN1 on NAFLD susceptibility in the Uygur population. MATERIAL AND METHODS: We collected 620 samples (317 NAFLD and 303 controls) for this case-control study. Meta-analysis was performed using Stata Software. RESULTS: Our data detected that the rs1260326 (T vs. C: OR = 1.27, 95% CI = 1.01-1.59) and rs780094 (T vs. C: OR = 1.30, 95% CI = 1.04-1.63) were significantly associated with the susceptibility to NAFLD in Uygur population. The rs1260326 and rs780094 T/T genotype are significantly associated with soda, egg, and soybean intakes in the consumption group with twice or more in a week. Furthermore, a significant haplotype effect of rs1260326/T- rs780094/T was found (OR = 1.29, 95% CI: 1.03-1.62) compared with CC haplotype. An additional meta-analysis using 4352 cases and 10,168 controls established that rs780094 (OR = 1.21, 95%CI: 1.14-1.28) is significantly associated with NAFLD. Finally, among the 4 case-control studies on rs1260326, including 712 NAFLD and 658 controls, significant associations were found in Asian, liver biopsy, adult and pediatric groups. CONCLUSION: Collectively, both our case-control study and meta-analysis confirm a significant association between rs780094 and NAFLD. Additionally, our results suggest Asian-specific, liver biopsy-specific, adult-specific and pediatric-specific associations between the rs1260326 and NAFLD. Moreover, the rs1260326 and rs780094 T/T genotype are significantly associated with food habits, such as soda, egg, and soybean.


Asunto(s)
Etnicidad/genética , Estudios de Asociación Genética , Predisposición Genética a la Enfermedad , Enfermedad del Hígado Graso no Alcohólico/genética , Polimorfismo de Nucleótido Simple/genética , Adulto , Estudios de Casos y Controles , Femenino , Haplotipos/genética , Humanos , Desequilibrio de Ligamiento/genética , Masculino , Sesgo de Publicación , Factores de Riesgo
20.
Artículo en Inglés | MEDLINE | ID: mdl-30188825

RESUMEN

Interactive ranking techniques have substantially promoted analysts' ability in making judicious and informed decisions effectively based on multiple criteria. However, the existing techniques cannot satisfactorily support the analysis tasks involved in ranking large-scale spatial alternatives, such as selecting optimal locations for chain stores, where the complex spatial contexts involved are essential to the decision-making process. Limitations observed in the prior attempts of integrating rankings with spatial contexts motivate us to develop a context-integrated visual ranking technique. Based on a set of generic design requirements we summarized by collaborating with domain experts, we propose SRVis, a novel spatial ranking visualization technique that supports efficient spatial multi-criteria decision-making processes by addressing three major challenges in the aforementioned context integration, namely, a) the presentation of spatial rankings and contexts, b) the scalability of rankings' visual representations, and c) the analysis of context-integrated spatial rankings. Specifically, we encode massive rankings and their cause with scalable matrix-based visualizations and stacked bar charts based on a novel two-phase optimization framework that minimizes the information loss, and the flexible spatial filtering and intuitive comparative analysis are adopted to enable the in-depth evaluation of the rankings and assist users in selecting the best spatial alternative. The effectiveness of the proposed technique has been evaluated and demonstrated with an empirical study of optimization methods, two case studies, and expert interviews.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA