Next Article in Journal
Understanding the EMR-Related Experiences of Pregnant Japanese Women to Redesign Antenatal Care EMR Systems
Previous Article in Journal
Creating a Multimodal Translation Tool and Testing Machine Translation Integration Using Touch and Voice
Open AccessArticle

Selective Wander Join: Fast Progressive Visualizations for Data Joins

1
Department of Computer Science, Tufts University, Medford, MA 02155, USA
2
Department of Computer Science, University of Arizona, Tucson, AZ 85721, USA
3
Department of Computer Science, Columbia University, New York, NY 10027, USA
*
Author to whom correspondence should be addressed.
Informatics 2019, 6(1), 14; https://doi.org/10.3390/informatics6010014
Received: 29 January 2019 / Revised: 9 March 2019 / Accepted: 14 March 2019 / Published: 25 March 2019
(This article belongs to the Special Issue Progressive Visual Analytics)
Progressive visualization offers a great deal of promise for big data visualization; however, current progressive visualization systems do not allow for continuous interaction. What if users want to see more confident results on a subset of the visualization? This can happen when users are in exploratory analysis mode but want to ask some directed questions of the data as well. In a progressive visualization system, the online aggregation algorithm determines the database sampling rate and resulting convergence rate, not the user. In this paper, we extend a recent method in online aggregation, called Wander Join, that is optimized for queries that join tables, one of the most computationally expensive operations. This extension leverages importance sampling to enable user-driven sampling when data joins are in the query. We applied user interaction techniques that allow the user to view and adjust the convergence rate, providing more transparency and control over the online aggregation process. By leveraging importance sampling, our extension of Wander Join also allows for stratified sampling of groups when there is data distribution skew. We also improve the convergence rate of filtering queries, but with additional overhead costs not needed in the original Wander Join algorithm. View Full-Text
Keywords: progressive visualization; online aggregation; interaction; information visualization progressive visualization; online aggregation; interaction; information visualization
Show Figures

Figure 1

MDPI and ACS Style

Procopio, M.; Scheidegger, C.; Wu, E.; Chang, R. Selective Wander Join: Fast Progressive Visualizations for Data Joins. Informatics 2019, 6, 14.

Show more citation formats Show less citations formats
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

1
Back to TopTop