Journal article
ArXiv: 1703.09218, 2017
APA
Click to copy
Alborzi, F., Chaudhuri, S., Chirkova, R. Y., Deo, P., Healey, C. G., Pingale, G., … Selvakani, V. (2017). DataSlicer: Task-based data selection for visual data exploration. ArXiv: 1703.09218.
Chicago/Turabian
Click to copy
Alborzi, F., S. Chaudhuri, R. Y. Chirkova, P. Deo, C. G. Healey, G. Pingale, J. L. Reutter, and V. Selvakani. “DataSlicer: Task-Based Data Selection for Visual Data Exploration.” ArXiv: 1703.09218 (2017).
MLA
Click to copy
Alborzi, F., et al. “DataSlicer: Task-Based Data Selection for Visual Data Exploration.” ArXiv: 1703.09218, 2017.
BibTeX Click to copy
@article{f2017a,
title = {DataSlicer: Task-based data selection for visual data exploration},
year = {2017},
journal = {ArXiv: 1703.09218},
author = {Alborzi, F. and Chaudhuri, S. and Chirkova, R. Y. and Deo, P. and Healey, C. G. and Pingale, G. and Reutter, J. L. and Selvakani, V.}
}
In visual exploration and analysis of data, determining how to select and transform the data for visualization is a challenge for data-unfamiliar or inexperienced users. Our main hypothesis is that for many data sets and common analysis tasks, there are relatively few "data slices" that result in effective visualizations. By focusing human users on appropriate and suitably transformed parts of the underlying data sets, these data slices can help the users carry their task to correct completion. To verify this hypothesis, we develop a framework that permits us to capture exemplary data slices for a user task, and to explore and parse visual-exploration sequences into a format that makes them distinct and easy to compare. We develop a recommendation system, DataSlicer, that matches a "currently viewed" data slice with the most promising "next effective" data slices for the given exploration task. We report the results of controlled experiments with an implementation of the DataSlicer system, using four common analytical task types. The experiments demonstrate statistically significant improvements in accuracy and exploration speed versus users without access to our system.