Tech report
TR-2015-08, North Carolina State University, 2015
APA
Click to copy
Sacerdote, L., Reutter, J. L., Healey, C. G., Deo, P., Chirkova, R. Y., & Alborzi, F. (2015). DataSlicer: Enabling data selection for visual data exploration. North Carolina State University.
Chicago/Turabian
Click to copy
Sacerdote, L., J. L. Reutter, C. G. Healey, P. Deo, R. Y. Chirkova, and F. Alborzi. DataSlicer: Enabling Data Selection for Visual Data Exploration. North Carolina State University, 2015.
MLA
Click to copy
Sacerdote, L., et al. DataSlicer: Enabling Data Selection for Visual Data Exploration. no. TR-2015-08, North Carolina State University, 2015.
BibTeX Click to copy
@techreport{l2015a,
title = {DataSlicer: Enabling data selection for visual data exploration},
year = {2015},
institution = {North Carolina State University},
issue = {TR-2015-08},
author = {Sacerdote, L. and Reutter, J. L. and Healey, C. G. and Deo, P. and Chirkova, R. Y. and Alborzi, F.},
howpublished = {ftp://ftp.ncsu.edu/pub/unity/lockers/ftp/csc_anon/tech/2015/TR-2015-8.pdf http://www.lib.ncsu.edu/resolver/1840.4/8674}
}
Determining how to select and transform the data for visualization is one of the hardest problems faced by dataunfamiliar or inexperienced users when performing a visual exploration to solve an analytical task. Our main hypothesis is that for many data sets and common analytical tasks, such as finding outliers or general trends in data, there are relatively few “data slices” that are key to providing effective visualizations for the task. By focusing human users on appropriate and suitably transformed parts of the underlying data sets, these data slices can help the users carry their task to correct completion. To verify this hypothesis, we develop a framework that permits us to capture exemplary data slices in an exploration task, and to explore and parse visual-exploration sequences into a format that makes them distinct and easy to compare. We develop a recommendation system, DataSlicer, that matches a “currently viewed” data slice with the most promising “next effective” data slices for the given exploration task. We report the results of controlled experiments with an implementation of the DataSlicer system, using four common analytical task types. The experiments demonstrate statistically significant improvements in accuracy and exploration speed versus users without access to our system.