Publications

Data Vocalization: Optimizing Voice Output of Relational Data

Paper Abstract

Research on data visualization aims at finding the best way to present data via visual interfaces. We introduce the complementary problem of "data vocalization". Our goal is to present relational data in the most efficient way via voice output. This problem setting is motivated by emerging tools and devices (e.g., Google Home, Amazon Echo, Apple's Siri, or voice-based SQL interfaces) that communicate data primarily via audio output to their users.


Vocalizing Large Time Series Efficiently

Paper Abstract

We vocalize query results for time series data. We describe a holistic approach that integrates query evaluation and vocalization. In particular, we generate only those parts of the query result that are relevant for voice output. We exploit the fact that voice output has to be concise and simple to be understandable for listeners. Hence, the problem of generating voice output reduces to choosing between several coarse-grained alternatives. To make that choice, it is sufficient to evaluate the time series at a few carefully chosen locations. We use techniques from the area of optimal experimental design to choose optimal sampling points. Our algorithm is iterative and generates in each iteration a set of promising voice description candidates. We consider multiple metrics when generating voice descriptions, including the accuracy of description as well as its complexity and length. Then, we choose a near-optimal batch of sampling points to refine our choice between promising candidates. We compare this algorithm experimentally against several baselines, demonstrating superior performance in terms of execution time and output quality. We also conducted a user study, showing that it enables users to execute simple exploratory data analysis via voice descriptions alone. We also compare against visual interfaces and sonification (i.e., non-speech sound) interfaces in terms of user performance.


Presentations

VLDB Conference

Munich, Germany, September 2017

Presented paper at annual research conference, 'Data Vocalization: Optimizing Voice Output of Relational Data.'

Awards

Outstanding Undergraduate Researcher Award (Honorable Mention), 2018

Received honorable mention for CRA's Outstanding Undergraduate Researcher Award, which recognizes undergraduate students in North American colleges and universities who show outstanding research potential in an area of computing research.

JP Morgan Award, Apr 2018

Received award at BOOM 2018, Cornell's annual showcase of student research and creativity in digital technology and applications. Presented project 'Voice-Based Analysis of Time Series Data'.

Lockheed Martin Award, Sept 2017

Received award at BOOM 2017, Cornell's annual showcase of student research and creativity in digital technology and applications. Presented project 'Optimizing Voice Output of Relational Data'.