publications | Oscar Moll

2023

ACM SIGMOD
SeeSaw: Interactive Ad-hoc Search Over Image Databases

Oscar Moll, Manuel Favela, Samuel Madden, Vijay Gadepally, and Michael Cafarella

Proc. ACM Manag. Data, Dec 2023

Abs Bib PDF

As image datasets become ubiquitous, the problem of ad-hoc searches over image data is increasingly important. Many high-level data tasks in machine learning, such as constructing datasets for training and testing object detectors, imply finding ad-hoc objects or scenes within large image datasets as a key sub-problem. New foundational visual-semantic embeddings trained on massive web datasets such as Contrastive Language-Image Pre-Training (CLIP) can help users start searches on their own data, but we find there is a long tail of queries where these models fall short in practice. Seesaw is a system for interactive ad-hoc searches on image datasets that integrates state-of-the-art embeddings like CLIP with user feedback in the form of box annotations to help users quickly locate images of interest in their data even in the long tail of harder queries. One key challenge for Seesaw is that, in practice, many sensible approaches to incorporating feedback into future results, including state-of-the-art active-learning algorithms, can worsen results compared to introducing no feedback, partly due to CLIP’s high-average performance. Therefore, Seesaw includes several algorithms that empirically result in larger and also more consistent improvements. We compare Seesaw’s accuracy to both using CLIP alone and to a state-of-the-art active-learning baseline and find Seesaw consistently helps improve results for users across four datasets and more than a thousand queries. Seesaw increases Average Precision (AP) on search tasks by an average of .08 on a wide benchmark (from a base of .72), and by a .27 on a subset of more difficult queries where CLIP alone performs poorly.
@article{seesaw, author = {Moll, Oscar and Favela, Manuel and Madden, Samuel and Gadepally, Vijay and Cafarella, Michael}, title = {SeeSaw: Interactive Ad-hoc Search Over Image Databases}, year = {2023}, issue_date = {December 2023}, publisher = {Association for Computing Machinery}, address = {New York, NY, USA}, volume = {1}, number = {4}, url = {https://doi.org/10.1145/3626754}, doi = {10.1145/3626754}, journal = {Proc. ACM Manag. Data}, month = dec, articleno = {260}, numpages = {26}, keywords = {active search, data exploration, image retrieval, image search, machine learning, multi-modal embeddings, relevance feedback} }

2022

IEEE ICDE
ExSample: Efficient Searches on Video Repositories through Adaptive Sampling

Oscar Moll, Favyen Bastani, Sam Madden, Mike Stonebraker, Vijay Gadepally, and Tim Kraska

In 2022 IEEE 38th International Conference on Data Engineering (ICDE) , Dec 2022

Abs Bib PDF

ExSample is introduced, a low cost framework for object search over un-indexed video that quickly processes search queries by adapting the amount and location of sampled frames to the particular data and query being processed. Capturing and processing video is increasingly common as cameras become cheaper to deploy. At the same time, rich video-understanding methods have progressed greatly in the last decade. As a result, many organizations now have massive repositories of video data, with applications in mapping, navigation, autonomous driving, and other areas. Because state-of-the-art object-detection methods are slow and expensive, our ability to process even simple ad-hoc object search queries (“find 100 traffic lights in dashcam video”) over this accumulated data lags far behind our ability to collect the data. Processing video at reduced sampling rates is a reasonable default strategy for these types of queries; however, the ideal sampling rate is both data and query dependent. We introduce ExSample, a low cost framework for object search over un-indexed video that quickly processes search queries by adapting the amount and location of sampled frames to the particular data and query being processed. ExSample prioritizes the processing of frames in a video repository so that processing is focused in portions of video that most likely contain objects of interest. It approaches searching in a similar way to a multi-arm bandit problem where each arm corresponds to a portion of a video. On large, real-world datasets, ExSample reduces processing time by 1.9x on average and up to 6x over an efficient random sampling baseline. Moreover, we show ExSample finds many results long before sophisticated, state-of-the-art baselines based on proxy scores can begin producing their first results.
@inproceedings{exsample, author = {Moll, Oscar and Bastani, Favyen and Madden, Sam and Stonebraker, Mike and Gadepally, Vijay and Kraska, Tim}, booktitle = {2022 IEEE 38th International Conference on Data Engineering (ICDE)}, title = {ExSample: Efficient Searches on Video Repositories through Adaptive Sampling}, year = {2022}, volume = {}, number = {}, pages = {2956-2968}, keywords = {Costs;Navigation;Conferences;Organizations;Search problems;Data engineering;Cameras;video data;sampling;object detection}, doi = {10.1109/ICDE53745.2022.00266} }
Ad-hoc Searches on Image Databases

Oscar Moll, Sam Madden, and Vijay Gadepally

In Heterogeneous Data Management, Polystores, and Analytics for Healthcare , Dec 2022

Abs

Searching for ad-hoc objects in image and video datasets can be expensive and time consuming. As image data is more common, we increasingly need systems to help us query it. In this talk, we describe related work and then explore two approaches: ExSample and SeeSaw which target different scenarios of image and video search.

2020

VLDB
Vaas: video analytics at scale

Favyen Bastani, Oscar Moll, and Sam Madden

Proceedings VLDB Endowment, Aug 2020

Abs Bib PDF

We demonstrate Vaas, a video analytics system for large-scale datasets. Vaas provides an interactive interface to rapidly develop and experiment with different workflows for solving a video analytics task. Users express these workflows as Vaas queries, which specify data flow graphs where nodes may be implemented by machine learning models, custom code, or basic built-in operations (e.g., cropping, selecting detections of by class, filtering tracks by bounding boxes). For example, the problem of detecting lane change events in dashboard camera video could be solved directly as an activity recognition task, by training a model to classify whether a segment of video contains a lane change, or decomposed into a set of simpler tasks, such as detecting lane markers and then identifying shifts in the detected lanes. Our system interface incorporates a query composition tool, where users can rapidly compose operations to implement a workflow, and an exploration tool, where users can experiment with a query by applying it over samples from the dataset to fix bugs and tune parameters. Vaas incorporates recent work in approximate video query processing to support the fast, interactive execution of queries, and accelerates the annotation process of hand-labeling examples to train models by allowing users to annotate over the outputs of previously expressed queries rather than the entire video dataset.
@article{vaas, title = {Vaas: video analytics at scale}, author = {Bastani, Favyen and Moll, Oscar and Madden, Sam}, journal = {Proceedings VLDB Endowment}, publisher = {VLDB Endowment}, volume = {13}, number = {12}, pages = {2877--2880}, month = aug, year = {2020}, url = {https://doi.org/10.14778/3415478.3415498}, issn = {2150-8097}, doi = {https://doi.org/10.14778/3415478.3415498}, eprint = {https://doi.org/10.14778/3415478.3415498} }

2019

USPTO

Efficient garbage collection for a log-structured data storage system

Yan Valerie Leshinsky, James Mcclellan Corey, Samuel James McKelvie, Oscar Ricardo Moll Thomae, and Pradeep Jnana Madhavarapu

Oct 2019

US Patent and Trademark Office
USPTO

Individual write quorums for a log-structured distributed storage system

Samuel James McKelvie, Benjamin Tobler, James Mcclellan Corey, Pradeep Jnana Madhavarapu, Oscar Ricardo Moll Thomae, Christopher Richard Newcombe, Yan Valerie Leshinsky, and Anurag Windlass Gupta

Mar 2019

US Patent and Trademark Office

2017

Exploring big volume sensor data with Vroom

Oscar Moll, Aaron Zalewski, Sudeep Pillai, Samuel Madden, Michael Stonebraker, and Vijay Gadepally

PVLDB (Demo paper), Mar 2017

2016

VLDB
Voodoo : A Vector Algebra for Portable Database Performance on Modern Hardware

Holger Pirk, Oscar Moll, Matei Zaharia, and Sam Madden

PVLDB, Mar 2016

Abs Bib PDF

In-memory databases require careful tuning and many engineering tricks to achieve good performance. Such database performance engineering is hard: a plethora of data and hardware-dependent optimization techniques form a design space that is difficult to navigate for a skilled engineer — even more so for a query compiler. To facilitate performance-oriented design exploration and query plan compilation, we present Voodoo, a declarative intermediate algebra that abstracts the detailed architectural properties of the hardware, such as multi- or many-core architectures, caches and SIMD registers, without losing the ability to generate highly tuned code. Because it consists of a collection of declarative, vector-oriented operations, Voodoo is easier to reason about and tune than low-level C and related hardware-focused extensions (Intrinsics, OpenCL, CUDA, etc.). This enables our Voodoo compiler to produce (OpenCL) code that rivals and even outperforms the fastest state-of-the-art in memory databases for both GPUs and CPUs. In addition, Voodoo makes it possible to express techniques as diverse as cache-conscious processing, predication and vectorization (again on both GPUs and CPUs) with just a few lines of code. Central to our approach is a novel idea we termed control vectors, which allows a code generating frontend to expose parallelism to the Voodoo compiler in a abstract manner, enabling portable performance across hardware platforms.We used Voodoo to build an alternative backend for MonetDB, a popular open-source in-memory database. Our backend allows MonetDB to perform at the same level as highly tuned in-memory databases, including HyPeR and Ocelot. We also demonstrate Voodoo’s usefulness when investigating hardware conscious tuning techniques, assessing their performance on different queries, devices and data.
@article{voodoo, author = {Pirk, Holger and Moll, Oscar and Zaharia, Matei and Madden, Sam}, title = {Voodoo : {A} Vector Algebra for Portable Database Performance on Modern Hardware}, journal = {{PVLDB}}, volume = {9}, number = {14}, pages = {1707--1718}, year = {2016}, url = {http://www.vldb.org/pvldb/vol9/p1707-pirk.pdf}, doi = {10.14778/3007328.3007336}, timestamp = {Thu, 16 Aug 2018 01:00:00 +0200}, biburl = {https://dblp.org/rec/bib/journals/pvldb/PirkMZM16}, bibsource = {dblp computer science bibliography, https://dblp.org} }
What Makes a Good Physical plan?: Experiencing Hardware-Conscious Query Optimization with Candomblé

Holger Pirk, Oscar Moll, and Sam Madden

In Proceedings of the 2016 International Conference on Management of Data, SIGMOD Conference 2016, San Francisco, CA, USA, June 26 - July 01, 2016 , Mar 2016

2015

Amalgamated Lock-Elision

Yehuda Afek, Alexander Matveev, Oscar R. Moll, and Nir Shavit

In Distributed Computing - 29th International Symposium, DISC 2015, Tokyo, Japan, October 7-9, 2015, Proceedings , Mar 2015