Approximating Streaming Window Joins Under CPU Limitations
Loading...
Files
Date
Authors
Ayad, Ahmed
Naughton, Jeffrey
Wright, Stephen
Srivastava, Utkarsh
Advisors
License
DOI
Type
Technical Report
Journal Title
Journal ISSN
Volume Title
Publisher
University of Wisconsin-Madison Department of Computer Sciences
Grantor
Abstract
Data streaming systems face the possibility of having to shed load in the case of CPU or memory resource limitations. In this paper, we study the CPU limited
scenario in detail. First, we propose a new model for the CPU cost. Then we formally state the problem of shedding load for the goal of obtaining the maximum possible subset of the complete answer, assuming an offline scenario. We then present an online strategy for semantic load shedding, assuming a frequency model. Moving on to random load shedding, we prove that having an open memory budget does not help the CPU constrained case. We then present a random load
shedding strategy ? Probe-No-Insert ? based on decoupling the window maintenance and tuple production operations of the symmetric hash join, and prove that it
always dominates the previously proposed coin flipping strategy. Finally, we investigate the problem in case the goal is to obtain a random sample of the join.
Description
Keywords
Related Material and Data
Citation
TR1542