Interacting with Large Distributed Datasets Using Sketch
| dc.contributor.author | Budiu, Mihai | |
| dc.contributor.author | Isaacs, Rebecca | |
| dc.contributor.author | Murray, Derek | |
| dc.contributor.author | Plotkin, Gordon | |
| dc.contributor.author | Barham, Paul | |
| dc.contributor.author | Al-Kiswany, Samer | |
| dc.contributor.author | Boshmaf, Yazan | |
| dc.contributor.author | Luo, Qingzhou | |
| dc.contributor.author | Andoni, Alexandr | |
| dc.date.accessioned | 2015-02-03T18:18:49Z | |
| dc.date.available | 2015-02-03T18:18:49Z | |
| dc.date.issued | 2015-01-29 | |
| dc.description.abstract | We present Sketch, a distributed software infrastructure for building interactive tools for exploring large datasets, distributed across multiple machines. We have built three sophisticated applications using this framework: a billion-row spreadsheet, a distributed log browser, and a distributed- systems performance debugging tool. Sketch applications allow interactive and responsive exploration of complex distributed datasets, scaling gracefully to large system sizes. The conflicting constraints of large-scale data and small timescales required by human interaction are difficult to satisfy simultaneously. Sketch exploits a sweet spot in this trade-off by exploiting the observation that the precision of a data view is limited by the resolution of the user?s screen. The system pushes data reduction operations to the data sources. The core Sketch abstraction provides a narrow programming interface; Sketch clients construct a distributed application by stacking modular components with identical interfaces, each providing a useful feature: network transparency, concurrency, fault-tolerance, straggler avoidance, round-trip reduction, distributed aggregation. | en |
| dc.identifier.citation | TR1817 | en |
| dc.identifier.uri | http://digital.library.wisc.edu/1793/70467 | |
| dc.subject | distributed systems | en |
| dc.subject | sketching | en |
| dc.subject | interactive visualization | en |
| dc.subject | big data visualization | en |
| dc.title | Interacting with Large Distributed Datasets Using Sketch | en |
| dc.type | Technical Report | en |