A Comparison of C-Store and Row-Store in a Common Framework

Loading...
Thumbnail Image

Date

Authors

Halverson, Alan
Beckmann, Jennifer L.
Naughton, Jeffrey F.
Dewitt, David J.

Advisors

License

DOI

Type

Technical Report

Journal Title

Journal ISSN

Volume Title

Publisher

University of Wisconsin-Madison Department of Computer Sciences

Grantor

Abstract

Recently, a ?column store? system called CStore has shown significant performance benefits by utilizing storage optimizations for a read-mostly query workload. The authors of the C-Store paper compared their optimized column store to a commercial row store RDBMS that is optimized for a mixture of reads and writes, which obscures the relative benefits of row and column stores. In this paper, we describe two storage optimizations for a row store architecture given a read-mostly query workload ? ?super tuples? and ?column abstraction.? We implement both our optimized row store and C-Store in a common framework in order to perform an ?apples-to-apples? comparison of the optimizations in isolation and combination. We also develop a detailed cost model for sequential scans tobreak down time spent into three categories ? disk I/O, iteration cost, and local tuple reconstruction cost. We conclude that, while the C-Store system offers tremendous performance benefits for scanning a small fraction of columns from a table, our optimized row store provides disk storage savings, reduced sequential scan times, and low additional CPU overheads while requiring only evolutionary changes to a standard row store.

Description

Keywords

Related Material and Data

Citation

TR1570

Sponsorship

Endorsement

Review

Supplemented By

Referenced By