Integrating Compression and Execution in Column-Oriented Database Systems. Abadi, D. J., Madden, S. R., & Ferreira, M. In SIGMOD, pages 671-682, Chicago, IL, USA, 2006.
Paper abstract bibtex Column-oriented database system architectures invite a re-evaluation of how and when data in databases is compressed. Storing data in a column-oriented fashion greatly increases the similarity of adjacent records on disk and thus opportunities for compression. The ability to compress many adjacent tuples at once lowers the per-tuple cost of compression, both in terms of CPU and space overheads. In this paper, we discuss how we extended C-Store (a column-oriented DBMS) with a compression sub-system. We show how compression schemes not traditionally used in row-oriented DBMSs can be applied to column-oriented systems. We then evaluate a set of compression schemes and show that the best scheme depends not only on the properties of the data but also on the nature of the query workload.
@inproceedings{cstore-comp,
author = {Daniel J. Abadi and Samuel R. Madden and Miguel Ferreira},
title = {Integrating Compression and Execution in Column-Oriented Database Systems},
booktitle = {SIGMOD},
year = {2006},
address = {Chicago, IL, USA},
pages = {671-682},
venue = "SIGMOD",
url_Paper = "http://www.cs.umd.edu/~abadi/papers/abadisigmod06.pdf",
abstract = "Column-oriented database system architectures invite a re-evaluation of how and when data in databases is compressed. Storing data in a column-oriented fashion greatly increases the similarity of adjacent records on disk and thus opportunities for compression. The ability to compress many adjacent tuples at once lowers the per-tuple cost of compression, both in terms of CPU and space overheads. In this paper, we discuss how we extended C-Store (a column-oriented DBMS) with a compression sub-system. We show how compression schemes not traditionally used in row-oriented DBMSs can be applied to column-oriented systems. We then evaluate a set of compression schemes and show that the best scheme depends not only on the properties of the data but also on the nature of the query workload.",
pdfKB = "265",
publicationtype = "Conference Paper",
displayCategory = "Conference or Journal Publication",
keywords = "Scalable queries,Analytical database systems,Column-stores,C-Store",
project:multiple = "C-Store",
}
Downloads: 0
{"_id":"juFSfWhHgNDCdxwXR","bibbaseid":"abadi-madden-ferreira-integratingcompressionandexecutionincolumnorienteddatabasesystems-2006","downloads":0,"creationDate":"2018-07-19T20:49:30.482Z","title":"Integrating Compression and Execution in Column-Oriented Database Systems","author_short":["Abadi, D. J.","Madden, S. R.","Ferreira, M."],"year":2006,"bibtype":"inproceedings","biburl":"cs.umd.edu/~abadi/pubs/abadirefs.bib","bibdata":{"bibtype":"inproceedings","type":"inproceedings","author":[{"firstnames":["Daniel","J."],"propositions":[],"lastnames":["Abadi"],"suffixes":[]},{"firstnames":["Samuel","R."],"propositions":[],"lastnames":["Madden"],"suffixes":[]},{"firstnames":["Miguel"],"propositions":[],"lastnames":["Ferreira"],"suffixes":[]}],"title":"Integrating Compression and Execution in Column-Oriented Database Systems","booktitle":"SIGMOD","year":"2006","address":"Chicago, IL, USA","pages":"671-682","venue":"SIGMOD","url_paper":"http://www.cs.umd.edu/~abadi/papers/abadisigmod06.pdf","abstract":"Column-oriented database system architectures invite a re-evaluation of how and when data in databases is compressed. Storing data in a column-oriented fashion greatly increases the similarity of adjacent records on disk and thus opportunities for compression. The ability to compress many adjacent tuples at once lowers the per-tuple cost of compression, both in terms of CPU and space overheads. In this paper, we discuss how we extended C-Store (a column-oriented DBMS) with a compression sub-system. We show how compression schemes not traditionally used in row-oriented DBMSs can be applied to column-oriented systems. We then evaluate a set of compression schemes and show that the best scheme depends not only on the properties of the data but also on the nature of the query workload.","pdfkb":"265","publicationtype":"Conference Paper","displaycategory":"Conference or Journal Publication","keywords":"Scalable queries,Analytical database systems,Column-stores,C-Store","project:multiple":"C-Store","bibtex":"@inproceedings{cstore-comp,\n author = {Daniel J. Abadi and Samuel R. Madden and Miguel Ferreira},\n title = {Integrating Compression and Execution in Column-Oriented Database Systems},\n booktitle = {SIGMOD},\n year = {2006},\n address = {Chicago, IL, USA},\n pages = {671-682},\n venue = \"SIGMOD\",\n url_Paper = \"http://www.cs.umd.edu/~abadi/papers/abadisigmod06.pdf\",\n abstract = \"Column-oriented database system architectures invite a re-evaluation of how and when data in databases is compressed. Storing data in a column-oriented fashion greatly increases the similarity of adjacent records on disk and thus opportunities for compression. The ability to compress many adjacent tuples at once lowers the per-tuple cost of compression, both in terms of CPU and space overheads. In this paper, we discuss how we extended C-Store (a column-oriented DBMS) with a compression sub-system. We show how compression schemes not traditionally used in row-oriented DBMSs can be applied to column-oriented systems. We then evaluate a set of compression schemes and show that the best scheme depends not only on the properties of the data but also on the nature of the query workload.\",\n pdfKB = \"265\",\n publicationtype = \"Conference Paper\",\n displayCategory = \"Conference or Journal Publication\",\n keywords = \"Scalable queries,Analytical database systems,Column-stores,C-Store\",\n project:multiple = \"C-Store\",\n}\n\n","author_short":["Abadi, D. J.","Madden, S. R.","Ferreira, M."],"key":"cstore-comp","id":"cstore-comp","bibbaseid":"abadi-madden-ferreira-integratingcompressionandexecutionincolumnorienteddatabasesystems-2006","role":"author","urls":{" paper":"http://www.cs.umd.edu/~abadi/papers/abadisigmod06.pdf"},"keyword":["Scalable queries","Analytical database systems","Column-stores","C-Store"],"metadata":{"authorlinks":{"abadi, d":"https://www.cs.umd.edu/~abadi/pubs/pubs.shtml"}},"downloads":0},"search_terms":["integrating","compression","execution","column","oriented","database","systems","abadi","madden","ferreira"],"keywords":["scalable queries","analytical database systems","column-stores","c-store"],"authorIDs":["545a372eb43425b772000d33","5GWfruXC8LGzsXQXZ","5dfafc2efa2bbbde01000123","5dfb4da1e04f92df01000122","5e0293fd64e549de01000015","5e07201f865432df01000050","5e0f3ab32c4a31df01000058","5e10fe46d6a01ede0100002d","5e18dd1da382e2de0100010a","5e196b6773bf69de01000051","5e1ed08b875c69df01000020","5e26cfec8535cedf01000169","5e28e675a3df5bdf0100013e","5e399a48d14579de0100029c","5e4aad9315f6c7df010000b3","5e59b208103b4fde0100000e","5e64a5a32551dede01000064","KDBQuK7XWc7CSqeit","LJreXapK6mk9Rc2Pu","N6yBEFnvRGqjThfK2","RspDsBhJKZKQBkgAw","Wp5ihSMdeiMT82YBn","e6qD3P7b8FZJE3FJf","ffuc7EABFba74ArWY","jhL6i2R8YgkXCbPae","oHD5MJpPt7u6NoDGo","qJmCcnJn6k7RTKK8o","wBCK972mKn88A3K3r","zfX7BeExuv4gc7kfg"],"dataSources":["YdtR8AbetSqiZGCey","MPg4deo7Xr6HYSthD","bHTCYJduhkrS5AHxu"]}