Footprint Reduction and Uniqueness Enforcement with Hash Indices in SAP HANA

Faust, Martin; Boissier, Martin; Keller, Marvin; Schwalb, David; Bischoff, Holger; Eisenreich, Katrin; Färber, Franz; Plattner, Hasso

doi:10.1007/978-3-319-44406-2_11

Martin Faust¹⁵,
Martin Boissier¹⁵,
Marvin Keller¹⁵,
David Schwalb¹⁵,
Holger Bischoff¹⁶,
Katrin Eisenreich¹⁶,
Franz Färber¹⁶ &
…
Hasso Plattner¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9828))

Included in the following conference series:

International Conference on Database and Expert Systems Applications

980 Accesses
3 Citations

Abstract

Databases commonly use multi-column indices for composite keys that concatenate attribute values for fast entity retrieval. For real-world applications, such concatenated composite keys contribute significantly to the overall space consumption, which is particularly expensive for main memory-resident databases. We present an integer-based hash representation of the actual values for the purpose of reducing the overall memory footprint of a system while maintaining the level of performance. We analyzed the performance impact as well as the memory footprint reduction of hash-based indices in SAP HANA in a real-world enterprise database setting. For a live production SAP ERP system, the introduction of hash-based primary key indices alone reduces the entire memory footprint by 10 % with comparable performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Global 2000: http://www.forbes.com/global2000/.

References

Ailamaki, A., et al.: DBMSs on a modern processor: where does time go? In: VLDB 1999, Proceedings of 25th International Conference on Very Large Data Bases, pp. 266–277 (1999)
Google Scholar
Anh, V.N., Moffat, A.: Inverted index compression using word-aligned binary codes. Inf. Retr. 8(1), 151–166 (2005)
Article Google Scholar
Athanassoulis, M., Ailamaki, A.: BF-Tree: approximate tree indexing. Proc. VLDB Endowment 7, 1881–1892 (2014)
Article Google Scholar
Fagin, R., Nievergelt, J., Pippenger, N., Raymond Strong, H.: Extendible hashing a fast access method for dynamic files. ACM Trans. Database Syst. (TODS) 4(3), 315–344 (1979)
Article Google Scholar
Färber, F., et al.: SAP HANA database: data management for modern business applications. ACM Sigmod Rec. 40(4), 45–51 (2012)
Article Google Scholar
Faust, M., Schwalb, D., Plattner, H.: Composite group-keys. In: Jagatheesan, A., Levandoski, J., Neumann, T., Pavlo, A. (eds.) IMDM 2013/2014. LNCS, vol. 8921, pp. 139–150. Springer, Heidelberg (2015)
Chapter Google Scholar
Gopal, V., et al.: Fast CRC computation for iSCSI Polynomial using CRC32 instruction. Technical report, Intel Corporation (2011)
Google Scholar
Knuth, D.E.: The Art of Computer Programming: Sorting and Searching, vol. 3. Pearson Education, USA (1998)
MATH Google Scholar
Krueger, J., et al.: Fast updates on read-optimized databases using multi-core cpus. Proc. VLDB Endowment 5(1), 61–72 (2011)
Article MathSciNet Google Scholar
Larson, P.-A.: Linear hashing with separators—a dynamic hashing scheme achieving one-access. ACM Trans. Database Syst. (TODS) 13(3), 366–388 (1988)
Article Google Scholar
Lehman, T.J., Carey, M.J.: A study of index structures for main memory database management systems. In: Conference on Very Large Data Bases, vol. 294 (1986)
Google Scholar
Leis, V., Kemper, A., Neumann, T.: The adaptive radix tree: artful indexing for main-memory databases. In: 2013 IEEE 29th International Conference on Data Engineering (ICDE), pp. 38–49. IEEE (2013)
Google Scholar
Litwin, W.: Linear hashing: a new tool for file and table addressing. In: VLDB, vol. 80, pp. 1–3 (1980)
Google Scholar
Manegold, S., Kersten, M.L., Boncz, P.: Database architecture evolution: mammals flourished long before dinosaurs became extinct. Proc. VLDB Endowment 2(2), 1648–1653 (2009)
Article Google Scholar
Plattner, H.: The impact of columnar in-memory databases on enterprise systems. Proc. VLDB Endowment 7(13), 1722–1729 (2014)
Article Google Scholar
Ross, K.A.: Efficient hash probes on modern processors. In: IEEE 23rd International Conference on Data Engineering, ICDE, pp. 1297–1301. IEEE (2007)
Google Scholar
Sidirourgos, L., Kersten, M.: Column imprints: a secondary index structure. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 893–904. ACM (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

Hasso Plattner Institute, Potsdam, Germany
Martin Faust, Martin Boissier, Marvin Keller, David Schwalb & Hasso Plattner
SAP SE, Walldorf, Germany
Holger Bischoff, Katrin Eisenreich & Franz Färber

Authors

Martin Faust
View author publications
You can also search for this author in PubMed Google Scholar
Martin Boissier
View author publications
You can also search for this author in PubMed Google Scholar
Marvin Keller
View author publications
You can also search for this author in PubMed Google Scholar
David Schwalb
View author publications
You can also search for this author in PubMed Google Scholar
Holger Bischoff
View author publications
You can also search for this author in PubMed Google Scholar
Katrin Eisenreich
View author publications
You can also search for this author in PubMed Google Scholar
Franz Färber
View author publications
You can also search for this author in PubMed Google Scholar
Hasso Plattner
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Martin Boissier .

Editor information

Editors and Affiliations

Clausthal University of Technology, Clausthal-Zellerfeld, Germany
Sven Hartmann
Victoria University of Wellington, Wellington, New Zealand
Hui Ma

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Faust, M. et al. (2016). Footprint Reduction and Uniqueness Enforcement with Hash Indices in SAP HANA. In: Hartmann, S., Ma, H. (eds) Database and Expert Systems Applications. DEXA 2016. Lecture Notes in Computer Science(), vol 9828. Springer, Cham. https://doi.org/10.1007/978-3-319-44406-2_11

Download citation

DOI: https://doi.org/10.1007/978-3-319-44406-2_11
Published: 06 August 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-44405-5
Online ISBN: 978-3-319-44406-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics