Resiliency-aware Data Compression for In-memory Database Systems

Till Kolditz, Dirk Habich, Patrick Damme, Wolfgang Lehner, Dmitrii Kuvaiskii, Oleksii Oleksenko, Christof Fetzer

2015

Abstract

Nowadays, database systems pursuit a main memory-centric architecture, where the entire business-related data is stored and processed in a compressed form in main memory. In this case, the performance gain is massive because database operations can benefit from its higher bandwidth and lower latency. However, current main memory-centric database systems utilize general-purpose error detection and correction solutions to address the emerging problem of increasing dynamic error rate of main memory. The costs of these generalpurpose methods dramatically increases with increasing error rates. To reduce these costs, we have to exploit context knowledge of database systems for resiliency. Therefore, we introduce our vision of resiliency-aware data compression in this paper, where we want to exploit the benefits of both fields in an integrated approach with low performance and memory overhead. In detail, we present and evaluate a first approach using AN encoding and two different compression schemes to show the potentials and challenges of our vision.

References

  1. Abadi, D., Madden, S., and Ferreira, M. Integrating compression and execution in column-oriented database systems. In SIGMOD, pages 671-682.
  2. Chen, Z., Gehrke, J., and Korn, F. (2001). Query optimization in compressed database systems. SIGMOD Rec., 30(2):271-282.
  3. Garcia-Molina, H. and Salem, K. (1992). Main Memory Database Systems: An Overview. Knowledge and Data Engineering, 4(6).
  4. Graefe, G., Kuno, H., and Seeger, B. (2012). Selfdiagnosing and self-healing indexes. In DBTest, pages 8:1-8:8.
  5. Graefe, G. and Stonecipher, R. (2009). Efficient verification of b-tree integrity. In BTW, pages 27-46.
  6. Hoffmann, M., Ulbrich, P., Dietrich, C., Schirmeier, H., Lohmann, D., and Schröder-Preikschat, W. (2014). A Practitioner's Guide to Software-based Soft-Error Mitigation Using AN-Codes. In HASE 7814, pages 33- 40.
  7. Hwang, A. A., Stefanovici, I. A., and Schroeder, B. (2012). Cosmic Rays Don't Strike Twice: Understanding the Nature of DRAM Errors and the Implications for System Design. SIGARCH Comput. Archit. News, 40(1).
  8. Italiano, G. F. (2010). Resilient algorithms and data structures. In CIAC 2010, pages 13-24.
  9. Kolditz, T., Kissinger, T., Schlegel, B., Habich, D., and Lehner, W. (2014). Online bit flip detection for inmemory b-trees on unreliable hardware. In DaMoN, pages 5:1-5:9.
  10. Lemire, D. and Boytsov, L. (2012). Decoding billions of integers per second through vectorization. CoRR, abs/1209.2137.
  11. Moon, T. K. (2005). Error Correction Coding: Mathematical Methods and Algorithms. Wiley.
  12. Roth, M. A. and Van Horn, S. J. (1993). Database compression. SIGMOD Rec., 22(3):31-39.
  13. Schiffel, U. (2011). Hardware Error Detection Using ANCodes. PhD thesis, Technische Universität Dresden.
  14. Schlegel, B., Gemulla, R., and Lehner, W. (2010). Fast integer compression using simd instructions. In DaMoN, pages 34-40.
  15. Sullivan, M. and Stonebraker, M. (1991). Using write protected data structures to improve software fault tolerance in highly available database management systems. In VLDB, pages 171-180.
  16. Warren, H. S. (2002). Hacker's Delight. Addison-Wesley Longman Publishing Co., Inc., Boston, MA, USA.
Download


Paper Citation


in Harvard Style

Kolditz T., Habich D., Damme P., Lehner W., Kuvaiskii D., Oleksenko O. and Fetzer C. (2015). Resiliency-aware Data Compression for In-memory Database Systems . In Proceedings of 4th International Conference on Data Management Technologies and Applications - Volume 1: DATA, ISBN 978-989-758-103-8, pages 326-331. DOI: 10.5220/0005557303260331


in Bibtex Style

@conference{data15,
author={Till Kolditz and Dirk Habich and Patrick Damme and Wolfgang Lehner and Dmitrii Kuvaiskii and Oleksii Oleksenko and Christof Fetzer},
title={Resiliency-aware Data Compression for In-memory Database Systems},
booktitle={Proceedings of 4th International Conference on Data Management Technologies and Applications - Volume 1: DATA,},
year={2015},
pages={326-331},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005557303260331},
isbn={978-989-758-103-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of 4th International Conference on Data Management Technologies and Applications - Volume 1: DATA,
TI - Resiliency-aware Data Compression for In-memory Database Systems
SN - 978-989-758-103-8
AU - Kolditz T.
AU - Habich D.
AU - Damme P.
AU - Lehner W.
AU - Kuvaiskii D.
AU - Oleksenko O.
AU - Fetzer C.
PY - 2015
SP - 326
EP - 331
DO - 10.5220/0005557303260331