Authors:
Nusrat Jahan Lisa
1
;
Annett Ungethüm
1
;
Dirk Habich
1
;
Nguyen Duy Anh Tuan
2
;
Akash Kumar
2
and
Wolfgang Lehner
1
Affiliations:
1
Database Systems Group, Technische Universität Dresden, Dresden and Germany
;
2
Processor Design Group, Technische Universität Dresden, Dresden and Germany
Keyword(s):
Column Stores, Scan Operation, Vectorization, FPGA, Pipeline.
Related
Ontology
Subjects/Areas/Topics:
Data Engineering
;
Database Architecture and Performance
;
Databases and Data Security
;
Query Processing and Optimization
Abstract:
The key objective of database systems is to reliably manage data, whereby high query throughput and low query latency are core requirements. To satisfy these requirements for analytical query workloads, in-memory column store database systems are state-of-the-art. In these systems, relational tables are organized by column rather than by row, so that a full column scan is a fundamental key operation and thus, the optimization of the key operation is very crucial. For this reason, we investigated the optimization of a well-known scan technique using SIMD (Single Instruction Multiple Data) vectorization as well as using Field Programmable Gate Arrays (FPGA). In this paper, we present both optimization approaches with the goal to increase the intra-instruction execution parallelism to process more columns values in a single instruction simultaneously. For both, we present selective results of our exhaustive evaluation. Based on this evaluation, we draw some lessons learned for our ongoi
ng research activities.
(More)