Authors:
Alexander Krause
1
;
Frank Ebner
2
;
Dirk Habich
1
and
Wolfgang Lehner
1
Affiliations:
1
Technische Universität Dresden, Database Systems Group, Dresden and Germany
;
2
University of Applied Sciences Würzburg-Schweinfurt, Faculty of Computer Science and Business Information Systems, Würzburg and Germany
Keyword(s):
Graph Processing, In-memory, Bloom Filter, Multiprocessor System, NUMA.
Related
Ontology
Subjects/Areas/Topics:
Data Engineering
;
Database Architecture and Performance
;
Databases and Data Security
;
Databases and Information Systems Integration
;
Enterprise Information Systems
;
Large Scale Databases
;
Nosql Databases
;
Query Processing and Optimization
Abstract:
Graph pattern matching (GPM) is a core primitive in graph analysis with many applications. Efficient processing of GPM on modern NUMA systems poses several challenges, such as an intelligent storage of the graph itself or keeping track of vertex locality information. During query processing, intermediate results need to be communicated, but target partitions are not always directly identifiable, which requires all workers to scan for requested vertices. To optimize this performance bottleneck, we introduce a Bloom filter based workload reduction approach and discuss the benefits and drawbacks of different implementations. Furthermore, we show the trade-offs between invested memory and performance gain, compared to fully redundant storage.