loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Manuel Weißbach and Thomas Springer

Affiliation: Faculty of Computer Science, Technische Universität Dresden, Germany

Keyword(s): Stream Processing, Benchmarking, Database Benchmark, Big Data, Performance.

Abstract: Data stream processing (DSP) is being used in more and more fields to process large amounts of data with minimal latency and high throughput. In typical setups, a stream processing engine is combined with additional components, especially database systems to implement complex use cases, which might cause a significant decrease of processing performance. In this paper we examine the specific data access patterns caused by data stream processing and benchmark database systems with typical use cases derived from a real-world application. Our tests involve popular databases in combination with Apache Flink to identify the system combinations with the highest processing performance. Our results show that the choice of a database is highly dependent on the data access pattern of the particular use case. In one of our benchmarks, we found a throughput difference of a factor of 46.2 between the best and the worst performing database. From our experience in implementing a complex real-world a pplication, we have derived a set of performance optimization recommendations to help system developers to select an appropriate database for their use case and to find a high-performing system configuration. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.15.228.171

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Weißbach, M. and Springer, T. (2022). Performance of Databases Used in Data Stream Processing Environments. In Proceedings of the 12th International Conference on Cloud Computing and Services Science - CLOSER; ISBN 978-989-758-570-8; ISSN 2184-5042, SciTePress, pages 15-26. DOI: 10.5220/0011018300003200

@conference{closer22,
author={Manuel Weißbach. and Thomas Springer.},
title={Performance of Databases Used in Data Stream Processing Environments},
booktitle={Proceedings of the 12th International Conference on Cloud Computing and Services Science - CLOSER},
year={2022},
pages={15-26},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0011018300003200},
isbn={978-989-758-570-8},
issn={2184-5042},
}

TY - CONF

JO - Proceedings of the 12th International Conference on Cloud Computing and Services Science - CLOSER
TI - Performance of Databases Used in Data Stream Processing Environments
SN - 978-989-758-570-8
IS - 2184-5042
AU - Weißbach, M.
AU - Springer, T.
PY - 2022
SP - 15
EP - 26
DO - 10.5220/0011018300003200
PB - SciTePress