applications, and with hardware exclusively dedicated
to the tests. Cassandra has data compression features,
involving data compression algorithms to save disk
space. All tests performed in this work were per-
formed with the compression disabled in order not to
degrade the performance. Complimentary tests also
must be performed using compression algorithms to
verify the impact of this in the used disk space and re-
sponse time. The compaction and SSTable generation
operations involve many disk operations, which must
be affected by the SSTable fragmentation reflected
in the disk as a file. As future work, C*DynaConf
could check the node fragmentation level – i.e., the
number of file fragments – and, in case of high frag-
mentation, emit system calls to defragment the files.
Another possible improvement would be the use of a
real IoT dataset. The simulated data used in this work
was meant to reflect a natural environment. However,
the use of data generated in a production environment
should be used to validate the efficacy and efficiency
of C*DynaConf.
REFERENCES
Apache Software Foundation (2016). Apache Cassan-
dra Monitoring. Available in http://cassandra.
apache.org/doc/latest/operating/metrics.html.
Acessed in 01/07/2021.
Apache Software Foundation (2020). Apache Cassan-
dra Compaction. Available in https://cassandra
.apache.org/doc/latest/operating/compaction/index.h-
tml?highlight=compaction%20strategies. Accessed
in 01/07/2021.
Carpenter, J. and Hewitt, E. (2016). Cassandra: The Defini-
tive Guide: Distributed Data at Web Scale. ”O’Reilly
Media, Inc.”.
Chang, F., Dean, J., Ghemawat, S., Hsieh, W. C., Wallach,
D. A., Burrows, M., Chandra, T., Fikes, A., and Gru-
ber, R. E. (2008). Bigtable: A distributed storage sys-
tem for structured data. ACM Transactions on Com-
puter Systems (TOCS), 26(2):4.
Chebotko, A., Kashlev, A., and Lu, S. (2015). A big data
modeling methodology for apache cassandra. In 2015
IEEE International Congress on Big Data, pages 238–
245. IEEE.
Criteo (2018). cassandra exporter: Apache cassan-
dra® metrics exporter for Prometheus. Available
in https://github.com/criteo/cassandra exporter. Ac-
cessed in 01/07/2021.
Datastax (2018). Configuring Compaction in Apache Cas-
sandra 3.0. Available in https://docs.datastax.com/
en/cassandra/3.0/cassandra/operations/opsConfigure
Compaction.html. Accessed in 01/07/2021.
DataStax (2018). Java Driver for Apache Cassandra -
Home. Available in https://docs.datastax.com/en/
developer/java-driver/3.5/. Accessed in 01/07/2021.
Dias, L. B. (2018). Github Repository: Experimental
data. Available in https://github.com/lucasbenevides/
mestrado. Acessado em 01/07/2021.
Dias, L. B., Holanda, M., Huacarpuma, R. C., and Jr, R.
T. d. S. (2018). NoSQL Database Performance Tun-
ing for IoT Data - Cassandra Case Study. In Proceed-
ings of the 3rd International Conference on Internet of
Things, Big Data and Security, pages 277–284, Fun-
chal, Portugal.
Eriksson, M. (2014). DateTieredCompactionStrategy :
Compaction for Time Series Data. Available in
http://www.datastax.com/dev/blog/datetieredcompac-
tionstrategy. Accessed in 01/07/2021.
Ghosh, M., Gupta, I., Gupta, S., and Kumar, N. (2015).
Fast Compaction Algorithms for NoSQL Databases.
In 2015 IEEE 35th International Conference on Dis-
tributed Computing Systems, pages 452–461.
Gujral, H., Sharma, A., and Kaur, P. (2018). Empir-
ical investigation of trends in nosql-based big-data
solutions in the last decade. In 2018 Eleventh In-
ternational Conference on Contemporary Computing
(IC3), pages 1–3. IEEE.
Hegerfors, B. (2014). Date-Tiered Com-
paction in Apache Cassandra. Available in
https://labs.spotify.com/2014/12/18/date-tiered-
compac- tion/. Acessed in 01/07/2021.
Jirsa, J. and Eriksson, M. (2016). Provide
an alternative to DTCS - ASF JIRA.
[CASSANDRA-9666]. Available in
https://issues.apache.org/jira/browse/CASSANDRA-9
666. Acessed in 01/07/2021.
Katiki Reddy, R. R. (2020). Improving Efficiency of
Data Compaction by Creating & Evaluating a Random
Compaction Strategy in Apache Cassandra. Master’s
thesis, Department of Software Engineering.
Kiraz, G. and To
˘
gay, C. (2017). Iot Data Storage: Rela-
tional & Non-Relational Database Management Sys-
tems Performance Comparison. A. Yazici & C. Turhan
(Eds.), 34:48–52.
Kona, S. (2016). Compactions in Apache Cassandra :
Performance Analysis of Compaction Strategies in
Apache Cassandra. Masters, Blekinge Institute of
Technology, Karlskrona, Sweden.
Li, T., Liu, Y., Tian, Y., Shen, S., and Mao, W. (2012). A
storage solution for massive iot data based on nosql. In
2012 IEEE International Conference on Green Com-
puting and Communications, pages 50–57.
Lu, B. and Xiaohui, Y. (2016). Research on Cassandra Data
Compaction Strategies for Time-Series Data. Journal
of Computers, 11(6):504–513.
Oliveira, M. I. S., L
´
oscio, B. F., da Gama, K. S., and
Saco, F. (2015). An
´
alise de Desempenho de Cat
´
alogo
de Produtores de Dados para Internet das Coisas
baseado em SensorML e NoSQL. In XIV Workshop
em Desempenho de Sistemas Computacionais e de
Comunicac¸
˜
ao.
Ravu, V. S. S. J. S. (2016). Compaction Strategies in
Apache Cassandra : Analysis of Default Cassandra
stress model. Masters, Blekinge Institute of Technol-
ogy, Karlskrona, Sweden.
C*DynaConf: An Apache Cassandra Auto-tuning Tool for Internet of Things Data
101