tenance performance. The same applies for replica
placement reconfiguration as shown in Fig. 6. The
increase in the importance of replica maintenance
and replica reconfiguration make the load imbalance
more difficult to minimize. Figures 5 and 6 shown
the trade-offs among load imbalance, replica place-
ment maintenance and replica placement reconfigura-
tion objectives.
Figure 6: Total data movement relative to the total previous
replica placement scheme.
7 CONCLUSION AND FUTURE
WORK
In this work, we analyzed the problem of replica pla-
cement on KVS systems based on consistent hashing
with virtual nodes for workloads with data access
skew. We formally defined our problem as a multi-
objective optimization and presented the PopRing ap-
proach based on genetic algorithm to solve the multi-
objective optimization.
Finally, we compared PopRing against the
OpenStack-Swift replica placement under different
configurations. In most configurations, PopRing
could balance workloads with data access skew
while reducing unnecessary data redundancy and mo-
vement. A moderate PopRing configuration reduced
in 52% the load imbalance and in 32% the replica pla-
cement maintenance while requiring the reconfigura-
tion (data movement) of only 6% of total system data.
As future work, we intend to evaluate PopRing not
only on simulated environment, but on real deploy-
ments as well while extending it to consider dyna-
mic workloads with restrictive agreements for service
quality.
ACKNOWLEDGEMENTS
This work was partially funded by Lenovo, as part of
its R&D investment under Brazil’s Informatics Law,
and also by LSBD/UFC.
REFERENCES
Chekam, T. T., Zhai, E., Li, Z., Cui, Y., and Ren, K. (2016).
On the synchronization bottleneck of openstack swift-
like cloud storage systems. In Computer Communica-
tions, IEEE INFOCOM 2016-The 35th Annual IEEE
International Conference on, pages 1–9. IEEE.
Cho, J.-H., Wang, Y., Chen, R., Chan, K. S., and Swami, A.
(2017). A survey on modeling and optimizing multi-
objective systems. IEEE Communications Surveys &
Tutorials.
DeCandia, G., Hastorun, D., Jampani, M., Kakulapati,
G., Lakshman, A., Pilchin, A., Sivasubramanian, S.,
Vosshall, P., and Vogels, W. (2007). Dynamo: ama-
zon’s highly available key-value store. ACM SIGOPS
operating systems review, 41(6):205–220.
Grossman, M., Thiele, C., Araya-Polo, M., Frank, F., Al-
pak, F. O., and Sarkar, V. (2016). A survey of
sparse matrix-vector multiplication performance on
large matrices. arXiv preprint arXiv:1608.00636.
Li, T., Shao, G., Zuo, W., and Huang, S. (2017). Genetic
algorithm for building optimization: State-of-the-art
survey. In Proceedings of the 9th International Con-
ference on Machine Learning and Computing, pages
205–210. ACM.
Liu, J. and Shen, H. (2016). A low-cost multi-failure re-
silient replication scheme for high data availability
in cloud storage. In High Performance Computing
(HiPC), 2016 IEEE 23rd International Conference on,
pages 242–251. IEEE.
Liu, S., Huang, X., Fu, H., and Yang, G. (2013). Under-
standing data characteristics and access patterns in a
cloud storage system. In Cluster, Cloud and Grid
Computing (CCGrid), 2013 13th IEEE/ACM Interna-
tional Symposium on, pages 327–334. IEEE.
Long, S.-Q., Zhao, Y.-L., and Chen, W. (2014). Morm:
A multi-objective optimized replication management
strategy for cloud storage cluster. Journal of Systems
Architecture, 60(2):234–244.
Makris, A., Tserpes, K., Anagnostopoulos, D., and Alt-
mann, J. (2017). Load balancing for minimizing the
average response time of get operations in distributed
key-value stores. In Networking, Sensing and Control
(ICNSC), 2017 IEEE 14th International Conference
on, pages 263–269. IEEE.
Mansouri, Y., Toosi, A. N., and Buyya, R. (2017). Cost
optimization for dynamic replication and migration
of data in cloud data centers. IEEE Transactions on
Cloud Computing.
Mseddi, A., Salahuddin, M. A., Zhani, M. F., Elbiaze, H.,
and Glitho, R. H. (2015). On optimizing replica mi-
gration in distributed cloud storage systems. In Cloud
Networking (CloudNet), 2015 IEEE 4th International
Conference on, pages 191–197. IEEE.
Zhuo, L., Wang, C.-L., and Lau, F. C. (2002). Load balan-
cing in distributed web server systems with partial do-
cument replication. In Parallel Processing, 2002. Pro-
ceedings. International Conference on, pages 305–
312. IEEE.
PopRing: A Popularity-aware Replica Placement for Distributed Key-Value Store
447