However, in conventional research on service selec-
tion, the QoS values to be considered for service se-
lection are calculated by convertingeither a single cri-
terion or multiple criteria into a single criterion. In
this study, we believe that using distributed reinforce-
ment learning (IMPALA), each element of QoS can
be learned separately, enabling service selection that
is more accurate and tailored to the user’s needs. To
demonstrate the usefulness of our method, we con-
ducted an experiment to compare the method of learn-
ing to a single task by DQN with the method of learn-
ing to multiple tasks by our method. As a result of
the experiment, it was confirmed that for all the ele-
ments of QoS, the best service was selected by learn-
ing to multi-task with our method rather than learn-
ing to single-task with DQN. Therefore, our method
is more accurate and can select services that meet the
individual needs of users.
This made it possible to select services more flex-
ibly according to users’ needs. However, although it
is now possible to select one criterion from multiple
QoS factors to suit the user’s needs, it would be desir-
able to be able to select services considering multiple
criteria from multiple QoS factors when considering
real-world applications. For example, it would be de-
sirable to select a service with high throughput and
reliability. In the future, we would like to make this
system applicable to the real world. Specifically, we
believe that by adapting multi-objective genetic algo-
rithms, we will be able to optimize for multiple crite-
ria.
ACKNOWLEDGEMENT
This work has been supported by Grant-in-Aid for
Scientific Research [KAKENHI Young Researcher]
Grant No. 20K19931.
REFERENCES
Xiongnan Jin, Sejin Chun, Jooik Jung, and Kyong-Ho
Lee. IoT Service Selection based on Physical Ser-
vice Model and Absolute Dominance Relationship.
2014 IEEE 7th International Conference on Service-
Oriented Computing and Applications.
Singh, D., Tripathi, G., and Jara, A. J. A survey of Internet-
of-Things: Future vision, architecture, challenges and
services. in Proc. of the IEEE World Forum on Inter-
net of Things (WF-IoT), pp. 287-292, IEEE, 2014.
Hongbing Wang , Jiajie Li , Qi Yu , Tianjing Hong , Jia Yan
, Wei Zhao. Integrating recurrent neural networks and
reinforcement learning for dynamic service composi-
tion. Future Generation Computer Systems Volume
107, June 2020, Pages 551-563.
H. Wang, X. Zhou, X. Zhou, W. Liu, W. Li, A. Bouguettaya.
Adaptive Service composition based on reinforce-
ment learning. International Conference on Service-
Oriented Computing, Springer, 2010, pp. 92–107.
H. Wang, G. Huang, Q. Yu. Automatic hierarchical re-
inforcement learning for efficient large-scale service
composition. 2016 IEEE International Conference on
Web Services, ICWS, 2016, pp. 57–64.
H. Wang, X. Chen, Q. Wu, Q. Yu, Z. Zheng,
A.Bouguettaya. Integrating on-policy reinforcement
learning with multi-agent techniques for adaptive
service composition. Service-Oriented Computing,
Springer, 2014, pp. 154–168.
Y. Lei, P.S. Yu. Multi-agent reinforcement learning for
service composition. 2016 IEEE International Con-
ference on Services Computing, SCC, 2016, pp.
790–793.
P. Kendrick, T. Baker, Z. Maamar, A. Hussain, R. Buyya, D.
Al-Jumeily. An efficient multi-cloud service compo-
sition using a distributed multiagent- based, memory-
driven approach. IEEE Trans. Sustain. Comput.
(2019) 1–13.
Volodymyr Mnih,Adri`a Puigdom`enech Badia, Mehdi Mirza
, Alex Graves, Tim Harley, Timothy P. Lillicrap,
David Silver, Koray Kavukcuoglu Asynchronous
Methods for Deep Reinforcement Learning. Proceed-
ings of The 33rd International Conference on Machine
Learning, PMLR 48:1928-1937, 2016.
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex
Graves, Ioannis Antonoglou, Daan Wierstra, Martin
Riedmiller. Playing Atari with Deep Reinforcement
Learning. NIPS Deep Learning Workshop 2013
Lasse Espeholt, Hubert Soyer, Remi Munos, Karen Si-
monyan, Volodymir Mnih, Tom Ward, Yotam Doron,
Vlad Firoiu, Tim Harley, Iain Dunning, Shane Legg,
Koray Kavukcuoglu. IMPALA: Scalable Distributed
Deep-RL with Importance Weighted Actor-Learner
Architectures. Proceedings of the International Con-
ference on Machine Learning (ICML) 2018.
Al-Masri, E., and Mahmoud Q. H. Investigating web ser-
vices on the world wide web. 17th international con-
ference on World Wide Web (WWW ’08), pp.795-
804.