Fachantidis, A., Taylor, M., and Vlahavas, I. (2017). Learn-
ing to teach reinforcement learning agents. Machine
Learning and Knowledge Extraction, 1(1):21–42.
Fagnant, D. and Kockelman, K. (2016). Dynamic ride-
sharing and fleet sizing for a system of shared au-
tonomous vehicles in Austin, texas. Transportation,
45.
Gu
´
eriau, M., Cugurullo, F., Acheampong, R. A., and Dus-
paric, I. (2020). Shared autonomous mobility on de-
mand: A learning-based approach and its performance
in the presence of traffic congestion. IEEE Intelligent
Transportation Systems Magazine, 12(4):208–218.
Gueriau, M. and Dusparic, I. (2018). Samod: Shared
autonomous mobility-on-demand using decentralized
reinforcement learning. In The 21st IEEE Interna-
tional Conference on Intelligent Transportation Sys-
tems.
Ilhan, E., Gow, J., and Perez Liebana, D. (2019). Teaching
on a budget in multi-agent deep reinforcement learn-
ing. pages 1–8.
Lazaric, A., Restelli, M., and Bonarini, A. (2008). Transfer
of samples in batch reinforcement learning. In Pro-
ceedings of the 25th International Conference on Ma-
chine Learning, ICML ’08, page 544–551, New York,
NY, USA. Association for Computing Machinery.
Liu, Y. and Samaranayake, S. (2019). Proactive rebalancing
and speed-up techniques for on-demand high capacity
vehicle pooling. CoRR.
Lopez, P. A., Behrisch, M., Bieker-Walz, L., Erdmann, J.,
Fl
¨
otter
¨
od, Y., Hilbrich, R., L
¨
ucken, L., Rummel, J.,
Wagner, P., and Wiessner, E. (2018). Microscopic
traffic simulation using sumo. In 2018 21st Interna-
tional Conference on Intelligent Transportation Sys-
tems (ITSC), pages 2575–2582.
Norouzi, M., Abdoos, M., and Bazzan, A. (2020). Experi-
ence classification for transfer learning in traffic signal
control. The Journal of Supercomputing.
NYC Taxi and Limousine Commission (2020). Tlc trip
record data.
Schulman, J., Wolski, F., Dhariwal, P., Radford, A., and
Klimov, O. (2017). Proximal policy optimization al-
gorithms. CoRR, abs/1707.06347.
Silva, F., Hernandez-Leal, P., Kartal, B., and Taylor, M.
(2020a). Uncertainty-aware action advising for deep
reinforcement learning agents.
Silva, F. L. D., Hernandez-Leal, P., Kartal, B., and Taylor,
M. D. (2020b). Uncertainty-aware action advising for
deep reinforcement learning agents. In AAAI.
Taylor, A., Dusparic, I., Gu
´
eriau, M., and Clarke, S.
(2019). Parallel transfer learning in multi-agent sys-
tems: What, when and how to transfer? In 2019
International Joint Conference on Neural Networks
(IJCNN), pages 1–8.
Taylor, M. E. (2018). Improving reinforcement learning
with human input. In Proceedings of the Twenty-
Seventh International Joint Conference on Artificial
Intelligence, IJCAI-18, pages 5724–5728. Interna-
tional Joint Conferences on Artificial Intelligence Or-
ganization.
Torrey, L. and Taylor, M. (2013). Teaching on a budget:
Agents advising agents in reinforcement learning. In
Proceedings of the 2013 international conference on
Autonomous agents and multi-agent systems, pages
1053–1060.
Wan, R., Zhang, S., Shi, C., Luo, S., and Song, R. (2021).
Pattern transfer learning for reinforcement learning in
order dispatching. arXiv preprint arXiv:2105.13218.
Wang, L., Guo, B., and Yang, Q. (2018a). Smart city de-
velopment with urban transfer learning. Computer,
51:32–41.
Wang, Z., Qin, Z., Tang, X., Ye, J., and Zhu, H. (2018b).
Deep reinforcement learning with knowledge transfer
for online rides order dispatching. pages 617–626.
Wen, J., Zhao, J., and Jaillet, P. (2017). Rebalancing shared
mobility-on-demand systems: A reinforcement learn-
ing approach. In 2017 IEEE 20th International Con-
ference on Intelligent Transportation Systems (ITSC),
pages 220–225. IEEE.
Zhu, C. (2020). Learning by reusing previous advice in
teacher-student paradigm. In Proceedings of the 2020
International Conference on Autonomous Agents and
Multi-Agent Systems, Underline Science Inc.
Zhuang, F., Qi, Z., Duan, K., Xi, D., Zhu, Y., Zhu, H.,
Xiong, H., and He, Q. (2020). A comprehensive sur-
vey on transfer learning. Proceedings of the IEEE,
109(1):43–76.
ICAART 2022 - 14th International Conference on Agents and Artificial Intelligence
130