Exploration Versus Exploitation Trade-off in Infinite Horizon Pareto Multi-armed Bandits Algorithms Topics: Evolutionary Computing; Machine Learning In Proceedings of the International Conference on Agents and Artificial Intelligence - Volume 1: ICAART, 66-77, 2015 , Lisbon, Portugal