Tuning bandit algorithms in stochastic environments. Audibert, J., Munos, R., & Szepesv́ari, C. In Proc. Int. Conf. Algor. Learn. Theory, pages 150--165, Sendai, Japan, 2007.
Tuning bandit algorithms in stochastic environments [pdf]Paper  bibtex   

Downloads: 0