Pessimism for Offline Linear Contextual Bandits using ℓp Confidence Sets. Li, G., Ma, C., & Srebro, N. Advances in Neural Information Processing Systems, 2022.
Pessimism for Offline Linear Contextual Bandits using ℓp Confidence Sets [pdf]Paper  bibtex   

Downloads: 0