DIRECT: Learning from Sparse and Shifting Rewards using Discriminative Reward Co-Training. Altmann, P., Phan, T., Ritz, F., Linnhoff-Popien, C., & Gabor, T. In Proc. of the Adaptive and Learning Agents Workshop (ALA@AAMAS), 5, 2023.
DIRECT: Learning from Sparse and Shifting Rewards using Discriminative Reward Co-Training [pdf]Paper  bibtex   

Downloads: 0