Policy Shaping: Integrating Human Feedback with Reinforcement Learning. Griffith, S., Subramanian, K., Scholz, J., Jr., C. L. I., & Thomaz, A. L. In Burges, C. J. C., Bottou, L., Ghahramani, Z., & Weinberger, K. Q., editors, Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013, Lake Tahoe, Nevada, United States, pages 2625–2633, 2013.
Policy Shaping: Integrating Human Feedback with Reinforcement Learning [link]Paper  bibtex   
@inproceedings{DBLP:conf/nips/GriffithSSIT13,
  author    = {Shane Griffith and
               Kaushik Subramanian and
               Jonathan Scholz and
               Charles L. Isbell Jr. and
               Andrea Lockerd Thomaz},
  editor    = {Christopher J. C. Burges and
               L{\'{e}}on Bottou and
               Zoubin Ghahramani and
               Kilian Q. Weinberger},
  title     = {Policy Shaping: Integrating Human Feedback with Reinforcement Learning},
  booktitle = {Advances in Neural Information Processing Systems 26: 27th Annual
               Conference on Neural Information Processing Systems 2013. Proceedings
               of a meeting held December 5-8, 2013, Lake Tahoe, Nevada, United States},
  pages     = {2625--2633},
  year      = {2013},
  url       = {https://proceedings.neurips.cc/paper/2013/hash/e034fb6b66aacc1d48f445ddfb08da98-Abstract.html},
  timestamp = {Thu, 21 Jan 2021 00:00:00 +0100},
  biburl    = {https://dblp.org/rec/conf/nips/GriffithSSIT13.bib},
  bibsource = {dblp computer science bibliography, https://dblp.org}
}

Downloads: 0