Learning to Mix n-Step Returns: Generalizing lambda-Returns for Deep Reinforcement Learning. Sharma, S., J, G. R., Ramesh, S., & Ravindran, B. CoRR, 2017.
Learning to Mix n-Step Returns: Generalizing lambda-Returns for Deep Reinforcement Learning [link]Paper  bibtex   
@article{DBLP:journals/corr/SharmaRJR17,
  author    = {Sahil Sharma and
               Girish Raguvir J and
               Srivatsan Ramesh and
               Balaraman Ravindran},
  title     = {Learning to Mix n-Step Returns: Generalizing lambda-Returns for Deep
               Reinforcement Learning},
  journal   = {CoRR},
  volume    = {abs/1705.07445},
  year      = {2017},
  url       = {http://arxiv.org/abs/1705.07445},
  archivePrefix = {arXiv},
  eprint    = {1705.07445},
  timestamp = {Fri, 10 Nov 2017 00:00:00 +0100},
  biburl    = {https://dblp.org/rec/bib/journals/corr/SharmaRJR17},
  bibsource = {dblp computer science bibliography, https://dblp.org}
}

Downloads: 0