Optimistic Reinforcement Learning by Forward Kullback-Leibler Divergence Optimization. Kobayashi, T. Neural Networks, 152:169–180, 2022.
Optimistic Reinforcement Learning by Forward Kullback-Leibler Divergence Optimization [link]Paper  doi  bibtex   
@article{kobayashiNN2022,
  author = {Taisuke Kobayashi},
  title = {Optimistic Reinforcement Learning by Forward Kullback-Leibler Divergence Optimization},
  journal = {Neural Networks},
  year = {2022},
  volume = {152},
  pages = {169--180},
  url = {https://arxiv.org/abs/2105.12991},
  doi = {10.1016/j.neunet.2022.04.021},
}

%RL for washing machine

Downloads: 0