Conservative Offline Distributional Reinforcement Learning. Ma, Y. J., Jayaraman, D., & Bastani, O. NeurIPS, 2021.
bibtex   
@article{ma2021conservative, title= {Conservative Offline Distributional Reinforcement Learning}, author= {Ma, Yecheng Jason and {Jayaraman}, {Dinesh} and Bastani, Osbert}, journal= {NeurIPS}, year= {2021}}

Downloads: 0