Sample-Efficient Iterative Lower Bound Optimization of Deep Reactive Policies for Planning in Continuous MDPs

Sample-Efficient Iterative Lower Bound Optimization of Deep Reactive Policies for Planning in Continuous MDPs. Low, S., Kumar, A., & Sanner, S. In Proceedings of the 36th AAAI Conference on Artificial Intelligence (AAAI-22), Online, 2022.

Paper bibtex 6 downloads

@inproceedings{sanner:aaai22b,
	author = {{Siow Meng} Low and Akshat Kumar and Scott Sanner},
	title = {Sample-Efficient Iterative Lower Bound Optimization of Deep Reactive Policies for Planning in Continuous MDPs},
	year = {2022},
	booktitle = {Proceedings of the 36th {AAAI} Conference on Artificial Intelligence ({AAAI-22})},
	address = {Online},
	url_paper = {https://ssanner.github.io/papers/aaai22_ilbo.pdf}
}

Downloads: 6