Online learning for Markov decision processes applied to multi-agent systems. El Chamie, M., Açı kmeşe , B., & Mesbahi, M. In 2017 IEEE 56th Annual Conference on Decision and Control (CDC), pages 1596–1601, 2017. IEEE.
bibtex   
@inproceedings{el2017online,
  title={Online learning for Markov decision processes applied to multi-agent systems},
  author={El Chamie, Mahmoud and A\c{c}\i kme\c{s}e, Beh\c{c}et and Mesbahi, Mehran},
  booktitle={2017 IEEE 56th Annual Conference on Decision and Control (CDC)},
  pages={1596--1601},
  year={2017},
  organization={IEEE}
}

Downloads: 0