Online learning for Markov decision processes applied to multi-agent systems. Chamie, M. E., Açikmese, B., & Mesbahi, M. In 56th IEEE Annual Conference on Decision and Control, CDC 2017, Melbourne, Australia, December 12-15, 2017, pages 1596–1601, 2017. IEEE.
Online learning for Markov decision processes applied to multi-agent systems [link]Paper  doi  bibtex   
@inproceedings{DBLP:conf/cdc/ChamieAM17,
  author       = {Mahmoud El Chamie and
                  Beh{\c{c}}et A{\c{c}}ikmese and
                  Mehran Mesbahi},
  title        = {Online learning for Markov decision processes applied to multi-agent
                  systems},
  booktitle    = {56th {IEEE} Annual Conference on Decision and Control, {CDC} 2017,
                  Melbourne, Australia, December 12-15, 2017},
  pages        = {1596--1601},
  publisher    = {{IEEE}},
  year         = {2017},
  url          = {https://doi.org/10.1109/CDC.2017.8263879},
  doi          = {10.1109/CDC.2017.8263879},
  timestamp    = {Wed, 24 Jan 2018 00:00:00 +0100},
  biburl       = {https://dblp.org/rec/conf/cdc/ChamieAM17.bib},
  bibsource    = {dblp computer science bibliography, https://dblp.org}
}

Downloads: 0