Learning to Communicate with Deep Multi-Agent Reinforcement Learning

Learning to Communicate with Deep Multi-Agent Reinforcement Learning. Foerster, J. N., Assael, Y. M., de Freitas, N., & Whiteson, S. May, 2016. 1123 citations (Semantic Scholar/arXiv) [2023-03-18] arXiv:1605.06676 [cs]

Paper abstract bibtex

We consider the problem of multiple agents sensing and acting in environments with the goal of maximising their shared utility. In these environments, agents must learn communication protocols in order to share information that is needed to solve the tasks. By embracing deep neural networks, we are able to demonstrate endto-end learning of protocols in complex environments inspired by communication riddles and multi-agent computer vision problems with partial observability. We propose two approaches for learning in these domains: Reinforced Inter-Agent Learning (RIAL) and Differentiable Inter-Agent Learning (DIAL). The former uses deep Q-learning, while the latter exploits the fact that, during learning, agents can backpropagate error derivatives through (noisy) communication channels. Hence, this approach uses centralised learning but decentralised execution. Our experiments introduce new environments for studying the learning of communication protocols and present a set of engineering innovations that are essential for success in these domains.

@misc{foerster_learning_2016-1,
	title = {Learning to {Communicate} with {Deep} {Multi}-{Agent} {Reinforcement} {Learning}},
	url = {http://arxiv.org/abs/1605.06676},
	abstract = {We consider the problem of multiple agents sensing and acting in environments with the goal of maximising their shared utility. In these environments, agents must learn communication protocols in order to share information that is needed to solve the tasks. By embracing deep neural networks, we are able to demonstrate endto-end learning of protocols in complex environments inspired by communication riddles and multi-agent computer vision problems with partial observability. We propose two approaches for learning in these domains: Reinforced Inter-Agent Learning (RIAL) and Differentiable Inter-Agent Learning (DIAL). The former uses deep Q-learning, while the latter exploits the fact that, during learning, agents can backpropagate error derivatives through (noisy) communication channels. Hence, this approach uses centralised learning but decentralised execution. Our experiments introduce new environments for studying the learning of communication protocols and present a set of engineering innovations that are essential for success in these domains.},
	language = {en},
	urldate = {2023-03-18},
	publisher = {arXiv},
	author = {Foerster, Jakob N. and Assael, Yannis M. and de Freitas, Nando and Whiteson, Shimon},
	month = may,
	year = {2016},
	note = {1123 citations (Semantic Scholar/arXiv) [2023-03-18]
arXiv:1605.06676 [cs]},
	keywords = {Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Computer Science - Multiagent Systems},
}

Downloads: 0

{"_id":"NwJxJbBnF7detuCAR","bibbaseid":"foerster-assael-defreitas-whiteson-learningtocommunicatewithdeepmultiagentreinforcementlearning-2016","authorIDs":[],"author_short":["Foerster, J. N.","Assael, Y. M.","de Freitas, N.","Whiteson, S."],"bibdata":{"bibtype":"misc","type":"misc","title":"Learning to Communicate with Deep Multi-Agent Reinforcement Learning","url":"http://arxiv.org/abs/1605.06676","abstract":"We consider the problem of multiple agents sensing and acting in environments with the goal of maximising their shared utility. In these environments, agents must learn communication protocols in order to share information that is needed to solve the tasks. By embracing deep neural networks, we are able to demonstrate endto-end learning of protocols in complex environments inspired by communication riddles and multi-agent computer vision problems with partial observability. We propose two approaches for learning in these domains: Reinforced Inter-Agent Learning (RIAL) and Differentiable Inter-Agent Learning (DIAL). The former uses deep Q-learning, while the latter exploits the fact that, during learning, agents can backpropagate error derivatives through (noisy) communication channels. Hence, this approach uses centralised learning but decentralised execution. Our experiments introduce new environments for studying the learning of communication protocols and present a set of engineering innovations that are essential for success in these domains.","language":"en","urldate":"2023-03-18","publisher":"arXiv","author":[{"propositions":[],"lastnames":["Foerster"],"firstnames":["Jakob","N."],"suffixes":[]},{"propositions":[],"lastnames":["Assael"],"firstnames":["Yannis","M."],"suffixes":[]},{"propositions":["de"],"lastnames":["Freitas"],"firstnames":["Nando"],"suffixes":[]},{"propositions":[],"lastnames":["Whiteson"],"firstnames":["Shimon"],"suffixes":[]}],"month":"May","year":"2016","note":"1123 citations (Semantic Scholar/arXiv) [2023-03-18] arXiv:1605.06676 [cs]","keywords":"Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Computer Science - Multiagent Systems","bibtex":"@misc{foerster_learning_2016-1,\n\ttitle = {Learning to {Communicate} with {Deep} {Multi}-{Agent} {Reinforcement} {Learning}},\n\turl = {http://arxiv.org/abs/1605.06676},\n\tabstract = {We consider the problem of multiple agents sensing and acting in environments with the goal of maximising their shared utility. In these environments, agents must learn communication protocols in order to share information that is needed to solve the tasks. By embracing deep neural networks, we are able to demonstrate endto-end learning of protocols in complex environments inspired by communication riddles and multi-agent computer vision problems with partial observability. We propose two approaches for learning in these domains: Reinforced Inter-Agent Learning (RIAL) and Differentiable Inter-Agent Learning (DIAL). The former uses deep Q-learning, while the latter exploits the fact that, during learning, agents can backpropagate error derivatives through (noisy) communication channels. Hence, this approach uses centralised learning but decentralised execution. Our experiments introduce new environments for studying the learning of communication protocols and present a set of engineering innovations that are essential for success in these domains.},\n\tlanguage = {en},\n\turldate = {2023-03-18},\n\tpublisher = {arXiv},\n\tauthor = {Foerster, Jakob N. and Assael, Yannis M. and de Freitas, Nando and Whiteson, Shimon},\n\tmonth = may,\n\tyear = {2016},\n\tnote = {1123 citations (Semantic Scholar/arXiv) [2023-03-18]\narXiv:1605.06676 [cs]},\n\tkeywords = {Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Computer Science - Multiagent Systems},\n}\n\n","author_short":["Foerster, J. N.","Assael, Y. M.","de Freitas, N.","Whiteson, S."],"key":"foerster_learning_2016-1","id":"foerster_learning_2016-1","bibbaseid":"foerster-assael-defreitas-whiteson-learningtocommunicatewithdeepmultiagentreinforcementlearning-2016","role":"author","urls":{"Paper":"http://arxiv.org/abs/1605.06676"},"keyword":["Computer Science - Artificial Intelligence","Computer Science - Machine Learning","Computer Science - Multiagent Systems"],"metadata":{"authorlinks":{}},"html":""},"bibtype":"misc","biburl":"https://bibbase.org/zotero/ifromm","creationDate":"2020-01-27T02:13:34.166Z","downloads":0,"keywords":["computer science - artificial intelligence","computer science - machine learning","computer science - multiagent systems"],"search_terms":["learning","communicate","deep","multi","agent","reinforcement","learning","foerster","assael","de freitas","whiteson"],"title":"Learning to Communicate with Deep Multi-Agent Reinforcement Learning","year":2016,"dataSources":["hEoKh4ygEAWbAZ5iy","KhfhF8P52iu5Szymq","N4kJAiLiJ7kxfNsoh"]}