Designing Neural Network Architectures using Reinforcement Learning. Baker, B., Gupta, O., Naik, N., & Raskar, R. arXiv:1611.02167 [cs], March, 2017. arXiv: 1611.02167
Designing Neural Network Architectures using Reinforcement Learning [link]Paper  abstract   bibtex   
At present, designing convolutional neural network (CNN) architectures requires both human expertise and labor. New architectures are handcrafted by careful experimentation or modified from a handful of existing networks. We introduce MetaQNN, a meta-modeling algorithm based on reinforcement learning to automatically generate high-performing CNN architectures for a given learning task. The learning agent is trained to sequentially choose CNN layers using Qlearning with an -greedy exploration strategy and experience replay. The agent explores a large but finite space of possible architectures and iteratively discovers designs with improved performance on the learning task. On image classification benchmarks, the agent-designed networks (consisting of only standard convolution, pooling, and fully-connected layers) beat existing networks designed with the same layer types and are competitive against the state-of-the-art methods that use more complex layer types. We also outperform existing meta-modeling approaches for network design on image classification tasks.
@article{baker_designing_2017,
	title = {Designing {Neural} {Network} {Architectures} using {Reinforcement} {Learning}},
	url = {http://arxiv.org/abs/1611.02167},
	abstract = {At present, designing convolutional neural network (CNN) architectures requires both human expertise and labor. New architectures are handcrafted by careful experimentation or modified from a handful of existing networks. We introduce MetaQNN, a meta-modeling algorithm based on reinforcement learning to automatically generate high-performing CNN architectures for a given learning task. The learning agent is trained to sequentially choose CNN layers using Qlearning with an -greedy exploration strategy and experience replay. The agent explores a large but finite space of possible architectures and iteratively discovers designs with improved performance on the learning task. On image classification benchmarks, the agent-designed networks (consisting of only standard convolution, pooling, and fully-connected layers) beat existing networks designed with the same layer types and are competitive against the state-of-the-art methods that use more complex layer types. We also outperform existing meta-modeling approaches for network design on image classification tasks.},
	language = {en},
	urldate = {2019-11-07},
	journal = {arXiv:1611.02167 [cs]},
	author = {Baker, Bowen and Gupta, Otkrist and Naik, Nikhil and Raskar, Ramesh},
	month = mar,
	year = {2017},
	note = {arXiv: 1611.02167},
	keywords = {Computer Science - Machine Learning}
}

Downloads: 0