Learning structured and non-redundant representations with deep neural networks. Yang, J., Xiong, W., Li, S., & Xu, C. Pattern Recognition, 86:224–235, February, 2019.
Learning structured and non-redundant representations with deep neural networks [link]Paper  doi  abstract   bibtex   
This paper proposes a novel regularizer named Structured Decorrelation Constraint, to address both the generalization and optimization of deep neural networks, including multiple-layer perceptrons and convolutional neural networks. Our proposed regularizer reduces overfitting by breaking the co-adaptions between the neurons with an explicit penalty. As a result, the network is capable of learning non-redundant representations. Meanwhile, the proposed regularizer encourages the networks to learn structured high-level features to aid the networks’ optimization during training. To this end, neurons are constrained to behave obeying a group prior. Our regularizer applies to various types of layers, including fully connected layers, convolutional layers and normalization layers. The loss of our regularizer can be directly minimized along with the network’s classification loss by stochastic gradient descent. Experiments show that the proposed regularizer obviously relieves the overfitting problem of the existing deep networks. It yields much better performance on extensive datasets than the conventional regularizers like Dropout.
@article{yang_learning_2019,
	title = {Learning structured and non-redundant representations with deep neural networks},
	volume = {86},
	issn = {0031-3203},
	url = {https://www.sciencedirect.com/science/article/pii/S0031320318303169},
	doi = {10.1016/j.patcog.2018.08.017},
	abstract = {This paper proposes a novel regularizer named Structured Decorrelation Constraint, to address both the generalization and optimization of deep neural networks, including multiple-layer perceptrons and convolutional neural networks. Our proposed regularizer reduces overfitting by breaking the co-adaptions between the neurons with an explicit penalty. As a result, the network is capable of learning non-redundant representations. Meanwhile, the proposed regularizer encourages the networks to learn structured high-level features to aid the networks’ optimization during training. To this end, neurons are constrained to behave obeying a group prior. Our regularizer applies to various types of layers, including fully connected layers, convolutional layers and normalization layers. The loss of our regularizer can be directly minimized along with the network’s classification loss by stochastic gradient descent. Experiments show that the proposed regularizer obviously relieves the overfitting problem of the existing deep networks. It yields much better performance on extensive datasets than the conventional regularizers like Dropout.},
	language = {en},
	urldate = {2021-10-18},
	journal = {Pattern Recognition},
	author = {Yang, Jihai and Xiong, Wei and Li, Shijun and Xu, Chang},
	month = feb,
	year = {2019},
	keywords = {Decorrelation, Deep networks, Overfitting},
	pages = {224--235},
}

Downloads: 0