Unsupervised feature learning for audio classification using convolutional deep belief networks. Lee, H., Pham, P., Largman, Y, & Ng, A. In Advances in neural information processing systems, pages 1–9, 2009. arXiv: 1301.3605v3 ISSN: 02643294
doi  abstract   bibtex   
In recent years, deep learning approaches have gained significant interest as a way of building hierarchical representations from unlabeled data. However, to our knowledge, these deep learning approaches have not been extensively studied for auditory data. In this paper, we apply convolutional deep belief networks to audio data and empirically evaluate them on various audio classification tasks. In the case of speech data, we show that the learned features correspond to phones/phonemes. In addition, our feature representations learned from unlabeled audio data show very good performance for multiple audio classification tasks. We hope that this paper will inspire more research on deep learning approaches applied to a wide range of audio recognition tasks.
@inproceedings{Lee2009a,
	title = {Unsupervised feature learning for audio classification using convolutional deep belief networks},
	isbn = {978-1-60558-516-1},
	doi = {10.1145/1553374.1553453},
	abstract = {In recent years, deep learning approaches have gained significant interest as a way of building hierarchical representations from unlabeled data. However, to our knowledge, these deep learning approaches have not been extensively studied for auditory data. In this paper, we apply convolutional deep belief networks to audio data and empirically evaluate them on various audio classification tasks. In the case of speech data, we show that the learned features correspond to phones/phonemes. In addition, our feature representations learned from unlabeled audio data show very good performance for multiple audio classification tasks. We hope that this paper will inspire more research on deep learning approaches applied to a wide range of audio recognition tasks.},
	booktitle = {Advances in neural information processing systems},
	author = {Lee, Honglak and Pham, Pt and Largman, Y and Ng, Ay},
	year = {2009},
	pmid = {20957573},
	note = {arXiv: 1301.3605v3
ISSN: 02643294},
	pages = {1--9},
}

Downloads: 0