Deep Salience Representations for f0 Estimation in Polyphonic Music

Deep Salience Representations for f0 Estimation in Polyphonic Music. Bittner, R., McFee, B., Salamon, J., Li, P., & Bello, J. In October, 2017.
abstract bibtex

Estimating fundamental frequencies in polyphonic music remains a notoriously difficult task in Music Information Retrieval. While other tasks, such as beat tracking and chord recognition have seen improvement with the application of deep learning models, little work has been done to apply deep learning methods to fundamental frequency related tasks including multi-f0 and melody tracking, primarily due to the scarce availability of labeled data. In this work, we describe a fully convolutional neural network for learning salience representations for estimating fundamental frequencies, trained using a large, semi-automatically generated f0 dataset. We demonstrate the effectiveness of our model for learning salience representations for both multi-f0 and melody tracking in polyphonic audio, and show that our models achieve state-of-the-art performance on several multi-f0 and melody datasets. We conclude with directions for future research.

@inproceedings{bittner_deep_2017,
	title = {Deep {Salience} {Representations} for f0 {Estimation} in {Polyphonic} {Music}},
	abstract = {Estimating fundamental frequencies in polyphonic music remains a notoriously difficult task in Music Information Retrieval. While other tasks, such as beat tracking and chord recognition have seen improvement with the application of deep learning models, little work has been done to apply deep learning methods to fundamental frequency related tasks including multi-f0 and melody tracking, primarily due to the scarce availability of labeled data. In this work, we describe a fully convolutional neural network for learning salience representations for estimating fundamental frequencies, trained using a large, semi-automatically generated f0 dataset. We demonstrate the effectiveness of our model for learning salience representations for both multi-f0 and melody tracking in polyphonic audio, and show that our models achieve state-of-the-art performance on several multi-f0 and melody datasets. We conclude with directions for future research.},
	author = {Bittner, Rachel and McFee, Brian and Salamon, Justin and Li, Peter and Bello, Juan},
	month = oct,
	year = {2017},
}

Downloads: 0

{"_id":"9cFHk7s7E9MECXtvT","bibbaseid":"bittner-mcfee-salamon-li-bello-deepsaliencerepresentationsforf0estimationinpolyphonicmusic-2017","author_short":["Bittner, R.","McFee, B.","Salamon, J.","Li, P.","Bello, J."],"bibdata":{"bibtype":"inproceedings","type":"inproceedings","title":"Deep Salience Representations for f0 Estimation in Polyphonic Music","abstract":"Estimating fundamental frequencies in polyphonic music remains a notoriously difficult task in Music Information Retrieval. While other tasks, such as beat tracking and chord recognition have seen improvement with the application of deep learning models, little work has been done to apply deep learning methods to fundamental frequency related tasks including multi-f0 and melody tracking, primarily due to the scarce availability of labeled data. In this work, we describe a fully convolutional neural network for learning salience representations for estimating fundamental frequencies, trained using a large, semi-automatically generated f0 dataset. We demonstrate the effectiveness of our model for learning salience representations for both multi-f0 and melody tracking in polyphonic audio, and show that our models achieve state-of-the-art performance on several multi-f0 and melody datasets. We conclude with directions for future research.","author":[{"propositions":[],"lastnames":["Bittner"],"firstnames":["Rachel"],"suffixes":[]},{"propositions":[],"lastnames":["McFee"],"firstnames":["Brian"],"suffixes":[]},{"propositions":[],"lastnames":["Salamon"],"firstnames":["Justin"],"suffixes":[]},{"propositions":[],"lastnames":["Li"],"firstnames":["Peter"],"suffixes":[]},{"propositions":[],"lastnames":["Bello"],"firstnames":["Juan"],"suffixes":[]}],"month":"October","year":"2017","bibtex":"@inproceedings{bittner_deep_2017,\n\ttitle = {Deep {Salience} {Representations} for f0 {Estimation} in {Polyphonic} {Music}},\n\tabstract = {Estimating fundamental frequencies in polyphonic music remains a notoriously difficult task in Music Information Retrieval. While other tasks, such as beat tracking and chord recognition have seen improvement with the application of deep learning models, little work has been done to apply deep learning methods to fundamental frequency related tasks including multi-f0 and melody tracking, primarily due to the scarce availability of labeled data. In this work, we describe a fully convolutional neural network for learning salience representations for estimating fundamental frequencies, trained using a large, semi-automatically generated f0 dataset. We demonstrate the effectiveness of our model for learning salience representations for both multi-f0 and melody tracking in polyphonic audio, and show that our models achieve state-of-the-art performance on several multi-f0 and melody datasets. We conclude with directions for future research.},\n\tauthor = {Bittner, Rachel and McFee, Brian and Salamon, Justin and Li, Peter and Bello, Juan},\n\tmonth = oct,\n\tyear = {2017},\n}\n\n","author_short":["Bittner, R.","McFee, B.","Salamon, J.","Li, P.","Bello, J."],"key":"bittner_deep_2017","id":"bittner_deep_2017","bibbaseid":"bittner-mcfee-salamon-li-bello-deepsaliencerepresentationsforf0estimationinpolyphonicmusic-2017","role":"author","urls":{},"metadata":{"authorlinks":{}},"html":""},"bibtype":"inproceedings","biburl":"https://bibbase.org/zotero/mxmplx","dataSources":["aXmRAq63YsH7a3ufx"],"keywords":[],"search_terms":["deep","salience","representations","estimation","polyphonic","music","bittner","mcfee","salamon","li","bello"],"title":"Deep Salience Representations for f0 Estimation in Polyphonic Music","year":2017}