The chamber ensemble generator: limitless high-quality MIR data via generative modeling

The chamber ensemble generator: limitless high-quality MIR data via generative modeling. Wu, Y., Gardner, J., Manilow, E., Simon, I., Hawthorne, C., & Engel, J. September, 2022. arXiv:2209.14458 [cs, eess]

Paper abstract bibtex

Data is the lifeblood of modern machine learning systems, including for those in Music Information Retrieval (MIR). However, MIR has long been mired by small datasets and unreliable labels. In this work, we propose to break this bottleneck using generative modeling. By pipelining a generative model of notes (Coconet trained on Bach Chorales) with a structured synthesis model of chamber ensembles (MIDI-DDSP trained on URMP), we demonstrate a system capable of producing unlimited amounts of realistic chorale music with rich annotations including mixes, stems, MIDI, note-level performance attributes (staccato, vibrato, etc.), and even ﬁne-grained synthesis parameters (pitch, amplitude, etc.). We call this system the Chamber Ensemble Generator (CEG), and use it to generate a large dataset of chorales from four different chamber ensembles (CocoChorales). We demonstrate that data generated using our approach improves state-of-theart models for music transcription and source separation, and we release both the system and the dataset as an opensource foundation for future work in the MIR community.

@misc{wu_chamber_2022,
	title = {The chamber ensemble generator: limitless high-quality {MIR} data via generative modeling},
	shorttitle = {The chamber ensemble generator},
	url = {http://arxiv.org/abs/2209.14458},
	abstract = {Data is the lifeblood of modern machine learning systems, including for those in Music Information Retrieval (MIR). However, MIR has long been mired by small datasets and unreliable labels. In this work, we propose to break this bottleneck using generative modeling. By pipelining a generative model of notes (Coconet trained on Bach Chorales) with a structured synthesis model of chamber ensembles (MIDI-DDSP trained on URMP), we demonstrate a system capable of producing unlimited amounts of realistic chorale music with rich annotations including mixes, stems, MIDI, note-level performance attributes (staccato, vibrato, etc.), and even ﬁne-grained synthesis parameters (pitch, amplitude, etc.). We call this system the Chamber Ensemble Generator (CEG), and use it to generate a large dataset of chorales from four different chamber ensembles (CocoChorales). We demonstrate that data generated using our approach improves state-of-theart models for music transcription and source separation, and we release both the system and the dataset as an opensource foundation for future work in the MIR community.},
	language = {en},
	urldate = {2022-11-04},
	publisher = {arXiv},
	author = {Wu, Yusong and Gardner, Josh and Manilow, Ethan and Simon, Ian and Hawthorne, Curtis and Engel, Jesse},
	month = sep,
	year = {2022},
	note = {arXiv:2209.14458 [cs, eess]},
	keywords = {Computer Science - Information Retrieval, Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing, ReadList},
}

Downloads: 0

{"_id":"NE6Bq66oC5jCJCmFf","bibbaseid":"wu-gardner-manilow-simon-hawthorne-engel-thechamberensemblegeneratorlimitlesshighqualitymirdataviagenerativemodeling-2022","author_short":["Wu, Y.","Gardner, J.","Manilow, E.","Simon, I.","Hawthorne, C.","Engel, J."],"bibdata":{"bibtype":"misc","type":"misc","title":"The chamber ensemble generator: limitless high-quality MIR data via generative modeling","shorttitle":"The chamber ensemble generator","url":"http://arxiv.org/abs/2209.14458","abstract":"Data is the lifeblood of modern machine learning systems, including for those in Music Information Retrieval (MIR). However, MIR has long been mired by small datasets and unreliable labels. In this work, we propose to break this bottleneck using generative modeling. By pipelining a generative model of notes (Coconet trained on Bach Chorales) with a structured synthesis model of chamber ensembles (MIDI-DDSP trained on URMP), we demonstrate a system capable of producing unlimited amounts of realistic chorale music with rich annotations including mixes, stems, MIDI, note-level performance attributes (staccato, vibrato, etc.), and even ﬁne-grained synthesis parameters (pitch, amplitude, etc.). We call this system the Chamber Ensemble Generator (CEG), and use it to generate a large dataset of chorales from four different chamber ensembles (CocoChorales). We demonstrate that data generated using our approach improves state-of-theart models for music transcription and source separation, and we release both the system and the dataset as an opensource foundation for future work in the MIR community.","language":"en","urldate":"2022-11-04","publisher":"arXiv","author":[{"propositions":[],"lastnames":["Wu"],"firstnames":["Yusong"],"suffixes":[]},{"propositions":[],"lastnames":["Gardner"],"firstnames":["Josh"],"suffixes":[]},{"propositions":[],"lastnames":["Manilow"],"firstnames":["Ethan"],"suffixes":[]},{"propositions":[],"lastnames":["Simon"],"firstnames":["Ian"],"suffixes":[]},{"propositions":[],"lastnames":["Hawthorne"],"firstnames":["Curtis"],"suffixes":[]},{"propositions":[],"lastnames":["Engel"],"firstnames":["Jesse"],"suffixes":[]}],"month":"September","year":"2022","note":"arXiv:2209.14458 [cs, eess]","keywords":"Computer Science - Information Retrieval, Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing, ReadList","bibtex":"@misc{wu_chamber_2022,\n\ttitle = {The chamber ensemble generator: limitless high-quality {MIR} data via generative modeling},\n\tshorttitle = {The chamber ensemble generator},\n\turl = {http://arxiv.org/abs/2209.14458},\n\tabstract = {Data is the lifeblood of modern machine learning systems, including for those in Music Information Retrieval (MIR). However, MIR has long been mired by small datasets and unreliable labels. In this work, we propose to break this bottleneck using generative modeling. By pipelining a generative model of notes (Coconet trained on Bach Chorales) with a structured synthesis model of chamber ensembles (MIDI-DDSP trained on URMP), we demonstrate a system capable of producing unlimited amounts of realistic chorale music with rich annotations including mixes, stems, MIDI, note-level performance attributes (staccato, vibrato, etc.), and even ﬁne-grained synthesis parameters (pitch, amplitude, etc.). We call this system the Chamber Ensemble Generator (CEG), and use it to generate a large dataset of chorales from four different chamber ensembles (CocoChorales). We demonstrate that data generated using our approach improves state-of-theart models for music transcription and source separation, and we release both the system and the dataset as an opensource foundation for future work in the MIR community.},\n\tlanguage = {en},\n\turldate = {2022-11-04},\n\tpublisher = {arXiv},\n\tauthor = {Wu, Yusong and Gardner, Josh and Manilow, Ethan and Simon, Ian and Hawthorne, Curtis and Engel, Jesse},\n\tmonth = sep,\n\tyear = {2022},\n\tnote = {arXiv:2209.14458 [cs, eess]},\n\tkeywords = {Computer Science - Information Retrieval, Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing, ReadList},\n}\n\n\n\n","author_short":["Wu, Y.","Gardner, J.","Manilow, E.","Simon, I.","Hawthorne, C.","Engel, J."],"key":"wu_chamber_2022","id":"wu_chamber_2022","bibbaseid":"wu-gardner-manilow-simon-hawthorne-engel-thechamberensemblegeneratorlimitlesshighqualitymirdataviagenerativemodeling-2022","role":"author","urls":{"Paper":"http://arxiv.org/abs/2209.14458"},"keyword":["Computer Science - Information Retrieval","Computer Science - Machine Learning","Computer Science - Sound","Electrical Engineering and Systems Science - Audio and Speech Processing","ReadList"],"metadata":{"authorlinks":{}},"html":""},"bibtype":"misc","biburl":"https://bibbase.org/zotero/fsimonetta","dataSources":["pzyFFGWvxG2bs63zP"],"keywords":["computer science - information retrieval","computer science - machine learning","computer science - sound","electrical engineering and systems science - audio and speech processing","readlist"],"search_terms":["chamber","ensemble","generator","limitless","high","quality","mir","data","via","generative","modeling","wu","gardner","manilow","simon","hawthorne","engel"],"title":"The chamber ensemble generator: limitless high-quality MIR data via generative modeling","year":2022}