Corpus Analysis Tools For Computational Hook Discovery. Balen, J. V., Burgoyne, J. A., Bountouridis, D., Müllensiefen, D., & Veltkamp, R. C. October, 2015. Publisher: Zenodo
Corpus Analysis Tools For Computational Hook Discovery. [link]Paper  doi  abstract   bibtex   
Compared to studies with symbolic music data, advances in music description from audio have overwhelmingly focused on ground truth reconstruction and maximizing prediction accuracy, with only a small fraction of studies using audio description to gain insight into musical data. We present a strategy for the corpus analysis of audio data that is optimized for interpretable results. The approach brings two previously unexplored concepts to the audio domain: audio bigram distributions, and the use of corpus-relative or “second-order” descriptors. To test the real-world applicability of our method, we present an experiment in which we model song recognition data collected in a widely-played music game. By using the proposed corpus analysis pipeline we are able to present a cognitively adequate analysis that allows a model interpretation in terms of the listening history and experience of our participants. We find that our corpus-based audio features are able to explain a comparable amount of variance to symbolic features for this task when used alone and that they can supplement symbolic features profitably when the two types of features are used in tandem. Finally, we highlight new insights into what makes music recognizable.
@Article{          balen.ea2015-corpus,
    author       = {Balen, Jan Van and Burgoyne, John Ashley and
                   Bountouridis, Dimitrios and Müllensiefen, Daniel and
                   Veltkamp, Remco C.},
    year         = {2015},
    title        = {Corpus {Analysis} {Tools} {For} {Computational} {Hook}
                   {Discovery}.},
    copyright    = {Creative Commons Attribution 4.0, Open Access},
    url          = {https://zenodo.org/record/1415038},
    doi          = {10.5281/ZENODO.1415038},
    abstract     = {Compared to studies with symbolic music data, advances in
                   music description from audio have overwhelmingly focused
                   on ground truth reconstruction and maximizing prediction
                   accuracy, with only a small fraction of studies using
                   audio description to gain insight into musical data. We
                   present a strategy for the corpus analysis of audio data
                   that is optimized for interpretable results. The approach
                   brings two previously unexplored concepts to the audio
                   domain: audio bigram distributions, and the use of
                   corpus-relative or “second-order” descriptors. To test
                   the real-world applicability of our method, we present an
                   experiment in which we model song recognition data
                   collected in a widely-played music game. By using the
                   proposed corpus analysis pipeline we are able to present a
                   cognitively adequate analysis that allows a model
                   interpretation in terms of the listening history and
                   experience of our participants. We find that our
                   corpus-based audio features are able to explain a
                   comparable amount of variance to symbolic features for
                   this task when used alone and that they can supplement
                   symbolic features profitably when the two types of
                   features are used in tandem. Finally, we highlight new
                   insights into what makes music recognizable.},
    language     = {en},
    urldate      = {2023-02-23},
    month        = oct,
    note         = {Publisher: Zenodo}
}

Downloads: 0