Modelling long- and short-term structure in symbolic music with attention and recurrence. de Berardinis, J., Barrett, S., Cangelosi, A., & Coutinho, E. In CSMC + MuMe 2020: 2020 Joint Conference on AI Music Creativity, 2020.
Modelling long- and short-term structure in symbolic music with attention and recurrence [link]Website  abstract   bibtex   1 download  
The automatic composition of music with long-term structure is a central problem in music generation. Neural network-based models have been shown to perform relatively well in melody generation, but generating music with long-term structure is still a major challenge. This paper introduces a new approach for music modelling that combines recent advancements of transformer models with recurrent networks-the long-short term universal transformer (LSTUT), and compare its ability to predict music against current state-of-the-art music models. Our experiments are designed to push the boundaries of music models on considerably long music sequences-a crucial requirement for learning long-term structure effectively. Results show that the LSTUT outper-forms all the other models and can potentially learn features related to music structure at different time scales. Overall, we show the importance of integrating both recurrence and attention in the architecture of music models, and their potential use in future automatic composition systems.
@inproceedings{
 title = {Modelling long- and short-term structure in symbolic music with attention and recurrence},
 type = {inproceedings},
 year = {2020},
 websites = {https://boblsturm.github.io/aimusic2020/},
 city = {Stockholm, Sweden},
 id = {d987ec74-4b6e-3b6d-b6f3-0f19eb74b818},
 created = {2020-10-12T15:42:50.876Z},
 file_attached = {true},
 profile_id = {ffa9027c-806a-3827-93a1-02c42eb146a1},
 last_modified = {2023-05-15T08:14:21.057Z},
 read = {false},
 starred = {false},
 authored = {true},
 confirmed = {true},
 hidden = {false},
 citation_key = {DeBerardinis2020},
 private_publication = {false},
 abstract = {The automatic composition of music with long-term structure is a central problem in music generation. Neural network-based models have been shown to perform relatively well in melody generation, but generating music with long-term structure is still a major challenge. This paper introduces a new approach for music modelling that combines recent advancements of transformer models with recurrent networks-the long-short term universal transformer (LSTUT), and compare its ability to predict music against current state-of-the-art music models. Our experiments are designed to push the boundaries of music models on considerably long music sequences-a crucial requirement for learning long-term structure effectively. Results show that the LSTUT outper-forms all the other models and can potentially learn features related to music structure at different time scales. Overall, we show the importance of integrating both recurrence and attention in the architecture of music models, and their potential use in future automatic composition systems.},
 bibtype = {inproceedings},
 author = {de Berardinis, J. and Barrett, S. and Cangelosi, A. and Coutinho, E.},
 booktitle = {CSMC + MuMe 2020: 2020 Joint Conference on AI Music Creativity}
}

Downloads: 1