Scaper: A library for soundscape synthesis and augmentation. Salamon, J., MacConnell, D., Cartwright, M., Li, P., & Bello, J. P. 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), October, 2017. 165 citations (Semantic Scholar/DOI) [2023-12-02] Conference Name: 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) ISBN: 9781538616321 Place: New Paltz, NY Publisher: IEEE
Scaper: A library for soundscape synthesis and augmentation [link]Paper  doi  abstract   bibtex   
Sound event detection (SED) in environmental recordings is a key topic of research in machine listening, with applications in noise monitoring for smart cities, self-driving cars, surveillance, bioa-coustic monitoring, and indexing of large multimedia collections. Developing new solutions for SED often relies on the availability of strongly labeled audio recordings, where the annotation includes the onset, offset and source of every event. Generating such precise annotations manually is very time consuming, and as a result existing datasets for SED with strong labels are scarce and limited in size. To address this issue, we present Scaper, an open-source library for soundscape synthesis and augmentation. Given a collection of iso-lated sound events, Scaper acts as a high-level sequencer that can generate multiple soundscapes from a single, probabilistically defined, “specification”. To increase the variability of the output, Scaper supports the application of audio transformations such as pitch shifting and time stretching individually to every event. To illustrate the potential of the library, we generate a dataset of 10,000 sound-scapes and use it to compare the performance of two state-of-the-art algorithms, including a breakdown by soundscape characteristics. We also describe how Scaper was used to generate audio stimuli for an audio labeling crowdsourcing experiment, and conclude with a discussion of Scaper's limitations and potential applications.
@article{salamon_scaper_2017,
	title = {Scaper: {A} library for soundscape synthesis and augmentation},
	shorttitle = {Scaper},
	url = {http://ieeexplore.ieee.org/document/8170052/},
	doi = {10.1109/WASPAA.2017.8170052},
	abstract = {Sound event detection (SED) in environmental recordings is a key topic of research in machine listening, with applications in noise monitoring for smart cities, self-driving cars, surveillance, bioa-coustic monitoring, and indexing of large multimedia collections. Developing new solutions for SED often relies on the availability of strongly labeled audio recordings, where the annotation includes the onset, offset and source of every event. Generating such precise annotations manually is very time consuming, and as a result existing datasets for SED with strong labels are scarce and limited in size. To address this issue, we present Scaper, an open-source library for soundscape synthesis and augmentation. Given a collection of iso-lated sound events, Scaper acts as a high-level sequencer that can generate multiple soundscapes from a single, probabilistically defined, “specification”. To increase the variability of the output, Scaper supports the application of audio transformations such as pitch shifting and time stretching individually to every event. To illustrate the potential of the library, we generate a dataset of 10,000 sound-scapes and use it to compare the performance of two state-of-the-art algorithms, including a breakdown by soundscape characteristics. We also describe how Scaper was used to generate audio stimuli for an audio labeling crowdsourcing experiment, and conclude with a discussion of Scaper's limitations and potential applications.},
	urldate = {2023-12-02},
	journal = {2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)},
	author = {Salamon, Justin and MacConnell, Duncan and Cartwright, Mark and Li, Peter and Bello, Juan Pablo},
	month = oct,
	year = {2017},
	note = {165 citations (Semantic Scholar/DOI) [2023-12-02]
Conference Name: 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)
ISBN: 9781538616321
Place: New Paltz, NY
Publisher: IEEE},
	pages = {344--348},
}

Downloads: 0