Crowdsourced Multimodal Corpora Collection Tool. Jonell, P., Oertel, C., Kontogiorgos, D., Beskow, J., & Gustafson, J. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) :, pages 728-734, 2018.
abstract   bibtex   
In recent years, more and more multimodal corpora have been created. To our knowledge there is no publicly available tool which allows for acquiring controlled multimodal data of people in a rapid and scalable fashion. We therefore are proposing (1) a novel tool which will enable researchers to rapidly gather large amounts of multimodal data spanning a wide demographic range, and (2) an example of how we used this tool for corpus collection of our "Attentive listener'' multimodal corpus. The code is released under an Apache License 2.0 and available as an open-source repository, which can be found at https://github.com/kth-social-robotics/multimodal-crowdsourcing-tool. This tool will allow researchers to set-up their own multimodal data collection system quickly and create their own multimodal corpora. Finally, this paper provides a discussion about the advantages and disadvantages with a crowd-sourced data collection tool, especially in comparison to a lab recorded corpora.
@inproceedings{
 title = {Crowdsourced Multimodal Corpora Collection Tool},
 type = {inproceedings},
 year = {2018},
 identifiers = {[object Object]},
 pages = {728-734},
 institution = {KTH, Speech, Music and Hearing, TMH},
 id = {d4b5d513-4bbc-38aa-a85a-6960becdcec1},
 created = {2020-01-06T19:15:02.031Z},
 file_attached = {false},
 profile_id = {fb8d345a-1d79-3791-a6c6-00233ea44521},
 last_modified = {2020-01-06T19:15:02.031Z},
 read = {false},
 starred = {false},
 authored = {true},
 confirmed = {true},
 hidden = {false},
 citation_key = {Jonell1217275},
 source_type = {inproceedings},
 notes = {QC 20180618},
 private_publication = {false},
 abstract = {In recent years, more and more multimodal corpora have been created. To our knowledge there is no publicly available tool which allows for acquiring controlled multimodal data of people in a rapid and scalable fashion. We therefore are proposing (1) a novel tool which will enable researchers to rapidly gather large amounts of multimodal data spanning a wide demographic range, and (2) an example of how we used this tool for corpus collection of our "Attentive listener'' multimodal corpus. The code is released under an Apache License 2.0 and available as an open-source repository, which can be found at https://github.com/kth-social-robotics/multimodal-crowdsourcing-tool. This tool will allow researchers to set-up their own multimodal data collection system quickly and create their own multimodal corpora. Finally, this paper provides a discussion about the advantages and disadvantages with a crowd-sourced data collection tool, especially in comparison to a lab recorded corpora. },
 bibtype = {inproceedings},
 author = {Jonell, Patrik and Oertel, Catharine and Kontogiorgos, Dimosthenis and Beskow, Jonas and Gustafson, Joakim},
 booktitle = {Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) :}
}

Downloads: 0