Does my Speech Rock? Automatic Assessment of Public Speaking Skills. Payan, A., Sun, T., Vidal, G., Zhang, T., Coutinho, E., & Eyben, F. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, volume 1, pages 2519-2523, 2015. ISCA.
Website abstract bibtex 1 download In this paper, we introduce results for the task of Automatic Public Speech Assessment (APSA). Given the comparably sparse work carried out on this task up to this point, a novel database was required for training and evaluation of machine learning models. As a basis, the freely available oral presentations of the ICASSP conference in 2011 were selected due to their transcription including non-verbal vocalisations. The data was specifically labelled in terms of the perceived oratory ability of the speakers by five raters according to a 5-point Public Speaking Skill Rating Likert scale. We investigate the feasibility of speaker-independent APSA using different standardised acoustic feature sets computed per fixed chunk of an oral presentation in a series of ternary classification and continuous regression experiments. Further, we compare the relevance of different feature groups related to fluency (speech/hesitation rate), prosody, voice quality and a variety of spectral features. Our results demonstrate that oratory speaking skills can be reliably assessed using supra-segmental audio features, with prosodic ones being particularly suited.
@inproceedings{
title = {Does my Speech Rock? Automatic Assessment of Public Speaking Skills},
type = {inproceedings},
year = {2015},
keywords = {article,conference},
pages = {2519-2523},
volume = {1},
websites = {http://www.isca-speech.org/archive/interspeech_2015/papers/i15_2519.pdf},
publisher = {ISCA},
city = {Dresden, Germany},
id = {c074b917-846e-3d0c-872d-00c4df3d3ca4},
created = {2020-05-29T11:51:37.001Z},
file_attached = {true},
profile_id = {ffa9027c-806a-3827-93a1-02c42eb146a1},
last_modified = {2020-06-08T17:31:31.714Z},
read = {false},
starred = {false},
authored = {true},
confirmed = {true},
hidden = {false},
citation_key = {azais2015doesskills},
source_type = {inproceedings},
folder_uuids = {aac08d0d-38e7-4f4e-a381-5271c5c099ce},
private_publication = {false},
abstract = {In this paper, we introduce results for the task of Automatic Public Speech Assessment (APSA). Given the comparably sparse work carried out on this task up to this point, a novel database was required for training and evaluation of machine learning models. As a basis, the freely available oral presentations of the ICASSP conference in 2011 were selected due to their transcription including non-verbal vocalisations. The data was specifically labelled in terms of the perceived oratory ability of the speakers by five raters according to a 5-point Public Speaking Skill Rating Likert scale. We investigate the feasibility of speaker-independent APSA using different standardised acoustic feature sets computed per fixed chunk of an oral presentation in a series of ternary classification and continuous regression experiments. Further, we compare the relevance of different feature groups related to fluency (speech/hesitation rate), prosody, voice quality and a variety of spectral features. Our results demonstrate that oratory speaking skills can be reliably assessed using supra-segmental audio features, with prosodic ones being particularly suited.},
bibtype = {inproceedings},
author = {Payan, Adrien and Sun, Tianjiao and Vidal, Guillaume and Zhang, Tina and Coutinho, Eduardo and Eyben, Florian},
booktitle = {Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH}
}
Downloads: 1
{"_id":"JbRMBswydk3PGXxFN","bibbaseid":"payan-sun-vidal-zhang-coutinho-eyben-doesmyspeechrockautomaticassessmentofpublicspeakingskills-2015","downloads":1,"creationDate":"2018-12-10T14:43:05.612Z","title":"Does my Speech Rock? Automatic Assessment of Public Speaking Skills","author_short":["Payan, A.","Sun, T.","Vidal, G.","Zhang, T.","Coutinho, E.","Eyben, F."],"year":2015,"bibtype":"inproceedings","biburl":"https://bibbase.org/service/mendeley/ffa9027c-806a-3827-93a1-02c42eb146a1","bibdata":{"title":"Does my Speech Rock? Automatic Assessment of Public Speaking Skills","type":"inproceedings","year":"2015","keywords":"article,conference","pages":"2519-2523","volume":"1","websites":"http://www.isca-speech.org/archive/interspeech_2015/papers/i15_2519.pdf","publisher":"ISCA","city":"Dresden, Germany","id":"c074b917-846e-3d0c-872d-00c4df3d3ca4","created":"2020-05-29T11:51:37.001Z","file_attached":"true","profile_id":"ffa9027c-806a-3827-93a1-02c42eb146a1","last_modified":"2020-06-08T17:31:31.714Z","read":false,"starred":false,"authored":"true","confirmed":"true","hidden":false,"citation_key":"azais2015doesskills","source_type":"inproceedings","folder_uuids":"aac08d0d-38e7-4f4e-a381-5271c5c099ce","private_publication":false,"abstract":"In this paper, we introduce results for the task of Automatic Public Speech Assessment (APSA). Given the comparably sparse work carried out on this task up to this point, a novel database was required for training and evaluation of machine learning models. As a basis, the freely available oral presentations of the ICASSP conference in 2011 were selected due to their transcription including non-verbal vocalisations. The data was specifically labelled in terms of the perceived oratory ability of the speakers by five raters according to a 5-point Public Speaking Skill Rating Likert scale. We investigate the feasibility of speaker-independent APSA using different standardised acoustic feature sets computed per fixed chunk of an oral presentation in a series of ternary classification and continuous regression experiments. Further, we compare the relevance of different feature groups related to fluency (speech/hesitation rate), prosody, voice quality and a variety of spectral features. Our results demonstrate that oratory speaking skills can be reliably assessed using supra-segmental audio features, with prosodic ones being particularly suited.","bibtype":"inproceedings","author":"Payan, Adrien and Sun, Tianjiao and Vidal, Guillaume and Zhang, Tina and Coutinho, Eduardo and Eyben, Florian","booktitle":"Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH","bibtex":"@inproceedings{\n title = {Does my Speech Rock? Automatic Assessment of Public Speaking Skills},\n type = {inproceedings},\n year = {2015},\n keywords = {article,conference},\n pages = {2519-2523},\n volume = {1},\n websites = {http://www.isca-speech.org/archive/interspeech_2015/papers/i15_2519.pdf},\n publisher = {ISCA},\n city = {Dresden, Germany},\n id = {c074b917-846e-3d0c-872d-00c4df3d3ca4},\n created = {2020-05-29T11:51:37.001Z},\n file_attached = {true},\n profile_id = {ffa9027c-806a-3827-93a1-02c42eb146a1},\n last_modified = {2020-06-08T17:31:31.714Z},\n read = {false},\n starred = {false},\n authored = {true},\n confirmed = {true},\n hidden = {false},\n citation_key = {azais2015doesskills},\n source_type = {inproceedings},\n folder_uuids = {aac08d0d-38e7-4f4e-a381-5271c5c099ce},\n private_publication = {false},\n abstract = {In this paper, we introduce results for the task of Automatic Public Speech Assessment (APSA). Given the comparably sparse work carried out on this task up to this point, a novel database was required for training and evaluation of machine learning models. As a basis, the freely available oral presentations of the ICASSP conference in 2011 were selected due to their transcription including non-verbal vocalisations. The data was specifically labelled in terms of the perceived oratory ability of the speakers by five raters according to a 5-point Public Speaking Skill Rating Likert scale. We investigate the feasibility of speaker-independent APSA using different standardised acoustic feature sets computed per fixed chunk of an oral presentation in a series of ternary classification and continuous regression experiments. Further, we compare the relevance of different feature groups related to fluency (speech/hesitation rate), prosody, voice quality and a variety of spectral features. Our results demonstrate that oratory speaking skills can be reliably assessed using supra-segmental audio features, with prosodic ones being particularly suited.},\n bibtype = {inproceedings},\n author = {Payan, Adrien and Sun, Tianjiao and Vidal, Guillaume and Zhang, Tina and Coutinho, Eduardo and Eyben, Florian},\n booktitle = {Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH}\n}","author_short":["Payan, A.","Sun, T.","Vidal, G.","Zhang, T.","Coutinho, E.","Eyben, F."],"urls":{"Website":"http://www.isca-speech.org/archive/interspeech_2015/papers/i15_2519.pdf"},"biburl":"https://bibbase.org/service/mendeley/ffa9027c-806a-3827-93a1-02c42eb146a1","bibbaseid":"payan-sun-vidal-zhang-coutinho-eyben-doesmyspeechrockautomaticassessmentofpublicspeakingskills-2015","role":"author","keyword":["article","conference"],"metadata":{"authorlinks":{"coutinho, e":"https://bibbase.org/service/mendeley/ffa9027c-806a-3827-93a1-02c42eb146a1"}},"downloads":1},"search_terms":["speech","rock","automatic","assessment","public","speaking","skills","payan","sun","vidal","zhang","coutinho","eyben"],"keywords":["article","conference"],"authorIDs":["mo4CFXJ7ukAMT9nho"],"dataSources":["Tcd3cXtdQsiKHPZsW","ya2CyA73rpZseyrZ8","YqW8pMoihb7JazZcx","2252seNhipfTmjEBQ"]}