Does my Speech Rock? Automatic Assessment of Public Speaking Skills. Payan, A., Sun, T., Vidal, G., Zhang, T., Coutinho, E., & Eyben, F. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, volume 1, pages 2519-2523, 2015. ISCA.
Paper
Website abstract bibtex 1 download In this paper, we introduce results for the task of Automatic Public Speech Assessment (APSA). Given the comparably sparse work carried out on this task up to this point, a novel database was required for training and evaluation of machine learning models. As a basis, the freely available oral presentations of the ICASSP conference in 2011 were selected due to their transcription including non-verbal vocalisations. The data was specifically labelled in terms of the perceived oratory ability of the speakers by five raters according to a 5-point Public Speaking Skill Rating Likert scale. We investigate the feasibility of speaker-independent APSA using different standardised acoustic feature sets computed per fixed chunk of an oral presentation in a series of ternary classification and continuous regression experiments. Further, we compare the relevance of different feature groups related to fluency (speech/hesitation rate), prosody, voice quality and a variety of spectral features. Our results demonstrate that oratory speaking skills can be reliably assessed using supra-segmental audio features, with prosodic ones being particularly suited.
@inproceedings{
title = {Does my Speech Rock? Automatic Assessment of Public Speaking Skills},
type = {inproceedings},
year = {2015},
keywords = {article,conference},
pages = {2519-2523},
volume = {1},
websites = {http://www.isca-speech.org/archive/interspeech_2015/papers/i15_2519.pdf},
publisher = {ISCA},
city = {Dresden, Germany},
id = {c93b3b7a-a288-3dfd-a1c4-0e21ed9354d1},
created = {2024-08-09T12:19:58.997Z},
file_attached = {true},
profile_id = {ffa9027c-806a-3827-93a1-02c42eb146a1},
group_id = {da2a8249-fdf4-3036-ba56-7358198a1600},
last_modified = {2024-08-09T12:20:41.598Z},
read = {false},
starred = {false},
authored = {false},
confirmed = {true},
hidden = {false},
citation_key = {azais2015doesskills},
source_type = {inproceedings},
private_publication = {false},
abstract = {In this paper, we introduce results for the task of Automatic Public Speech Assessment (APSA). Given the comparably sparse work carried out on this task up to this point, a novel database was required for training and evaluation of machine learning models. As a basis, the freely available oral presentations of the ICASSP conference in 2011 were selected due to their transcription including non-verbal vocalisations. The data was specifically labelled in terms of the perceived oratory ability of the speakers by five raters according to a 5-point Public Speaking Skill Rating Likert scale. We investigate the feasibility of speaker-independent APSA using different standardised acoustic feature sets computed per fixed chunk of an oral presentation in a series of ternary classification and continuous regression experiments. Further, we compare the relevance of different feature groups related to fluency (speech/hesitation rate), prosody, voice quality and a variety of spectral features. Our results demonstrate that oratory speaking skills can be reliably assessed using supra-segmental audio features, with prosodic ones being particularly suited.},
bibtype = {inproceedings},
author = {Payan, Adrien and Sun, Tianjiao and Vidal, Guillaume and Zhang, Tina and Coutinho, Eduardo and Eyben, Florian},
booktitle = {Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH}
}
Downloads: 1
{"_id":"JbRMBswydk3PGXxFN","bibbaseid":"payan-sun-vidal-zhang-coutinho-eyben-doesmyspeechrockautomaticassessmentofpublicspeakingskills-2015","downloads":1,"creationDate":"2018-12-10T14:43:05.612Z","title":"Does my Speech Rock? Automatic Assessment of Public Speaking Skills","author_short":["Payan, A.","Sun, T.","Vidal, G.","Zhang, T.","Coutinho, E.","Eyben, F."],"year":2015,"bibtype":"inproceedings","biburl":"https://bibbase.org/service/mendeley/ffa9027c-806a-3827-93a1-02c42eb146a1","bibdata":{"title":"Does my Speech Rock? Automatic Assessment of Public Speaking Skills","type":"inproceedings","year":"2015","keywords":"article,conference","pages":"2519-2523","volume":"1","websites":"http://www.isca-speech.org/archive/interspeech_2015/papers/i15_2519.pdf","publisher":"ISCA","city":"Dresden, Germany","id":"c93b3b7a-a288-3dfd-a1c4-0e21ed9354d1","created":"2024-08-09T12:19:58.997Z","file_attached":"true","profile_id":"ffa9027c-806a-3827-93a1-02c42eb146a1","group_id":"da2a8249-fdf4-3036-ba56-7358198a1600","last_modified":"2024-08-09T12:20:41.598Z","read":false,"starred":false,"authored":false,"confirmed":"true","hidden":false,"citation_key":"azais2015doesskills","source_type":"inproceedings","private_publication":false,"abstract":"In this paper, we introduce results for the task of Automatic Public Speech Assessment (APSA). Given the comparably sparse work carried out on this task up to this point, a novel database was required for training and evaluation of machine learning models. As a basis, the freely available oral presentations of the ICASSP conference in 2011 were selected due to their transcription including non-verbal vocalisations. The data was specifically labelled in terms of the perceived oratory ability of the speakers by five raters according to a 5-point Public Speaking Skill Rating Likert scale. We investigate the feasibility of speaker-independent APSA using different standardised acoustic feature sets computed per fixed chunk of an oral presentation in a series of ternary classification and continuous regression experiments. Further, we compare the relevance of different feature groups related to fluency (speech/hesitation rate), prosody, voice quality and a variety of spectral features. Our results demonstrate that oratory speaking skills can be reliably assessed using supra-segmental audio features, with prosodic ones being particularly suited.","bibtype":"inproceedings","author":"Payan, Adrien and Sun, Tianjiao and Vidal, Guillaume and Zhang, Tina and Coutinho, Eduardo and Eyben, Florian","booktitle":"Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH","bibtex":"@inproceedings{\n title = {Does my Speech Rock? Automatic Assessment of Public Speaking Skills},\n type = {inproceedings},\n year = {2015},\n keywords = {article,conference},\n pages = {2519-2523},\n volume = {1},\n websites = {http://www.isca-speech.org/archive/interspeech_2015/papers/i15_2519.pdf},\n publisher = {ISCA},\n city = {Dresden, Germany},\n id = {c93b3b7a-a288-3dfd-a1c4-0e21ed9354d1},\n created = {2024-08-09T12:19:58.997Z},\n file_attached = {true},\n profile_id = {ffa9027c-806a-3827-93a1-02c42eb146a1},\n group_id = {da2a8249-fdf4-3036-ba56-7358198a1600},\n last_modified = {2024-08-09T12:20:41.598Z},\n read = {false},\n starred = {false},\n authored = {false},\n confirmed = {true},\n hidden = {false},\n citation_key = {azais2015doesskills},\n source_type = {inproceedings},\n private_publication = {false},\n abstract = {In this paper, we introduce results for the task of Automatic Public Speech Assessment (APSA). Given the comparably sparse work carried out on this task up to this point, a novel database was required for training and evaluation of machine learning models. As a basis, the freely available oral presentations of the ICASSP conference in 2011 were selected due to their transcription including non-verbal vocalisations. The data was specifically labelled in terms of the perceived oratory ability of the speakers by five raters according to a 5-point Public Speaking Skill Rating Likert scale. We investigate the feasibility of speaker-independent APSA using different standardised acoustic feature sets computed per fixed chunk of an oral presentation in a series of ternary classification and continuous regression experiments. Further, we compare the relevance of different feature groups related to fluency (speech/hesitation rate), prosody, voice quality and a variety of spectral features. Our results demonstrate that oratory speaking skills can be reliably assessed using supra-segmental audio features, with prosodic ones being particularly suited.},\n bibtype = {inproceedings},\n author = {Payan, Adrien and Sun, Tianjiao and Vidal, Guillaume and Zhang, Tina and Coutinho, Eduardo and Eyben, Florian},\n booktitle = {Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH}\n}","author_short":["Payan, A.","Sun, T.","Vidal, G.","Zhang, T.","Coutinho, E.","Eyben, F."],"urls":{"Paper":"https://bibbase.org/service/mendeley/ffa9027c-806a-3827-93a1-02c42eb146a1/file/34461d45-fc35-f097-25b2-22e3bb7da86f/2015___Payan_et_al___Does_my_Speech_Rock_Automatic_Assessment_of_Public_Speaking_Skills.pdf.pdf","Website":"http://www.isca-speech.org/archive/interspeech_2015/papers/i15_2519.pdf"},"biburl":"https://bibbase.org/service/mendeley/ffa9027c-806a-3827-93a1-02c42eb146a1","bibbaseid":"payan-sun-vidal-zhang-coutinho-eyben-doesmyspeechrockautomaticassessmentofpublicspeakingskills-2015","role":"author","keyword":["article","conference"],"metadata":{"authorlinks":{"coutinho, e":"https://bibbase.org/service/mendeley/ffa9027c-806a-3827-93a1-02c42eb146a1"}},"downloads":1},"search_terms":["speech","rock","automatic","assessment","public","speaking","skills","payan","sun","vidal","zhang","coutinho","eyben"],"keywords":["article","conference"],"authorIDs":["mo4CFXJ7ukAMT9nho"],"dataSources":["Tcd3cXtdQsiKHPZsW","ya2CyA73rpZseyrZ8","YqW8pMoihb7JazZcx","2252seNhipfTmjEBQ"]}