A quantitative study of disfluencies in French broadcast interviews. Boula de Mareüil, P.; Habert, B.; Bénard, F.; Adda-Decker, M.; Barras, C.; Adda, G.; and Paroubek, P. In Véronis, J. and Campione, E., editors, DISS 2005. Proceedings of the ISCA Tutorial and Research Workshop Disfluency in Spontaneous Speech, pages 27-32, Aix-en-Provence, France. 10-12 September, 2005.
A quantitative study of disfluencies in French broadcast interviews [link]Paper  abstract   bibtex   
The reported study aims at increasing our understanding of spontaneous speech-related phenomena from sibling corpora of speech and orthographic transcriptions at various levels of elaboration. It makes use of 9 hours of French broadcast interview archives, involving 10 journalists and 10 personalities from political or civil society. First we considered press-oriented transcripts, where most of the so-called disfluencies are discarded. They were then aligned with automatic transcripts, by using the LIMSI speech recogniser. This facilitated the production of exact transcripts, where all audible phenomena in non-overlapping speech segments were transcribed manually. Four types of disfluencies were distinguished: discourse markers, filled pauses, repetitions and revisions, each of which accounts for about 2% of the corpus (8% in total). They were analysed by utterance, speaker and disfluency pattern types. Four question were raised. Where do disfluencies occur in the utterance? What is the influence of the speakers' status? And what are the most frequent disfuency patterns?
@inproceedings{boula_de_mareuil_quantitative_2005,
	Address = {Aix-en-Provence, France. 10-12 September, 2005},
	Author = {Boula de Mareüil, Philippe and Habert, Benoît and Bénard, Frédérique and Adda-Decker, Martine and Barras, Claude and Adda, Gilles and Paroubek, Patrick},
	Booktitle = {DISS 2005. Proceedings of the ISCA Tutorial and Research Workshop Disfluency in Spontaneous Speech},
	Date = {2005},
	Date-Modified = {2016-09-24 18:55:59 +0000},
	Editor = {Véronis, Jean and Campione, Estelle},
	Keywords = {conversation, descriptive, disfluencies, ESTIVOZ, filled pauses, French, mass media, pauses, phonetics, prosody, radio, repairs, repetitions, speaking styles, spontaneous speech, temporal factors},
	Pages = {27-32},
	Title = {A quantitative study of disfluencies in French broadcast interviews},
	Url = {http://www.isca-speech.org/archive_open/diss_05/dis5_027.html},
	Abstract = {The reported study aims at increasing our understanding of spontaneous speech-related phenomena from sibling corpora of speech and orthographic transcriptions at various levels of elaboration. It makes use of 9 hours of French broadcast interview archives, involving 10 journalists and 10 personalities from political or civil society. First we considered press-oriented transcripts, where most of the so-called disfluencies are discarded. They were then aligned with automatic transcripts, by using the LIMSI speech recogniser. This facilitated the production of exact transcripts, where all audible phenomena in non-overlapping speech segments were transcribed manually. Four types of disfluencies were distinguished: discourse markers, filled pauses, repetitions and revisions, each of which accounts for about 2\% of the corpus (8\% in total). They were analysed by utterance, speaker and disfluency pattern types. Four question were raised. Where do disfluencies occur in the utterance? What is the influence of the speakers' status? And what are the most frequent disfuency patterns?},
	Bdsk-File-1 = {YnBsaXN0MDDUAQIDBAUGJCVYJHZlcnNpb25YJG9iamVjdHNZJGFyY2hpdmVyVCR0b3ASAAGGoKgHCBMUFRYaIVUkbnVsbNMJCgsMDxJXTlMua2V5c1pOUy5vYmplY3RzViRjbGFzc6INDoACgAOiEBGABIAFgAdccmVsYXRpdmVQYXRoWWFsaWFzRGF0YW8QawAuAC4ALwAuAC4ALwAuAC4ALwBCAGkAYgBsAGkAbwBnAHIAYQBmAGkAYQAvAFAAYQBwAGUAcgBzAC8AQgBvAHUAbABhACAAZABlACAATQBhAHIAZQB1AwgAaQBsAC8AQQAgAHEAdQBhAG4AdABpAHQAYQB0AGkAdgBlACAAcwB0AHUAZAB5ACAAbwBmACAAZABpAHMAZgBsAHUAZQBuAGMAaQBlAHMAIABpAG4AIABGAHIAZQBuAGMAaAAgAGIAcgBvAGEAZABjAGEAcwB0AC4AcABkAGbSFwsYGVdOUy5kYXRhTxECeAAAAAACeAACAAAMTWFjaW50b3NoIEhEAAAAAAAAAAAAAAAAAAAAy/YfzkgrAAAQhmfmH0EgcXVhbnRpdGF0aXZlIHN0dSMxMDg2NjdFNy5wZGYAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAABCGZ+fT6keZAAAAAAAAAAAAAwAEAAAJIAAAAAAAAAAAAAAAAAAAABBCb3VsYSBkZSBNYXJln2lsABAACAAAy/YDrgAAABEACAAA0+oreQAAAAEAFBCGZ+YQhmWOAAX8RwAF+5gAAMBGAAIAbk1hY2ludG9zaCBIRDpVc2VyczoAam9hcXVpbV9sbGlzdGVycmk6AEJpYmxpb2dyYWZpYToAUGFwZXJzOgBCb3VsYSBkZSBNYXJln2lsOgBBIHF1YW50aXRhdGl2ZSBzdHUjMTA4NjY3RTcucGRmAA4AegA8AEEAIABxAHUAYQBuAHQAaQB0AGEAdABpAHYAZQAgAHMAdAB1AGQAeQAgAG8AZgAgAGQAaQBzAGYAbAB1AGUAbgBjAGkAZQBzACAAaQBuACAARgByAGUAbgBjAGgAIABiAHIAbwBhAGQAYwBhAHMAdAAuAHAAZABmAA8AGgAMAE0AYQBjAGkAbgB0AG8AcwBoACAASABEABIAe1VzZXJzL2pvYXF1aW1fbGxpc3RlcnJpL0JpYmxpb2dyYWZpYS9QYXBlcnMvQm91bGEgZGUgTWFyZXXMiGlsL0EgcXVhbnRpdGF0aXZlIHN0dWR5IG9mIGRpc2ZsdWVuY2llcyBpbiBGcmVuY2ggYnJvYWRjYXN0LnBkZgAAEwABLwAAFQACABj//wAAgAbSGxwdHlokY2xhc3NuYW1lWCRjbGFzc2VzXU5TTXV0YWJsZURhdGGjHR8gVk5TRGF0YVhOU09iamVjdNIbHCIjXE5TRGljdGlvbmFyeaIiIF8QD05TS2V5ZWRBcmNoaXZlctEmJ1Ryb290gAEACAARABoAIwAtADIANwBAAEYATQBVAGAAZwBqAGwAbgBxAHMAdQB3AIQAjgFnAWwBdAPwA/ID9wQCBAsEGQQdBCQELQQyBD8EQgRUBFcEXAAAAAAAAAIBAAAAAAAAACgAAAAAAAAAAAAAAAAAAARe},
	Bdsk-Url-1 = {http://www.isca-speech.org/archive_open/diss_05/dis5_027.html}}
Downloads: 0