Prosody takes over: Towards a prosodically guided dialog system. Kompe, R.; Nöth, E.; Kieβling, A; Kuhn, T; Mast, M; Niemann, H; Ott, K; and Batliner, A. Speech Communication, 15(1-2):155-167, October.
Prosody takes over: Towards a prosodically guided dialog system [link]Paper  doi  abstract   bibtex   
The domain of the speech recognition and dialog system EVAR is train time table inquiry. We observed that in real human-human dialogs when the officer transmits the information, the customer very often interrupts. Many of these interruptions are just repetitions of the time of day given by the officer. The functional role of these interruptions is often determined by prosodic cues only. An important result of experiments where naive persons used the EVAR system is that it is hard to follow the train connection given via speech synthesis. In this case it is even more important than in human-human dialogs that the user has the opportunity to interact during the answer phase. Therefore we extended the dialog module to allow the user to repeat the time of day and we added a prosody module guiding the continuation of the dialog by analyzing the intonation contour of this utterance.
@article{kompe_prosody_1994,
	Author = {Kompe, Ralf and Nöth, Elmar and Kieβling, A and Kuhn, T and Mast, M and Niemann, H and Ott, K and Batliner, Anton},
	Date = {1994},
	Date-Modified = {2017-04-19 08:04:07 +0000},
	Doi = {10.1016/0167-6393(94)90049-3},
	Issn = {01676393},
	Journal = {Speech Communication},
	Keywords = {dialogue systems, phonetics, prosody, speech technology},
	Month = oct,
	Number = {1-2},
	Pages = {155-167},
	Title = {Prosody takes over: Towards a prosodically guided dialog system},
	Url = {http://linkinghub.elsevier.com/retrieve/pii/0167639394900493},
	Volume = {15},
	Abstract = {The domain of the speech recognition and dialog system EVAR is train time table inquiry. We observed that in real human-human dialogs when the officer transmits the information, the customer very often interrupts. Many of these interruptions are just repetitions of the time of day given by the officer. The functional role of these interruptions is often determined by prosodic cues only. An important result of experiments where naive persons used the EVAR system is that it is hard to follow the train connection given via speech synthesis. In this case it is even more important than in human-human dialogs that the user has the opportunity to interact during the answer phase. Therefore we extended the dialog module to allow the user to repeat the time of day and we added a prosody module guiding the continuation of the dialog by analyzing the intonation contour of this utterance.},
	Bdsk-Url-1 = {http://linkinghub.elsevier.com/retrieve/pii/0167639394900493},
	Bdsk-Url-2 = {http://dx.doi.org/10.1016/0167-6393(94)90049-3}}
Downloads: 0