The analysis of voice quality in speech processing. Keller, E. In Chollet, G.; Esposito, A.; Faúndez Zanuy, M.; and Marinaro, M., editors, Nonlinear speech modeling and applications: advanced lectures and revised selected papers, pages 54-73. Springer, Berlin - Heidelberg.
The analysis of voice quality in speech processing [link]Paper  abstract   bibtex   
Voice quality has been defined as the characteristic auditory colouring of an individual's voice, derived from a variety of laryngeal and supralaryngeal features and running continuously through the individual's speech. The dis- tinctive tone of speech sounds produced by a particular person yields a particular voice. Voice quality is at the centre of several speech processing issues. In speech recognition, voice differences, particularly extreme divergences from the norm, are responsible for known performance degradations. In speech synthesis on the other hand, voice quality is a desirable modelling parameter, with millions of voice types that can be distinguished theoretically. This article reviews the experimental derivation of voice quality markers. Specifically, the use of perceptual judgements, the long-term averaged spectrum (LTAS) and prosodic markers is examined, as well as inverse filtering for the extraction of the glottal source waveform. This review suggests that voice quality is best investigated as a multi-dimensional parameter space involving a combination of factors involving individual prosody, temporally structured speech characteristics, spectral divergence and voice source features, and that it could profitably complement simple linguistic prosodic model processing in speech synthesis.
@incollection{keller_analysis_2005,
	Address = {Berlin - Heidelberg},
	Author = {Keller, Eric},
	Booktitle = {Nonlinear speech modeling and applications: advanced lectures and revised selected papers},
	Date = {2005},
	Date-Modified = {2016-09-24 18:56:06 +0000},
	Editor = {Chollet, Gérard and Esposito, Anna and Faúndez Zanuy, Marcos and Marinaro, María},
	File = {Attachment:files/5864/Keller - 2005 - The analysis of voice quality in speech processing.pdf:application/pdf},
	Keywords = {acoustic phonetics, LTAS, methods, phonation, phonetics, prosody, speech production, voice quality},
	Pages = {54-73},
	Publisher = {Springer},
	Title = {The analysis of voice quality in speech processing},
	Url = {http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.91.9949&rep=rep1&type=pdf},
	Abstract = {Voice quality has been defined as the characteristic auditory colouring of an individual's voice, derived from a variety of laryngeal and supralaryngeal features and running continuously through the individual's speech. The dis- tinctive tone of speech sounds produced by a particular person yields a particular voice. Voice quality is at the centre of several speech processing issues. In speech recognition, voice differences, particularly extreme divergences from the norm, are responsible for known performance degradations. In speech synthesis on the other hand, voice quality is a desirable modelling parameter, with millions of voice types that can be distinguished theoretically. This article reviews the experimental derivation of voice quality markers. Specifically, the use of perceptual judgements, the long-term averaged spectrum (LTAS) and prosodic markers is examined, as well as inverse filtering for the extraction of the glottal source waveform. This review suggests that voice quality is best investigated as a multi-dimensional parameter space involving a combination of factors involving individual prosody, temporally structured speech characteristics, spectral divergence and voice source features, and that it could profitably complement simple linguistic prosodic model processing in speech synthesis.},
	Bdsk-File-1 = {YnBsaXN0MDDUAQIDBAUGJCVYJHZlcnNpb25YJG9iamVjdHNZJGFyY2hpdmVyVCR0b3ASAAGGoKgHCBMUFRYaIVUkbnVsbNMJCgsMDxJXTlMua2V5c1pOUy5vYmplY3RzViRjbGFzc6INDoACgAOiEBGABIAFgAdccmVsYXRpdmVQYXRoWWFsaWFzRGF0YV8QWi4uLy4uLy4uL0JpYmxpb2dyYWZpYS9QYXBlcnMvS2VsbGVyL1RoZSBhbmFseXNpcyBvZiB2b2ljZSBxdWFsaXR5IGluIHNwZWVjaCBwcm9jZXNzaW5nLnBkZtIXCxgZV05TLmRhdGFPEQJGAAAAAAJGAAIAAAxNYWNpbnRvc2ggSEQAAAAAAAAAAAAAAAAAAADL9h/OSCsAABCGbssfVGhlIGFuYWx5c2lzIG9mIHZvIzEwODY2RUNELnBkZgAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAEIZuzdQJ03gAAAAAAAAAAAADAAQAAAkgAAAAAAAAAAAAAAAAAAAABktlbGxlcgAQAAgAAMv2A64AAAARAAgAANQJt1gAAAABABQQhm7LEIZljgAF/EcABfuYAADARgACAGRNYWNpbnRvc2ggSEQ6VXNlcnM6AGpvYXF1aW1fbGxpc3RlcnJpOgBCaWJsaW9ncmFmaWE6AFBhcGVyczoAS2VsbGVyOgBUaGUgYW5hbHlzaXMgb2Ygdm8jMTA4NjZFQ0QucGRmAA4AbgA2AFQAaABlACAAYQBuAGEAbAB5AHMAaQBzACAAbwBmACAAdgBvAGkAYwBlACAAcQB1AGEAbABpAHQAeQAgAGkAbgAgAHMAcABlAGUAYwBoACAAcAByAG8AYwBlAHMAcwBpAG4AZwAuAHAAZABmAA8AGgAMAE0AYQBjAGkAbgB0AG8AcwBoACAASABEABIAaVVzZXJzL2pvYXF1aW1fbGxpc3RlcnJpL0JpYmxpb2dyYWZpYS9QYXBlcnMvS2VsbGVyL1RoZSBhbmFseXNpcyBvZiB2b2ljZSBxdWFsaXR5IGluIHNwZWVjaCBwcm9jZXNzaW5nLnBkZgAAEwABLwAAFQACABj//wAAgAbSGxwdHlokY2xhc3NuYW1lWCRjbGFzc2VzXU5TTXV0YWJsZURhdGGjHR8gVk5TRGF0YVhOU09iamVjdNIbHCIjXE5TRGljdGlvbmFyeaIiIF8QD05TS2V5ZWRBcmNoaXZlctEmJ1Ryb290gAEACAARABoAIwAtADIANwBAAEYATQBVAGAAZwBqAGwAbgBxAHMAdQB3AIQAjgDrAPAA+ANCA0QDSQNUA10DawNvA3YDfwOEA5EDlAOmA6kDrgAAAAAAAAIBAAAAAAAAACgAAAAAAAAAAAAAAAAAAAOw},
	Bdsk-Url-1 = {http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.91.9949&rep=rep1&type=pdf}}
Downloads: 0