Review of text-to-speech conversion for English. Klatt, D. H In Atal, B S; Miller, L J; and Kent, R D, editors, Papers in speech communication: Speech processing, pages 57-114. The Acoustical Society of America, New York.
Review of text-to-speech conversion for English [link]Paper  abstract   bibtex   
The automatic conversion of English text to synthetic speech is presently being performed, remarkably well, by a number of laboratory systems and commercial devices. Progress in this area has been made possible by advances in linguistic theory, acoustic-phonetic characterization of English sound patterns, perceptual psychology, mathematical modeling of speech production, structured programming, and computer hardware design. This review traces the early work on the development of speech synthesizers, discovery of minimal acoustic cues for phonetic contrasts, evolution of phonemic rule programs, incorporation of prosodic rules, and formulation of techniques for text analysis. Examples of rules are used liberally to illustrate the state of the art. Many of the examples are taken from Klattalk, a text-to-speech system developed by the author. A number of scientific problems are identified that prevent current systems from achieving the goal of completely human-sounding speech. While the emphasis is on rule programs that drive a formant synthesizer, alternatives such as articulatory synthesis and waveform concatenation are also reviewed. An extensive bibliography has been assembled to show both the breadth of synthesis activity and the wealth of phenomena covered by rules in the best of these programs. A recording of selected examples of the historical development of synthetic speech, enclosed as a 33 1/3-rpm record, is described in the Appendix.
@incollection{klatt_review_1991,
	Address = {New York},
	Author = {Klatt, Dennis H},
	Booktitle = {Papers in speech communication: Speech processing},
	Date = {1991},
	Date-Modified = {2016-09-24 18:56:07 +0000},
	Editor = {Atal, B S and Miller, L J and Kent, R D},
	Keywords = {speech synthesis, speech technology},
	Number = {3},
	Pages = {57-114},
	Publisher = {The Acoustical Society of America},
	Title = {Review of text-to-speech conversion for English},
	Url = {http://www.cs.indiana.edu/rhythmsp/ASA/Contents.html},
	Abstract = {The automatic conversion of English text to synthetic speech is presently being performed, remarkably well, by a number of laboratory systems and commercial devices. Progress in this area has been made possible by advances in linguistic theory, acoustic-phonetic characterization of English sound patterns, perceptual psychology, mathematical modeling of speech production, structured programming, and computer hardware design. This review traces the early work on the development of speech synthesizers, discovery of minimal acoustic cues for phonetic contrasts, evolution of phonemic rule programs, incorporation of prosodic rules, and formulation of techniques for text analysis. Examples of rules are used liberally to illustrate the state of the art. Many of the examples are taken from Klattalk, a text-to-speech system developed by the author. A number of scientific problems are identified that prevent current systems from achieving the goal of completely human-sounding speech. While the emphasis is on rule programs that drive a formant synthesizer, alternatives such as articulatory synthesis and waveform concatenation are also reviewed. An extensive bibliography has been assembled to show both the breadth of synthesis activity and the wealth of phenomena covered by rules in the best of these programs. A recording of selected examples of the historical development of synthetic speech, enclosed as a 33 1/3-rpm record, is described in the Appendix.},
	Bdsk-Url-1 = {http://www.cs.indiana.edu/rhythmsp/ASA/Contents.html}}
Downloads: 0