Temporal-based acoustic-phonetic patterns in read speech: some evidence for speaker sex differences. Whiteside, S. P Journal of the International Phonetic Association, 26(1):23–40, 1996.
doi  abstract   bibtex   
The perception of speaker sex depends on the listener's integration of a complex range of factors. These may relate, for example, to the style of delivery, the use of particular language, pronunciation (Trudgill, 1983; Smith, 1979), the use of particular intonation patterns (McConnell-Ginet, 1983) and the perceived pitch of the speaker (Aronovitch, 1976, Elyan, 1978; Lass et al., 1976). Some acoustic-phonetic investigations have explored through instrumental analysis how speaker sex differences are perceived. These have shown that acoustic phonetic differences exist between the read speech of men and women speakers. It has been demonstrated that fundamental frequency differences exist between men and women, with men having on average, lower fundamental frequencies (Aronovitch, 1976; Coleman, 1973a). This can be explained in part by their larger larynges. However it is also acknowledged that it is not a low overall average fundamental frequency alone that contributes to the perception of an adult male voice. Some evidence shows for example that use of a wider pitch range will contribute to the perception of femininity, even where the overall pitch is low (Terrango, 1966). In addition women have been found to have on average higher formant frequencies (Coleman, 1976; Henton, 1986; Peterson & Barney, 1952; Childers & Wu, 1991; Wu & Childers, 1991) as a result of the smaller vocal tract. Women have different glottal source characteristics (Karlsson, 1989) which are reflected in the filter characteristics of the speech signal (Klatt & Klatt, 1990). There is also some evidence to suggest that other speaker sex differences exist in the temporal domain. Byrd (1992) found differences between men and women speakers in speaking rate in read speech in American English in the TIMIT database. Byrd states that under the recording conditions used for the TIMIT database, women spoke appreciably more slowly than the men and that men tended to reduce vowels to schwa ([xs0259]) more often than the women. Byrd also found that female speakers in the TIMIT database released stops in sentence-final position more frequently and produced more glottal stops than male speakers. All these findings were statistically significant.
@article{whiteside_temporal-based_1996,
	Author = {Whiteside, Sandra P},
	Date = {1996},
	Date-Modified = {2018-05-14 09:39:52 +0000},
	Doi = {10.1017/S0025100300005302},
	Journal = {Journal of the International Phonetic Association},
	Keywords = {descriptive, duration, English, gender, interspeaker variation, phonetics, prosody, read speech, speaking styles, temporal factors},
	Number = {1},
	Pages = {23--40},
	Title = {Temporal-based acoustic-phonetic patterns in read speech: some evidence for speaker sex differences},
	Volume = {26},
	Year = {1996},
	Abstract = {The perception of speaker sex depends on the listener's integration of a complex range of factors. These may relate, for example, to the style of delivery, the use of particular language, pronunciation (Trudgill, 1983; Smith, 1979), the use of particular intonation patterns (McConnell-Ginet, 1983) and the perceived pitch of the speaker (Aronovitch, 1976, Elyan, 1978; Lass et al., 1976). Some acoustic-phonetic investigations have explored through instrumental analysis how speaker sex differences are perceived. These have shown that acoustic phonetic differences exist between the read speech of men and women speakers. It has been demonstrated that fundamental frequency differences exist between men and women, with men having on average, lower fundamental frequencies (Aronovitch, 1976; Coleman, 1973a). This can be explained in part by their larger larynges. However it is also acknowledged that it is not a low overall average fundamental frequency alone that contributes to the perception of an adult male voice. Some evidence shows for example that use of a wider pitch range will contribute to the perception of femininity, even where the overall pitch is low (Terrango, 1966). In addition women have been found to have on average higher formant frequencies (Coleman, 1976; Henton, 1986; Peterson \& Barney, 1952; Childers \& Wu, 1991; Wu \& Childers, 1991) as a result of the smaller vocal tract. Women have different glottal source characteristics (Karlsson, 1989) which are reflected in the filter characteristics of the speech signal (Klatt \& Klatt, 1990). There is also some evidence to suggest that other speaker sex differences exist in the temporal domain. Byrd (1992) found differences between men and women speakers in speaking rate in read speech in American English in the TIMIT database. Byrd states that under the recording conditions used for the TIMIT database, women spoke appreciably more slowly than the men and that men tended to reduce vowels to schwa ([xs0259]) more often than the women. Byrd also found that female speakers in the TIMIT database released stops in sentence-final position more frequently and produced more glottal stops than male speakers. All these findings were statistically significant.},
	Bdsk-File-1 = {YnBsaXN0MDDUAQIDBAUGJCVYJHZlcnNpb25YJG9iamVjdHNZJGFyY2hpdmVyVCR0b3ASAAGGoKgHCBMUFRYaIVUkbnVsbNMJCgsMDxJXTlMua2V5c1pOUy5vYmplY3RzViRjbGFzc6INDoACgAOiEBGABIAFgAdccmVsYXRpdmVQYXRoWWFsaWFzRGF0YV8QYy4uLy4uLy4uL0JpYmxpb2dyYWZpYS9QYXBlcnMvV2hpdGVzaWRlL1RlbXBvcmFsLWJhc2VkIGFjb3VzdGljLXBob25ldGljIHBhdHRlcm5zIGluIHJlYWQgc3BlZWNoLnBkZtIXCxgZV05TLmRhdGFPEQJiAAAAAAJiAAIAAAxNYWNpbnRvc2ggSEQAAAAAAAAAAAAAAAAAAADL9h/OSCsAABCGd/QfVGVtcG9yYWwtYmFzZWQgYWNvIzEwODY3N0Y2LnBkZgAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAEIZ39tQJ1BIAAAAAAAAAAAADAAQAAAkgAAAAAAAAAAAAAAAAAAAACVdoaXRlc2lkZQAAEAAIAADL9gOuAAAAEQAIAADUCbfyAAAAAQAUEIZ39BCGZY4ABfxHAAX7mAAAwEYAAgBnTWFjaW50b3NoIEhEOlVzZXJzOgBqb2FxdWltX2xsaXN0ZXJyaToAQmlibGlvZ3JhZmlhOgBQYXBlcnM6AFdoaXRlc2lkZToAVGVtcG9yYWwtYmFzZWQgYWNvIzEwODY3N0Y2LnBkZgAADgB6ADwAVABlAG0AcABvAHIAYQBsAC0AYgBhAHMAZQBkACAAYQBjAG8AdQBzAHQAaQBjAC0AcABoAG8AbgBlAHQAaQBjACAAcABhAHQAdABlAHIAbgBzACAAaQBuACAAcgBlAGEAZAAgAHMAcABlAGUAYwBoAC4AcABkAGYADwAaAAwATQBhAGMAaQBuAHQAbwBzAGgAIABIAEQAEgByVXNlcnMvam9hcXVpbV9sbGlzdGVycmkvQmlibGlvZ3JhZmlhL1BhcGVycy9XaGl0ZXNpZGUvVGVtcG9yYWwtYmFzZWQgYWNvdXN0aWMtcGhvbmV0aWMgcGF0dGVybnMgaW4gcmVhZCBzcGVlY2gucGRmABMAAS8AABUAAgAY//8AAIAG0hscHR5aJGNsYXNzbmFtZVgkY2xhc3Nlc11OU011dGFibGVEYXRhox0fIFZOU0RhdGFYTlNPYmplY3TSGxwiI1xOU0RpY3Rpb25hcnmiIiBfEA9OU0tleWVkQXJjaGl2ZXLRJidUcm9vdIABAAgAEQAaACMALQAyADcAQABGAE0AVQBgAGcAagBsAG4AcQBzAHUAdwCEAI4A9AD5AQEDZwNpA24DeQOCA5ADlAObA6QDqQO2A7kDywPOA9MAAAAAAAACAQAAAAAAAAAoAAAAAAAAAAAAAAAAAAAD1Q==},
	Bdsk-Url-1 = {http://dx.doi.org/10.1017/S0025100300005302}}

Downloads: 0