Improved language recognition using better phonetic decoders and fusion with MFCC and SDC features. Torre, D.; González Domínguez, J.; Abejón, A.; Spada, D.; Mateos García, I.; and González Rodríguez, J. In Interspeech 2007. Proceedings of the 8th Annual Conference of the International Speech Communication Association, pages 194-197. Antwerp, Belgium, August 27-31, 2007.
Improved language recognition using better phonetic decoders and fusion with MFCC and SDC features [link]Paper  abstract   bibtex   
One of the most popular and better performing approaches to language recognition (LR) is Parallel Phonetic Recognition followed by Language Modeling (PPRLM). In this paper we report several improvements in our PPRLM system that allowed us to move from an Equal Error Rate (EER) of over 15% to less than 8% on NIST LR Evaluation 2005 data still using a standard PPRLM system. The most successful improvement was the retraining of the phonetic decoders on larger and more appropriate corpora. We have also developed a new system based on Support Vector Machines (SVMs) that uses as features both Mel Frequency Cepstral Coefficients (MFCCs) and Shifted Delta Cepstra (SDC). This new SVM system alone gives an EER of 10.5% on NIST LRE 2005 data. Fusing our PPRLM system and the new SVM system we achieve an EER of 5.43% on NIST LRE 2005 data, a relative reduction of almost 66% from our baseline system.
@incollection{torre_improved_2007,
	Address = {Antwerp, Belgium, August 27-31, 2007},
	Author = {Torre, Doroteo and González Domínguez, Javier and Abejón, Alejandro and Spada, Danilo and Mateos García, Ismael and González Rodríguez, Joaquín},
	Booktitle = {Interspeech 2007. Proceedings of the 8th Annual Conference of the International Speech Communication Association},
	Date = {2007},
	Date-Modified = {2016-09-24 18:56:16 +0000},
	File = {Attachment:files/11096/Torre et al. - 2007 - Improved language recognition using better phonetic decoders and fusion with MFCC and SDC features.pdf:application/pdf},
	Keywords = {CV citació, language identification, Spanish, speech technology},
	Pages = {194-197},
	Title = {Improved language recognition using better phonetic decoders and fusion with MFCC and SDC features},
	Url = {http://www.isca-speech.org/archive/interspeech_2007/i07_0194.html},
	Abstract = {One of the most popular and better performing approaches to language recognition (LR) is Parallel Phonetic Recognition followed by Language Modeling (PPRLM). In this paper we report several improvements in our PPRLM system that allowed us to move from an Equal Error Rate (EER) of over 15\% to less than 8\% on NIST LR Evaluation 2005 data still using a standard PPRLM system. The most successful improvement was the retraining of the phonetic decoders on larger and more appropriate corpora. We have also developed a new system based on Support Vector Machines (SVMs) that uses as features both Mel Frequency Cepstral Coefficients (MFCCs) and Shifted Delta Cepstra (SDC). This new SVM system alone gives an EER of 10.5\% on NIST LRE 2005 data. Fusing our PPRLM system and the new SVM system we achieve an EER of 5.43\% on NIST LRE 2005 data, a relative reduction of almost 66\% from our baseline system.},
	Bdsk-File-1 = {YnBsaXN0MDDUAQIDBAUGJCVYJHZlcnNpb25YJG9iamVjdHNZJGFyY2hpdmVyVCR0b3ASAAGGoKgHCBMUFRYaIVUkbnVsbNMJCgsMDxJXTlMua2V5c1pOUy5vYmplY3RzViRjbGFzc6INDoACgAOiEBGABIAFgAdccmVsYXRpdmVQYXRoWWFsaWFzRGF0YV8QUS4uLy4uLy4uL0JpYmxpb2dyYWZpYS9QYXBlcnMvVG9ycmUvSW1wcm92ZWQgbGFuZ3VhZ2UgcmVjb2duaXRpb24gdXNpbmcgYmV0dGVyLnBkZtIXCxgZV05TLmRhdGFPEQIsAAAAAAIsAAIAAAxNYWNpbnRvc2ggSEQAAAAAAAAAAAAAAAAAAADL9h/OSCsAABCGdxcfSW1wcm92ZWQgbGFuZ3VhZ2UgIzEwODY3NzE4LnBkZgAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAEIZ3GNQJ0/8AAAAAAAAAAAADAAQAAAkgAAAAAAAAAAAAAAAAAAAABVRvcnJlAAAQAAgAAMv2A64AAAARAAgAANQJt98AAAABABQQhncXEIZljgAF/EcABfuYAADARgACAGNNYWNpbnRvc2ggSEQ6VXNlcnM6AGpvYXF1aW1fbGxpc3RlcnJpOgBCaWJsaW9ncmFmaWE6AFBhcGVyczoAVG9ycmU6AEltcHJvdmVkIGxhbmd1YWdlICMxMDg2NzcxOC5wZGYAAA4AXgAuAEkAbQBwAHIAbwB2AGUAZAAgAGwAYQBuAGcAdQBhAGcAZQAgAHIAZQBjAG8AZwBuAGkAdABpAG8AbgAgAHUAcwBpAG4AZwAgAGIAZQB0AHQAZQByAC4AcABkAGYADwAaAAwATQBhAGMAaQBuAHQAbwBzAGgAIABIAEQAEgBgVXNlcnMvam9hcXVpbV9sbGlzdGVycmkvQmlibGlvZ3JhZmlhL1BhcGVycy9Ub3JyZS9JbXByb3ZlZCBsYW5ndWFnZSByZWNvZ25pdGlvbiB1c2luZyBiZXR0ZXIucGRmABMAAS8AABUAAgAY//8AAIAG0hscHR5aJGNsYXNzbmFtZVgkY2xhc3Nlc11OU011dGFibGVEYXRhox0fIFZOU0RhdGFYTlNPYmplY3TSGxwiI1xOU0RpY3Rpb25hcnmiIiBfEA9OU0tleWVkQXJjaGl2ZXLRJidUcm9vdIABAAgAEQAaACMALQAyADcAQABGAE0AVQBgAGcAagBsAG4AcQBzAHUAdwCEAI4A4gDnAO8DHwMhAyYDMQM6A0gDTANTA1wDYQNuA3EDgwOGA4sAAAAAAAACAQAAAAAAAAAoAAAAAAAAAAAAAAAAAAADjQ==},
	Bdsk-Url-1 = {http://www.isca-speech.org/archive/interspeech_2007/i07_0194.html}}
Downloads: 0