Glottal mixture model (GLOMM) for speaker identification on telephone channels. Baggenstoss, P. M., Wilkinghoff, K., & Kurth, F. In 2017 25th European Signal Processing Conference (EUSIPCO), pages 2734-2738, Aug, 2017.
Glottal mixture model (GLOMM) for speaker identification on telephone channels [pdf]Paper  doi  abstract   bibtex   
The Glottal Mixture Model (GLOMM) extracts speaker-dependent voice source information from speech data. It has previously been shown to provide speaker identification performance on clean speech comparable to universal background model (UBM), a state of the art method based on MFCC. And, when combined with UBM, the error rate was reduced by a factor of three, showing that the voice source information is largely independent of the information contained in the MFCC, yet holds as much speaker-related information. We now describe how GLOMM can be adapted for telephone quality audio and provide significant error reduction when combined with UBM and I-vector approaches. We demonstrate a factor of two error reduction on the NTIMIT data set with respect to the best published results.

Downloads: 0