Spectral Subband Centroids as Complementary Features for Speaker Authentication. Poh, N., Sanderson, C., & Bengio, S. In International Conference on Biometric Authentication, ICBA, Lecture Notes in Computer Science, volume LNCS 3072, pages 631–639, 2004. Springer-Verlag.
Spectral Subband Centroids as Complementary Features for Speaker Authentication [link]Paper  abstract   bibtex   
Most conventional features used in speaker authentication are based on estimation of spectral envelopes in one way or another, e.g., Mel-scale Filterbank Cepstrum Coefficients (MFCCs), Linear-scale Filterbank Cepstrum Coefficients (LFCCs) and Relative Spectral Perceptual Linear Prediction (RASTA-PLP). In this study, Spectral Subband Centroids (SSCs) are examined. These features are the centroid frequency in each subband. They have properties similar to formant frequencies but are limited to a given subband.Empirical experiments carried out on the NIST2001 database using SSCs, MFCCs, LFCCs and their combinations by concatenation suggest that SSCs are somewhat more robust compared to conventional MFCC and LFCC features as well as being partially complementary.
@inproceedings{poh:2004:icba,
  author =   {N. Poh and C. Sanderson and S. Bengio},
  title =    {Spectral Subband Centroids as Complementary Features for Speaker Authentication},
  booktitle =  {International Conference on Biometric Authentication, {ICBA}, Lecture Notes in Computer Science},
  volume = {LNCS 3072},
  year =   2004,
  pages = {631--639},
  publisher = {Springer-Verlag},
  url = {publications/ps/poh_2004_icba.ps.gz},
  pdf = {publications/pdf/poh_2004_icba.pdf},
  djvu = {publications/djvu/poh_2004_icba.djvu},
  original = {2004/spectral_icba},
  idiap = {pdf/rr03-62.ps.gz},
  topics = {biometric_authentication},
  web = {http://www.springerlink.com/link.asp?id=6cl951tfug6gbjfb},
  abstract = {Most conventional features used in speaker authentication are based on estimation of spectral envelopes in one way or another, e.g., Mel-scale Filterbank Cepstrum Coefficients (MFCCs), Linear-scale Filterbank Cepstrum Coefficients (LFCCs) and Relative Spectral Perceptual Linear Prediction (RASTA-PLP).  In this study, Spectral Subband Centroids (SSCs) are examined. These features are the centroid frequency in each subband.  They have properties similar to formant frequencies but are limited to a given subband.Empirical experiments carried out on the NIST2001 database using SSCs, MFCCs, LFCCs and their combinations by concatenation suggest that SSCs are somewhat more robust compared to conventional MFCC and LFCC features as well as being partially complementary.},
  categorie = {C},
}

Downloads: 0