Artificial bandwidth extension of spectral envelope along a Viterbi path. Yagli, C., Turan, M. A. T., & Erzin, E. SPEECH COMMUNICATION, 55(1):111-118, JAN, 2013.
doi  abstract   bibtex   
In this paper, we propose a hidden Markov model (HMM)-based wideband spectral envelope estimation method for the artificial bandwidth extension problem. The proposed HMM-based estimator decodes an optimal Viterbi path based on the temporal contour of the narrowband spectral envelope and then performs the minimum mean square error (MMSE) estimation of the wideband spectral envelope on this path. Experimental evaluations are performed to compare the proposed estimator to the state-of-the-art HMM and Gaussian mixture model based estimators using both objective and subjective evaluations. Objective evaluations are performed with the log-spectral distortion (LSD) and the wideband perceptual evaluation of speech quality (PESQ) metrics. Subjective evaluations are performed with the A/B pair comparison listening test. Both objective and subjective evaluations yield that the proposed wideband spectral envelope estimator consistently improves performances over the state-of-the-art estimators. (C) 2012 Elsevier B.V. All rights reserved.
@article{ ISI:000312422900009,
Author = {Yagli, Can and Turan, M. A. Tugtekin and Erzin, Engin},
Title = {{Artificial bandwidth extension of spectral envelope along a Viterbi path}},
Journal = {{SPEECH COMMUNICATION}},
Year = {{2013}},
Volume = {{55}},
Number = {{1}},
Pages = {{111-118}},
Month = {{JAN}},
Abstract = {{In this paper, we propose a hidden Markov model (HMM)-based wideband
   spectral envelope estimation method for the artificial bandwidth
   extension problem. The proposed HMM-based estimator decodes an optimal
   Viterbi path based on the temporal contour of the narrowband spectral
   envelope and then performs the minimum mean square error (MMSE)
   estimation of the wideband spectral envelope on this path. Experimental
   evaluations are performed to compare the proposed estimator to the
   state-of-the-art HMM and Gaussian mixture model based estimators using
   both objective and subjective evaluations. Objective evaluations are
   performed with the log-spectral distortion (LSD) and the wideband
   perceptual evaluation of speech quality (PESQ) metrics. Subjective
   evaluations are performed with the A/B pair comparison listening test.
   Both objective and subjective evaluations yield that the proposed
   wideband spectral envelope estimator consistently improves performances
   over the state-of-the-art estimators. (C) 2012 Elsevier B.V. All rights
   reserved.}},
DOI = {{10.1016/j.specom.2012.07.003}},
ISSN = {{0167-6393}},
EISSN = {{1872-7182}},
Unique-ID = {{ISI:000312422900009}},
}

Downloads: 0