Bayesian on-line spectral change point detection: a soft computing approach for on-line ASR. Chowdhury, M. F. R., Selouani, S., & O’Shaughnessy, D. International Journal of Speech Technology, 15(1):5–23, March, 2012.
Bayesian on-line spectral change point detection: a soft computing approach for on-line ASR [link]Paper  doi  abstract   bibtex   
Current automatic speech recognition (ASR) works in off-line mode and needs prior knowledge of the stationary or quasi-stationary test conditions for expected word recognition accuracy. These requirements limit the application of ASR for real-world applications where test conditions are highly non-stationary and are not known a priori. This paper presents an innovative frame dynamic rapid adaptation and noise compensation technique for tracking highly non-stationary noises and its application for on-line ASR. The proposed algorithm is based on a soft computing model using Bayesian on-line inference for spectral change point detection (BOSCPD) in unknown non-stationary noises. BOSCPD is tested with the MCRA noise tracking technique for on-line rapid environmental change learning in different non-stationary noise scenarios. The test results show that the proposed BOSCPD technique reduces the delay in spectral change point detection significantly compared to the baseline MCRA and its derivatives. The proposed BOSCPD soft computing model is tested for joint additive and channel distortions compensation (JAC)-based on-line ASR in unknown test conditions using non-stationary noisy speech samples from the Aurora 2 speech database. The simulation results for the on-line AR show significant improvement in recognition accuracy compared to the baseline Aurora 2 distributed speech recognition (DSR) in batch-mode.
@article{chowdhury_bayesian_2012,
	title = {Bayesian on-line spectral change point detection: a soft computing approach for on-line {ASR}},
	volume = {15},
	issn = {1572-8110},
	shorttitle = {Bayesian on-line spectral change point detection},
	url = {https://doi.org/10.1007/s10772-011-9116-2},
	doi = {10.1007/s10772-011-9116-2},
	abstract = {Current automatic speech recognition (ASR) works in off-line mode and needs prior knowledge of the stationary or quasi-stationary test conditions for expected word recognition accuracy. These requirements limit the application of ASR for real-world applications where test conditions are highly non-stationary and are not known a priori. This paper presents an innovative frame dynamic rapid adaptation and noise compensation technique for tracking highly non-stationary noises and its application for on-line ASR. The proposed algorithm is based on a soft computing model using Bayesian on-line inference for spectral change point detection (BOSCPD) in unknown non-stationary noises. BOSCPD is tested with the MCRA noise tracking technique for on-line rapid environmental change learning in different non-stationary noise scenarios. The test results show that the proposed BOSCPD technique reduces the delay in spectral change point detection significantly compared to the baseline MCRA and its derivatives. The proposed BOSCPD soft computing model is tested for joint additive and channel distortions compensation (JAC)-based on-line ASR in unknown test conditions using non-stationary noisy speech samples from the Aurora 2 speech database. The simulation results for the on-line AR show significant improvement in recognition accuracy compared to the baseline Aurora 2 distributed speech recognition (DSR) in batch-mode.},
	language = {en},
	number = {1},
	urldate = {2020-10-01},
	journal = {International Journal of Speech Technology},
	author = {Chowdhury, M. F. R. and Selouani, S.-A. and O’Shaughnessy, D.},
	month = mar,
	year = {2012},
	pages = {5--23},
}

Downloads: 0