Reverberation-robust underdetermined source separation with non-negative tensor double deconvolution. Murata, N., Kameoka, H., Kinoshita, K., Araki, S., Nakatani, T., Koyama, S., & Saruwatari, H. In 2016 24th European Signal Processing Conference (EUSIPCO), pages 1648-1652, Aug, 2016.
Reverberation-robust underdetermined source separation with non-negative tensor double deconvolution [pdf]Paper  doi  abstract   bibtex   
Source separation using an ad hoc microphone array can be useful for enhancing speech in such applications as teleconference systems without the need to prepare special devices. However, the positions of the sources (and the microphones when using an ad hoc microphone array) can change during recording, thus violating the commonly made assumption in many source separation algorithms that the mixing system is time-invariant. This paper proposes an extension of the multichannel nonnegative matrix factorization (NMF) approach to deal with the problem of underdetermined source separation in time-variant reverberant environments. The proposed method models the mixing system as a non-negative convolutive mixture based on the concept of a “semi-time-variant system” to handle the reverberation in a room as well allowing for relatively small changes in the source/microphone positions. It also models the power spectrogram of each sound source using the convolutive NMF model to consider the local dynamics of speech.

Downloads: 0