Independent Low-Rank Tensor Analysis for Audio Source Separation. Yoshii, K., Kitamura, K., Bando, Y., Nakamura, E., & Kawahara, T. In 2018 26th European Signal Processing Conference (EUSIPCO), pages 1657-1661, Sep., 2018.
Independent Low-Rank Tensor Analysis for Audio Source Separation [pdf]Paper  doi  abstract   bibtex   
This paper describes a versatile tensor factorization technique called independent low-rank tensor analysis (ILRTA) and its application to single-channel audio source separation. In general, audio source separation has been conducted in the short-time Fourier transform (STFT) domain under an unrealistic but conventional assumption of the independence of time-frequency (TF) bins. Nonnegative matrix factorization (NMF) is a typical technique of single-channel source separation based on the low-rankness of source spectrograms. In a multichannel setting, independent component analysis (ICA) and its multivariate extension called independent vector analysis (IVA) have often been used for blind source separation based on the independence of source spectrograms. Integrating NMF and IVA, independent low-rank matrix analysis (ILRMA) was recently proposed. To deal with the covariance of TF bins, in this paper we propose ILRTA as a new extension of NMF. Both ILRMA and ILRTA aim to find independent and low-rank sources. A key difference is that while ILRMA estimates demixing filters that decorrelate the channels for multichannel source separation, ILRTA finds optimal transforms that decorrelate the time frames and frequency bins of a STFT representation for single-channel source separation in a way that the bin-wise independence assumed by NMF holds true as much as possible. We report evaluation results of ILRTA and discuss extension of ILRTA to multichannel source separation.

Downloads: 0