Masked Non-negative Matrix Factorization for Bird Detection Using Weakly Labeled Data. Sobieraj, I., Kong, Q., & Plumbley, M. D. In 2017 25th European Signal Processing Conference (EUSIPCO), pages 1769-1773, Aug, 2017.
Masked Non-negative Matrix Factorization for Bird Detection Using Weakly Labeled Data [pdf]Paper  doi  abstract   bibtex   
Acoustic monitoring of bird species is an increasingly important field in signal processing. Many available bird sound datasets do not contain exact timestamp of the bird call but have a coarse weak label instead. Traditional Non-negative Matrix Factorization (NMF) models are not well designed to deal with weakly labeled data. In this paper we propose a novel Masked Non-negative Matrix Factorization (Masked NMF) approach for bird detection using weakly labeled data. During dictionary extraction we introduce a binary mask on the activation matrix. In that way we are able to control which parts of dictionary are used to reconstruct the training data. We compare our method with conventional NMF approaches and current state of the art methods. The proposed method outperforms the NMF baseline and offers a parsimonious model for bird detection on weakly labeled data. Moreover, to our knowledge, the proposed Masked NMF achieved the best result among non-deep learning methods on a test dataset used for the recent Bird Audio Detection Challenge.

Downloads: 0