Audio/video supervised independent vector analysis through multimodal pilot dependent components. Nesta, F., Mosayyebpour, S., Koldovský, Z., & Paleček, K. In 2017 25th European Signal Processing Conference (EUSIPCO), pages 1150-1164, Aug, 2017.
Audio/video supervised independent vector analysis through multimodal pilot dependent components [pdf]Paper  doi  abstract   bibtex   
Independent Vector Analysis is a powerful tool for estimating the broadband acoustic transfer function between multiple sources and the microphones in the frequency domain. In this work, we consider an extended IVA model which adopts the concept of pilot dependent signals. Without imposing any constraint on the de-mixing system, pilot signals depending on the target source are injected into the model enforcing the permutation of outputs to be consistent over time. A neural network trained on acoustic data and a lip motion detection are jointly used to produce a multimodal pilot signal dependent on the target source. It is shown through experimental results that this structure allows the enhancement of a predefined target source in very difficult and ambiguous scenarios.

Downloads: 0