DeepBinaryMask: Learning a binary mask for video compressive sensing. Iliadis, M., Spinoulas, L., & Katsaggelos, A. K. Digital Signal Processing, 96:102591, jan, 2020.
DeepBinaryMask: Learning a binary mask for video compressive sensing [link]Paper  doi  abstract   bibtex   
In this paper, we propose an encoder-decoder neural network model referred to as DeepBinaryMask for video compressive sensing. In video compressive sensing one frame is acquired using a set of coded masks (sensing matrix) from which a number of video frames, equal to the number of coded masks, is reconstructed. The proposed framework is an end-to-end model where the sensing matrix is trained along with the video reconstruction. The encoder maps a video block to compressive measurements by learning the binary elements of the sensing matrix. The decoder is trained to map the measurements from a video patch back to a video block via several hidden layers of a Multi-Layer Perceptron network. The predicted video blocks are stacked together to recover the unknown video sequence. The reconstruction performance is found to improve when using the trained sensing mask from the network as compared to other mask designs such as random, across a wide variety of compressive sensing reconstruction algorithms. Finally, our analysis and discussion offers insights into understanding the characteristics of the trained mask designs that lead to the improved reconstruction quality.
@article{iliadis2020deepbinarymask,
abstract = {In this paper, we propose an encoder-decoder neural network model referred to as DeepBinaryMask for video compressive sensing. In video compressive sensing one frame is acquired using a set of coded masks (sensing matrix) from which a number of video frames, equal to the number of coded masks, is reconstructed. The proposed framework is an end-to-end model where the sensing matrix is trained along with the video reconstruction. The encoder maps a video block to compressive measurements by learning the binary elements of the sensing matrix. The decoder is trained to map the measurements from a video patch back to a video block via several hidden layers of a Multi-Layer Perceptron network. The predicted video blocks are stacked together to recover the unknown video sequence. The reconstruction performance is found to improve when using the trained sensing mask from the network as compared to other mask designs such as random, across a wide variety of compressive sensing reconstruction algorithms. Finally, our analysis and discussion offers insights into understanding the characteristics of the trained mask designs that lead to the improved reconstruction quality.},
archivePrefix = {arXiv},
arxivId = {1607.03343},
author = {Iliadis, Michael and Spinoulas, Leonidas and Katsaggelos, Aggelos K.},
doi = {10.1016/j.dsp.2019.102591},
eprint = {1607.03343},
issn = {10512004},
journal = {Digital Signal Processing},
keywords = {Binary mask,Compressive sensing,Deep learning,Mask optimization,Video reconstruction},
month = {jan},
pages = {102591},
title = {{DeepBinaryMask: Learning a binary mask for video compressive sensing}},
url = {https://linkinghub.elsevier.com/retrieve/pii/S1051200419301459},
volume = {96},
year = {2020}
}

Downloads: 0