Generative Adversarial Networks and Perceptual Losses for Video Super-Resolution

Generative Adversarial Networks and Perceptual Losses for Video Super-Resolution. Lucas, A., Lopez-Tapia, S., Molina, R., & Katsaggelos, A. K. IEEE Transactions on Image Processing, 28(7):3312–3327, jul, 2019.

Paper doi abstract bibtex

Video super-resolution (VSR) has become one of the most critical problems in video processing. In the deep learning literature, recent works have shown the benefits of using adversarial-based and perceptual losses to improve the performance on various image restoration tasks; however, these have yet to be applied for video super-resolution. In this paper, we propose a generative adversarial network (GAN)-based formulation for VSR. We introduce a new generator network optimized for the VSR problem, named VSRResNet, along with new discriminator architecture to properly guide VSRResNet during the GAN training. We further enhance our VSR GAN formulation with two regularizers, a distance loss in feature-space and pixel-space, to obtain our final VSRResFeatGAN model. We show that pre-training our generator with the mean-squared-error loss only quantitatively surpasses the current state-of-the-art VSR models. Finally, we employ the PercepDist metric to compare the state-of-the-art VSR models. We show that this metric more accurately evaluates the perceptual quality of SR solutions obtained from neural networks, compared with the commonly used PSNR/SSIM metrics. Finally, we show that our proposed model, the VSRResFeatGAN model, outperforms the current state-of-the-art SR models, both quantitatively and qualitatively.

@article{Alice2019c,
abstract = {Video super-resolution (VSR) has become one of the most critical problems in video processing. In the deep learning literature, recent works have shown the benefits of using adversarial-based and perceptual losses to improve the performance on various image restoration tasks; however, these have yet to be applied for video super-resolution. In this paper, we propose a generative adversarial network (GAN)-based formulation for VSR. We introduce a new generator network optimized for the VSR problem, named VSRResNet, along with new discriminator architecture to properly guide VSRResNet during the GAN training. We further enhance our VSR GAN formulation with two regularizers, a distance loss in feature-space and pixel-space, to obtain our final VSRResFeatGAN model. We show that pre-training our generator with the mean-squared-error loss only quantitatively surpasses the current state-of-the-art VSR models. Finally, we employ the PercepDist metric to compare the state-of-the-art VSR models. We show that this metric more accurately evaluates the perceptual quality of SR solutions obtained from neural networks, compared with the commonly used PSNR/SSIM metrics. Finally, we show that our proposed model, the VSRResFeatGAN model, outperforms the current state-of-the-art SR models, both quantitatively and qualitatively.},
archivePrefix = {arXiv},
arxivId = {1806.05764},
author = {Lucas, Alice and Lopez-Tapia, Santiago and Molina, Rafael and Katsaggelos, Aggelos K.},
doi = {10.1109/TIP.2019.2895768},
eprint = {1806.05764},
issn = {1057-7149},
journal = {IEEE Transactions on Image Processing},
keywords = {Artificial neural networks,image generation,image resolution,video signal processing},
month = {jul},
number = {7},
pages = {3312--3327},
pmid = {30714918},
title = {{Generative Adversarial Networks and Perceptual Losses for Video Super-Resolution}},
url = {https://ieeexplore.ieee.org/document/8629024/},
volume = {28},
year = {2019}
}

Downloads: 0

{"_id":"z7kv3hHYHx5ZaLbKS","bibbaseid":"lucas-lopeztapia-molina-katsaggelos-generativeadversarialnetworksandperceptuallossesforvideosuperresolution-2019","author_short":["Lucas, A.","Lopez-Tapia, S.","Molina, R.","Katsaggelos, A. K."],"bibdata":{"bibtype":"article","type":"article","abstract":"Video super-resolution (VSR) has become one of the most critical problems in video processing. In the deep learning literature, recent works have shown the benefits of using adversarial-based and perceptual losses to improve the performance on various image restoration tasks; however, these have yet to be applied for video super-resolution. In this paper, we propose a generative adversarial network (GAN)-based formulation for VSR. We introduce a new generator network optimized for the VSR problem, named VSRResNet, along with new discriminator architecture to properly guide VSRResNet during the GAN training. We further enhance our VSR GAN formulation with two regularizers, a distance loss in feature-space and pixel-space, to obtain our final VSRResFeatGAN model. We show that pre-training our generator with the mean-squared-error loss only quantitatively surpasses the current state-of-the-art VSR models. Finally, we employ the PercepDist metric to compare the state-of-the-art VSR models. We show that this metric more accurately evaluates the perceptual quality of SR solutions obtained from neural networks, compared with the commonly used PSNR/SSIM metrics. Finally, we show that our proposed model, the VSRResFeatGAN model, outperforms the current state-of-the-art SR models, both quantitatively and qualitatively.","archiveprefix":"arXiv","arxivid":"1806.05764","author":[{"propositions":[],"lastnames":["Lucas"],"firstnames":["Alice"],"suffixes":[]},{"propositions":[],"lastnames":["Lopez-Tapia"],"firstnames":["Santiago"],"suffixes":[]},{"propositions":[],"lastnames":["Molina"],"firstnames":["Rafael"],"suffixes":[]},{"propositions":[],"lastnames":["Katsaggelos"],"firstnames":["Aggelos","K."],"suffixes":[]}],"doi":"10.1109/TIP.2019.2895768","eprint":"1806.05764","issn":"1057-7149","journal":"IEEE Transactions on Image Processing","keywords":"Artificial neural networks,image generation,image resolution,video signal processing","month":"jul","number":"7","pages":"3312–3327","pmid":"30714918","title":"Generative Adversarial Networks and Perceptual Losses for Video Super-Resolution","url":"https://ieeexplore.ieee.org/document/8629024/","volume":"28","year":"2019","bibtex":"@article{Alice2019c,\nabstract = {Video super-resolution (VSR) has become one of the most critical problems in video processing. In the deep learning literature, recent works have shown the benefits of using adversarial-based and perceptual losses to improve the performance on various image restoration tasks; however, these have yet to be applied for video super-resolution. In this paper, we propose a generative adversarial network (GAN)-based formulation for VSR. We introduce a new generator network optimized for the VSR problem, named VSRResNet, along with new discriminator architecture to properly guide VSRResNet during the GAN training. We further enhance our VSR GAN formulation with two regularizers, a distance loss in feature-space and pixel-space, to obtain our final VSRResFeatGAN model. We show that pre-training our generator with the mean-squared-error loss only quantitatively surpasses the current state-of-the-art VSR models. Finally, we employ the PercepDist metric to compare the state-of-the-art VSR models. We show that this metric more accurately evaluates the perceptual quality of SR solutions obtained from neural networks, compared with the commonly used PSNR/SSIM metrics. Finally, we show that our proposed model, the VSRResFeatGAN model, outperforms the current state-of-the-art SR models, both quantitatively and qualitatively.},\narchivePrefix = {arXiv},\narxivId = {1806.05764},\nauthor = {Lucas, Alice and Lopez-Tapia, Santiago and Molina, Rafael and Katsaggelos, Aggelos K.},\ndoi = {10.1109/TIP.2019.2895768},\neprint = {1806.05764},\nissn = {1057-7149},\njournal = {IEEE Transactions on Image Processing},\nkeywords = {Artificial neural networks,image generation,image resolution,video signal processing},\nmonth = {jul},\nnumber = {7},\npages = {3312--3327},\npmid = {30714918},\ntitle = {{Generative Adversarial Networks and Perceptual Losses for Video Super-Resolution}},\nurl = {https://ieeexplore.ieee.org/document/8629024/},\nvolume = {28},\nyear = {2019}\n}\n","author_short":["Lucas, A.","Lopez-Tapia, S.","Molina, R.","Katsaggelos, A. K."],"key":"Alice2019c","id":"Alice2019c","bibbaseid":"lucas-lopeztapia-molina-katsaggelos-generativeadversarialnetworksandperceptuallossesforvideosuperresolution-2019","role":"author","urls":{"Paper":"https://ieeexplore.ieee.org/document/8629024/"},"keyword":["Artificial neural networks","image generation","image resolution","video signal processing"],"metadata":{"authorlinks":{}}},"bibtype":"article","biburl":"https://sites.northwestern.edu/ivpl/files/2023/06/IVPL_Updated_publications-1.bib","dataSources":["ePKPjG8C6yvpk4mEK","ya2CyA73rpZseyrZ8","E6Bth2QB5BYjBMZE7","nbnEjsN7MJhurAK9x","PNQZj6FjzoxxJk4Yi","7FpDWDGJ4KgpDiGfB","bod9ms4MQJHuJgPpp","QR9t5P2cLdJuzhfzK","D8k2SxfC5dKNRFgro","7Dwzbxq93HWrJEhT6","qhF8zxmGcJfvtdeAg","fvDEHD49E2ZRwE3fb","H7crv8NWhZup4d4by","DHqokWsryttGh7pJE","vRJd4wNg9HpoZSMHD","sYxQ6pxFgA59JRhxi","w2WahSbYrbcCKBDsC","XasdXLL99y5rygCmq","3gkSihZQRfAD2KBo3","t5XMbyZbtPBo4wBGS","bEpHM2CtrwW2qE8FP","teJzFLHexaz5AQW5z"],"keywords":["artificial neural networks","image generation","image resolution","video signal processing"],"search_terms":["generative","adversarial","networks","perceptual","losses","video","super","resolution","lucas","lopez-tapia","molina","katsaggelos"],"title":"Generative Adversarial Networks and Perceptual Losses for Video Super-Resolution","year":2019}