Learning Face Similarity for Re-identification from Real Surveillance Video: A Deep Metric Solution. Li, P., Flynn, P., Mery, D., & Prieto, M. In International Joint Conference on Biometrics (IJCB2017), 2017.
abstract   bibtex   
Person re-identification (ReID) is the task of automatically matching persons across surveillance cameras with location or time differences. Nearly all proposed ReID approaches exploit body features. Even if successfully captured in the scene, faces are often assumed to be unhelpful to the ReID process. As cameras and surveillance systems improve, `Facial ReID' approaches deserve attention. The following contributions are made in this work: 1) We describe a high-quality dataset for person re-identification featuring faces. This dataset was collected from a real surveillance network in a municipal rapid transit system, and includes the same people appearing in multiple sites at multiple dimes wearing different attire. 2) We employ new DNN architectures and patch matching techniques to handle face misalignment in quality regimes where landmarking fails. We further boost the performance by adopting the fully convolutional structure and spatial pyramid pooling (SPP).
@INPROCEEDINGS{Mery2017:IJCB, 
author={Li, P. and Flynn, P. and Mery, D. and Prieto, M.L.}, 
booktitle={International Joint Conference on Biometrics (IJCB2017)},
title={Learning Face Similarity for Re-identification from Real Surveillance Video: A Deep Metric Solution}, 
year={2017},
abstract = {Person re-identification (ReID) is the task of automatically matching persons across surveillance cameras with location or time differences. Nearly all proposed ReID approaches exploit body features. Even if successfully captured in the scene, faces are often assumed to be unhelpful to the ReID process. As cameras and surveillance systems improve, `Facial ReID' approaches deserve attention. The following contributions are made in this work: 1) We describe a high-quality dataset for person re-identification featuring faces. This dataset was collected from a real surveillance network in a municipal rapid transit system, and includes the same people appearing in multiple sites at multiple dimes wearing different attire. 2) We employ new DNN architectures and patch matching techniques to handle face misalignment in quality regimes where landmarking fails. We further boost the performance by adopting the fully convolutional structure and spatial pyramid pooling (SPP).
}
}

Downloads: 0