Contextual object category recognition for RGB-D scene labeling. Ali, H., Shafait, F., Giannakidou, E., Vakali, A., Figueroa, N., Varvadoukas, T., & Mavridis, N. Robotics and Autonomous Systems, 62(2):241 - 256, 2014.
Contextual object category recognition for RGB-D scene labeling [link]Paper  doi  abstract   bibtex   1 download  
Recent advances in computer vision on the one hand, and imaging technologies on the other hand, have opened up a number of interesting possibilities for robust 3D scene labeling. This paper presents contributions in several directions to improve the state-of-the-art in RGB-D scene labeling. First, we present a novel combination of depth and color features to recognize different object categories in isolation. Then, we use a context model that exploits detection results of other objects in the scene to jointly optimize labels of co-occurring objects in the scene. Finally, we investigate the use of social media mining to develop the context model, and provide an investigation of its convergence. We perform thorough experimentation on both the publicly available RGB-D Dataset from the University of Washington as well as on the NYU scene dataset. An analysis of the results shows interesting insights about contextual object category recognition, and its benefits.
@article{ALI2014241,
title = "Contextual object category recognition for RGB-D scene labeling",
journal = "Robotics and Autonomous Systems",
volume = "62",
number = "2",
pages = "241 - 256",
year = "2014",
issn = "0921-8890",
doi = "https://doi.org/10.1016/j.robot.2013.10.001",
url = "http://www.sciencedirect.com/science/article/pii/S0921889013001929",
author = "Haider Ali and Faisal Shafait and Eirini Giannakidou and Athena Vakali and Nadia Figueroa and Theodoros Varvadoukas and Nikolaos Mavridis",
keywords = "Object recognition, Contextual modeling, RGB-D scenes, Social media, 3D scene labeling",
abstract = "Recent advances in computer vision on the one hand, and imaging technologies on the other hand, have opened up a number of interesting possibilities for robust 3D scene labeling. This paper presents contributions in several directions to improve the state-of-the-art in RGB-D scene labeling. First, we present a novel combination of depth and color features to recognize different object categories in isolation. Then, we use a context model that exploits detection results of other objects in the scene to jointly optimize labels of co-occurring objects in the scene. Finally, we investigate the use of social media mining to develop the context model, and provide an investigation of its convergence. We perform thorough experimentation on both the publicly available RGB-D Dataset from the University of Washington as well as on the NYU scene dataset. An analysis of the results shows interesting insights about contextual object category recognition, and its benefits."
}

Downloads: 1