Multi-modal Descriptors for Multi-class Hand Pose Recognition in Human Computer Interaction Systems. Abella, J., Alcaide, R., Sabaté, A., Mas, J., Escalera, S., Gonzàlez, J., & Antens, C. In Proceedings of the 15th ACM on International Conference on Multimodal Interaction, of ICMI '13, pages 503--508, New York, NY, USA, 2013. ACM.
Multi-modal Descriptors for Multi-class Hand Pose Recognition in Human Computer Interaction Systems [link]Paper  doi  abstract   bibtex   
Hand pose recognition in advanced Human Computer Interaction systems (HCI) is becoming more feasible thanks to the use of affordable multi-modal RGB-Depth cameras. Depth data generated by these sensors is a very valuable input information, although the representation of 3D descriptors is still a critical step to obtain robust object representations. This paper presents an overview of different multi-modal descriptors, and provides a comparative study of two feature descriptors called Multi-modal Hand Shape (MHS) and Fourier-based Hand Shape (FHS), which compute local and global 2D-3D hand shape statistics to robustly describe hand poses. A new dataset of 38K hand poses has been created for real-time hand pose and gesture recognition, corresponding to five hand shape categories recorded from eight users. Experimental results show good performance of the fused MHS and FHS descriptors, improving recognition accuracy while assuring real-time computation in HCI scenarios.
@inproceedings{abella_multi-modal_2013,
	address = {New York, NY, USA},
	series = {{ICMI} '13},
	title = {Multi-modal {Descriptors} for {Multi}-class {Hand} {Pose} {Recognition} in {Human} {Computer} {Interaction} {Systems}},
	isbn = {978-1-4503-2129-7},
	url = {http://doi.acm.org/10.1145/2522848.2532596},
	doi = {10.1145/2522848.2532596},
	abstract = {Hand pose recognition in advanced Human Computer Interaction systems (HCI) is becoming more feasible thanks to the use of affordable multi-modal RGB-Depth cameras. Depth data generated by these sensors is a very valuable input information, although the representation of 3D descriptors is still a critical step to obtain robust object representations. This paper presents an overview of different multi-modal descriptors, and provides a comparative study of two feature descriptors called Multi-modal Hand Shape (MHS) and Fourier-based Hand Shape (FHS), which compute local and global 2D-3D hand shape statistics to robustly describe hand poses. A new dataset of 38K hand poses has been created for real-time hand pose and gesture recognition, corresponding to five hand shape categories recorded from eight users. Experimental results show good performance of the fused MHS and FHS descriptors, improving recognition accuracy while assuring real-time computation in HCI scenarios.},
	urldate = {2014-06-05TZ},
	booktitle = {Proceedings of the 15th {ACM} on {International} {Conference} on {Multimodal} {Interaction}},
	publisher = {ACM},
	author = {Abella, Jordi and Alcaide, Raúl and Sabaté, Anna and Mas, Joan and Escalera, Sergio and Gonzàlez, Jordi and Antens, Coen},
	year = {2013},
	pages = {503--508}
}

Downloads: 0