Pantomime: Mid-Air Gesture Recognition with Sparse Millimeter-Wave Radar Point Clouds. Palipana, S., Salami, D., Leiva, L., & Sigg, S. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT), 5(1):1-27, ACM New York, NY, USA, 2021.
abstract   bibtex   
We introduce Pantomime, a novel mid-air gesture recognition system exploiting spatio-temporal properties of millimeter-wave radio frequency (RF) signals. Pantomime is positioned in a unique region of the RF landscape: mid-resolution mid-range high-frequency sensing, which makes it ideal for motion gesture interaction. We configure a commercial frequency-modulated continuous-wave radar device to promote spatial information over temporal resolution by means of sparse 3D point clouds, and contribute a deep learning architecture that directly consumes the point cloud, enabling real-time performance with low computational demands. Pantomime achieves 95% accuracy and 99% AUC in a challenging set of 21 gestures articulated by 45 participants in two indoor environments, outperforming four state-of-the-art 3D point cloud recognizers. We also analyze the effect of environment, articulation speed, angle, and distance to the sensor. We conclude that Pantomime is resilient to various input conditions and that it may enable novel applications in industrial, vehicular, and smart home scenarios.
@article{Sameera_2021_IMWUT,
author={Sameera Palipana and Dariush Salami and Luis Leiva and Stephan Sigg},
journal={Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT)},
title={Pantomime: Mid-Air Gesture Recognition with Sparse Millimeter-Wave Radar Point Clouds},
year={2021},
abstract={We introduce Pantomime, a novel mid-air gesture recognition system exploiting spatio-temporal properties of millimeter-wave radio frequency (RF) signals. Pantomime is positioned in a unique region of the RF landscape: mid-resolution mid-range high-frequency sensing, which makes it ideal for motion gesture interaction. We configure a commercial frequency-modulated continuous-wave radar device to promote spatial information over temporal resolution by means of sparse 3D point clouds, and contribute a deep learning architecture that directly consumes the point cloud, enabling real-time performance with low computational demands. Pantomime achieves 95\% accuracy and 99\% AUC in a challenging set of 21 gestures articulated by 45 participants in two indoor environments, outperforming four state-of-the-art 3D point cloud recognizers. We also analyze the effect of environment, articulation speed, angle, and distance to the sensor. We conclude that Pantomime is resilient to various input conditions and that it may enable novel applications in industrial, vehicular, and smart home scenarios.
},
issue_date = {March 2021},
publisher = {ACM New York, NY, USA},
volume = {5},
number = {1},
pages = {1-27},
group = {ambience},
project = {radiosense,windmill}
}
Downloads: 0