Hand Gesture Recognition Using an IR-UWB Radar with an Inception Module-Based Classifier
Abstract
:1. Introduction
- We present a novel implementation of the deep-learning algorithm for recognizing hand gestures using IR-UWB radars. To the best of our knowledge, for hand gesture recognition, deep-learning algorithms (based on 3D-CNN architectures), such as inception modules and GoogLeNet, have never been implemented with IR-UWB radars;
- We present an intuitive scheme to demonstrate hand motions as three-dimensional intensity images for IR-UWB radars;
- Finally, the integrated framework is tested for a diverse set of gesture vocabulary with two radars. Validation is performed for several samples of each individual gesture. The formulated methodology can further be extended to build any radar signal classifier, regardless of the nature of the application.
2. Methodology
2.1. Signal Preprocessing
2.2. Conversion of the Radar Signal into a 3D Image
Algorithm 1 Transformation of the Radar Signal into a 3D Image |
|
2.3. Feature Extraction and Classification Using an Inception Module-Based CNN
2.3.1. CNN Architecture
- Input layer: Represents the raw input (pixels) in the form of a 2D or 3D matrix;
- Convolutional layer: The main objective function of a convolutional layer is to generate a feature map by convolving the input layer with a kernel of a 2D filter with the size ‘’. This kernel is moved throughout the image to generate the output of the convolutional layer. The process is further demonstrated in Figure 7, where the input pattern is convolved with a 3 × 3 kernel and the resulting output is fed to the forthcoming layer in the architecture;
- Batch normalization layer: Deployed after the convolutional layer to further accelerate the training process;
- Rectified linear unit and max pooling layer: These layers perform the operation of the activation function and linear down sampling, respectively. Max pooling layers pull the regional maxima from the input, which further reduces the complexity of the network.
2.3.2. Motivation to Use Inception Modules
2.3.3. Proposed Classifier for the IR-UWB Radar
3. Experimental Setup
3.1. Hardware and Software Setup
3.2. Gesture Vocabulary
4. Experimental Results
4.1. Single Dimensional Clutter Removal Results
4.2. 2D Clutter Removal Results
4.3. Image Pattern Analysis of Recorded Hand Gestures
4.4. Analysis of Variations in Patterns of the Same Gestures
4.5. Classification Accuracy
4.6. Comparison with Existing Techniques
5. Discussion and Conclusions
Author Contributions
Funding
Acknowledgments
Conflicts of Interest
References
- Yeo, H.S.; Lee, B.G.; Lim, H. Hand tracking and gesture recognition system for human-computer interaction using low-cost hardware. Multimed. Tools Appl. 2015, 74, 2687–2715. [Google Scholar] [CrossRef]
- Wachs, J.P.; Stern, H.I.; Edan, Y.; Gillam, M.; Handler, J.; Feied, C.; Smith, M. A gesture-based tool for sterile browsing of radiology images. J. Am. Med. Inform. Assoc. 2008, 15, 321–323. [Google Scholar] [CrossRef] [PubMed]
- Kumar, P.; Gauba, H.; Roy, P.P.; Dogra, D.P. Coupled HMM-based multi-sensor data fusion for sign language recognition. Pattern Recognit. Lett. 2017, 86, 1–8. [Google Scholar] [CrossRef]
- Kumar, P.; Jaiswal, A.; Deepak, B.; Reddy, G.R.M. Hand Gesture-Based Stable PowerPoint Presentation Using Kinect. In Progress in Intelligent Computing Techniques: Theory, Practice, and Applications; Springer: Singapore, 2018; pp. 81–94. [Google Scholar]
- Li, K.; Jin, Y.; Akram, M.W.; Han, R.; Chen, J. Facial expression recognition with convolutional neural networks via a new face cropping and rotation strategy. Vis. Comput. 2019, 1–14. [Google Scholar] [CrossRef]
- Schiff, J.; Meingast, M.; Mulligan, D.K.; Sastry, S.; Goldberg, K. Respectful cameras: Detecting visual markers in real-time to address privacy concerns. In Protecting Privacy in Video Surveillance; Springer: London, UK, 2009; pp. 65–89. [Google Scholar]
- Gifford, R.H.; Noble, J.H.; Camarata, S.M.; Sunderhaus, L.W.; Dwyer, R.T.; Dawant, B.M.; Labadie, R.F. The relationship between spectral modulation detection and speech recognition: Adult versus pediatric cochlear implant recipients. Trends Hear. 2018, 22, 1176. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Singh, G.; Nelson, A.; Robucci, R.; Patel, C.; Banerjee, N. Inviz: Low-power personalized gesture recognition using wearable textile capacitive sensor arrays. In Proceedings of the 2015 IEEE International Conference on Pervasive Computing and Communications (PerCom), St. Louis, MO, USA, 23–27 March 2015; pp. 198–206. [Google Scholar]
- Rautaray, S.S.; Agrawal, A. Vision based hand gesture recognition for human computer interaction: A survey. Artif. Intell. Rev. 2015, 43, 1–54. [Google Scholar] [CrossRef]
- Yarovoy, A.G.; Ligthart, L.P.; Matuzas, J.; Levitas, B. UWB radar for human being detection. IEEE Aerosp. Electron. Syst. Mag. 2006, 21, 10–14. [Google Scholar] [CrossRef] [Green Version]
- Kim, Y.; Toomajian, B. Hand gesture recognition using micro-Doppler signatures with convolutional neural network. IEEE Access 2016, 4, 7125–7130. [Google Scholar] [CrossRef]
- Wang, Y.; Wang, S.; Zhou, M.; Jiang, Q.; Tian, Z. TS-I3D based Hand Gesture Recognition Method with Radar Sensor. IEEE Access 2019, 7, 22902–22913. [Google Scholar] [CrossRef]
- Skaria, S.; Al-Hourani, A.; Lech, M.; Evans, R.J. Hand-Gesture Recognition Using Two-Antenna Doppler Radar with Deep Convolutional Neural Networks. IEEE Sens. J. 2019, 19, 3041–3048. [Google Scholar] [CrossRef]
- Choi, J.W.; Quan, X.; Cho, S.H. Bi-directional passing people counting system based on IR-UWB radar sensors. IEEE Internet Things J. 2017, 5, 512–522. [Google Scholar] [CrossRef]
- Lee, Y.; Choi, J.W.; Cho, S.H. Vital sign quality assessment based on IR-UWB radar sensor. In Proceedings of the 2017 International Conference on Information and Communication Technology Convergence (ICTC), Jeju, Korea, 18–20 October 2017; pp. 896–900. [Google Scholar]
- Khan, F.; Leem, S.; Cho, S. Hand-based gesture recognition for vehicular applications using IR-UWB radar. Sensors 2017, 17, 833. [Google Scholar] [CrossRef] [PubMed]
- Ren, N.; Quan, X.; Cho, S.H. Algorithm for gesture recognition using an IR-UWB radar sensor. J. Comput. Commun. 2016, 4, 95. [Google Scholar] [CrossRef] [Green Version]
- Khan, F.; Cho, S.H. Hand based Gesture Recognition inside a car through IR-UWB Radar. In Proceedings of the 2017 International Conference on Electronics, Information, and Communication, At Phuket, Thailand, 19–20 May 2017. [Google Scholar]
- Ahmed, S.; Khan, F.; Ghaffar, A.; Hussain, F.; Cho, S.H. Finger-Counting-Based Gesture Recognition within Cars Using Impulse Radar with Convolutional Neural Network. Sensors 2019, 19, 1429. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Krizhevsky, A.; Sutskever, I.; Hinton, G.E. Imagenet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 2012, 60, 1097–1105. [Google Scholar] [CrossRef]
- Serre, T.; Wolf, L.; Bileschi, S.; Riesenhuber, M.; Poggio, T. Robust object recognition with cortex-like mechanisms. IEEE Trans. Pattern Anal. Mach. Intell. 2007, 29, 411–426. [Google Scholar] [CrossRef] [PubMed]
- Girshick, R.; Donahue, J.; Darrell, T.; Malik, J. Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 23–28 June 2014; pp. 580–587. [Google Scholar]
- Noori, F.M.; Wallace, B.; Uddin, M.Z.; Torresen, J. A Robust Human Activity Recognition Approach Using OpenPose, Motion Features, and Deep Recurrent Neural Network. In Scandinavian Conference on Image Analysis; Springer: Cham, Switzerland, 2019; pp. 299–310. [Google Scholar]
- Szegedy, C.; Liu, W.; Jia, Y.; Sermanet, P.; Reed, S.; Anguelov, D.; Rabinovich, A. Going deeper with convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015; pp. 1–9. [Google Scholar]
- Bai, J.; Jiang, H.; Li, S.; Ma, X. NHL Pathological Image Classification Based on Hierarchical Local Information and GoogLeNet-Based Representations. BioMed Res. Int. 2019, 2019, 1–13. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Fang, T. A Novel Computer-Aided Lung Cancer Detection Method Based on Transfer Learning from GoogLeNet and Median Intensity Projections. In Proceedings of the 2018 IEEE International Conference on Computer and Communication Engineering Technology (CCET), Beijing, China, 18–20 August 2018; pp. 286–290. [Google Scholar]
- Zhong, Z.; Jin, L.; Xie, Z. High performance offline handwritten chinese character recognition using googlenet and directional feature maps. In Proceedings of the 2015 13th International Conference on Document Analysis and Recognition (ICDAR), Tunis, Tunisia, 23–26 August 2015; pp. 846–850. [Google Scholar]
- Khan, R.U.; Zhang, X.; Kumar, R. Analysis of ResNet and GoogleNet models for malware detection. J. Comput. Virol. Hacking Tech. 2019, 15, 29–37. [Google Scholar] [CrossRef]
- Chernyak, V.S. Fundamentals of Multisite Radar Systems: Multistatic Radars and Multistatic Radar Systems; Routledge: London, UK, 2018. [Google Scholar]
Technical Parameter | Specification |
---|---|
Accuracy | ~1 mm |
Center frequency | 8.748 GHz |
Frame rate (slow-time sampling rate) | 20 frames/s |
Bandwidth (–10 dB) | 2 GHz |
Pulse repetition frequency | 40.5 MHz |
Antenna beam width | 65° |
Number of antennas | 2 pairs of transmitters and receivers |
Parameter | Value |
---|---|
Clutter removal filter coefficient (alpha) | 0.9 |
Single radar data size (height × width) | 112 × 224 pixels |
Size of image with two radars (height × width) | 224 × 224 pixels |
Learning rate for GoogLeNet | 0.001 |
Optimizer | Stochastic gradient descent. |
Learning iterations | 950 |
Original Gesture Class | |||||||||
---|---|---|---|---|---|---|---|---|---|
LR Swipe | RL Swipe | UD Swipe | DU Swipe | Diag-LR-UD Swipe | Diag-LR-DU SWIPE | Cw Rotation | CCW Rotation | ||
Predicted gestures class | LR-swipe | 100 | 0 | 0 | 0 | 5 | 0 | 0 | 0 |
RL-swipe | 0 | 95 | 0 | 0 | 0 | 0 | 0 | 0 | |
UD-swipe | 0 | 0 | 95 | 0 | 0 | 0 | 0 | 0 | |
DU-swipe | 0 | 0 | 0 | 100 | 0 | 0 | 0 | 0 | |
Diag-LR-UD-swipe | 0 | 0 | 5 | 0 | 90 | 10 | 0 | 0 | |
Diag-LR-DU-swipe | 0 | 0 | 0 | 0 | 5 | 90 | 0 | 0 | |
CW-rotation | 0 | 5 | 0 | 0 | 0 | 0 | 95 | 5 | |
CCW rotation | 0 | 0 | 0 | 0 | 0 | 0 | 5 | 90 |
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Ahmed, S.; Cho, S.H. Hand Gesture Recognition Using an IR-UWB Radar with an Inception Module-Based Classifier. Sensors 2020, 20, 564. https://doi.org/10.3390/s20020564
Ahmed S, Cho SH. Hand Gesture Recognition Using an IR-UWB Radar with an Inception Module-Based Classifier. Sensors. 2020; 20(2):564. https://doi.org/10.3390/s20020564
Chicago/Turabian StyleAhmed, Shahzad, and Sung Ho Cho. 2020. "Hand Gesture Recognition Using an IR-UWB Radar with an Inception Module-Based Classifier" Sensors 20, no. 2: 564. https://doi.org/10.3390/s20020564