Gesture Recognition with Residual LSTM Attention Using Millimeter-Wave Radar †
Abstract
:1. Introduction
- In the signal preprocessing stage, this paper adopts a simple and straightforward background subtraction method to filter out noise, including static clutter, multipath reflections, and DC components after Fourier transformation. This approach is intentionally uncomplicated, ensuring the preservation of gesture signals without adding processing complexity. Furthermore, range–Doppler maps (RDM) are created based on gesture velocity and range data, serving as clear and efficient inputs to the network model.
- In terms of deep learning architecture, this paper combines the strengths of multiple network types to maximize performance. To fully utilize network capabilities while avoiding issues like gradient vanishing, a Residual Long Short-Term Memory (ResNet-LSTM) network is employed for processing input features. Additionally, a residual attention module is integrated into the network that directs attention to critical variations in gesture features, significantly boosting gesture recognition accuracy. This module sequentially generates attention maps across both channel and spatial dimensions, then multiplies these maps with the network’s input feature maps to create new feature maps. These enhanced maps are further processed for feature extraction, resulting in a notable performance gain.
2. Related Work
3. The Proposed RLA Algorithm
3.1. Gesture Feature Extraction and Clutter Removal
3.1.1. Acquisition of IF Signals
3.1.2. Measurement of Gesture Range and Velocity
3.1.3. Clutter Removal and Heatmap Optimization
3.2. Network Model Based on Residual Attention Mechanism
3.2.1. Construction of Residual Networks
3.2.2. Introduction of Attention Mechanisms
3.2.3. Network Architecture
4. Experiment and Evaluation
4.1. Experimental Setup and Parameters Setting
4.2. Gesture Dataset
4.2.1. Self-Constructed Datasets
- 1.
- Finger Tap, a click in the air with the index finger to determine the opening of the software on the selected multimedia device.
- 2.
- Finger Around, the index finger makes a circling motion in space to adjust the volume level of a multimedia device.
- 3.
- Palm Left, five fingers together and move them from right to left to swipe the interface left or select software on a multimedia device.
- 4.
- Palm Right, five fingers together and move from left to right to swipe right on a multimedia device or select software.
- 5.
- Palm Forward, five fingers together and pushing the palm in a direction away from the driver’s body, used to wake up the multimedia device.
- 6.
- Palm Back, five fingers together and push the palm in the direction close to the driver’s body, used to lock the multimedia device to prevent accidental touch from occurring.
- 7.
- Palm Up, five fingers together to slide the palm upwards to exit the software currently open on your multimedia device and return to the software selection screen.
4.2.2. Soli Gesture Dataset
- 1.
- Gesture Design: The gestures primarily involve fine movements of the fingers and wrist joints, driven by small muscle groups. This design minimizes user fatigue during prolonged gesture interactions, making the dataset suitable for practical applications.
- 2.
- Dynamic Gesture: Most gestures in the dataset are dynamic in order to align with the signal characteristics of millimeter-wave radar. The dataset includes 11 distinct gestures, recorded by 10 participants. The original range–Doppler maps were captured at a frequency of 40 Hz. Preprocessing steps include normalization and background removal using a per-pixel Gaussian model. Each participant performed each gesture 25 times, resulting in a total of 11 × 25 × 10 = 2750 gesture sequences.
4.2.3. Experimental Results and Comparison
- (1)
- Performance evaluation on self-constructed gesture dataset
- (2)
- Experimental results on the soil public dataset
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Patra, A.; Geuer, P.; Munari, A.; Mähönen, P. Mm-Wave Radar Based Gesture Recognition: Development and Evaluation of a Low-Power, Low-Complexity System. In Proceedings of the 2nd ACM Workshop on Millimeter Wave Networks and Sensing Systems, New York, NY, USA, 29 October 2018; pp. 51–56. [Google Scholar] [CrossRef]
- Wang, X.; Li, X. The study of MovingTarget tracking based on Kalman-CamShift in the video. In Proceedings of the 2nd International Conference on Information Science and Engineering, Hangzhou, China, 4–6 December 2010; pp. 1–4. [Google Scholar] [CrossRef]
- Sun, S.; Yang, K.Y.; Wang, C.M.; Juan, T.K.; Chen, W.T.; Liao, C.Y.; He, Q.; Xiao, S.; Kung, W.T.; Guo, G.Y.; et al. High-Efficiency Broadband Anomalous Reflection by Gradient Meta-Surfaces. Nano Lett. 2012, 12, 6223–6229. [Google Scholar] [CrossRef] [PubMed]
- Xu, M.; Chen, X.; Ruan, Y.; Zhang, X. Cross-user electromyography pattern recognition based on a novel spatial-temporal graph convolutional network. IEEE Trans. Neural Syst. Rehabil. Eng. 2023, 32, 72–82. [Google Scholar] [CrossRef] [PubMed]
- Kern, N.; Paulus, L.; Grebner, T.; Janoudi, V.; Waldschmidt, C. Radar-based gesture recognition under ego-motion for automotive applications. IEEE Trans. Radar Syst. 2023, 1, 542–552. [Google Scholar] [CrossRef]
- Sluÿters, A.; Lambot, S.; Vanderdonckt, J.; Vatavu, R.D. RadarSense: Accurate recognition of mid-air hand gestures with radar sensing and few training examples. ACM Trans. Interact. Intell. Syst. 2023, 13, 1–45. [Google Scholar] [CrossRef]
- Schuessler, C.; Zhang, W.; Bräunig, J.; Hoffmann, M.; Stelzig, M.; Vossiek, M. Radar-Based Recognition of Static Hand Gestures in American Sign Language. In Proceedings of the 2024 IEEE Radar Conference (RadarConf24), Denver, CO, USA, 6–10 May 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 1–6. [Google Scholar]
- Lien, J.; Gillian, N.; Karagozler, M.E.; Amihood, P.; Schwesig, C.; Olson, E.; Raja, H.; Poupyrev, I. Soli: Ubiquitous Gesture Sensing with Millimeter Wave Radar. ACM Trans. Graph. 2016, 35, 1–19. [Google Scholar] [CrossRef]
- Smith, K.A.; Csech, C.; Murdoch, D.; Shaker, G. Gesture Recognition Using mm-Wave Sensor for Human-Car Interface. IEEE Sens. Lett. 2018, 2, 1–4. [Google Scholar] [CrossRef]
- Yan, B.; Wang, P.; Du, L.; Chen, X.; Fang, Z.; Wu, Y. mmGesture: Semi-supervised gesture recognition system using mmWave radar. Expert Syst. Appl. 2023, 213, 119042. [Google Scholar] [CrossRef]
- Ehsanfar, S.; Bazzi, A.; Mößner, K.; Chafii, M. Hypothesis Testing on FMCW and OFDM for Joint Communication and Radar in IEEE 802.11bd. In Proceedings of the 2023 IEEE International Conference on Communications Workshops (ICC Workshops), Rome, Italy, 28 May–1 June 2023; pp. 464–469. [Google Scholar] [CrossRef]
- Reddy, V.A.; Bazzi, A.; Stüber, G.L.; Al-Dharrab, S.; Mesbah, W.; Muqaibel, A.H. Fundamental Performance Limits of mm-Wave Cooperative Localization in Linear Topologies. IEEE Wirel. Commun. Lett. 2020, 9, 1899–1903. [Google Scholar] [CrossRef]
- Antes, T.; De Oliveira, L.G.; Diewald, A.; Bekker, E.; Bhutani, A.; Zwick, T. Velocities in Human Hand Gestures for Radar-based Gesture Recognition Applications. In Proceedings of the 2023 IEEE Radar Conference (RadarConf23), San Antonio, TX, USA, 1–5 May 2023; pp. 1–5. [Google Scholar] [CrossRef]
- Molchanov, P.; Gupta, S.; Kim, K.; Pulli, K. Multi-sensor system for driver’s hand-gesture recognition. In Proceedings of the 2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG), Ljubljana, Slovenia, 4–8 May 2015; Volume 1, pp. 1–8. [Google Scholar] [CrossRef]
- Mao, Y.; Zhao, L.; Liu, C.; Ling, M. A Low-Complexity Hand Gesture Recognition Framework via Dual mmWave FMCW Radar System. Sensors 2023, 23, 8551. [Google Scholar] [CrossRef] [PubMed]
- Si-Jung, R.; Jun-Seuk, S.; Seung-Hwan, B.; Songcheol, H.; Jong-Hwan, K. Feature-Based Hand Gesture Recognition Using an FMCW Radar and its Temporal Feature Analysis. IEEE Sens. J. 2018, 18, 7593–7602. [Google Scholar]
- Gan, L.; Liu, Y.; Li, Y.; Zhang, R.; Huang, L.; Shi, C. Gesture Recognition System Using 24 GHz FMCW Radar Sensor Realized on Real-Time Edge Computing Platform. IEEE Sens. J. 2022, 22, 8904–8914. [Google Scholar] [CrossRef]
- Wang, S.; Li, Z.; Huang, R.; Wang, R.; Li, J.; Xu, Z. Hand gesture recognition scheme based on millimeter-wave radar with convolutional neural network. In Proceedings of the IET International Radar Conference (IET IRC 2020), Chongqing, China, 4–6 November 2020; Volume 2020, pp. 488–492. [Google Scholar] [CrossRef]
- Cardillo, E.; Li, C.; Caddemi, A. Radar-Based Monitoring of the Worker Activities by Exploiting Range-Doppler and Micro-Doppler Signatures. In Proceedings of the 2021 IEEE International Workshop on Metrology for Industry 4.0 & IoT (MetroInd4.0&IoT), Rome, Italy, 7–9 June 2021; pp. 412–416. [Google Scholar] [CrossRef]
- Zhang, S.; Li, G.; Ritchie, M.; Fioranelli, F.; Griffiths, H. Dynamic hand gesture classification based on radar micro-Doppler signatures. In Proceedings of the 2016 CIE International Conference on Radar (RADAR), Guangzhou, China, 10–13 October 2016; pp. 1–4. [Google Scholar] [CrossRef]
- Bao, Z.; Liu, T. Radar Micro Moving Gesture Recognition Method Based on Multi-Scale Fusion Deep Network. In Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition, New York, NY, USA, 23–25 September 2023; AIPR ’22. pp. 657–663. [Google Scholar] [CrossRef]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar] [CrossRef]
- Mehedi, S.T.; Anwar, A.; Rahman, Z.; Ahmed, K.; Islam, R. Dependable Intrusion Detection System for IoT: A Deep Transfer Learning Based Approach. IEEE Trans. Ind. Inform. 2023, 19, 1006–1017. [Google Scholar] [CrossRef]
- Cao, Y.; Xu, J.; Lin, S.; Wei, F.; Hu, H. GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond. arXiv 2019, arXiv:1904.11492. [Google Scholar] [CrossRef]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, L.; Polosukhin, I. Attention Is All You Need. arXiv 2017, arXiv:1706.03762. [Google Scholar] [CrossRef]
- Wang, S.; Song, J.; Lien, J.; Poupyrev, I.; Hilliges, O. Interacting with Soli: Exploring Fine-Grained Dynamic Gesture Recognition in the Radio-Frequency Spectrum. In Proceedings of the 29th Annual Symposium on User Interface Software and Technology, Tokyo, Japan, 16–19 October 2016; UIST’16. pp. 851–860. [Google Scholar]
- Yao, D.; Wang, Y.; Nie, W.; Xie, L.; Zhou, M.; Yang, X. A Multi-feature Fusion Temporal Neural Network for Multi-hand Gesture Recognition using Millimeter-wave Radar Sensor. In Proceedings of the 2021 IEEE Asia-Pacific Microwave Conference (APMC), Brisbane, Australia, 28 November–1 December 2021; pp. 226–228. [Google Scholar] [CrossRef]
- Chen, S.; Bai, W.; Han, C.; Ma, J.; Wang, Y. Gesture Recognition with Residual Attention Using Commodity Millimeter Wave Radar. In Proceedings of the 21st IEEE International Conference on Ubiquitous Intelligence and Computing (IEEE UIC 2024), Denaran Island, Fiji, 2–7 December 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 1105–1111. [Google Scholar]
Parameters | Value | Parameters | Value |
---|---|---|---|
Sweep range | 60∼64 GHz | Bandwidth | 4 GHz |
Range resolution | 3.75 cm | Angular resolution | 29° |
Sampling point | 256 | Sampling rate | 1 Msps |
Frame rate | 64 | Frame rate | 25 frames/s |
TX (Transmit) | 3 | RX (Receive) | 4 |
Model | Training Acc (%) | Test Acc (%) |
---|---|---|
ResNet18 | 88.32 | 87.12 |
ResNet34 | 88.47 | 87.23 |
ResNet50 | 89.74 | 89.11 |
ResNet101 | 87.93 | 88.47 |
Recognition Acc. (%) | ||||||||
---|---|---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | 5 | 6 | 7 | Avg. | |
RD-T [18] | 90 | 92.5 | 92.5 | 95 | 95 | 92.5 | 95 | 93.2 |
RDMF [17] | 90 | 92.5 | 95 | 92.5 | 100 | 97.5 | 92.5 | 94.3 |
MF-FTN [27] | 95 | 97.5 | 92.5 | 92.5 | 95 | 97.5 | 95 | 95 |
RLA (Ours) | 92.5 | 92.5 | 95 | 97.5 | 100 | 100 | 95 | 96.1 |
Model | Recall Rate | Accuracy | F1 Value |
---|---|---|---|
RD-T [18] | 92.80% | 94.40% | 93.60% |
RDMF [17] | 93.80% | 94.30% | 94.10% |
MF-FTN [27] | 93.50% | 95.30% | 94.40% |
RLA (Ours) | 96.00% | 96.10% | 96.10% |
Recognition Acc. (%) | ||||||||
---|---|---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | 5 | 6 | 7 | Avg. | |
Resnet50 only | 87.5 | 85 | 82.5 | 85 | 95.5 | 92.5 | 85 | 87.6 |
Resnet50 + LSTM | 92.5 | 90 | 92.5 | 92.5 | 97.5 | 100 | 92.5 | 93.9 |
RLA (Ours) | 92.5 | 92.5 | 95 | 97.5 | 100 | 100 | 95 | 96.1 |
Recognition Acc. (%) | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
(a) | (b) | (c) | (d) | (e) | (f) | (g) | (h) | (i) | (j) | (k) | (avg) | |
ETE [26] | 67.7 | 71.1 | 77.8 | 94.5 | 84.8 | 98.5 | 98.6 | 88.9 | 94.9 | 89.6 | 92.6 | 87.2 |
RLA (Ours) | 86.8 | 87.9 | 93.4 | 97.6 | 98.2 | 96.9 | 100 | 100 | 96.8 | 94.1 | 95.6 | 95.2 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Bai, W.; Chen, S.; Ma, J.; Wang, Y.; Han, C. Gesture Recognition with Residual LSTM Attention Using Millimeter-Wave Radar. Sensors 2025, 25, 469. https://doi.org/10.3390/s25020469
Bai W, Chen S, Ma J, Wang Y, Han C. Gesture Recognition with Residual LSTM Attention Using Millimeter-Wave Radar. Sensors. 2025; 25(2):469. https://doi.org/10.3390/s25020469
Chicago/Turabian StyleBai, Weiqing, Siyu Chen, Jialiang Ma, Ying Wang, and Chong Han. 2025. "Gesture Recognition with Residual LSTM Attention Using Millimeter-Wave Radar" Sensors 25, no. 2: 469. https://doi.org/10.3390/s25020469
APA StyleBai, W., Chen, S., Ma, J., Wang, Y., & Han, C. (2025). Gesture Recognition with Residual LSTM Attention Using Millimeter-Wave Radar. Sensors, 25(2), 469. https://doi.org/10.3390/s25020469