Micro Gesture Recognition with Multi-Dimensional Feature Fusion and CQ-MobileNetV3 Using FMCW Radar
Abstract
1. Introduction
- (1)
- A feature extraction algorithm that integrates multi-dimensional features is proposed. First, the range–time map (RTM), velocity–time map (VTM), and angle–time map (ATM) are constructed from the raw data. Then, the three maps are refined using normalization and adaptive filtering. Finally, the refined RTM, VTM, and ATM are fused to construct the range–velocity–angle–time map (RVATM). RVATM can fully describe the motion information of micro gestures, with low feature dimensionality and less information redundancy.
- (2)
- A lightweight micro gesture recognition network CQ-MobileNetV3 is proposed based on the MobileNetV3 network. First, several convolution layers and bottleneck blocks are deleted, and the expansion size of bottleneck blocks is reduced, which can reduce computational and storage requirements. Then, an improved convolutional block attention module (CBAM) is constructed, in which the convolution operations of some modules are simplified, and the simpler activation function is adopted to reduce computational complexity. Meanwhile, an improved self-attention (SA) module is designed, in which grouped query technology is employed to reduce computational burden. Finally, the two improved attention modules are integrated into different bottleneck blocks to enhance feature extraction capabilities with little addition in computational complexity. The combination of three improvements enables the network to remain lightweight while maintaining high recognition accuracy.
- (3)
- An experimental platform is designed, and a micro gesture dataset is constructed to evaluate the proposed method. A dataset containing 14 micro gestures in three scenarios is constructed using TI’s IWR1443 FMCW radar sensor. The experimental results verify the robustness and superiority of the proposed method.
2. Principle of FMCW Radar
2.1. System Architecture and Signal Model
2.2. Range Measurement
2.3. Velocity Measurement
2.4. Angle Measurement
3. Multi-Dimensional Feature Fusion
3.1. RTM and VTM Extraction
3.2. ATM Extraction
3.3. Feature Preprocessing
3.4. Feature Fusion
4. CQ-MobileNetV3 Network
4.1. MobileNetV3 Network
4.2. Improved CBAM
4.3. Improved SA Module
4.4. Optimization of Network Structure
4.5. Overall Network Structure
5. Experiments and Analysis
5.1. Experimental Setup and Parameter Configuration
5.2. Dataset
5.2.1. Data Collection
5.2.2. Feature Extraction and Preprocessing Results
5.2.3. Feature Fusion Results
5.3. Recognition Results and Analysis
5.3.1. Evaluation Metrics
5.3.2. Network Training
5.3.3. Ablation Experiments
5.3.4. Comparison with Other Networks
6. Discussion
7. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Rastgoo, R.; Kiani, K.; Escalera, S. Sign language recognition: A deep survey. Expert Syst. Appl. 2021, 164, 113794. [Google Scholar] [CrossRef]
- Li, A.; Bodanese, E.; Poslad, S.; Hou, T.; Wu, K.; Luo, F. A Trajectory-Based Gesture Recognition in Smart Homes Based on the Ultrawideband Communication System. IEEE Internet Things J. 2022, 9, 22861–22873. [Google Scholar] [CrossRef]
- Zeng, M. Gesture Recognition Technology of VR Piano Playing Teaching Game based on Hidden Markov Model. Int. Arab J. Inf. Technol. 2024, 21, 760–772. [Google Scholar] [CrossRef]
- Zheng, L.; Bai, J.; Zhu, X.; Huang, L.; Shan, C.; Wu, Q.; Zhang, L. Dynamic Hand Gesture Recognition in In-Vehicle Environment Based on FMCW Radar and Transformer. Sensors 2021, 21, 6368. [Google Scholar] [CrossRef]
- Mahmoud, N.M.; Fouad, H.; Soliman, A.M. Smart healthcare solutions using the internet of medical things for hand gesture recognition system. Complex Intell. Syst. 2021, 7, 1253–1264. [Google Scholar] [CrossRef]
- Wang, H.; Zhang, M.; Zhang, L.; Zhu, X.; Cao, Q. Real-Time Hand Gesture Recognition in Clinical Settings: A Low-Power FMCW Radar Integrated Sensor System with Multiple Feature Fusion. Sensors 2025, 25, 4169. [Google Scholar] [CrossRef]
- Li, B.; Yang, J.; Yang, Y.; Li, C.; Zhang, Y. Sign Language/Gesture Recognition Based on Cumulative Distribution Density Features Using UWB Radar. IEEE Trans. Instrum. Meas. 2021, 70, 2511113. [Google Scholar] [CrossRef]
- Zhang, Z.; Cai, J.; Dai, X.; Xiao, H. Multi-Scale Attention Fusion Gesture-Recognition Algorithm Based on Strain Sensors. Sensors 2025, 25, 4200. [Google Scholar] [CrossRef]
- Sharma, S.; Singh, S. Vision-based hand gesture recognition using deep learning for the interpretation of sign language. Expert Syst. Appl. 2021, 182, 115657. [Google Scholar] [CrossRef]
- Leon, D.G.; Gröli, J.; Yeduri, S.R.; Mosqueron, R.; Pandey, O.J.; Cenkeramaddi, L.R. Video Hand Gestures Recognition Using Depth Camera and Lightweight CNN. IEEE Sens. J. 2022, 22, 14610–14619. [Google Scholar] [CrossRef]
- Kong, F.; Deng, J.; Fan, Z. Gesture recognition system based on ultrasonic FMCW and ConvLSTM model. Measurement 2022, 190, 110743. [Google Scholar] [CrossRef]
- Ding, X.; Yu, X.; Zhong, Y.; Xie, W.; Cai, B.; You, M.; Jiang, T. Robust gesture recognition method toward intelligent environment using Wi-Fi signals. Measurement 2024, 231, 114525. [Google Scholar] [CrossRef]
- Ahmed, S.; Kallu, K.D.; Ahmed, S.; Cho, S.H. Hand Gestures Recognition Using Radar Sensors For Human-Computer-Interaction: A review. Remote Sens. 2021, 13, 527. [Google Scholar] [CrossRef]
- Mao, Y.; Zhao, L.; Liu, C.; Ling, M. A Low-Complexity Hand Gesture Recognition Framework via Dual mmWave FMCW Radar System. Sensors 2023, 23, 8551. [Google Scholar] [CrossRef]
- Li, Q.; Liu, L.; Hao, S.; Wan, G. Dynamic Gesture Recognition Method Based on Millimeter-Wave Radar. In Proceedings of the 2022 5th International Conference on Pattern Recognition and Artificial Intelligence (PRAI), Chengdu, China, 19–21 August 2022; pp. 63–67. [Google Scholar]
- Zhang, S.; Li, G.; Ritchie, M.; Fioranelli, F.; Griffiths, H. Dynamic Hand Gesture Classification Based on Radar Micro-Doppler Signatures. In Proceedings of the 2016 CIE International Conference on Radar (RADAR), Guangzhou, China, 10–13 October 2016; pp. 1–4. [Google Scholar]
- Rashid, N.E.A.; Nor, Y.A.I.M.; Sharif, K.K.M.; Khan, Z.I.; Zakaria, N.A. Hand Gesture Recognition using Continuous Wave (CW) Radar based on Hybrid PCA-KNN. In Proceedings of the 2021 IEEE Symposium on Wireless Technology & Applications (ISWTA), Shah Alam, Malaysia, 17 August 2021; pp. 88–92. [Google Scholar]
- Wang, S.; Song, J.; Lien, J.; Poupyrev, I.; Hilliges, O. Interacting with Soli: Exploring Fine-Grained Dynamic Gesture Recognition in the Radio-Frequency Spectrum. In Proceedings of the 29th Radio-frequency Spectrum Symposium on User Interface Software and Technology, Tokyo, Japan, 16–19 October 2016; pp. 851–860. [Google Scholar]
- Wang, X.; Min, R.; Cui, Z.; Cao, Z. Micro Gesture Recognition with Terahertz Radar Based on Diagonal Profile of Range-Doppler Map. In Proceedings of the 2020 IEEE International Geoscience and Remote Sensing Symposium, Waikoloa, HI, USA, 26 September–2 October 2020; pp. 770–773. [Google Scholar]
- Ali, A.; Parida, P.; Va, V.; Ni, S.; Nguyen, K.N.; Ng, B.L.; Zhang, J.C. End-to-End Dynamic Gesture Recognition Using MmWave Radar. IEEE Access 2022, 10, 88692–88706. [Google Scholar] [CrossRef]
- Wang, Y.; Wang, D.; Fu, Y.; Yao, D.; Xie, L.; Zhou, M. Multi-Hand Gesture Recognition Using Automotive FMCW Radar Sensor. Remote Sens. 2022, 14, 2374. [Google Scholar] [CrossRef]
- Yu, J.T.; Tseng, Y.H.; Tseng, P.H. A mmWave MIMO Radar-Based Gesture Recognition Using Fusion of Range, Velocity, and Angular Information. IEEE Sens. J. 2024, 24, 9124–9134. [Google Scholar] [CrossRef]
- Wu, Y.; Wang, X.; Guo, S.; Zhang, B.; Cui, G. A Lightweight Network With Multifeature Fusion for mmWave Radar-Based Hand Gesture Recognition. IEEE Sens. J. 2024, 24, 19553–19561. [Google Scholar] [CrossRef]
- Yang, Z.; Zhuang, L.; Chu, P.; Zhou, J. A Low-Complexity Air-Digit-Writing Recognition Method Based on Adaptive Trajectory Learning Using MIMO Radar. IEEE Sens. J. 2024, 24, 4992–5003. [Google Scholar] [CrossRef]
- Song, Y.; Wu, L.; Zhao, Y.; Liu, P.; Lv, R.; Ullah, H. High-Accuracy Gesture Recognition using Mm-Wave Radar Based on Convolutional Block Attention Module. In Proceedings of the 2023 IEEE International Conference on Image Processing (ICIP), Kuala Lumpur, Malaysia, 8–11 October 2023; pp. 1485–1489. [Google Scholar]
- Li, Y.; Li, B.; Zhang, A.; Xue, P. Research on Gesture Recognition Based on Millimeter Wave Radar. In Proceedings of the 2024 10th International Conference on Computer and Communications (ICCC), Chengdu, China, 13–16 December 2024; pp. 157–162. [Google Scholar]
- Li, W.; Jiang, J.; Liu, D.; Gao, Y.; Li, Q. Digital Gesture Recognition Based on Millimeter Wave Radar. In Proceedings of the 2021 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC), Xi’an, China, 17–19 August; 2021; pp. 1–6. [Google Scholar]
- Jin, B.; Wu, H.; Zhang, Z.; Lian, Z.; Zhang, X.; Du, G. SRDST: Effective Dynamic Gesture Recognition With Sparse Representation and Dual-Stream Transformers in mmWave Radar. IEEE Trans. Ind. Inform. 2025, 21, 604–612. [Google Scholar] [CrossRef]
- Lien, J.; Gillian, N.; Karagozler, M.E.; Amihood, P.; Schwesig, C.; Olson, E.; Raja, H.; Poupyrev, I. Soli: Ubiquitous Gesture Sensing with Millimeter Wave Radar. ACM Trans. Graph. 2016, 35, 1–9. [Google Scholar] [CrossRef]
- Tang, G.; Wu, T.; Li, C. Dynamic Gesture Recognition Based on FMCW Millimeter Wave Radar: Review of Methodologies and Results. Sensors 2023, 23, 7478. [Google Scholar] [CrossRef]
- Ascione, M.; Buonanno, A.; Urso, M.D.; Angrisani, L.; Lo Moriello, R.S. A New Measurement Method Based on Music Algorithm for Through-the-Wall Detection of Life Signs. IEEE Trans. Instrum. Meas. 2013, 62, 13–26. [Google Scholar] [CrossRef]
- Howard, A.; Sandler, M.; Chu, G.; Chen, L.; Chen, B.; Tan, M.; Wang, W.; Zhu, Y.; Pang, R.; Vasudevan, V.; et al. Searching for MobileNetV3. In Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Republic of Korea, 27 October–2 November 2019; pp. 1314–1324. [Google Scholar]
- Huang, X.; Yang, R.; Wang, Q.; Yu, F.; He, B. A novel method for real-time ATR system of AUV based on Attention-MobileNetV3 network and pixel correction algorithm. Ocean Eng. 2023, 270, 113403. [Google Scholar] [CrossRef]
- Woo, S.; Park, J.; Lee, J.; Kweon, I.S. CBAM: Convolutional Block Attention Module. In Proceedings of the 2018 European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 3–19. [Google Scholar]
- Dubey, S.R.; Singh, S.K.; Chaudhuri, B.B. Activation functions in deep learning: A comprehensive survey and benchmark. Neurocomputing 2022, 503, 92–108. [Google Scholar] [CrossRef]
- Vaswani, A.; Shazeer, N.M.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, L.; Polosukhin, I. Attention is all you need. arXiv 2017, arXiv:1706.03762. [Google Scholar]
- Zhang, Y.; Tang, H.; Wu, Y.; Wang, B.; Yang, D. FMCW Radar Human Action Recognition Based on Asymmetric Convolutional Residual Blocks. Sensors 2024, 24, 4570. [Google Scholar] [CrossRef]
- Liu, Z.; Hao, Z.; Han, K.; Tang, Y.; Wang, Y. GhostNetV3: Exploring the Training Strategies for Compact Models. arXiv 2024, arXiv:2404.11202. [Google Scholar]
- Qin, D.; Leichner, C.; Delakis, M.; Fornoni, M.; Luo, S.; Yang, F.; Wang, W.; Banbury, C.; Ye, C.; Akin, B.; et al. MobileNetV4: Universal Models for the Mobile Ecosystem. arXiv 2024, arXiv:2404.10518. [Google Scholar]
- Gu, M.; Chen, Z.; Chen, K.; Pan, H. IR-ST: A Lightweight Transformer Network for Human Fall Detection Based on FMCW Radar. IEEE Sens. J. 2023, 23, 25128–25135. [Google Scholar] [CrossRef]


























| Module | Expansion Size (Original/Optimized) | Output Size |
|---|---|---|
| Bottleneck_SE | 16/16 | 64 × 64 × 16 |
| Bottleneck | 72/16 | 32 × 32 × 16 |
| Bottleneck_SE | 88/72 | 16 × 16 × 24 |
| Bottleneck_SE | 144/96 | 8 × 8 × 48 |
| Bottleneck_SE | 288/144 | 8 × 8 × 48 |
| Bottleneck_SE | 576/96 | 4 × 4 × 96 |
| Bottleneck_SE | 576/192 | 4 × 4 × 96 |
| Conv2d, 1 × 1 | 4 × 4 × 576 | |
| Pool | 1 × 1 × 576 | |
| Conv2d, 1 × 1 | 1 × 1 × 14 |
| Parameter | Value |
|---|---|
| Number of transmitting antennas | 2 |
| Number of receiving antennas | 4 |
| Starting frequency | 77 GHz |
| Modulation slope | 100 MHz/us |
| Chirp period | 380 us |
| Chirp duration | 40 us |
| Number of chirps per frame | 128 |
| Frame period | 50 ms |
| Sampling rate | 2 MHz |
| Samples per chirp | 64 |
| Number of frames | 50 |
| Network | Structure Optimization | Improved CBAM | Improved SA | Accuracy (%) | Params (K) | FLOPs (M) | Size (KB) | FPS |
|---|---|---|---|---|---|---|---|---|
| MobileNetV3 | 96.75 | 932.96 | 42.99 | 3786 | 270 | |||
| A | √ | 94.04 | 192.49 | 24.36 | 814 | 331 | ||
| B | √ | √ | 96.32 | 194.63 | 25.31 | 823 | 320 | |
| C | √ | √ | 96.13 | 206.64 | 26.59 | 864 | 315 | |
| Proposed | √ | √ | √ | 97.16 | 207.24 | 26.81 | 895 | 309 |
| Network | Accuracy (%) | Params (M) | FLOPs (G) | Size (MB) | FPS |
|---|---|---|---|---|---|
| DenseNet + CBAM | 96.98 | 7.012 | 1.924 | 28.025 | 182 |
| ResNet50 | 97.44 | 23.537 | 2.698 | 89.785 | 199 |
| Xception | 97.54 | 20.836 | 2.982 | 78.482 | 281 |
| ResNet18 + CBAM | 97.50 | 11.346 | 1.193 | 44.496 | 124 |
| GhostNetV3 | 97.20 | 8.129 | 0.300 | 31.010 | 158 |
| MobileNetV4 | 95.59 | 2.511 | 0.124 | 9.579 | 381 |
| Swin-Transformer-Small | 91.70 | 49.800 | 17.089 | 191.399 | 35 |
| CQ-MobileNetV3 | 97.16 | 0.207 | 0.027 | 0.895 | 309 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Xue, W.; Wang, R.; Wei, J.; Liu, L. Micro Gesture Recognition with Multi-Dimensional Feature Fusion and CQ-MobileNetV3 Using FMCW Radar. Sensors 2025, 25, 6949. https://doi.org/10.3390/s25226949
Xue W, Wang R, Wei J, Liu L. Micro Gesture Recognition with Multi-Dimensional Feature Fusion and CQ-MobileNetV3 Using FMCW Radar. Sensors. 2025; 25(22):6949. https://doi.org/10.3390/s25226949
Chicago/Turabian StyleXue, Wei, Rui Wang, Jianyun Wei, and Li Liu. 2025. "Micro Gesture Recognition with Multi-Dimensional Feature Fusion and CQ-MobileNetV3 Using FMCW Radar" Sensors 25, no. 22: 6949. https://doi.org/10.3390/s25226949
APA StyleXue, W., Wang, R., Wei, J., & Liu, L. (2025). Micro Gesture Recognition with Multi-Dimensional Feature Fusion and CQ-MobileNetV3 Using FMCW Radar. Sensors, 25(22), 6949. https://doi.org/10.3390/s25226949

