Human Activity Recognition by the Image Type Encoding Method of 3-Axial Sensor Data
Abstract
1. Introduction
2. Related Research
3. Image Type Encoding Method of the 3-Axial Sensor Data
3.1. Three-Axial Acceleration and Gyroscope Data Analysis of the WISDM Dataset
3.2. Walking-Activity-Based Data Argumentation
3.3. Brightness Intensity Distribution Pattern Transformation
3.4. 2 Step SBIDP Enhancement Method
4. Three-Dimensional Visualization Method of BIDP
5. Proposed CNN Architecture for Learning Activity Data
6. Performance Evaluation
6.1. Training Result
6.2. Performance Evaluation Result
6.3. Performance Evaluation Comparison by Using the WISDM Dataset
7. Conclusions
Author Contributions
Funding
Conflicts of Interest
References
- Keusch, F.; Wenz, A.; Conrad, F. Do You Have Your Smartphone with You? Behavioral Barriers for Measuring Everyday Activities with Smartphone Sensors. Comput. Hum. Behav. 2022, 127, 107054. [Google Scholar] [CrossRef]
- Yang, P.; Yang, C.; Lanfranchi, V.; Ciravegna, F. Activity Graph based Convolutional Neural Network for Physical Activity Recognition using Acceleration and Gyroscope Data. IEEE Trans. Ind. Inform. 2022, 18, 6619–6630. [Google Scholar] [CrossRef]
- Alrazzak, U.; Alhalabi, B. A survey on human activity recognition using accelerometer sensor. In Proceedings of the Joint 8th International Conference on Informatics, Electronics & Vision (ICIEV) and 2019 3rd International Conference on Imaging, Vision & Pattern Recognition (icIVPR), Spokane, WA, USA, 30 May–2 June 2019; pp. 152–159. [Google Scholar]
- Huang, J.; Kaewunruen, S.; Ning, J. AI-based quantification of fitness activities using smartphones. Sustainability 2022, 14, 1–19. [Google Scholar] [CrossRef]
- Ehatisham-ul-Haq, M.; Murtaza, F.; Azam, M.A.; Amin, Y. Daily Living Activity Recognition In-The-Wild: Modeling and Inferring Activity-Aware Human Contexts. Electronics 2022, 11, 1–24. [Google Scholar] [CrossRef]
- Wang, J.; Chen, Y.; Hao, S.; Peng, X.; Lisha, H. Deep Learning for Sensor-based Activity Recognition: A Survey. Pattern Recognit. Lett. 2017, 119, 3–11. [Google Scholar] [CrossRef]
- Tian, Y.; Zhang, J.; Wang, J.; Geng, Y.; Wang, X. Robust human activity recognition using single accelerometer via wavelet energy spectrum features and ensemble feature selection. Syst. Sci. Control. Eng. 2020, 8, 83–96. [Google Scholar] [CrossRef]
- Kang, J.; Shin, J.; Shin, J.; Lee, D.; Choi, A. Robust Human Activity Recognition by Integrating Image and Accelerometer Sensor Data Using Deep Fusion Network. Sensors 2022, 22, 174. [Google Scholar] [CrossRef]
- Anguita, D.; Ghio, A.; Oneto, L.; Parra, X.; Reyes-Ortiz, J.L. Human Activity Recognition on Smartphones Using a Multiclass Hardware-Friendly Support Vector Machine. Adv. Nonlinear Speech Process. 2012, 7657, 216–223. [Google Scholar]
- Sengül, G.; Karakaya, M.; Misra, S.; Abayomi-Alli, O.O.; Damaševiˇcius, R. Deep learning based fall detection using smartwatches for healthcare applications. Biomed. Signal Process. Control 2022, 71, 103242. [Google Scholar] [CrossRef]
- Ignatov, A.D.; Strijov, V.V. Human activity recognition using quasiperiodic time series collected from a single tri-axial accelerometer. Multimed. Tools Appl. 2016, 75, 7257–7270. [Google Scholar] [CrossRef]
- Gupta, A.; Semwal, V.B. Multiple task human gait analysis and identification: Ensemble learning approach. In Emotion and Information Processing; A Practical Approach; Springer: Cham, Switzerland, 2020; pp. 185–197. [Google Scholar] [CrossRef]
- Barra, S.; Carta, S.M.; Corriga, A.; Podda, A.S.; Reforgiato Recupero, D. Deep Learning and Time Series-to-Image Encoding for Financial Forecasting. IEEE/CAA J. Autom. Sin. 2020, 7, 683. [Google Scholar] [CrossRef]
- Ahmad, Z.; Khan, N. Inertial Sensor Data to Image Encoding for Human Action Recognition. IEEE Sens. J. 2021, 9, 10978–10988. [Google Scholar] [CrossRef]
- Wang, D.; Wang, T.; Florescu, I. Is Image Encoding Beneficial for Deep Learning in Finance? IEEE Internet Things J. 2020, 9, 5617–5628. [Google Scholar] [CrossRef]
- Estebsari, A.; Rajabi, R. Single residential load forecasting using deep learning and image encoding techniques. Electronics 2020, 9, 68. [Google Scholar] [CrossRef]
- Bulling, A.; Blanke, U.; Schiele, B. A tutorial on human activity recognition using body-worn inertial sensors. ACM Comput. Surv. 2014, 46, 1–33. [Google Scholar] [CrossRef]
- Sadouk, L. CNN approaches for time series classification. In Time Series Analysis-Data, Methods, and Applications; IntechOpen: London, UK, 2019; pp. 1–23. [Google Scholar]
- Vishwakarma, D.K.; Dhiman, C. A unified model for human activity recognition using spatial distribution of gradients and difference of Gaussian kernel. Vis. Comput. 2019, 35, 1595–1613. [Google Scholar] [CrossRef]
- Semwal, V.B.; Nandi, G.C. Generation of joint trajectories using hybrid automate-based model: A rocking block-based approach. IEEE Sens. J. 2016, 16, 5805–5816. [Google Scholar] [CrossRef]
- Teng, Q.; Wang, K.; Zhang, L.; He, J. The layer-wise training convolutional neural networks using local loss for sensor based human activity recognition. IEEE Sens. J. 2020, 20, 7265–7274. [Google Scholar] [CrossRef]
- Agarwal, P.; Alam, M. A Lightweight Deep Learning Model for Human Activity Recognition on Edge Devices. arXiv 2019, arXiv:1909.12917. Available online: https://arxiv.org/abs/1909.12917 (accessed on 8 July 2020).
- Ignatov, A. Real-time human activity recognition from accelerometer data using Convolutional Neural Networks. Appl. Soft Comput. 2018, 62, 915–922. [Google Scholar] [CrossRef]
- Xiao, Z.; Xu, X.; Xing, H.; Song, F.; Wang, X.; Zhao, B. A federated learning system with enhanced feature extraction for human activity recognition. Knowl. Based Syst. 2021, 229, 107338. [Google Scholar] [CrossRef]
- Weiss, G.M. Wisdm smartphone and smartwatch activity and biometrics dataset. In UCI Machine Learning Repository: WISDM Smartphone and Smartwatch Activity and Biometrics Dataset Data Set; 2019; Available online: https://archive.ics.uci.edu/ml/machine-learning-databases/00507/WISDM-dataset-description.pdf (accessed on 8 July 2021).
- Chen, L.J.; Stubbs, B.; Chien, I.C.; Lan, T.H.; Chung, M.S.; Lee, H.L.; Ku, P.W. Associations between daily steps and cognitive function among inpatients with schizophrenia. BMC Psychiatry 2022, 22, 87. [Google Scholar] [CrossRef] [PubMed]
- Yuenyongchaiwat, K.; Pipatsitipong, D.; Sangprasert, P. Increasing walking steps daily can reduce blood pressure and diabetes in overweight participants. Diabetol. Int. 2018, 9, 75–79. [Google Scholar] [CrossRef] [PubMed]
- Nagovitsyn, R.S.; Osipov, A.Y.; Ratmanskaya, T.I.; Loginov, D.V.; Prikhodov, D.S. The Program for Monitoring Students’ Walking and Running according to the System “10,000 Steps a Day” During the Spread of COVID-19. In Proceedings of the Winter Conferences of Sports Science, Costa Blanca Sports Science Events Alicante, Alicante, Spain, 22–23 March 2021. [Google Scholar]
- Willis, W.T.; Ganley, K.J.; Herman, R.M. Fuel oxidation during human walking. Metabolism 2005, 54, 793–799. [Google Scholar] [CrossRef]
- Hallam, K.T.; Bilsborough, S.; De Courten, M. “Happy feet”: Evaluating the benefits of a 100-day 10,000 step challenge on mental health and wellbeing. BMC Psychiatry 2018, 18, 19. [Google Scholar] [CrossRef]
- Li, X.; Wang, Y.; Zhang, B.; Ma, J. PSDRNN: An efficient and effective HAR scheme based on feature extraction and deep learning. IEEE Trans. Ind. Inform. 2020, 16, 6703–6713. [Google Scholar] [CrossRef]
- Xia, K.; Huang, J.; Wang, H. LSTM-CNN architecture for human activity recognition. IEEE Access 2020, 8, 56855–56866. [Google Scholar] [CrossRef]
- Pienaar, S.W.; Malekian, R. Human Activity Recognition using LSTM-RNN Deep Neural Network Architecture. In Proceedings of the 2019 IEEE 2nd Wireless Africa Conference (WAC), Pretoria, South Africa, 18–20 August 2019; pp. 1–5. [Google Scholar]
- Dua, N.; Singh, S.N.; Semwal, V.B. Multi-input CNN-GRU based human activity recognition using wearable sensors. Computing 2021, 103, 1461–1478. [Google Scholar] [CrossRef]
- Verma, U.; Tyagi, P.; Kaur, M. Single Input Single Head CNN-GRU-LSTM Architecture for Recognition of Human Activities. Indones. J. Electr. Eng. Inform 2022, 10, 410–420. [Google Scholar] [CrossRef]
- Li, F.; Shirahama, K.; Nisar, M.; Köping, L.; Grzegorzek, M. Comparison of Feature Learning Methods for Human Activity Recognition Using Wearable Sensors. Sensors 2018, 18, 679. [Google Scholar] [CrossRef]
- Simonyan, K.; Zisserman, A. Very Deep Convolutional Networks for Large-Scale Image Recognition. arXiv arXiv:1409.1556, 2014.
- Szegedy, C.; Vanhoucke, V.; Ioffe, S.; Shlens, J.; Wojna, Z. Rethinking the Inception Architecture for Computer Vision. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27 June–28 July 2016; Volume 1, pp. 2818–2826. [Google Scholar]
- Chollet, F. Xception: Deep Learning with Depthwise Separable Convolutions. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 1800–1807. [Google Scholar]
- Tan, M.; Le, Q.V. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. Proc. Mach. Learn. Res. 2019, 97, 6105–6114. [Google Scholar]
- Lu, L.; Zhang, C.; Cao, K.; Deng, T.; Yang, Q. A multichannel CNN-GRU model for human activity recognition. IEEE Access 2022, 10, 66797–66810. [Google Scholar] [CrossRef]
- Zhang, Y.; Zhang, Z.; Zhang, Y.; Bao, J.; Zhang, Y.; Deng, H. Human Activity Recognition Based on Motion Sensor Using U-Net. IEEE Access 2019, 7, 75213–75226. [Google Scholar] [CrossRef]
- Zhang, H.; Xiao, Z.; Wang, J.; Li, F.; Szczerbicki, E. A novel IoT-perceptive human activity recognition (HAR) approach using multihead convolutional attention. IEEE Internet Things J. 2019, 7, 1072–1080. [Google Scholar] [CrossRef]














| Label | Activity | No. of Columns | No. of Merged Columns | No. of Data | Grouping Type | |
|---|---|---|---|---|---|---|
| Accel | Gyro | |||||
| 0 | Walking | 279,817 | 203,919 | 152,114 | 51 | A | 
| 1 | Jogging | 268,409 | 200,252 | 154,020 | 49 | A | 
| 2 | Stairs | 255,645 | 197,857 | 160,430 | 50 | A | 
| 3 | Sitting | 264,592 | 202,370 | 180,315 | 51 | A | 
| 4 | Standing | 269,604 | 202,351 | 165,068 | 51 | A | 
| 5 | Typing | 246,356 | 194,540 | 166,646 | 49 | B | 
| 6 | Brushing teeth | 269,609 | 202,622 | 168,771 | 51 | B | 
| 7 | Eating soup | 270,756 | 202,408 | 164,177 | 51 | C | 
| 8 | Eating chips | 261,360 | 197,905 | 160,237 | 50 | C | 
| 9 | Eating pasta | 249,793 | 197,844 | 170,598 | 50 | C | 
| 10 | Drinking | 285,190 | 202,395 | 149,138 | 51 | C | 
| 11 | Eating sandwich | 265,781 | 197,915 | 164,635 | 51 | C | 
| 12 | Kicking | 278,766 | 202,625 | 150,651 | 51 | A | 
| 13 | Catching | 272,219 | 198,756 | 146,675 | 50 | B | 
| 14 | Dribbling | 272,730 | 202,331 | 150,333 | 51 | B | 
| 15 | Writing | 260,497 | 197,894 | 175,638 | 51 | B | 
| 16 | Clapping | 268,065 | 202,330 | 165,304 | 51 | B | 
| 17 | Folding clothes | 265,214 | 202,321 | 164,006 | 51 | B | 
| Our Model | |
|---|---|
| Accuracy | 99.52% | 
| Loss | 0.0134 | 
| Class | Precision | Recall | F1 Score | 
|---|---|---|---|
| 0 | 99.0% | 99.0% | 99.0% | 
| 1 | 99.0% | 99.0% | 99.0% | 
| 2 | 97.0% | 97.0% | 97.0% | 
| 3 | 99.0% | 99.0% | 99.0% | 
| 4 | 99.0% | 99.0% | 99.0% | 
| 5 | 100.0% | 100.0% | 100.0% | 
| 6 | 100.0% | 100.0% | 100.0% | 
| 7 | 99.0% | 99.0% | 99.0% | 
| 8 | 99.0% | 99.0% | 99.0% | 
| 9 | 99.0% | 99.0% | 99.0% | 
| 10 | 99.0% | 99.0% | 99.0% | 
| 11 | 99.0% | 99.0% | 99.0% | 
| 12 | 91.0% | 93.0% | 92.0% | 
| 13 | 94.0% | 93.0% | 94.0% | 
| 14 | 96.0% | 96.0% | 96.0% | 
| 15 | 100.0% | 100.0% | 100.0% | 
| 16 | 99.0% | 99.0% | 99.0% | 
| 17 | 97.0% | 98.0% | 97.0% | 
| Ref. | Model | F1 Score (%) | Accuracy (%) | 
|---|---|---|---|
| [31] | Tri-PSRNN | 96.62 | 94.76 | 
| [31] | PSDRNN | 94.01 | 93.06 | 
| [32] | LSTM-CNN | - | 95.85 | 
| [33] | LSTM-RNN | 95.40 | 96.40 | 
| [34] | Single-input CNN-GRU model A | 92.42 | 92.03 | 
| [34] | Single-input CNN-GRU model B | 94.50 | 94.71 | 
| [34] | Single-input CNN-GRU model C | 92.55 | 92.37 | 
| [34] | Multi-input CNN-LSTM | 95.55 | 95.45 | 
| [34] | Multi-input CNN-GRU | 97.22 | 97.21 | 
| [35] | CNN-GRU-LSTM | 98.52 | 98.51 | 
| - | Proposed model | 98.00 | 98.15 | 
| Ref. | Model | No. of Activities | Layer | F1 Score (%) | Accuracy (%) | 
|---|---|---|---|---|---|
| [36] | Baseline | 6 | 10 | - | 89.55 | 
| [37] | VGG16 | 6 | 23 | - | 89.32 | 
| [38] | Inception-V3 | 6 | 313 | - | 91.54 | 
| [39] | Xception | 6 | 126 | - | 90.17 | 
| [40] | EfficientNet B0 | 6 | 233 | - | 89.11 | 
| [23] | CNN | 6 | 6 | - | 93.32 | 
| [41] | Multichannel CNN-GRU | 6 | 9 | 96.39 | 96.41 | 
| [42] | U-Net | 6 | 11 | 96.50 | 96.40 | 
| [43] | CNN with an attention mechanism | 6 | 6 | - | 96.40 | 
| - | Proposed Model | 18 | 8 | 98.00 | 98.15 | 
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. | 
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Kim, C.; Lee, W. Human Activity Recognition by the Image Type Encoding Method of 3-Axial Sensor Data. Appl. Sci. 2023, 13, 4961. https://doi.org/10.3390/app13084961
Kim C, Lee W. Human Activity Recognition by the Image Type Encoding Method of 3-Axial Sensor Data. Applied Sciences. 2023; 13(8):4961. https://doi.org/10.3390/app13084961
Chicago/Turabian StyleKim, Changmin, and Woobeom Lee. 2023. "Human Activity Recognition by the Image Type Encoding Method of 3-Axial Sensor Data" Applied Sciences 13, no. 8: 4961. https://doi.org/10.3390/app13084961
APA StyleKim, C., & Lee, W. (2023). Human Activity Recognition by the Image Type Encoding Method of 3-Axial Sensor Data. Applied Sciences, 13(8), 4961. https://doi.org/10.3390/app13084961
 
        




 
       